<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Published | Macro Paper Warehouse</title><link>https://macropaperwarehouse.com/status/published/</link><atom:link href="https://macropaperwarehouse.com/status/published/index.xml" rel="self" type="application/rss+xml"/><description>Published</description><generator>Hugo Blox Builder (https://hugoblox.com)</generator><language>en-us</language><lastBuildDate>Thu, 01 Jan 2026 00:00:00 +0000</lastBuildDate><item><title>Adverse Selection and Small Business Finances</title><link>https://macropaperwarehouse.com/papers/adverse-selection-and-small-business-finances/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/adverse-selection-and-small-business-finances/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper asks why small firms hold large quantities of liquid assets — cash and cash equivalents that earn low or negative real returns — even when external credit is available. The conventional answer is a precautionary motive: liquidity buffers the risk of being shut out of credit markets. Liang proposes a second, complementary motive: a signaling motive, whereby firms hold liquid assets specifically to pledge as collateral and credibly signal their repayment ability to lenders, thereby obtaining better loan terms. The empirical backdrop is striking: about 28% of small business assets are cash and cash equivalents (Kauffman Firm Survey 2011 wave); about 7% of commercial business loans are secured by liquid collateral (SSBF 2003); and 43% of small firms sought a commercial business loan in 2020.&lt;/p&gt;
&lt;p&gt;The theoretical framework embeds directed search (Guerrieri, Shimer, and Wright 2010, hereafter GSW) and asymmetric information inside a Lagos-Wright general equilibrium monetary model. There are two types of entrepreneurs — low types (success probability δ_L) and high types (δ_H &amp;gt; δ_L) — who privately know their own type. Bankers post loan contracts specifying a down payment d, loan amount ℓ, and repayment R, and then entrepreneurs direct their search to contracts. Investment opportunities arrive stochastically. Entrepreneurs who fail to match with a banker self-finance from their liquid holdings; this endogenous outside option gives liquidity value and generates a precautionary demand for it. The opportunity cost of holding liquidity equals the policy rate i (equivalently, the inflation rate π).&lt;/p&gt;
&lt;p&gt;The main equilibrium characterization (Proposition 2) shows that as the policy rate rises, the economy passes through four regimes: (1) no participation in the credit market; (2) only high types borrow, no screening needed; (3) both types borrow, bankers screen using down payment only; (4) both types borrow, bankers screen using both down payment and loan approval rate (market tightness). The key distortion is in the extensive margin: under adverse selection with binding incentive constraints, high-type borrowers must pledge more liquid assets (dH = zH &amp;gt; z*_H) and face a tighter loan market (θ_H &amp;lt; θ*_H) than under complete information, but the loan size is undistorted (ℓ_H = ℓ*_H, Proposition 3). Low-type borrowers&amp;rsquo; allocations are never distorted by adverse selection.&lt;/p&gt;
&lt;p&gt;The interest rate pass-through from the policy rate to the real lending rate on high-type loans can be negative (Proposition, Section 4 and Figure 5). With an urn-ball matching function, γ_H (the real lending rate for high types) falls in i when screening is active, even as the aggregate lending rate rises monotonically. With a Cobb-Douglas matching function, lending rates always increase in i. Whether negative pass-through obtains therefore depends on the matching technology.&lt;/p&gt;
&lt;p&gt;Screening intensity — the degree to which high-type borrowers must hold excess liquidity and accept lower loan approval odds — is non-monotone in the low types&amp;rsquo; success probability δ_L (Proposition 4). When δ_L is very small or very close to δ_H, a small down payment suffices. Distortions are largest for intermediate values of δ_L, where the low types have large incentives to misreport but the cost of mimicry is neither trivially high nor trivially low.&lt;/p&gt;
&lt;p&gt;Without the self-finance channel — the endogenous outside option — both the precautionary and signaling motives vanish entirely, and liquid assets become redundant (Proposition 5). Bankers then use only market tightness to screen, which is less costly than using both down payment and approval rate. This result cleanly isolates why self-finance is the structural ingredient making liquidity essential.&lt;/p&gt;
&lt;p&gt;On policy, the competitive equilibrium is generically constrained inefficient when both screening tools are used, because bankers in one submarket do not internalize the externality they impose on the other submarket through the binding incentive constraint. A utilitarian social planner who faces the same information and search frictions can restore the complete information allocation by taxing high types and subsidizing low types, under a sufficient condition (Proposition 6): the high types&amp;rsquo; surplus from borrowing relative to self-finance exceeds the low types&amp;rsquo; net gain from misreporting, scaled by the population ratio and inverse success probability ratio. This condition is more likely to hold when i is large, when there are few low types (small ν_L), or when the low types&amp;rsquo; net gain from misreporting is small. Conversely (Proposition 7), the competitive equilibrium is constrained efficient — and no transfers are needed — if δ_L/δ_H + ν_H/ν_L &amp;lt; 1, which obtains when the low types are very risky (low δ_L) or very numerous (high ν_L), making subsidization costly.&lt;/p&gt;
&lt;p&gt;Empirically, Liang estimates a dynamic panel model of liquidity-to-assets ratios using the Kauffman Firm Survey (KFS), a longitudinal survey of 4,928 new U.S. firms from 2004-2011 (660 in the balanced panel after cleaning). Using a first-difference transformation with Anderson-Hsiao IV (instrumenting lagged differenced liquidity-to-assets with its second lag and differenced liquid collateral with its own lag), the preferred estimate (column 5) shows that firms holding liquid collateral to obtain loans hold on average 19.83% more liquid assets as a share of total assets before the loan application than do comparable firms that pledge illiquid or no collateral. This is treated as evidence for the signaling motive. The precautionary motive is confirmed: firms reporting credit difficulties hold an additional 9.93% of total assets in liquid form, and a one-percentage-point increase in R&amp;amp;D-to-assets (proxy for growth opportunities) is associated with 0.09% higher liquidity-to-assets. The transaction motive is confirmed: a one-percentage-point increase in total assets is associated with 0.09% lower liquidity-to-assets. The tax and agency motives are not statistically significant for small firms.&lt;/p&gt;
&lt;p&gt;A moral hazard extension (Appendix E) relaxes the assumption that banknotes can only be used to purchase capital. When entrepreneurs can divert loan proceeds to consumption (at cost), a third screening tool is added — loan size — and equilibria are more distorted and more likely to be distorted (Propositions 8-10). The threshold i above which two-tool screening kicks in falls, and loan amounts are reduced below the complete information optimum, which does not occur in the baseline.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-papers-core-identification-challenge-in-the-empirical-section-and-how-does-it-address-it"&gt;Q1. What is the paper&amp;rsquo;s core identification challenge in the empirical section, and how does it address it?&lt;/h3&gt;
&lt;p&gt;The main challenge is that the decision to pledge liquid collateral is endogenous to unobserved firm characteristics that also affect liquidity holdings. OLS suffers from omitted variable bias (the lagged liquidity-to-assets ratio is correlated with the error). Fixed effects corrects for firm heterogeneity but introduces Nickell (1981) downward bias in the lagged dependent variable. The first-difference transformation removes fixed effects but creates a mechanical correlation between the differenced lagged liquidity variable and the differenced error. The Anderson-Hsiao IV strategy instruments the differenced lagged liquidity-to-assets with its second lag in levels (column 4) and additionally instruments differenced future liquid collateral with its own lagged difference (column 5), addressing the endogeneity of the collateral-pledging decision. The Cragg-Donald Wald F-statistic is 62.056, exceeding the Stock-Yogo weak instrument threshold of 7.03, supporting instrument relevance.&lt;/p&gt;
&lt;h3 id="q2-what-is-the-signaling-mechanism-in-precise-terms-and-how-does-it-differ-from-leland-pyle-1977"&gt;Q2. What is the signaling mechanism in precise terms, and how does it differ from Leland-Pyle (1977)?&lt;/h3&gt;
&lt;p&gt;In the model, high-type entrepreneurs hold excess liquid assets (beyond what precaution alone requires) and pledge them as down payments on bank loans. Because the precautionary marginal benefit of holding liquid assets is higher for high types (they have better investment projects and thus more to gain from self-financing), the cost of holding the additional liquidity required by a high-type loan contract is lower for high types than for low types. This makes the down-payment requirement a credible separating device: low types will not mimic high types by holding the required level of liquidity because the cost of doing so outweighs the savings on repayment. The marginal benefit of liquidity thus includes both a precautionary term (gain when unmatched) and a signaling term (relaxes the incentive compatibility constraint on low types). Leland-Pyle (1977) also features signaling through self-finance, but obtains a continuum of signaling equilibria. The present model has a unique separating equilibrium because directed search imposes bilateral matching and a capacity constraint on bankers, eliminating the equilibrium multiplicity.&lt;/p&gt;
&lt;h3 id="q3-how-are-the-four-equilibrium-regimes-generated-and-what-determines-which-one-prevails"&gt;Q3. How are the four equilibrium regimes generated and what determines which one prevails?&lt;/h3&gt;
&lt;p&gt;The regime depends on the opportunity cost of holding liquidity i (equivalently, the policy rate) relative to three cutoffs i &amp;lt; i-bar &amp;lt; i-double-bar. At low i, both types prefer self-finance (high net return on liquidity, so the gain from a bank loan is small). As i rises, high types enter the credit market first because they have a larger surplus from obtaining a bank loan; low types follow at a higher cutoff. Once both types are in the market, the incentive compatibility constraint for low types (IC-LH) may or may not bind. When IC-LH is slack, only a small down payment is needed, and the allocation is undistorted (regime 3). When IC-LH binds — at yet higher i because holding large amounts of liquidity becomes even more attractive to misreporting low types as the precautionary value of liquidity falls — bankers must use both down payment and market tightness, distorting the allocation (regime 4). The policy rate thus operates on the outside option, reshaping the credit market structure endogenously.&lt;/p&gt;
&lt;h3 id="q4-why-is-the-loan-size-intensive-margin-undistorted-even-when-the-extensive-margin-market-tightness-and-down-payment-is-distorted"&gt;Q4. Why is the loan size (intensive margin) undistorted even when the extensive margin (market tightness and down payment) is distorted?&lt;/h3&gt;
&lt;p&gt;Once bankers successfully screen out low types using down payment and market tightness, they have no further incentive to distort the loan amount issued upon matching. The first-order condition for loan size in the high-type contract remains δ_H f&amp;rsquo;(ℓ_H) = 1 (Equation 8), which is the complete information optimum. The logic is that down payment and market tightness are the instruments that affect the incentive compatibility constraint, and once these are set at levels that prevent mimicry, the loan size can be set efficiently to maximize surplus from the match. This is a standard feature of competitive screening equilibria in the GSW framework and contrasts with the moral hazard extension, where the loan size is distorted because diversion of funds is possible.&lt;/p&gt;
&lt;h3 id="q5-what-is-the-key-externality-that-makes-the-competitive-equilibrium-constrained-inefficient-and-how-does-the-planner-correct-it"&gt;Q5. What is the key externality that makes the competitive equilibrium constrained inefficient, and how does the planner correct it?&lt;/h3&gt;
&lt;p&gt;Bankers in the high-type submarket post contracts taking the payoff of low-type entrepreneurs (in the low-type submarket) as given. But the low-type payoff enters their incentive compatibility constraint (IC-LH), which governs how much down payment and rationing they must impose. When the planner raises the low-type payoff (by subsidizing low types), the IC-LH constraint relaxes: the low types are already better off and have less incentive to mimic. This allows bankers to offer high types smaller down payments and more loan supply, increasing high-type welfare. If the benefit to high types (lower screening cost) exceeds the tax cost, a Pareto improvement is possible. The planner implements this through type-contingent transfers: taxing bankers who serve high types, subsidizing bankers who serve low types. The planner can internalize the cross-submarket externality because it controls both submarkets simultaneously, whereas competitive bankers each maximize their own submarket&amp;rsquo;s contracts taking the other as given.&lt;/p&gt;
&lt;h3 id="q6-what-is-the-non-monotonicity-of-screening-intensity-in-δ_l-and-what-is-the-intuition"&gt;Q6. What is the non-monotonicity of screening intensity in δ_L, and what is the intuition?&lt;/h3&gt;
&lt;p&gt;Proposition 4 shows that the equilibrium high-type liquidity holding z_H and market tightness θ_H are non-monotone in δ_L (the low type success probability), with a cutoff δ-bar_L. For low δ_L: either the low types are not in the loan market at all, or they would not want to mimic the high types even if the down payment is small, because the precautionary value of holding so much liquidity outside the loan market is very low for low types with poor prospects. As δ_L rises (low types become moderately good), they want to mimic high types more aggressively (higher repayment savings) while the cost of mimicry remains moderate, so down payment and rationing must both be higher. At very high δ_L (low types nearly as good as high types), the types are similar and a small amount of screening suffices again. Distortions peak at intermediate δ_L where the benefit-cost ratio of misreporting for low types is maximized.&lt;/p&gt;
&lt;h3 id="q7-how-does-the-moral-hazard-extension-change-the-results-compared-with-the-baseline"&gt;Q7. How does the moral hazard extension change the results compared with the baseline?&lt;/h3&gt;
&lt;p&gt;In the baseline, banknotes can only purchase capital (observable investment). In the extension (Appendix E), banknotes can also buy consumption goods at unit cost C(χ), introducing dual deviation: a low-type entrepreneur who misreports can both obtain a high-type loan and divert some of the proceeds to consumption. This raises the low types&amp;rsquo; payoff from misreporting (U^mh_LH &amp;gt; U_LH), tightening the incentive constraint. As a result: (i) a third screening tool is deployed — bankers reduce the loan size below the complete information optimum (ℓ^mh_H &amp;lt; ℓ*_H); (ii) the threshold i above which multi-tool screening kicks in is lower (i-double-bar^mh ≤ i-double-bar), so distorted equilibria occur over a larger parameter space; (iii) in the distorted region, allocations are more distorted along all three margins (loan size, liquidity, market tightness). When χ ≤ δ_L/δ_H (the cost of diverting banknotes to consumption is high enough that low types prefer to invest all proceeds), the extension coincides exactly with the baseline.&lt;/p&gt;
&lt;h3 id="q8-how-does-this-paper-relate-to-guerrieri-shimer-and-wright-2010-and-what-does-it-add"&gt;Q8. How does this paper relate to Guerrieri, Shimer, and Wright (2010) and what does it add?&lt;/h3&gt;
&lt;p&gt;GSW show that directed search with adverse selection generates a unique separating equilibrium in which market tightness (loan approval rate) is the dominant screening device, while down payment (liquidity) is not used when the self-finance option is absent. In GSW&amp;rsquo;s setup applied to credit markets, liquid assets are redundant — without an endogenous outside option, there is no precautionary demand and no signaling demand for liquidity (Proposition 5 of this paper). Liang&amp;rsquo;s contribution is to introduce the self-finance channel as an endogenous outside option to the GSW framework. This makes liquidity valuable both outside the credit market (precautionary motive) and inside it (signaling/screening device). The result is that both down payment and market tightness are used as screening instruments in the fully distorted regime, whereas GSW uses only market tightness. This also changes the constrained efficiency analysis: Liang shows that the planner can fully undo adverse selection under certain conditions, a result that does not arise in the vanilla GSW model.&lt;/p&gt;
&lt;h3 id="q9-what-robustness-and-consistency-checks-are-run-in-the-empirical-section"&gt;Q9. What robustness and consistency checks are run in the empirical section?&lt;/h3&gt;
&lt;p&gt;The empirical section runs OLS (column 1), one-way fixed effects (column 2), first-difference transformation OLS (column 3), Anderson-Hsiao IV with one instrument (column 4), and Anderson-Hsiao IV with two instruments (column 5, the preferred specification). The consistency of the lagged liquidity estimator is checked against the Nickell bounds: Bond (2002) recommends the consistent estimate should lie between the OLS and FE estimates (0.4920 and -0.1833); the preferred IV estimate (0.2766) satisfies this. Instrument strength is verified with the Cragg-Donald Wald F-statistic (62.056 vs. threshold 7.03). The paper acknowledges that the liquid collateral coefficient may be biased in either direction: upward if firms that plan to pledge liquid collateral but fail to obtain loans are misclassified as non-signalers, or downward if ineligible firms (with insufficient liquid assets to pledge) are misclassified as non-signalers. The direction of bias is ambiguous, which limits the paper&amp;rsquo;s ability to bound the true signaling motive magnitude.&lt;/p&gt;
&lt;h3 id="q10-what-are-the-policy-implications-and-their-scope-conditions"&gt;Q10. What are the policy implications and their scope conditions?&lt;/h3&gt;
&lt;p&gt;First, the paper recommends cross-subsidization — taxing high-type borrowers and subsidizing low-type borrowers — to restore the complete information allocation when the equilibrium is distorted. This is implementable through type-contingent tax policies on bank loans. The scope condition (Proposition 6) is that the high types&amp;rsquo; net surplus from borrowing must exceed the low types&amp;rsquo; scaled gain from misreporting (Equation 11); this is more likely to hold when i is large (high policy rate), ν_L is small (few low types), or δ_L/δ_H is very small or very close to 1 (extreme types). Second, and more restrictively, if δ_L/δ_H + ν_H/ν_L &amp;lt; 1 (low types are very risky or very numerous), the competitive equilibrium is already constrained efficient and no transfers are needed. Third, on monetary policy: a rise in the policy rate can trigger a transition from an undistorted to a distorted equilibrium, causing welfare to fall. The paper interprets this as a caution against using high policy rates when credit market adverse selection is a concern. The paper also connects to loan guarantee programs (analogous to low-type subsidies), citing Chilean evidence (Cowan et al. 2015) showing that guarantees increase both guaranteed and non-guaranteed credit supply, consistent with the model&amp;rsquo;s cross-submarket externality mechanism.&lt;/p&gt;
&lt;h3 id="q11-what-are-the-main-data-limitations-acknowledged-in-the-empirical-analysis"&gt;Q11. What are the main data limitations acknowledged in the empirical analysis?&lt;/h3&gt;
&lt;p&gt;The KFS records the type of debt collateral only in the last three years of the survey (2009-2011), severely limiting the time dimension for liquid collateral analysis. This prevents the use of GMM estimators (Arellano-Bond 1991) that require different lag instruments across periods. The KFS does not record ex post loan outcomes (interest rates, default rates), so the paper cannot directly test the model&amp;rsquo;s prediction that loans with liquid collateral carry lower interest rates and lower default rates (unlike Berger et al. 2016 using Bolivian data). Loan application outcomes are also not available, preventing a sample restriction to successful applicants, which would resolve one direction of bias in the signaling motive estimator. The liquid collateral variable encompasses all debt types (business loans, credit cards, lines of credit), not only commercial bank loans, which is the model&amp;rsquo;s focus.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Signaling motive for liquidity&lt;/strong&gt;: In the paper&amp;rsquo;s sense: small firms hold liquid assets specifically to satisfy bank down payment requirements, thereby credibly signaling their investment quality (high success probability) to lenders who cannot observe borrower type. This is distinct from the textbook corporate finance definition of signaling; here the signal operates through costly liquid collateral pledged inside the credit contract, not through equity stakes or dividends.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Self-finance channel&lt;/strong&gt;: In the paper&amp;rsquo;s sense: the outside option to bank borrowing, in which an entrepreneur uses accumulated liquid holdings to directly purchase capital and invest when she either fails to match with a banker or prefers not to. The channel is endogenous — its value depends on the entrepreneur&amp;rsquo;s liquidity holdings z and investment success probability δ_j — and is the structural ingredient that makes liquidity valuable both inside and outside the credit market.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Market tightness (θ) as a screening device&lt;/strong&gt;: In the paper&amp;rsquo;s sense: bankers deliberately make high-type loan contracts scarce (low θ_H, i.e., few bankers per entrepreneur in the high-type submarket), reducing the loan approval probability µ(θ_H). Because low types have a lower surplus from obtaining a high-type loan than high types do, they are disproportionately discouraged by a low approval probability. Market tightness is the extensive-margin screening instrument in the GSW framework; this paper adds down payment as a second instrument.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Down payment (d) as inside collateral&lt;/strong&gt;: In the paper&amp;rsquo;s sense: liquid assets pledged at the time of loan application, paid from the entrepreneur&amp;rsquo;s own liquid holdings z. Called &amp;lsquo;inside collateral&amp;rsquo; because the pledged assets (liquidity) are used in financing the project, as opposed to &amp;lsquo;outside collateral&amp;rsquo; (equipment, inventory) not used in the financed project. The down payment is the intensive-margin screening instrument; high types pledge d_H = z_H, their full liquid holdings.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Constrained efficiency with adverse selection&lt;/strong&gt;: In the paper&amp;rsquo;s sense: the best allocation achievable by a social planner who faces the same information asymmetry (types are private) and the same search frictions as agents, and who maximizes a welfare-weighted sum of entrepreneur payoffs subject to incentive compatibility, participation, and budget balance constraints. The paper shows the competitive equilibrium may fail constrained efficiency due to a cross-submarket externality not internalized by individual bankers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Dual deviation (moral hazard extension)&lt;/strong&gt;: In the paper&amp;rsquo;s sense (Appendix E): when loan proceeds (banknotes) can be used to purchase consumption goods as well as capital, a low-type entrepreneur who misreports her type faces two deviation margins — misreporting her type (adverse selection) and diverting loan proceeds to consumption rather than investment (moral hazard). Dual deviation raises the low types&amp;rsquo; payoff from mimicry and forces bankers to add loan size as a third screening tool, at the cost of an inefficiently small loan.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Opportunity cost of liquidity (i) and regime transitions&lt;/strong&gt;: In the paper&amp;rsquo;s sense: i = 1/(β(1+r_z)) − 1, the per-period cost of holding one unit of liquid assets, which equals the inflation rate π in steady state. As i increases, it simultaneously raises the self-finance outside option (liquidity becomes a better investment channel) and affects the low types&amp;rsquo; incentive to mimic high types, triggering discrete transitions between four equilibrium regimes from no credit market participation through increasingly distorted screening configurations.&lt;/p&gt;</description></item><item><title>An Analytical Model of Behavior and Policy in an Epidemic</title><link>https://macropaperwarehouse.com/papers/an-analytical-model-of-behavior-and-policy-in-an-epidemic/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/an-analytical-model-of-behavior-and-policy-in-an-epidemic/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper builds a tractable, fully analytical version of the workhorse macro-epidemiology (&amp;ldquo;econ-epi&amp;rdquo;) model and uses it to characterize how susceptible individuals behave during a deadly epidemic, how a social planner would have them behave, and the externality that separates the two. The motivation is that prior macro-SIR results came almost entirely from numerical simulation; a closed-form treatment can expose general insights those simulations missed and provide a transparent benchmark for any future epidemic. The model appends the standard Kermack-McKendrick SIR system (susceptible S, infected I, recovered R, deceased D, with transmission rate β, recovery rate γr, death rate γd, and γ := γr + γd) with forward-looking agents who choose an activity level λ ∈ [0,1] that scales transmission via β = βa·λ + βo. The single key modeling departure is LINEAR (rather than convex) costs of mitigation, microfounded by indivisible activity choices in the spirit of Rogerson (1988); this makes the optimal control bang-bang or singular and yields closed-form solutions. Three constants organize the analysis: the herd immunity threshold S̄ := γ/β, the basic reproduction number R0 := 1/S̄, and the infection fatality rate IFR := γd/γ. A central composite statistic is the cost-benefit ratio of mitigation κ := (uW − uL)/(βa·IFR·VSL), where VSL := uW/ρ is the value of statistical life in utility terms.\n\nMain results. (1) Decentralized equilibrium (Proposition 1): there is no mitigation at the very start and the very end of the epidemic; mitigation occurs only over an interval [t0, t1). Susceptibles begin mitigating just below full susceptibility, the infection rate peaks exactly at t0 (when precautions are greatest), and from then on the effective reproduction number sits slightly below one, producing a gently declining infection path — a pattern the author notes is broadly consistent with first-wave Covid-19 data. The equilibrium infection trajectory is approximated by the simple ray I(t) ≈ (S(t)/S̄)·κ, and the equilibrium steady-state susceptibility is S∞ ≈ S̄ − S̄·√(2κR0). A higher κ and lower S̄ both reduce mitigation and raise infections (a &amp;ldquo;fatalism effect&amp;rdquo;). (2) Socially optimal behavior (Propositions 2-3): optimal policy is bang-bang (λ* ∈ {0,1}) — no mitigation at start and end, full mitigation in a single intermediate interval. The planner &amp;ldquo;holds fire,&amp;rdquo; lets infections climb high, then imposes maximal restrictions late, driving the system quickly to herd immunity. The optimal long-run susceptibility is S∞* ≈ S̄ − S̄·2κR0/(κR0 − 1)². (3) The externality: contrary to the conventional view, susceptibles&amp;rsquo; privately optimal behavior is EXCESSIVELY cautious — the equilibrium infection rate lies below the optimal infection rate for any S above herd immunity — yet cumulative deaths are HIGHER in equilibrium than under the planner. Mitigation by susceptibles mostly substitutes infection risk intertemporally (&amp;ldquo;flattening the curve also makes it fatter&amp;rdquo;); beyond eliminating epidemic overshoot it cannot prevent the inevitable share 1 − S̄ from being infected. The planner&amp;rsquo;s late-strong-short lockdown comes close to implementing a lottery that randomly selects who gets sick.\n\nImplications. Because the externality runs in the opposite direction to standard intuition, optimal policy can call for the government to INCREASE interaction (the paper cites the UK&amp;rsquo;s 2020 &amp;ldquo;Eat Out To Help Out&amp;rdquo; subsidy as an analogue). Results are framed as technical/foundational insights, not direct prescriptions: the benchmark abstracts from reinfection, variants, vaccines/cures, healthcare capacity limits, and endogenous IFR, all of which can shift specific recommendations while leaving the underlying forces intact.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-identification-or-solution-strategy-and-what-makes-the-analytical-characterization-possible"&gt;Q1. What is the &amp;lsquo;identification&amp;rsquo; or solution strategy, and what makes the analytical characterization possible?&lt;/h3&gt;
&lt;p&gt;This is a theory paper, so the relevant strategy is solving the dynamic optimization analytically rather than empirically. The enabling assumption is LINEAR costs of mitigation (instantaneous utility u = λ·uW + (1−λ)·uL), microfounded by indivisible activity choices as in Rogerson (1988), where λ is the probability of being active in a mixed-strategy equilibrium. Linearity makes the current-value Hamiltonian linear in the control λ, so the optimal control is bang-bang or singular with switching function ψ(t) := uW − uL − (ηs(t) − ηi)·βa·I(t). This permits closed-form characterization of switching points and trajectories. The main &amp;rsquo;threat&amp;rsquo; the author addresses is generality: does linearity drive the conclusions? Section VI shows numerically that convex costs (U = uL + λ^(1−α)·(uW − uL), with α the convexity degree) merely smooth out the kinks and corners without changing qualitative features — passing what the author calls the &amp;lsquo;Solow test.&amp;rsquo;&lt;/p&gt;
&lt;h3 id="q2-what-is-the-core-economic-mechanism-behind-excessive-caution-and-the-two-ways-the-paper-frames-the-externality"&gt;Q2. What is the core economic mechanism behind &amp;rsquo;excessive caution,&amp;rsquo; and the two ways the paper frames the externality?&lt;/h3&gt;
&lt;p&gt;In equilibrium, the singular-control optimality condition equates a constant marginal cost of mitigation (uW − uL) to a marginal benefit (ηs(t) − ηi)·βa·I(t). The shadow value of being susceptible ηs(t) rises over time (cumulative future infection risk and cumulative future mitigation effort both decline as the epidemic progresses), while ηi is constant. To keep the equation balanced, βa·I(t) must fall, so agents become more cautious over time. First framing of the externality: the planner recognizes that at least 1 − S̄ of the population must eventually be infected (and a share IFR of those die); individuals recognize this too (perfect foresight) but each wants to avoid being in the infected group, so they over-mitigate, merely delaying rather than preventing infections. Second framing: stronger mitigation today lowers near-term infections but raises later infections — &amp;lsquo;flattening the curve also makes it fatter&amp;rsquo; — so beyond removing overshoot, mitigation only substitutes infection risk intertemporally. The planner internalizes the whole time path; individuals take the aggregate infection rate as given.&lt;/p&gt;
&lt;h3 id="q3-why-is-the-optimal-lockdown-late-strong-and-short-rather-than-gradual"&gt;Q3. Why is the optimal lockdown &amp;rsquo;late, strong, and short&amp;rsquo; rather than gradual?&lt;/h3&gt;
&lt;p&gt;From the planner&amp;rsquo;s law of motion, the velocity Ṡ/S is proportional to I. An interior λ would lower instantaneous costs proportionately but increase the duration of mitigation more than proportionately (since both λ and I are lower), so gradualism is dominated. This makes optimal policy bang-bang with a single interval of maximal restriction. The planner therefore holds fire, lets I climb high (where the system moves fast), then imposes λ=0 to drive the trajectory quickly to herd immunity — minimizing cumulative deaths at minimum cost rather than flattening the curve.&lt;/p&gt;
&lt;h3 id="q4-how-do-equilibrium-and-optimal-cumulative-deaths-compare-and-why-does-the-more-cautious-equilibrium-produce-more-deaths"&gt;Q4. How do equilibrium and optimal cumulative deaths compare, and why does the more cautious equilibrium produce MORE deaths?&lt;/h3&gt;
&lt;p&gt;Cumulative deaths equal IFR·(1 − S∞). The equilibrium steady-state susceptibility S∞ ≈ S̄ − S̄·√(2κR0) lies below the planner&amp;rsquo;s S∞* ≈ S̄ − S̄·2κR0/(κR0 − 1)², meaning the equilibrium overshoots herd immunity by more, so 1 − S∞ (cumulative infections) and hence deaths are higher in equilibrium. The equilibrium&amp;rsquo;s caution lowers the infection rate at each S above herd immunity and stretches the epidemic out (raising economic cost), but does not prevent the inevitable infections and in fact allows more overshoot than the planner&amp;rsquo;s quick-to-herd-immunity strategy. Cumulative death toll is increasing in R0 and in κ.&lt;/p&gt;
&lt;h3 id="q5-what-is-the-role-of-the-cost-benefit-ratio-κ-and-the-fatalism-effect"&gt;Q5. What is the role of the cost-benefit ratio κ and the &amp;lsquo;fatalism effect&amp;rsquo;?&lt;/h3&gt;
&lt;p&gt;κ := (uW − uL)/(βa·IFR·VSL) combines preferences, epidemiology, and policy effectiveness: the numerator is the utility cost of mitigation; the denominator is the benefit (lower activity reduces transmission by βa, preventing deaths by IFR, each life worth VSL = uW/ρ). A higher κ lowers mitigation and raises the equilibrium infection rate, starts mitigation later (lower S(t0)), and raises cumulative deaths. The &amp;lsquo;fatalism effect&amp;rsquo; has two parts: a lower S̄ (greater lifetime chance of falling ill) dissuades mitigation today; and the high expected cumulative future mitigation effort at the epidemic&amp;rsquo;s start lowers the value of staying alive, further tempering precaution. The simple approximation I(t) ≈ (S(t)/S̄)·κ captures the first part but omits the second.&lt;/p&gt;
&lt;h3 id="q6-what-is-the-practical-back-of-the-envelope-contribution"&gt;Q6. What is the practical &amp;lsquo;back-of-the-envelope&amp;rsquo; contribution?&lt;/h3&gt;
&lt;p&gt;The paper provides a recipe to trace the equilibrium epidemic path without solving the full dynamic model: (1) compute the thresholds S(t0) ≈ 1 − κ/(√(2κR0)·(1−S̄))·S̄(1−S̄), S(t1) ≈ S̄ − ρ/(βo + βa), and S∞ ≈ S̄ − S̄·√(2κR0); (2) plot the ray I = (S/S̄)·κ between the thresholds; (3) splice it on both sides with the no-mitigation (λ=1) trajectory I = −S + S̄·log S + C0. This rivals running the naive SIR model in simplicity but is grounded in optimizing behavior, giving a more plausible benchmark for human populations. The author intends it for forecasting any future epidemic.&lt;/p&gt;
&lt;h3 id="q7-how-do-the-results-relate-to-and-differ-from-prior-numerical-econ-epi-work"&gt;Q7. How do the results relate to and differ from prior numerical econ-epi work?&lt;/h3&gt;
&lt;p&gt;The equilibrium characterization is qualitatively consistent with Farboodi et al. (2021) — little mitigation at the start, then a jump keeping the effective reproduction number just below 1 — the only difference being their path is smoother due to convex costs. Eichenbaum-Rebelo-Trabandt (2021) get a qualitatively different, still hump-shaped equilibrium infection path because in their calibration mitigation is too weak to push the effective reproduction number below 1 (so βo is not &amp;lsquo;sufficiently low&amp;rsquo;). For the planner, the paper&amp;rsquo;s late-strong-short lockdown differs from work finding early/strong responses (Farboodi et al.) or intermediate restrictions (Alvarez et al. 2021; Eichenbaum et al. 2021), for two reasons: (1) this model rules out suppression/vaccine arrival as a feasible endgame, whereas papers allowing vaccine arrival find early strong suppression optimal; (2) the planner here controls only susceptibles&amp;rsquo; behavior with linear costs, whereas broader instruments and convex costs make intermediate restrictions more attractive. The paper is, to the author&amp;rsquo;s knowledge, the first to derive equilibrium and optimal behavior fully analytically and to show the susceptibles&amp;rsquo; externality makes the infection rate too LOW socially.&lt;/p&gt;
&lt;h3 id="q8-what-do-the-costate-shadow-value-dynamics-reveal"&gt;Q8. What do the costate (shadow-value) dynamics reveal?&lt;/h3&gt;
&lt;p&gt;The private value of infection ηi = (uI + (γr/ρ)·uW)/(ρ+γ) is time-invariant (payoffs while ill/recovered/dead don&amp;rsquo;t depend on timing). The social value of an infected person η&lt;em&gt;i is time-varying because the planner internalizes onward transmission via a (η&lt;/em&gt;i − η&lt;em&gt;s)(βaλ&lt;/em&gt; + βo)S* term. η&lt;em&gt;i is deeply negative at the epidemic&amp;rsquo;s start (diverging as I→0, because an infinitesimal seed inflicts unboundedly large relative damage), rises sharply and roughly tracks the private value during the bulk of the epidemic (e.g. when S ∈ [0.5, 0.9]), and settles just above zero in the long run. In the long run the social value of an additional infected person can even be negative when γd is high, because the value of that person&amp;rsquo;s life is below the welfare loss from infections they spread. The social value of a susceptible η&lt;/em&gt;s is always below the private value (except converging to uW/ρ in the long run), reflecting unpriced future contagion.&lt;/p&gt;
&lt;h3 id="q9-what-robustnessextension-checks-does-the-paper-run"&gt;Q9. What robustness/extension checks does the paper run?&lt;/h3&gt;
&lt;p&gt;Section VI: (1) Convex costs (numerical, α=0.3) smooth kinks but preserve qualitative features. (2) Broader planner instruments — controlling susceptibles AND infected (without distinguishing them), or restricting everyone identically — are &amp;lsquo;double-edged&amp;rsquo;: more costly (especially late when many are recovered) but more effective because they also restrict the infected; effectiveness gains peak at intermediate restrictions (around λ=1/2) due to the quadratic contact function, which makes intermediate restrictions and earlier/longer lockdowns more attractive, moving results toward Alvarez et al. (2021). Section VII discusses healthcare/ICU capacity constraints (optimal to hold infections at the capacity level until near herd immunity; endogenous IFR brings equilibrium and optimal paths closer but doesn&amp;rsquo;t change the externality&amp;rsquo;s nature), feasible suppression (optimal policy becomes a discrete choice between herd-immunity and best suppression strategy; equilibrium behavior is largely insensitive to suppression feasibility), and temporary immunity/endemicity (strengthens the fatalism effect, raising equilibrium infections; optimal policy still rushes to steady state, now also to avoid costly multiple waves).&lt;/p&gt;
&lt;h3 id="q10-what-is-the-calibration-used-for-the-figures-and-is-it-meant-to-be-quantitatively-serious"&gt;Q10. What is the calibration used for the figures, and is it meant to be quantitatively serious?&lt;/h3&gt;
&lt;p&gt;The calibration resembles Covid-19 but is explicitly illustrative, not a serious quantitative calibration. A model period is a week. Epidemiological parameters: βo = 0.7, βa = 1.24, γr = 0.77, γd = 0.0078, implying R0 = 2.5, S̄ = 0.4, IFR = 1%, and average disease duration of 9 days; under full mitigation (λ=0) R0 falls to 0.9. Annual discount rate is 4% (weekly ρ = 0.96^(−1/52) − 1). Utility is logarithmic; weekly consumption is $60,000/52 ≈ $1,250 so uW = log(1250) ≈ 7; full lockdown cuts consumption 20%, giving uL = 6.6, (uW − uL)/uL = 3.2%. With VSL = $10 million, κ = 0.002 (0.2%).&lt;/p&gt;
&lt;h3 id="q11-what-are-the-key-caveats-and-the-scope-of-the-policy-implications"&gt;Q11. What are the key caveats and the scope of the policy implications?&lt;/h3&gt;
&lt;p&gt;The author stresses the model is a stripped-down BENCHMARK: no reinfection, no variants, constant IFR, no cure or vaccine (so herd immunity pins down minimum feasible deaths). Specific results are &amp;rsquo;technical contributions, not direct normative prescriptions.&amp;rsquo; The striking implication that a planner might subsidize interaction (forcing susceptibles to interact, since optimal activity sometimes exceeds equilibrium activity) faces an implementability problem — restricting activity is easier than increasing it. The herd-immunity-quick strategy ceases to be optimal once suppression is feasible (vaccine/cure expected), ICU constraints bind with endogenous IFR, or immunity is only temporary; but the underlying forces (the susceptibles&amp;rsquo; intertemporal infection-substitution externality) continue to operate in all these richer settings.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Herd immunity threshold (S̄)&lt;/strong&gt;: S̄ := γ/β, the level of susceptibility below which the infected pool shrinks; in this model, because there is no cure or vaccine, it pins down the minimum feasible deaths and is the endgame both equilibrium and planner converge toward.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Cost-benefit ratio of mitigation (κ)&lt;/strong&gt;: κ := (uW − uL)/(βa·IFR·VSL), a composite statistic combining preferences, epidemiology, and policy effectiveness; the numerator is the utility cost of mitigation and the denominator the benefit (transmission reduction βa times deaths averted IFR times value of statistical life). Higher κ means less mitigation and more infections.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Excessive caution / susceptibles&amp;rsquo; externality&lt;/strong&gt;: The paper&amp;rsquo;s central finding that privately optimal mitigation by susceptibles is too cautious socially — the equilibrium infection rate lies below the optimal rate for any S above herd immunity — because each individual wants to avoid being in the inevitable infected share, merely substituting infection risk intertemporally rather than preventing it; the conventional one-way infected-spreader externality view is therefore incomplete.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Linear costs of mitigation / singular control&lt;/strong&gt;: The assumption (microfounded by indivisible activity choices à la Rogerson 1988) that utility is linear in activity λ, making the Hamiltonian linear in the control so the optimum is bang-bang or singular; this delivers sharp closed-form solutions whose intuitions survive under convex costs (the &amp;lsquo;Solow test&amp;rsquo;).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Late-strong-short lockdown&lt;/strong&gt;: The socially optimal policy in this benchmark: hold fire while infections climb high, then impose maximal restrictions (λ=0) in a single intermediate interval that quickly drives the system to herd immunity — minimizing cumulative deaths at minimum cost rather than flattening the curve.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Costates (ηs, ηi)&lt;/strong&gt;: Shadow values of being in the susceptible and infected states. ηi (private) is constant since the payoffs of being ill are timing-independent; the planner&amp;rsquo;s η*i is time-varying because it internalizes onward transmission and can even be negative in the long run when the death rate is high.&lt;/p&gt;</description></item><item><title>Central Banks as Dollar Lenders of Last Resort: Implications for Regulation and Reserve Holdings</title><link>https://macropaperwarehouse.com/papers/central-banks-as-dollar-lenders-of-last-resort-implications-for-regulation-and-reserve-holdings/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/central-banks-as-dollar-lenders-of-last-resort-implications-for-regulation-and-reserve-holdings/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper investigates why non-U.S. central banks accumulate large holdings of dollar-denominated foreign exchange reserves, focusing on a previously under-emphasized motive: the currency mismatch of private-sector non-financial firms. When domestic firms borrow heavily in dollars despite having predominantly local operating revenues, the central bank faces potential liability as a dollar lender of last resort (DOLLR) in the event of a banking crisis coinciding with a dollar appreciation. The paper combines motivating empirical evidence with a formal theoretical model to analyze the optimal policy mix between ex ante financial regulation (bank capital requirements) and ex post reserve accumulation, and then extends the model to characterize global externalities arising from decentralized reserve-holding decisions.&lt;/p&gt;
&lt;p&gt;The empirical work uses an unbalanced panel of 52 non-U.S., non-Eurozone countries (excluding Hong Kong as an extreme outlier) with 357 observations covering 2013-2020. The sample includes 12 advanced economies, 29 emerging economies, and 11 developing economies. The key dependent variable is central bank dollar reserves as a share of GDP; the key right-hand-side variable is cross-border dollar-denominated bank loans to non-financial corporations (NFC), also as a share of GDP, drawn from BIS Locational Banking Statistics. Because banks tightly offset their own currency exposures (dollar assets and liabilities correlate at 0.965 in the panel), the relevant mismatch resides on NFC balance sheets, not bank balance sheets. Cross-border NFC dollar lending proxies for total NFC dollar lending, with correlations of 0.66 overall, 0.89 for advanced economies, and 0.73 for emerging economies in the 21-country subsample where total data are available.&lt;/p&gt;
&lt;p&gt;In the full 53-country univariate regression including Hong Kong, the R-squared is 0.53 and the slope coefficient is 5.3 (t-statistic 7.6): a one-percentage-point increase in NFC dollar loans to GDP is associated with a 5.3-percentage-point increase in dollar reserves to GDP. Excluding Hong Kong, the R-squared falls to 0.083 and the slope to 1.3 (t-statistic 2.5). Splitting by income group, the relationship holds for advanced economies (coefficient 3.7, t-statistic 2.2, R-squared 0.31) and emerging economies (coefficient 2.4, t-statistic 2.5, R-squared 0.18) but is absent and wrongly signed for developing economies. Panel regressions with standard reserve-accumulation controls (M2/GDP, financial openness, bilateral trade with the U.S., GDP per capita, log population) and country fixed effects leave the key coefficient broadly stable and significant at the 5% level for both advanced and emerging economies.&lt;/p&gt;
&lt;p&gt;The theoretical framework models a two-period small open economy in which households have an exogenous preference for dollar-denominated safe assets (capturing the dollar&amp;rsquo;s special status), banks intermediate between these households and a fixed investment project, and banking crises occur with probability q. When the home currency depreciates, currency-mismatched NFC borrowers incur liquidity costs that are quadratic in the share of dollar funding; these costs flow through to the banking system. The central bank can respond with two instruments: (i) accumulate dollar reserves R$ at a carrying cost equal to the dollar-domestic interest rate spread S; (ii) impose capital requirements, which crowd out home-currency deposits but cannot directly control dollar deposits (since mismatch resides off the bank balance sheet in the NFC sector). The optimal level of dollar reserves is decreasing in S and increasing in the fraction of failing banks’ dollar liabilities (pB$). When banking crises and exchange rate depreciations are correlated — as is empirically documented — dollar reserves serve an additional hedging function, because the central bank is more likely to need dollar liquidity precisely when the dollar is strong.&lt;/p&gt;
&lt;p&gt;The paper’s primary normative contribution is to show that decentralized central banks over-accumulate reserves relative to a global planner’s optimum. Each central bank, acting as a price-taker in the market for safe dollar assets, ignores that its own reserve hoarding reduces the global supply of dollar-denominated safe assets, driving down the dollar interest rate. A lower dollar rate, in turn, widens the dollar-domestic rate spread S and makes dollar borrowing more attractive to NFCs, amplifying the very mismatch the reserves are supposed to hedge. A global planner internalizes this feedback and therefore prefers lower reserve accumulation combined with tighter capital requirements. This result (Proposition 1) holds for all values of the households’ discount factor beta above a threshold that is shown to be below zero under the natural condition that reserve holdings do not exceed the supply of safe dollar assets — meaning the proposition holds robustly for any realistic calibration, including in extensive numerical experimentation where the threshold never exceeds 0.5. In the paper’s global numerical example, the global planner’s equilibrium has dollar reserves fall from 54.62 to 27.99, capital requirements rise from K=7.61 to K=23.77, dollar borrowing B$ fall from 59.99 to 42.98, and the interest-rate spread S narrow by approximately one percentage point, relative to the decentralized outcome. The welfare decomposition shows that bank profits decline but are more than offset by gains in household utility from dollar deposits and reductions in carrying costs, taxation deadweight costs, and liquidity costs from mismatch.&lt;/p&gt;
&lt;p&gt;A further extension examines global risk-sharing. When banking crises are imperfectly correlated across countries, a supranational pooling of reserves (e.g., through the IMF) allows reserves to be reallocated ex post to countries in crisis, reducing total required reserve holdings. This risk-sharing motive reinforces the case for international coordination but raises additional institutional challenges around moral hazard and monitoring. The paper concludes that, analogously to the Basel process for capital regulation, an international coordination mechanism for reserve holdings would be globally welfare-improving, but this potential benefit is less widely recognized.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-papers-core-empirical-identification-strategy-and-what-are-the-main-limitations"&gt;Q1. What is the paper’s core empirical identification strategy and what are the main limitations?&lt;/h3&gt;
&lt;p&gt;The empirical strategy is correlational: the paper regresses central bank dollar reserves (as a share of GDP) on cross-border NFC dollar loans (as a share of GDP) in a panel of 52 countries over 2013-2020, progressively adding controls (M2/GDP, financial openness, bilateral trade with the U.S., GDP per capita, log population, nominal exchange rate) and country fixed effects. The authors are explicit that the regressions cannot establish causality and should be interpreted as suggestive motivating patterns rather than tight causal tests. The main data limitation is that the BIS only provides complete cross-border NFC dollar lending data, not total (cross-border plus local) NFC dollar lending; total data are available for only 21 countries (10 advanced, 11 emerging), and the correlation between the two measures is 0.66 overall (0.89 advanced, 0.73 emerging). Additionally, dollar-denominated bond-market borrowing by NFCs is excluded. The paper also cannot cleanly separate dollar borrowing by exporters (who are naturally hedged) from dollar borrowing by purely domestic non-tradable firms (who are genuinely mismatched).&lt;/p&gt;
&lt;h3 id="q2-what-is-the-mechanism-through-which-reserve-accumulation-creates-a-global-externality"&gt;Q2. What is the mechanism through which reserve accumulation creates a global externality?&lt;/h3&gt;
&lt;p&gt;Central banks collectively purchase large quantities of dollar-denominated safe assets (e.g., U.S. Treasuries). Each individual central bank takes the dollar interest rate as given (price-taking assumption) and does not account for the effect of its own purchases on the aggregate supply of dollar safe assets in global markets. In the global equilibrium, however, central bank reserve accumulation reduces the net supply of dollar safe assets available to private households, pushing up dollar asset prices and lowering the dollar interest rate. A lower dollar interest rate narrows the dollar-domestic rate spread S, making dollar borrowing cheaper for NFCs, and therefore encouraging greater currency mismatch of private-sector liabilities. This increased mismatch is the very risk that motivated reserve accumulation in the first place, creating a self-defeating dynamic: decentralized reserve hoarding amplifies the aggregate fragility it seeks to hedge. The global planner internalizes this feedback and prefers less reserve accumulation to let the dollar interest rate remain higher, which discourages NFC dollar borrowing even without direct regulatory control over the NFC funding mix.&lt;/p&gt;
&lt;h3 id="q3-what-roles-do-capital-requirements-and-funding-mix-regulation-play-in-the-model-and-how-do-they-differ"&gt;Q3. What roles do capital requirements and funding-mix regulation play in the model, and how do they differ?&lt;/h3&gt;
&lt;p&gt;Capital requirements (equity capital mandates) act by crowding out home-currency bank deposits; they do not directly affect dollar deposits because the interior optimum for dollar borrowing by banks is independent of total deposit funding in the baseline model without crisis-exchange rate correlation. Thus in the baseline model, capital requirements do not change dollar borrowing and do not change optimal reserve holdings. When banking crises and exchange rate depreciations are positively correlated, however, capital requirements that reduce total deposits (both home-currency and dollar) do reduce optimal reserve holdings, because holding dollar reserves hedges the need to bail out both types of deposits when crises concentrate in strong-dollar states. Funding-mix regulation (direct control over the proportion of dollar versus home-currency deposits) more directly reduces dollar mismatch and allows the central bank to cut reserves substantially further. In the numerical example with capital-only regulation, reserves fall from 56.9 to 54.6; with both capital and funding-mix regulation, reserves fall to 38.5. The paper notes, however, that funding-mix regulation is unlikely to be empirically relevant because currency mismatch resides predominantly on NFC balance sheets outside the regulatory perimeter, not on bank balance sheets.&lt;/p&gt;
&lt;h3 id="q4-under-what-conditions-does-the-global-planner-prefer-more-reserves-than-the-decentralized-outcome-the-wrong-way-effect"&gt;Q4. Under what conditions does the global planner prefer more reserves than the decentralized outcome (the ‘wrong-way’ effect)?&lt;/h3&gt;
&lt;p&gt;There is one channel through which a global planner might want more reserves than individual central banks: by holding more reserves, the planner would depress the dollar interest rate and thereby increase bank profitability (banks can borrow cheaply in dollars and earn the spread). This ‘wrong-way’ bank-profit effect is captured by the term (Q$ - beta) in the global planner’s first-order condition and grows when the spread between the cost of equity capital and the dollar deposit rate is large — i.e., when beta (the discount factor, or equivalently the inverse of the gross cost of equity) is very low. Proposition 1 establishes that the global planner prefers fewer reserves than the decentralized outcome for all beta above a threshold beta-hat. Under the natural constraint that reserves cannot exceed the total supply of dollar Treasury securities, beta-hat is shown to be negative, meaning the global-planner-prefers-fewer-reserves result holds for all positive values of beta. In extensive numerical experimentation, the threshold was never found to exceed 0.5, implying that the wrong-way effect would only dominate if the cost of equity capital exceeded 100% — an implausible calibration.&lt;/p&gt;
&lt;h3 id="q5-how-does-the-paper-handle-the-correlation-between-banking-crises-and-exchange-rate-depreciations"&gt;Q5. How does the paper handle the correlation between banking crises and exchange rate depreciations?&lt;/h3&gt;
&lt;p&gt;The baseline model assumes crisis probability is independent of the exchange rate. The paper then extends to allow a positive correlation: the probability of a banking crisis rises to (q + h) when the home currency depreciates (dollar strengthens) and falls to (q - h) when it appreciates. This setup nests the baseline as h = 0. With h &amp;gt; 0, two new effects arise. First, dollar borrowing by banks increases because their effective cost of dollar debt is reduced by the implicit put option they have when the dollar appreciates: they default more in the appreciation state, and dollar depositors bear losses. Second, the central bank’s optimal reserve holdings increase substantially, because holding dollars hedges not only future dollar-denominated bailout costs but also home-currency-denominated bailout costs (since crises cluster in dollar-appreciation states where home-currency deposits are worth less in dollars). The formula for optimal reserves gains an additional term proportional to (ph/qz)(Bh + B$) — meaning total bank deposits, not just dollar deposits, now motivate reserve holdings. In this richer environment, any capital regulation that reduces total bank deposits will also reduce optimal reserve holdings, which was not true in the baseline.&lt;/p&gt;
&lt;h3 id="q6-what-does-the-risk-sharing-extension-section-5-contribute"&gt;Q6. What does the risk-sharing extension (Section 5) contribute?&lt;/h3&gt;
&lt;p&gt;Section 5 asks what happens when banking crises are imperfectly correlated across countries, creating scope for risk-pooling. The paper reverts to h = 0 (no exchange rate-crisis correlation) and an inelastic dollar safe asset supply (theta_$2 = 0) to isolate the risk-sharing effect. If a mass q of countries experience crises independently each period, and a supranational institution (like the IMF) can hold a common pool of reserves and allocate them to countries in crisis, then each dollar of pooled reserves provides 1/q times the crisis coverage of a dollar held at the individual-country level. This multiplier means the total required pool of reserves is dramatically smaller: optimal pooled reserves scale with pqB$ rather than pB$. However, the carrying-cost term in the FOC is also reduced by q^2, which partly offsets the coverage multiplier. For empirically relevant small values of the interest-rate spread S, the coverage effect dominates and pooled reserves are substantially lower than individual-country reserves. The extension reinforces the paper’s main message — international coordination reduces required reserve holdings — but also highlights additional institutional challenges: pooling requires the supranational institution to be able to reallocate reserves away from countries not currently in crisis, raising serious moral hazard and monitoring issues.&lt;/p&gt;
&lt;h3 id="q7-how-does-this-paper-relate-to-bocola-and-lorenzoni-2020-and-what-is-the-key-theoretical-distinction"&gt;Q7. How does this paper relate to Bocola and Lorenzoni (2020), and what is the key theoretical distinction?&lt;/h3&gt;
&lt;p&gt;Bocola and Lorenzoni (2020) is the closest antecedent: it also models reserve accumulation as driven by currency mismatch in the private sector and the central bank’s role as a dollar lender of last resort. The current paper’s key additions are: (i) it explicitly introduces financial regulation (capital requirements, and hypothetically funding-mix regulation) as an alternative or complementary tool to reserve accumulation, showing how the optimal mix depends on the carrying cost of reserves relative to the welfare cost of stringent regulation; (ii) it develops the global externality argument — that decentralized reserve accumulation depresses the dollar rate and thereby endogenously exacerbates the mismatch the reserves are intended to hedge — and shows that a global planner prefers a different mix (more regulation, fewer reserves); and (iii) it provides explicit cross-country empirical evidence linking central bank dollar reserve holdings to NFC dollar borrowing to motivate the mechanism.&lt;/p&gt;
&lt;h3 id="q8-how-does-this-paper-relate-to-the-literature-on-mercantilist-versus-precautionary-motives-for-reserve-accumulation"&gt;Q8. How does this paper relate to the literature on ‘mercantilist’ versus ‘precautionary’ motives for reserve accumulation?&lt;/h3&gt;
&lt;p&gt;The paper classifies its motive as falling within the broad ‘precautionary’ view, alongside the sudden-stops literature and the banking-system flight-to-dollar-assets literature (Obstfeld, Shambaugh and Taylor 2010, who use M2/GDP as their key proxy). The paper differs from M2-based frameworks by focusing specifically on corporate-sector dollar mismatch rather than the risk of domestic depositor flight. The paper distinguishes itself from the mercantilist view (Dooley et al. 2003; Aizenman and Lee 2010; Benigno and Fornaro 2012), which attributes reserve accumulation to exchange rate management and trade surplus recycling. The normative contribution also relates to Fanelli and Straub (2021), who find that individual countries over-accumulate reserves relative to a global planner; however, that paper’s mechanism is mercantilist (exchange rate stabilization) whereas this paper’s is precautionary (dollar LOLR).&lt;/p&gt;
&lt;h3 id="q9-how-does-this-paper-connect-to-the-literature-on-international-coordination-of-financial-regulation"&gt;Q9. How does this paper connect to the literature on international coordination of financial regulation?&lt;/h3&gt;
&lt;p&gt;The paper shares with Clayton and Schaab (2022) the conclusion that countries acting individually impose insufficiently stringent capital requirements relative to the global optimum, motivating the Basel Process of international regulatory cooperation. However, the paper argues that even if capital regulation is fully coordinated internationally, this is not sufficient to achieve the global optimum — there additionally needs to be a separate mechanism to restrain reserve accumulation, because excess reserve holding depresses the dollar interest rate and exacerbates corporate dollar mismatch through a general-equilibrium channel that capital regulation alone cannot offset. The paper thus identifies reserve coordination as a distinct policy dimension that has received less policy attention than capital coordination.&lt;/p&gt;
&lt;h3 id="q10-why-are-eurozone-countries-excluded-from-the-empirical-sample"&gt;Q10. Why are Eurozone countries excluded from the empirical sample?&lt;/h3&gt;
&lt;p&gt;Eurozone member countries benefit from either explicit or implicit ECB support in dollar markets. Measuring dollar reserve holdings at the individual country level (e.g., on the Bank of Italy’s balance sheet) and relating them to that country’s corporate-sector dollar borrowing would be conceptually misleading, because the relevant backstop is the ECB at the union level rather than the national central bank. The relevant LOLR function is pooled across Eurozone members. Including them would therefore introduce a systematic bias in the proxy for the dollar LOLR motive.&lt;/p&gt;
&lt;h3 id="q11-what-are-the-scope-conditions-on-the-empirical-results"&gt;Q11. What are the scope conditions on the empirical results?&lt;/h3&gt;
&lt;p&gt;The significant positive association between NFC dollar borrowing and central bank dollar reserve holdings holds for advanced economies (coefficient 3.7, t-statistic 2.2) and emerging economies (coefficient 2.4, t-statistic 2.5) but is absent and correctly (negatively) signed but insignificant for developing economies. The authors note that for advanced economies, the result for the subsample is sensitive to removing both Hong Kong (already excluded from the baseline) and Switzerland, given the small number of countries. The results are presented as suggestive correlations rather than causal estimates; missing data on local-currency NFC dollar lending (available for only 21 countries) and on dollar bond-market borrowing are acknowledged as limitations. The theoretical results apply most cleanly when the interest-rate spread S is not too large (so that the small-S configuration is empirically relevant) and when the discount factor beta is above a threshold that is never found to exceed 0.5 in calibrations.&lt;/p&gt;
&lt;h3 id="q12-what-is-the-models-treatment-of-the-dollar-interest-rate-and-safe-asset-scarcity"&gt;Q12. What is the model’s treatment of the dollar interest rate and safe asset scarcity?&lt;/h3&gt;
&lt;p&gt;In the small open economy version, the dollar interest rate (equivalently, the price of dollar safe assets Q$) is exogenously given, consistent with the small-country price-taking assumption. In the global model, Q$ is endogenized: households have a quadratic extra utility from holding dollar safe assets, so Q$ = beta + theta_d + theta_$1 - theta_$2 * D$, where theta_$2 governs the sensitivity of the dollar rate to the total supply of dollar assets (D$). The spread S = Q$/Q_h - 1 becomes endogenous and falls when central banks absorb dollar assets (reserves R$), since this reduces the net supply available to private households. The externality is zero when theta_$2 = 0 (perfectly elastic supply), and increasing in theta_$2. The paper thus situates the externality squarely in the ‘global safe asset scarcity’ framework originating with Caballero, Farhi and Gourinchas (2008) and Bernanke (2005).&lt;/p&gt;
&lt;h3 id="q13-what-is-the-welfare-decomposition-from-the-global-numerical-example"&gt;Q13. What is the welfare decomposition from the global numerical example?&lt;/h3&gt;
&lt;p&gt;Table 5 normalizes total welfare in the no-regulation, no-reserve benchmark to 100. Moving from no-regulation to the local-planner outcome (with capital requirements and reserves) raises total welfare from 100 to 113.4, driven largely by a reduction in the deadweight costs of taxation (from -131.9 to -70.7) as reserves substitute for costly fiscal bailouts, despite increased carrying costs of reserves (-18.6) and higher liquidity costs due to unchanged dollar borrowing. Moving from the local-planner to the global-planner outcome raises welfare further to 120.4. This additional gain comes from: a large reduction in carrying costs of reserves (from -18.6 to -5.8), reduced deadweight taxation costs (from -70.7 to -61.3), reduced liquidity costs from mismatch (from -13.8 to -7.1), and increased household utility from dollar deposits (55.8 vs. 43.9) — all more than offsetting a decline in bank profits (138.8 vs. 172.6).&lt;/p&gt;
&lt;h3 id="q14-what-policy-implications-does-the-paper-draw-and-how-are-they-scoped"&gt;Q14. What policy implications does the paper draw, and how are they scoped?&lt;/h3&gt;
&lt;p&gt;First, international coordination of reserve holdings — analogous to the Basel Process for capital regulation — would improve global welfare by internalizing the safe-asset-scarcity externality. The paper frames itself as initiating a conversation about what such a coordination process might look like; it does not propose a specific mechanism. Second, tighter capital regulation combined with reduced reserve accumulation is the globally optimal policy mix, but individual central banks will not choose this combination unilaterally because they do not internalize the general-equilibrium impact of their reserve holdings on global dollar rates. Third, the risk-sharing extension implies that pooled supranational reserve management (e.g., through the IMF) could substantially reduce the total quantity of reserves needed globally, but this requires the supranational institution to have significant powers to reallocate reserves across countries mid-crisis, raising governance challenges around moral hazard and monitoring. Fourth, the paper does not advocate for coordinating away all reserve holdings — it acknowledges other legitimate reserve motives (sudden stops, domestic bank runs, exchange rate management) not modeled here.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Dollar lender of last resort (DOLLR)&lt;/strong&gt;: A central bank that stands ready to supply dollar liquidity to its domestic banking system during a crisis in which currency-mismatched borrowers face distress because the home currency has depreciated against the dollar. The DOLLR role motivates holding dollar reserves in advance.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Currency mismatch&lt;/strong&gt;: A situation in which non-financial corporations (and, by extension, the banking sector that lends to them) have liabilities denominated in dollars while their revenues and assets are predominantly in home currency, creating exposure to losses when the home currency depreciates. In this paper’s framework, mismatch is measured by the ratio of cross-border NFC dollar bank borrowing to GDP.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Carrying cost of reserves&lt;/strong&gt;: The expected negative return earned by the central bank on its dollar reserve holdings, equal to the spread S between the domestic interest rate (what the central bank pays on the government bonds it issues to finance reserve purchases) and the dollar interest rate (what the reserves earn). A higher S makes reserves more costly to hold and tilts the optimal policy toward financial regulation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Safe dollar asset scarcity externality&lt;/strong&gt;: The general-equilibrium feedback by which individual central banks’ reserve accumulation reduces the net supply of dollar-denominated safe assets available to private households, lowers the dollar interest rate, and thereby makes dollar borrowing cheaper for NFCs — amplifying the currency mismatch that motivated reserve accumulation in the first place. Individual price-taking central banks do not internalize this externality.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Decentralized vs. global-planner equilibrium&lt;/strong&gt;: The decentralized equilibrium is one where each country’s central bank sets capital requirements and reserve holdings to maximize own-country welfare, taking the dollar interest rate as given. The global-planner equilibrium internalizes the impact of aggregate reserve accumulation on the endogenous dollar interest rate. The paper establishes (Proposition 1) that the global planner chooses strictly fewer dollar reserves and strictly higher capital requirements than the decentralized equilibrium, for all empirically plausible parameter values.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Precautionary reserve motive&lt;/strong&gt;: The class of explanations for foreign exchange reserve holdings based on self-insurance against adverse future shocks, including sudden stops, domestic depositor flight, and (in this paper) the need to serve as dollar lender of last resort when corporate currency mismatch generates systemic banking distress. Contrasted with the ‘mercantilist’ motive based on exchange rate management and trade surplus recycling.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Risk-sharing (pooled reserves)&lt;/strong&gt;: The efficiency gain achievable when banking crises are imperfectly correlated across countries and a supranational institution holds reserves centrally and redistributes them to countries experiencing crises. Each dollar of pooled reserves provides 1/q times the crisis coverage of a dollar held by an individual country, where q is the fraction of countries in crisis at any given time, enabling total reserve requirements to be substantially smaller.&lt;/p&gt;</description></item><item><title>Deciphering Federal Reserve Communication via Text Analysis of Alternative FOMC Statements</title><link>https://macropaperwarehouse.com/papers/deciphering-federal-reserve-communication-via-text-analysis-of-alternative-fomc-statements/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/deciphering-federal-reserve-communication-via-text-analysis-of-alternative-fomc-statements/</guid><description>&lt;p&gt;This paper proposes a text-based measure of monetary policy stance by modelling FOMC post-meeting statements as convex combinations of the staff-drafted dovish (&amp;ldquo;alternative A&amp;rdquo;) and hawkish (&amp;ldquo;alternative C/D&amp;rdquo;) versions that accompany each meeting, providing a transparent and adaptive reference spectrum. The authors fine-tune the Universal Sentence Encoder—a pre-trained language model—using synthetic examples that mirror numerical information in policy actions, enabling the model to capture both semantic tone and quantitative precision. Stance is defined as the product of tone (alignment with the dovish/hawkish alternatives) and novelty (semantic shift from the previous statement), and is decomposed into expected and surprise components using intraday financial data. Surprises arise from shifts in tone relative to market expectations or from statement novelty. The resulting surprise measure aligns closely (correlations of 70–80%) with established high-frequency measures (Swanson 2017, Nakamura-Steinsson 2018, Bauer-Swanson 2023), and the framework enables counterfactual analysis of how alternative communication could have moved markets.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-are-the-alternative-fomc-statements-and-how-are-they-used"&gt;Q1. What are the alternative FOMC statements and how are they used?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;For each FOMC meeting, staff draft multiple versions of the policy statement—typically a more dovish &amp;ldquo;alternative A,&amp;rdquo; a baseline &amp;ldquo;alternative B,&amp;rdquo; and a more hawkish &amp;ldquo;alternative C&amp;rdquo; or &amp;ldquo;D&amp;rdquo;—and the paper uses these pre-structured alternatives as a reference spectrum against which to position the released statement.&lt;/strong&gt; This institutional feature provides a transparent, adaptive measure of tone that evolves with the policy environment and internal deliberations, avoiding the rigidity of pre-fixed tone definitions. The released statement&amp;rsquo;s embedding in the language model space is compared to the dovish and hawkish alternatives to determine its location on the policy spectrum.&lt;/p&gt;
&lt;h3 id="q2-how-is-the-policy-stance-measure-constructed"&gt;Q2. How is the policy stance measure constructed?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Stance is defined as the product of novelty and tone: novelty captures semantic shifts from previous statements, and tone reflects the alignment of the released statement with the dovish or hawkish alternatives; the observed stance reflects both the content of each position and the relative positioning of the Committee along the policy spectrum.&lt;/strong&gt; A second, structural interpretation models the released statement as the outcome of internal deliberation—a weighted average of dovish and hawkish stances—linking textual variation to shifts in the internal balance of influence within the Committee.&lt;/p&gt;
&lt;h3 id="q3-how-is-the-stance-decomposed-into-expected-and-surprise-components"&gt;Q3. How is the stance decomposed into expected and surprise components?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The decomposition into expected and surprise components uses intraday bond price movements to recover the market-expected dovish weight of the released statement, then defines the surprise as the deviation between the realized stance and the market-expected stance.&lt;/strong&gt; Surprises arise from two sources: deviations in tone relative to expectations, and statement novelty. This framework shows that monetary policy surprises are not just about what the Fed did but also about how it communicated—capturing interpretable surprises that reveal shifts in the Committee&amp;rsquo;s internal balance.&lt;/p&gt;
&lt;h3 id="q4-how-is-the-measure-validated-and-what-are-its-macroeconomic-effects"&gt;Q4. How is the measure validated and what are its macroeconomic effects?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The surprise measure aligns closely with established high-frequency measures (correlations of 70–80% with Swanson 2017, Nakamura-Steinsson 2018, and Bauer-Swanson 2023); surprise tightenings reduce stock prices, raise short-term Treasury yields, dampen real activity and inflation, and raise credit risk premia.&lt;/strong&gt; Local projection estimates corroborate that surprise contractionary shocks have the expected macroeconomic effects, providing a validation that the text-based measure captures meaningful monetary policy information beyond what is already priced in.&lt;/p&gt;
&lt;h3 id="q5-what-counterfactual-analysis-does-the-framework-enable"&gt;Q5. What counterfactual analysis does the framework enable?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The framework enables counterfactual analysis of how alternative FOMC communication could have moved markets—for example, estimating what asset price movements would have occurred had the Committee released the more dovish or hawkish alternative statement rather than the actual release.&lt;/strong&gt; This counterfactual capability stems from the explicit modelling of stance as a position on a spectrum defined by the staff-drafted alternatives, so the market impact of any point on that spectrum can be estimated.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;alternative FOMC statements&lt;/strong&gt; : staff-drafted dovish (&amp;ldquo;alternative A&amp;rdquo;) and hawkish (&amp;ldquo;alternative C/D&amp;rdquo;) versions of the FOMC post-meeting statement prepared for each meeting; used as the reference spectrum for measuring the tone and position of the released statement.
&lt;strong&gt;monetary policy stance&lt;/strong&gt; : as defined in this paper, the product of tone (alignment with the dovish/hawkish alternatives) and novelty (semantic shift from the previous statement); captures both the direction and the information content of the released statement.
&lt;strong&gt;tone&lt;/strong&gt; : the alignment of a released FOMC statement with the dovish or hawkish alternative drafts in the Universal Sentence Encoder embedding space; reflects the direction of the Committee&amp;rsquo;s communication along the policy spectrum.
&lt;strong&gt;novelty&lt;/strong&gt; : the semantic distance of the released FOMC statement from the previous statement in the embedding space; captures how much new information or emphasis the statement introduces.
&lt;strong&gt;Universal Sentence Encoder (USE)&lt;/strong&gt; : the pre-trained language model applied by the paper; fine-tuned on synthetic examples that mirror numerical information in policy actions (e.g., rate-hike sizes) to capture both semantic tone and quantitative policy precision.&lt;/p&gt;</description></item><item><title>Did the US Really Grow Out of Its World War II Debt?</title><link>https://macropaperwarehouse.com/papers/did-the-us-really-grow-out-of-its-world-war-ii-debt/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/did-the-us-really-grow-out-of-its-world-war-ii-debt/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation. The fall in the US federal debt-held-by-the-public/GDP ratio from a postwar peak of 106% in fiscal year 1946 to a trough of 23% in 1974 is widely cited (Elmendorf-Mankiw, Krugman) as evidence that an economy &amp;ldquo;grows out of&amp;rdquo; debt because the GDP growth rate exceeds the interest rate on government debt (r &amp;lt; g). That narrative underpins the modern view (Blanchard 2019; Furman-Summers 2020) that high public debt &amp;ldquo;may have no fiscal cost.&amp;rdquo; Acalin and Ball ask how much of the postwar debt decline was genuinely due to growth exceeding undistorted real interest rates, versus three other factors: primary budget surpluses, the Fed&amp;rsquo;s 1942-1951 interest-rate peg before the Fed-Treasury Accord, and surprise inflation.&lt;/p&gt;
&lt;p&gt;Method and data. The authors simulate counterfactual debt/GDP paths from the standard debt-dynamics identity D_t = (1+i_t)D_{t-1} - P_t, starting from the actual 1946 debt level and holding nominal GDP fixed at its historical path. They build three counterfactuals: (i) &amp;ldquo;primary balance&amp;rdquo; (set primary surplus to zero each year); (ii) &amp;ldquo;adjusted interest rate&amp;rdquo; (remove distortions from both the peg and surprise inflation); and (iii) &amp;ldquo;combined&amp;rdquo; (both), whose path is driven purely by r* - g, the undistorted real rate minus growth. A key innovation is measuring the &amp;ldquo;reverse maturity structure&amp;rdquo; — the fractions of currently outstanding debt issued in each past year — using Hall-Payne-Sargent (2018) data for 1942-1960 and CRSP thereafter. They construct a term structure of inflation expectations from one-year (Livingston, SPF) and ten-year (FRB/US) survey data, and estimate undistorted peg-era real rates from ex-ante real rates on securities issued in 1952-1961. T-bills and TIPS are assumed unaffected by inflation surprises (conservative). Debt is par value, held by the public, by fiscal year.&lt;/p&gt;
&lt;p&gt;Main quantitative findings. In the combined counterfactual, debt/GDP falls only to 74% in 1974 (vs. 23% actual); the individual counterfactuals give 40% (primary balance) and 51% (adjusted rate) in 1974. Of the actual 83-point fall (106 to 23), 51 points are explained by surpluses plus rate distortions, decomposed as 17 points from surpluses alone, 28 from rate distortions alone, and 6 from their interaction; only 32 points (the fall to 74%) reflect growth net of undistorted rates. Extending to the present, the combined counterfactual ratio starts rising in 1980, dipping to 70% in 1979 before climbing to 84% in 2022 — only 22 points below the 1946 level of 106. Over the full 76 years, undistorted growth alone would have cut debt/GDP by just 22 points. The post-1979 reversal reflects a sign change in r* - g: average r* rose from 2.3% (1947-1979) to 2.8% (1980-2022) while average g fell from 3.5% to 2.6%. The estimated undistorted real-rate term structure is 1.7% (1yr), 2.2% (5yr), 2.5% (10yr), 2.7% (30yr).&lt;/p&gt;
&lt;p&gt;Mechanisms and implications. Primary surpluses averaged 1.1% of GDP over 1947-1974 (peaking at 6.3% in 1948), then turned to persistent deficits. The peg (caps of 0.375% on bills to 2.5% on 30-year bonds) combined with post-1946 inflation surges (CPI averaging 7.1% in FY1947-1951) produced deeply negative ex-post real rates; the aggregate interest-rate adjustment x_t reached 13 points in 1947 and 8 points in 1951. Policy implication: the distortions are unlikely to recur (no peg/price controls planned, Fed committed to low inflation, shorter average maturity — down from 4.4 years in 1951 to 2.2 years in 2022 — blunts inflation&amp;rsquo;s effect), so substantially reducing today&amp;rsquo;s 97% (FY2022) ratio will likely require primary surpluses, which CBO projections suggest are not forthcoming.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-identificationcounterfactual-strategy-and-what-are-its-main-threats"&gt;Q1. What is the identification/counterfactual strategy and what are its main threats?&lt;/h3&gt;
&lt;p&gt;There is no causal identification in the econometric sense; the strategy is an accounting simulation of the debt-dynamics identity under counterfactual interest rates and primary balances, holding nominal GDP (and real GDP and undistorted real rates) fixed at historical values. Threats: (1) the undistorted peg-era real rates are unobserved and must be guessed from 1952-1961 ex-ante real rates; (2) the reverse maturity structure (weights w) is held at historical levels even though higher counterfactual debt would alter issuance; (3) general-equilibrium feedback is ignored — higher counterfactual debt would raise real rates and crowd out capital, lowering GDP, both of which would push debt/GDP even higher, so the authors interpret their paths as LOWER BOUNDS; (4) pre-1943 debt is not adjusted for surprise inflation because long-term expectations data do not exist before 1943, which the authors argue biases against finding a large inflation role.&lt;/p&gt;
&lt;h3 id="q2-how-are-the-effects-of-the-peg-and-surprise-inflation-distinguished-and-can-they-be-separated"&gt;Q2. How are the effects of the peg and surprise inflation distinguished, and can they be separated?&lt;/h3&gt;
&lt;p&gt;The adjusted-interest-rate scenario removes both jointly. The authors state it would be difficult to separate them cleanly because that requires measures of expected inflation during the peg period (1942-1951), and there are no data on long-term inflation expectations before 1951 or short-term expectations before 1947 (start of Livingston). For post-1952 debt, the surprise-inflation adjustment is pi_t minus the expectation formed when the security was issued; for peg-era debt the adjustment is the gap between the ex-post real rate and the assumed undistorted real rate.&lt;/p&gt;
&lt;h3 id="q3-what-is-the-decomposition-relative-to-hall-and-sargent-2011"&gt;Q3. What is the decomposition relative to Hall and Sargent (2011)?&lt;/h3&gt;
&lt;p&gt;Hall-Sargent decompose the 1946-1974 debt/GDP change into r-g and primary surpluses but do not ask how interest-rate distortions shape r-g. Replicating their approach (Table 2A), the authors attribute -48.1 points to r-g and -29.6 points to primary surpluses (the terms sum to -78 points, less than the actual -82.9 because of the debt-dynamics residual). The paper&amp;rsquo;s extension (Table 2B) splits the -48.1 r-g contribution into only -11.7 points from r*-g (undistorted) and -36.3 points from the distortion r-r*, with surpluses still -29.6. So most of the apparent &amp;lsquo;growth out of debt&amp;rsquo; was actually interest-rate distortion.&lt;/p&gt;
&lt;h3 id="q4-why-do-the-table-2-surplus-contributions-differ-from-the-table-1-scenario-differences"&gt;Q4. Why do the Table 2 surplus contributions differ from the Table 1 scenario differences?&lt;/h3&gt;
&lt;p&gt;In Table 2 surpluses contribute -29.6 points, larger than the 17-point effect implied by the Table 1 difference between actual 1974 debt/GDP and the primary-balance scenario. The reason is an interaction: eliminating surpluses raises the debt path d_{t-1}, which magnifies the r-g term, so additional debt is partly eroded by r-g. The authors call the Figure 7 / Table 1 scenario paths the more precise representation.&lt;/p&gt;
&lt;h3 id="q5-how-do-the-findings-reconcile-with-blanchards-2019-claim-that-r--g-since-1979"&gt;Q5. How do the findings reconcile with Blanchard&amp;rsquo;s (2019) claim that r &amp;lt; g since 1979?&lt;/h3&gt;
&lt;p&gt;The authors find r &amp;gt; g on average since 1979 (even in the primary-balance counterfactual with actual ex-post rates), so debt/GDP would rise. The difference from Blanchard is purely measurement: (1) they use the government&amp;rsquo;s interest payments on outstanding debt — the rates set at issuance — whereas Blanchard uses current market yields (a weighted average of 1- and 10-year Treasury rates), which since 1979 have been lower because rates trended down; (2) the authors use pre-tax rates while Blanchard uses after-tax rates. Figure A.11 confirms: with the authors&amp;rsquo; measure debt/GDP rises 1979-2022; with Blanchard&amp;rsquo;s pre-tax market yields it rises then falls back near its 1979 level; with his after-tax rates it falls significantly. The authors argue the rate paid by the government is the relevant one for the debt-dynamics identity, and that a natural baseline assumes debt has no net effect on tax revenue (so pre-tax rates apply).&lt;/p&gt;
&lt;h3 id="q6-what-is-a-notable-nuance-about-the-post-1979-period-in-the-primary-balance-counterfactual"&gt;Q6. What is a notable nuance about the post-1979 period in the primary-balance counterfactual?&lt;/h3&gt;
&lt;p&gt;The post-1979 rise in debt/GDP is LARGER in the primary-balance counterfactual (19 points, from 34% to 53%) than in the combined counterfactual (14 points). This is because inflation surprises since 1979 have on average been negative (post-Volcker disinflation, actual below expected), raising ex-post real rates and thus debt/GDP. It confirms that actual r has exceeded g since 1979.&lt;/p&gt;
&lt;h3 id="q7-what-robustness-checks-are-run"&gt;Q7. What robustness checks are run?&lt;/h3&gt;
&lt;p&gt;(1) Undistorted peg-era real rates shifted by +/-0.5% and +/-1% across the whole term structure: 1974 combined debt/GDP ranges from 67% (-1%) to 81% (+1%) around the 74% baseline; 2022 ranges from 78% to 91% around 84% (Table A.2). (2) Pre-1962 interest measured by net interest times 1.1; using net interest directly gives 73% in 1974 and 83% in 2022 vs. 74% and 84% baseline. (3) The debt-dynamics residual epsilon (mainly Treasury cash balances) is held at historical values; setting it to zero gives a combined counterfactual of 78% in 1974 and 77% in 2022, showing the residual contributed -0.19% GDP/year on average over 1947-1974 and +0.25% over 1975-2022. (4) Term-structure shape assumptions and the GDP-deflator-vs-CPI expectation-error approximation are checked in the Appendix as reasonable.&lt;/p&gt;
&lt;h3 id="q8-what-heterogeneity-across-the-debt-structure-matters"&gt;Q8. What heterogeneity across the debt structure matters?&lt;/h3&gt;
&lt;p&gt;The reverse maturity structure is central: the share of debt with reverse maturities above five years peaked at 48% in 1951 (long-term WWII bonds), then fell, fluctuating between 10% and 25% from 1975-2022; average reverse maturity fell from 4.4 years in 1951 to 2.2 years in 2022. Shorter maturity means inflation surprises erode less debt — a reason later inflation surprises had smaller effects than the 1940s-1970s ones. T-bills (assumed unaffected by surprise inflation since rolled over at adjusting rates) and TIPS (post-1997, indexed) are excluded from the inflation-surprise adjustment. Non-marketable debt fell from 23% of total in 1960 to 3% in 2022; its reverse maturity structure is assumed constant after 1960.&lt;/p&gt;
&lt;h3 id="q9-what-are-the-timingmeasurement-complications"&gt;Q9. What are the timing/measurement complications?&lt;/h3&gt;
&lt;p&gt;Unit is fiscal year (July-June before FY1977, October-September after), creating a &amp;lsquo;Transitional Quarter&amp;rsquo; in Q3 1976 requiring special handling. Inflation is GDP-deflator growth. Pre-1970 deflator expectations are proxied from Livingston CPI forecasts assuming equal expectation errors for CPI and deflator. Ten-year expectations before 1968 are fitted from one-year expectations via a regression (1968-1997) with a negative coefficient (-1.549) on the change in smoothed one-year expectations, capturing long-term expectations lagging short-term moves.&lt;/p&gt;
&lt;h3 id="q10-what-are-the-policy-implications-and-their-scope-conditions"&gt;Q10. What are the policy implications and their scope conditions?&lt;/h3&gt;
&lt;p&gt;Because the postwar debt reduction came largely from one-off distortions (the peg with price controls, and surprise inflation) unlikely to recur — and the Fed is committed to low inflation while shorter average maturity weakens inflation&amp;rsquo;s erosive power — economic growth alone is unlikely to resolve the current ~97% (FY2022) ratio. Substantial reduction will probably require primary surpluses, which CBO projects will not occur under current policy (large primary deficits forecast for three decades). Scope conditions: results are lower bounds (GE crowding-out omitted); they depend on the assumed undistorted real-rate term structure; the 2021-2022 inflation surge is again temporarily reducing debt/GDP.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>Expecting Floods: Firm Entry, Employment, and Aggregate Implications</title><link>https://macropaperwarehouse.com/papers/expecting-floods-firm-entry-employment-and-aggregate-implications/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/expecting-floods-firm-entry-employment-and-aggregate-implications/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper studies how the &lt;em&gt;expectation&lt;/em&gt; of rising flood risk — distinct from realized flood events — reshapes where firms locate, where workers live and how much they work, and what this implies for U.S. aggregate output. The motivation is climate-driven: roughly 6 million Americans lived within a 100-year flood zone in 1998, rising to 13 million by 2018, and FEMA floodplains are projected to grow about 45% by century&amp;rsquo;s end. Prior work largely studied actual floods or housing-price effects; this is among the first to examine firm entry and employment responses to anticipated risk.&lt;/p&gt;
&lt;p&gt;Data and design: The authors digitize FEMA Special Flood Hazard Zone maps (historic Q3 maps tied to 1998 Flood Insurance Rate Maps, and 2018 National Flood Hazard Layer), measuring flood risk as the share of land area within flood zones at the county and ZIP-code (ZCTA) level. Average flood-zone share rose 1.5 percentage points from 1998 to 2018, with a 20-pp increase at the 90th percentile of ZIP-level changes. Firm entry/exit, employment, population and county real GDP come from Census Business Dynamics Statistics, ZIP Codes Business Patterns, and BEA; actual flood events come from the Dartmouth Flood Observatory. The baseline specification is a two-period (1998, 2018) fixed-effects regression with county (or ZCTA) fixed effects, state-by-year fixed effects, demographic/economic controls (female labor share, manufacturing share, population density, China import-penetration change), and a control for actual flooded area.&lt;/p&gt;
&lt;p&gt;Main reduced-form findings: A one-standard-deviation (7-percentage-point) increase in flood risk over 1998-2018 reduced firm entry by 1.2%, employment by 1.2%, population by 0.8% (smaller than employment, implying both relocation and labor-supply margins), and real GDP by 2.4%. Firm exits also &lt;em&gt;declined&lt;/em&gt; with higher risk (smaller magnitude), reflecting reduced business dynamism. A county at the 90th percentile of risk increase saw a 3.3% drop in firm entry. ZIP-level estimates are similar. An IV using the interaction of rest-of-state risk change with local geo-climatic conditions (rainfall, temperature, evaporation) yields comparable magnitudes (entry -1.2%, employment -1.4%, GDP -2.2%); a placebo (1990-1998 outcomes) test is insignificant. In sharp contrast, actual flood &lt;em&gt;events&lt;/em&gt; had negligible effects on entry, exit, employment and population, but a one-SD (0.4) increase in flooded-area share lowered real GDP by 0.2% in the same year, driven by current-year shocks (lagged effects negligible).&lt;/p&gt;
&lt;p&gt;Model and quantification: The authors build a spatial-equilibrium model (McFadden 1978 location choice, Krugman 1980 monopolistic competition) with M = 2,772 counties (96% of 2018 GDP), σ = 5, exit rate κ = 0.08. Flood risk operates through three channels: direct damage, an employment channel (relocation + endogenous labor supply), and a love-of-variety channel (fewer firms). Damage parameters are disciplined by reduced-form evidence (δ = 0.005, δκ = 0.003) and Barrage (2020) (η = 0.002); labor-supply elasticities φL = 1.55, φM = 0.83 are set by indirect inference targeting employment and population responses. Non-targeted moments (output, entry, exit) match the data.&lt;/p&gt;
&lt;p&gt;Counterfactuals: Eliminating 2018 flood risk shows it reduced aggregate output by 0.52% (employment -0.31%, firm entry -0.30%, welfare -0.51%). Decomposition: direct damage -0.11% (21%), labor relocation 0%, labor supply -0.33% (63%), variety -0.08% (15%) — so about 80% of the loss is expectation-driven and 20% direct damage. Effects are highly unequal: top-5% and top-1% counties (by output loss) lost 7.9% and 13.9% of output. A projected 4.5% rise in at-risk properties (2020-2050) would cut output 0.12%. Extensions (entry costs in goods, interregional trade, capital and land) yield somewhat larger losses (0.57%, 0.62%, 0.67%). Policy implication: counting only direct damages badly understates disaster costs and the social cost of carbon, because firms and workers rationally adjust to anticipated risk.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-identification-strategy-and-what-are-the-main-threats-to-it"&gt;Q1. What is the identification strategy, and what are the main threats to it?&lt;/h3&gt;
&lt;p&gt;The core design is a two-period (1998 and 2018) fixed-effects regression of log outcomes (firm entry, exit, employment, population, real GDP) on the share of land in FEMA flood zones, absorbing locality fixed effects (time-invariant characteristics like industry composition), state-by-year fixed effects (statewide growth/business cycles), demographic/economic controls, and a control for actual flooded area. The main threat is measurement error in FEMA risk maps: some underlying data are outdated, and political-economy incentives lead politicians and homeowners to resist map updates to avoid higher insurance premiums, so designations may reflect politics rather than true risk. A second threat is omitted local economic trends correlated with both risk and outcomes. The authors address measurement error with a Bartik-type IV (rest-of-state average risk change interacted with own geo-climatic features — satellite temperature, cumulative rainfall, evaporation), controlling for cumulative past flooded area. IV estimates are close to the fixed-effects ones (entry -1.2%, employment -1.4%, GDP -2.2%), with first-stage KP F-statistics around 63-66. A placebo/pre-trend test (regressing 1990-1998 changes on 1998-2018 risk changes, following Goldsmith-Pinkham et al. 2020) yields small, insignificant coefficients, arguing against omitted-trend confounding.&lt;/p&gt;
&lt;h3 id="q2-what-are-the-main-mechanisms-and-how-are-they-distinguished-empirically-and-in-the-model"&gt;Q2. What are the main mechanisms, and how are they distinguished empirically and in the model?&lt;/h3&gt;
&lt;p&gt;Three channels: (1) direct damage — realized floods lower firm productivity and firm survival; (2) employment channel — anticipated risk lowers real wages/amenities, prompting out-migration and reduced labor supply per household; (3) love-of-variety — fewer firms enter, reducing the variety component of welfare/output. Empirically, the authors distinguish &lt;em&gt;flood risk&lt;/em&gt; (long-run anticipation) from &lt;em&gt;flood events&lt;/em&gt; (short-run realization) by estimating both: risk hits entry/employment/population strongly while events do not, but events hit current-year GDP (productivity) while risk hits it more through adjustment. In the model, direct damages are calibrated from the actual-flood GDP and exit responses (δ, δκ); the employment and variety channels are separated in the counterfactual by sequentially allowing population shares, then labor supply, then variety to respond. The decomposition attributes -0.11% to direct damage, ~0% to labor relocation (offsetting in- and out-migration), -0.33% to labor supply, and -0.08% to variety.&lt;/p&gt;
&lt;h3 id="q3-why-does-population-fall-less-than-employment-and-why-do-firm-exits-decline"&gt;Q3. Why does population fall less than employment, and why do firm exits decline?&lt;/h3&gt;
&lt;p&gt;Employment falls 1.2% while population falls only 0.8% for a one-SD risk increase, implying the response is not purely relocation — remaining households also reduce labor supply. This motivates introducing a positive labor-supply elasticity φL alongside migration elasticity φM, capturing &amp;lsquo;immobile labor&amp;rsquo; (as in Autor et al. 2013) where some workers cut hours rather than move. Firm exits decline with higher risk even though floods mechanically raise closures, because higher risk deters entry so much that the stock of firms shrinks, lowering the base of firms that can exit — reflecting reduced business dynamism rather than greater firm survival.&lt;/p&gt;
&lt;h3 id="q4-what-heterogeneity-is-documented"&gt;Q4. What heterogeneity is documented?&lt;/h3&gt;
&lt;p&gt;Large regional dispersion. While national output fell 0.52%, the top-5% and top-1% counties by output loss lost 7.9% and 13.9% of output respectively (the abstract describes top-5% losses of 7-14%). The hardest-hit counties — coastal and riverine areas in southern and eastern regions (e.g., Cape May NJ, Marion County FL, Sharkey County MS) — lost population, labor supply per household, and firms (top-1% counties: -6.1% population, -4.7% labor supply per household, -10.8% firms). Conversely, mildly affected counties (some Midwestern) were &amp;lsquo;winners,&amp;rsquo; gaining in-migration, more firm entry, and higher labor supply per worker. For the 2020-2050 projection, direct damages play a &lt;em&gt;smaller&lt;/em&gt; relative role (12% vs 21% for 2018) because projected risk increases are more positively correlated with regional productivity, amplifying aggregate adjustment effects.&lt;/p&gt;
&lt;h3 id="q5-what-robustness-checks-are-run"&gt;Q5. What robustness checks are run?&lt;/h3&gt;
&lt;p&gt;(1) Controlling vs. not controlling for actual flooded area leaves risk estimates stable. (2) ZIP-code-level regressions exploiting finer spatial variation give similar magnitudes (establishments -0.233, employment -0.240, payroll -0.221). (3) Restricting to counties with available Q3 (1998) FEMA maps gives qualitatively similar, slightly larger estimates (Appendix Table A.2); the authors conservatively use baseline estimates for calibration. (4) IV estimation and (5) placebo pre-trend tests as above. (6) Lagged flood shocks (Appendix A.4) have negligible effects, confirming floods act through current-year productivity. (7) Model non-targeted moments (output, entry, exit) match data, and model-data correlations of regional GDP, population, emp-to-pop ratio, and firm count are near unity. (8) The implied regional-population-to-real-wage elasticity φM(1+φL) ≈ 2.1 lies within the 1.1-2.5 range from Fajgelbaum et al. (2018).&lt;/p&gt;
&lt;h3 id="q6-what-model-extensions-are-explored-and-how-do-results-change"&gt;Q6. What model extensions are explored and how do results change?&lt;/h3&gt;
&lt;p&gt;Four extensions, all yielding somewhat larger output losses than the 0.52% baseline: (1) entry costs paid partly/fully in final goods rather than labor — with α=1 the loss is 0.57%, because final-goods prices respond more to risk than wages; (2) interregional trade with traded/nontraded sectors — requires a larger labor-supply elasticity (φL=1.72) to match data, giving a 0.62% loss; (3) capital (mobile, rented at constant global rate) and land (fixed, congestion force) in production — 0.67% loss, since risk also lowers the capital-to-labor ratio (by 0.34%) as capital becomes relatively more expensive, outweighing land congestion (small land share). The authors read the modest size of these differences as evidence the simplified baseline captures the key forces.&lt;/p&gt;
&lt;h3 id="q7-how-does-this-paper-relate-to-and-differ-from-closely-related-prior-work"&gt;Q7. How does this paper relate to and differ from closely related prior work?&lt;/h3&gt;
&lt;p&gt;It contributes to climate-spatial-economics work (Costinot et al. 2016, Desmet et al. 2021, Alvarez &amp;amp; Rossi-Hansberg 2021, Rudik et al. 2021). Closest are three flood-aggregate studies: Desmet et al. (2021) on coastal-flooding costs via migration and local technology investment; Balboni (2019) on infrastructure misallocation under sea-level risk; Lin et al. (2021) on coastal housing construction. Differences: prior work focuses mainly on coastal land inundation from sea-level rise, whereas this paper uses historic flood-zone designation maps capturing overall flood risk and studies production damage rather than land loss; and it reconciles structural estimates with reduced-form evidence showing firm/worker responses to &lt;em&gt;risk&lt;/em&gt; differ from responses to &lt;em&gt;actual floods&lt;/em&gt;. Relative to Kocornik-Mina et al. (2020) (satellite-nightlight evidence that floods reduce output transiently), this paper confirms the short-run finding but shows risk has larger, longer-run effects via behavioral adjustment. It relates to Hino &amp;amp; Burke (2020) (same risk data; floods cut property values 1-2%), interpreting housing-price effects as amenity changes; their estimate implies a 0.3-0.6% utility loss, comparable to the paper&amp;rsquo;s calibrated amenity loss of 0.2%.&lt;/p&gt;
&lt;h3 id="q8-what-are-the-policy-implications-and-their-scope-conditions"&gt;Q8. What are the policy implications and their scope conditions?&lt;/h3&gt;
&lt;p&gt;The central implication is that evaluations counting only direct flood damages substantially understate true costs, since about 80% of the 0.52% 2018 output loss comes from expectation-driven adjustments (labor supply, migration, fewer firms) rather than the 20% direct damage. Direct damages (-0.11%) match FEMA&amp;rsquo;s ~$17B/year (~0.1% of GDP) estimate, validating the model&amp;rsquo;s lower bound. Policies addressing climate damage — and estimates of the social cost of carbon — should incorporate firms&amp;rsquo; and workers&amp;rsquo; long-run general-equilibrium adjustments. Scope conditions: the analysis is U.S.-specific (chosen for systematic flood-risk data), uses establishments as &amp;lsquo;firms,&amp;rsquo; abstracts from flood insurance (justified by near-actuarially-fair pricing evidence) and from explicit housing, treats unmapped areas as zero-risk, and assumes observed FEMA designations are the risk signal agents act on despite measurement error. The authors note the approach generalizes to other natural disasters.&lt;/p&gt;
&lt;h3 id="q9-what-are-notable-caveats-or-limitations"&gt;Q9. What are notable caveats or limitations?&lt;/h3&gt;
&lt;p&gt;GDP data do not capture variety/welfare changes, so the love-of-variety channel matters for welfare but is invisible in GDP-based estimates. The amenity parameter η is not directly estimated but imported from Barrage (2020) (output-to-utility damage ratio ~3); the authors note η has little effect on national productivity impact because amenity mostly drives offsetting migration. Labor supply is assumed fixed before shocks (micro-founded by job-search frictions). Flood insurance and housing are not modeled explicitly. Risk is measured by flood-zone land share, which is converted to flood probabilities {rm} via a regression of 2015-2019 actual flooded shares on 2018 zone shares. The two-period long-run design limits dynamics, and counties without FEMA maps are assigned zero risk.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Flood risk vs. flood events&lt;/strong&gt;: The paper sharply separates anticipated flood risk (the share of local land in FEMA Special Flood Hazard Zones, a long-run signal firms/workers observe and act on) from realized flood events (the share of area actually flooded in a given year, from Dartmouth data). Risk drives firm-entry and employment relocation; events drive transient productivity/GDP losses.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Expectation effects (vs. direct damages)&lt;/strong&gt;: Output losses arising because firms and workers rationally adjust location, entry, and labor supply in anticipation of flood risk — comprising the employment and variety channels. In 2018 these accounted for about 80% (the employment channel 0.33% plus variety 0.08% of the 0.52% loss), four times the 20% from direct physical damage.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Employment channel&lt;/strong&gt;: In the model, the mechanism by which higher flood risk lowers real wages and amenities, inducing both out-migration (relocation, ~0% net aggregate effect due to offsetting regions) and reduced labor supply per household (the dominant -0.33% component), governed by elasticities φM (migration) and φL (labor supply).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Love-of-variety channel&lt;/strong&gt;: The output/welfare loss from fewer firms entering under higher risk, operating through the CES variety term (agglomeration force 1/(σ-1)). It reduced 2018 output by 0.08% and matters for welfare but is not captured in GDP data.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Direct damage channel&lt;/strong&gt;: The component of flood losses from realized floods lowering firm productivity (parameter δ=0.005) and destroying a fraction of firms (δκ=0.003) plus amenity loss (η=0.002), calibrated from the short-run actual-flood reduced-form estimates; it caused a 0.11% output decline in 2018 (21% of the total).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Indirect inference calibration&lt;/strong&gt;: The simulated-method-of-moments procedure (Gouriéroux &amp;amp; Monfort 1996) used to set labor-supply elasticities φL=1.55 and φM=0.83: running the same 1998-vs-2018 panel regressions on model-generated data and choosing elasticities so model employment and population responses to flood risk match the empirical coefficients.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Immobile labor&lt;/strong&gt;: Following Autor et al. (2013), the model feature that some households respond to local flood risk by reducing labor supply rather than relocating, which is why employment falls more (1.2%) than population (0.8%) and motivates a positive labor-supply elasticity φL.&lt;/p&gt;</description></item><item><title>Firm dynamics, monopsony, and aggregate productivity differences</title><link>https://macropaperwarehouse.com/papers/firm-dynamics-monopsony-and-aggregate-productivity-differences/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/firm-dynamics-monopsony-and-aggregate-productivity-differences/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research question and motivation.&lt;/strong&gt; Firms are larger and grow faster over the life cycle in high-income countries, while labor markets in poorer countries are less competitive (employers hold more wage-setting power). The paper asks how important employer labor market power (monopsony) is for explaining cross-country differences in firm dynamics and aggregate productivity. The novelty is that beyond the standard static misallocation-of-workers channel, monopsony also distorts &lt;em&gt;selection into entrepreneurship&lt;/em&gt; and &lt;em&gt;productivity-enhancing technology adoption&lt;/em&gt;, potentially making the losses larger than prior static estimates suggest.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and setup.&lt;/strong&gt; Stylized facts come from the World Bank Enterprise Surveys (WBES), an establishment-level survey of non-agricultural, non-financial private firms with at least 5 full-time permanent employees, covering more than 90 countries from 2006 to 2021, merged with World Development Indicators GDP per capita (2017 constant USD). The estimation sample restricts to countries that ever had GDP per capita above 25,000 USD and to manufacturing firms with non-missing sales/workers/material/capital data, yielding 37,096 firm-year observations across 31 middle- and high-income countries (poorest: Kazakhstan, 19,615 USD in 2009; richest: Ireland, 91,791 USD in 2020). Local labor markets are defined as location-industry (2-digit ISIC v3.1) pairs. The model is a dynamic general-equilibrium neoclassical-monopsony model with occupational choice (entrepreneur vs. wage worker), endogenous productivity investment, and Card-et-al.-style taste-for-employer (amenity) differentiation that gives firms wage-setting power. It is calibrated to the Netherlands (GDP per capita 54,275 USD; median wage markdown 1.301, implying firm-level labor supply elasticity 3.318) via method of simulated moments.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main quantitative findings.&lt;/strong&gt; Empirically, moving from poorer to richer countries in the sample, average firm age triples from 11 to nearly 30 years; annualized firm growth rises ~1.6 percentage points per year per doubling of GDP per capita; the share of firms doing R&amp;amp;D more than doubles (from ~15% to &amp;gt;40%); product innovation rises from 20% to 80% and process innovation from 20% to 50%; and median wage markdowns fall (from ~2.25 at 25,000 USD GDP per capita — workers paid ~55% below marginal product — to ~1.25 at 60,000 USD — paid 20-25% below). The calibrated model matches a right-skewed firm-size distribution, life-cycle growth, employer turnover, age distribution, and R&amp;amp;D share (sum of squared deviations between empirical and simulated moments = 1.7%). In counterfactuals raising the markdown from 1.2 to 3, average firm growth shrinks by more than half (from ~150% to ~50%), average firm size falls from ~60 to ~45 employees, the innovating share halves (from ~40% to ~25%), and average firm productivity is ~20% higher in competitive markets. Differences in wage markdown alone account for &lt;strong&gt;25%&lt;/strong&gt; of observed cross-country TFP variation (model TFP std dev 0.051 vs. data 0.201), and &lt;strong&gt;no less than 11%&lt;/strong&gt; across robustness checks. In a Netherlands-vs-Greece decomposition, about &lt;strong&gt;85%&lt;/strong&gt; of the model-implied TFP gap is attributable to lower technology adoption, ~9% to distorted selection into entrepreneurship, and ~6% to static employment reallocation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mechanisms and implications.&lt;/strong&gt; Labor market competition acts as a “skill-biased” force favoring high-productivity firms through three channels: (i) static labor reallocation toward high-productivity, low-amenity firms; (ii) improved selection into entrepreneurship (low-productivity high-amenity agents stop being able to profitably attract workers as ϵL rises); and (iii) higher returns to innovation. The policy implication is that raising labor market competition in less-developed economies could yield substantial productivity gains, and that prior static studies understate the cost of monopsony because they omit the dynamic investment/selection channels.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-identificationcalibration-strategy-and-what-are-the-main-threats-to-it"&gt;Q1. What is the identification/calibration strategy and what are the main threats to it?&lt;/h3&gt;
&lt;p&gt;The model is calibrated to the Netherlands using a mix of externally set and internally estimated (method-of-simulated-moments) parameters. Externally: model period = 1 year; σν (Gumbel scale) normalized to 1; β = 0.961 (4% annual rate); δw = 0.025 (40-year working life); revenue elasticity of labor ξ = 0.333 (estimated via control function in Section 2); labor supply elasticity ϵL = 3.318 backed out from median markdown 1.301 via ϵL = 1/(µ−1). Six parameters {c_f, c_x, p_i, p_n, σ_z, σ_a} are estimated by MSM. The markdown itself is a key input and is estimated as the ratio of marginal revenue product of labor to wage, with revenue elasticity ξ from a standard control-function approach. Threats: the markdown estimate drives the whole quantitative exercise; the WBES sample is truncated at firms with ≥5 employees (biasing toward larger firms), addressed by re-estimating with imputed moments; and the cross-country counterfactual attributes all variation in ϵL to labor market power while holding all other parameters at Netherlands values, so other cross-country differences are not separately identified.&lt;/p&gt;
&lt;h3 id="q2-what-are-the-three-mechanisms-and-how-are-they-distinguished-quantitatively"&gt;Q2. What are the three mechanisms and how are they distinguished quantitatively?&lt;/h3&gt;
&lt;p&gt;(1) Static labor allocation: lower competition raises marginal factor cost only for sufficiently high-productivity firms, reallocating employment toward less-productive, lower-paying employers. (2) Selection into entrepreneurship: when ϵL is low, amenities matter more for profits, letting low-productivity high-amenity agents profitably self-select into entrepreneurship. (3) Technology adoption: returns to innovation increase with ϵL, so weak competition lowers the share of firms investing. They are distinguished via a decomposition that sequentially fixes policy functions at benchmark levels: ~6% of the TFP loss is from employment allocation alone, ~85% from the distortion to innovation policy, and ~9% from distorted selection into entrepreneurship.&lt;/p&gt;
&lt;h3 id="q3-what-heterogeneity-across-firms-is-documented"&gt;Q3. What heterogeneity across firms is documented?&lt;/h3&gt;
&lt;p&gt;Firms differ in entrepreneurial productivity z and amenity a. Average revenue product of labor rises with productivity and falls with amenities, and this dispersion is much steeper under weak competition: the elasticity of APL with respect to productivity is 0.31 in the baseline (Netherlands) vs 0.79 in the counterfactual (Greece), and with respect to amenities -0.28 vs -0.81. High-productivity, low-amenity firms face the biggest barriers in less-competitive markets and stay inefficiently small; low-productivity, high-amenity firms are propped up. Innovation distortion is concentrated among high-productivity firms.&lt;/p&gt;
&lt;h3 id="q4-what-robustness-checks-are-run-and-what-do-they-show"&gt;Q4. What robustness checks are run and what do they show?&lt;/h3&gt;
&lt;p&gt;Four main checks, each reported as the share of cross-country TFP variation explained (data std dev 0.201): (1) Productivity-amenity correlation — allowing entrants to draw correlated (z,a) with σ_za = 0.296 (matching Sockin 2024’s 0.622 wage-satisfaction correlation) lowers explained variation to ~15% (model std dev 0.030), because correlation reduces scope for reallocation. (2) Costs in terms of labor instead of final goods (per Klenow and Li 2025) gives ~22% (std dev 0.044). (3) Imputed firm-level moments covering all firms (not just ≥5 employees) gives ~14% (std dev 0.028). (4) Over-identified alternative identification using size/age/R&amp;amp;D shares and annualized growth gives ~11% (std dev 0.023). The headline range is therefore 25% baseline, no less than 11% across checks.&lt;/p&gt;
&lt;h3 id="q5-how-does-this-paper-relate-to-and-differ-from-closely-related-prior-work"&gt;Q5. How does this paper relate to and differ from closely related prior work?&lt;/h3&gt;
&lt;p&gt;It builds on static monopsony cost estimates: Berger et al. (2022, eliminating US labor market power raises average wage 48%, welfare +6% of lifetime consumption); Armangüé-Jubert et al. (2025, labor market power explains 15% of GDP-per-capita gap over development); Deb et al. (2022, less competition lowered US low/high-skill wages 12% and 11%); Amodio et al. (2025b, eliminating monopsony in Peru raises earnings 26%); Bachmann et al. (2022, monopsony caused a 10% aggregate productivity loss in East Germany). Its contribution is to add the entrepreneurial-selection and innovation channels, yielding larger losses than static studies, and to bridge the monopsony-cost literature with the misallocation literature (Restuccia-Rogerson, Guner et al., Hsieh-Klenow).&lt;/p&gt;
&lt;h3 id="q6-what-are-the-policy-implications-and-their-scope-conditions"&gt;Q6. What are the policy implications and their scope conditions?&lt;/h3&gt;
&lt;p&gt;Raising labor market competition (higher firm-level labor supply elasticity) improves allocative efficiency, selection into entrepreneurship, and innovation, raising firm growth and aggregate productivity. Scope conditions: the quantitative results apply to middle- and high-income countries (sample restricted to those ever above 25,000 USD GDP per capita); the 25% headline depends on the assumption that initial productivity and amenities are independent (falls to ~15% under positive correlation); and the decomposition attributing 85% to innovation is specific to the Netherlands-vs-Greece comparison. The model treats labor supply elasticity differences as the sole varying parameter, so the counterfactuals isolate the labor-market-power channel rather than reproducing total cross-country income gaps.&lt;/p&gt;
&lt;h3 id="q7-what-is-the-netherlands-vs-greece-comparison-specifically"&gt;Q7. What is the Netherlands-vs-Greece comparison specifically?&lt;/h3&gt;
&lt;p&gt;Greece has roughly half the GDP per capita of the Netherlands (29,000 vs 54,000 USD) and much weaker competition (wage markdown 2.623 vs 1.301, labor supply elasticity 0.616 vs 3.318). In the Greece counterfactual, average firm size is 26 vs 59 employees, life-cycle growth 84.5% vs 153%, average age 22.5 vs 30 years, and R&amp;amp;D investing share 18% vs 41%. Labor market competition differences explain 29% of the firm-size gap, 27% of the firm-age gap, and 74% of the R&amp;amp;D-share gap between the two countries.&lt;/p&gt;
&lt;h3 id="q8-what-does-the-model-get-right-that-was-not-targeted"&gt;Q8. What does the model get right that was not targeted?&lt;/h3&gt;
&lt;p&gt;The firm size and age distributions are not targeted yet are matched: in the data ~57.6% of firms have &amp;lt;20 employees and ~6.2% have &amp;gt;100; ~60% of firms are under 30 years old and ~10% over 60. The estimated parameters imply investing firms are 15% more likely to grow (p_i=0.649 vs p_n=0.499); innovation and operating costs equal ~43% and ~8% of average incumbent profits respectively; standard errors are small, indicating informative moments.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>Import Liberalization as Export Destruction? Evidence from the United States</title><link>https://macropaperwarehouse.com/papers/import-liberalization-as-export-destruction-evidence-from-the-united-states/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/import-liberalization-as-export-destruction-evidence-from-the-united-states/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research question and motivation.&lt;/strong&gt; How does import liberalization affect a country&amp;rsquo;s &lt;em&gt;export&lt;/em&gt; performance and welfare? Economic theory (Graham 1923, Ethier 1982, Krugman 1984) shows the answer hinges on whether production exhibits increasing returns to scale at the sector level. Krugman (1984) argued that with scale economies, import protection can be export-promoting because a protected industry expands, exploits scale economies, becomes more productive, and exports more — so conversely import liberalization is &amp;ldquo;export destroying.&amp;rdquo; The paper turns this logic into an empirical test: the sign of the import-liberalization-to-export relationship discriminates between constant-returns and increasing-returns trade models. Researchers otherwise lack tools to choose between these model classes, yet the choice matters greatly for multi-sector trade policy analysis.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model and data.&lt;/strong&gt; The authors build a multi-sector general-equilibrium gravity model generalizing Krugman (1980) to many countries/sectors with input-output linkages (as in Caliendo-Parro 2015). The model nests constant returns (Armington, σ→∞) and increasing returns. The &amp;ldquo;scale elasticity&amp;rdquo; is 1/(σ−1); the &amp;ldquo;output elasticity&amp;rdquo; of exports equals the trade elasticity (ε−1) times the scale elasticity, and is positive iff there are increasing returns. The empirical application exploits US Permanent Normal Trade Relations with China (PNTR), passed Oct 2000, which removed tariff-revocation uncertainty. Exposure is measured by Pierce-Schott&amp;rsquo;s NTR gap (log gap between non-NTR and NTR tariffs; mean 0.23, SD 0.13, range 0–0.59). Trade data are from CEPII BACI; the baseline sample covers exports from 23 OECD countries (including the US) to 141 importers across 444 NAICS goods industries, in long differences (1995–2000 pre-period vs 2000–07 post-period).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main findings.&lt;/strong&gt; Reduced-form: US export growth fell in higher-NTR-gap industries after PNTR. The raw Figure 1 slope is −0.51 (SE 0.057); a 10-log-point NTR-gap increase is associated with 5.0 log points lower annual export growth, and the NTR gap explains 18% of cross-industry variation. This is inconsistent with constant returns and implies increasing returns in US goods production. An offsetting &lt;em&gt;input cost effect&lt;/em&gt; (lower imported-input costs) raises exports: PNTR reduced 2007 exports by 13% more for a 75th- vs 25th-percentile NTR-gap industry, but raised them 20% more for a 75th- vs 25th-percentile input-cost-shock industry; net effects range from −18% (Cigarettes) to +56% (Automobiles). A structural IV (NTR gap instrumenting output growth) yields an output elasticity of 0.74 (SE 0.41, preferred column).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Quantitative GE results.&lt;/strong&gt; Calibrating the output elasticity to 0.821 (matching the −0.10 conditional NTR-gap effect; trade elasticity set to 5), PNTR raised aggregate US exports/GDP by 3.2%, decomposed into −1.8% real market potential (export destruction), +2.4% input cost, and +2.7% foreign demand. Aggregate export growth is 28% larger with scale economies than without, because scale economies make the input-cost effect almost five times stronger (2.4% vs 0.5%). Exports nevertheless declined in the most exposed sectors (Textiles &amp;amp; Leather, Other Manufacturing), shifting US comparative advantage away from high-NTR-gap sectors. Welfare: PNTR raised US real income 0.068% (real expenditure 0.087%); gains are ~30% smaller than under constant returns because a negative specialization effect (−0.15%) offsets a larger ACR openness gain (0.22%). Chinese gains exceed US gains tenfold.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-core-theoretical-test-and-why-does-the-sign-of-the-import-liberalization-to-export-relationship-identify-returns-to-scale"&gt;Q1. What is the core theoretical test and why does the sign of the import-liberalization-to-export relationship identify returns to scale?&lt;/h3&gt;
&lt;p&gt;From the bilateral trade equation, the elasticity of exports to output equals the output elasticity (ε−1)/(σ−1), which is strictly positive iff there are increasing sector-level returns. Under constant returns (Proposition 1), conditional on foreign demand and domestic input costs, import liberalization does not affect exports (α1=0). Under increasing returns (Proposition 2), import liberalization shrinks domestic real market potential, lowers output, and — because productivity falls with output under scale economies — reduces exports to ALL destinations (α1&amp;lt;0), with the effect&amp;rsquo;s magnitude strictly increasing in the output elasticity. So estimating whether export growth falls in more-liberalized industries distinguishes the two model classes.&lt;/p&gt;
&lt;h3 id="q2-what-is-the-identification-strategy-and-its-main-threats"&gt;Q2. What is the identification strategy and its main threats?&lt;/h3&gt;
&lt;p&gt;A triple-difference: changes in US bilateral export growth by sector after PNTR relative to changes in other OECD exporters&amp;rsquo; growth, identified from the NTR gap interacted with Post and a US-exporter dummy. The estimating equation (12) uses importer-exporter-industry, importer-exporter-period, and importer-industry-period fixed effects to absorb importer demand, common-across-exporter technology shocks, and industry trends in supply capacity and trade costs. The NTR gap is plausibly exogenous because variation stems mostly from Smoot-Hawley (1930) non-NTR tariffs, unlikely related to economic conditions 70 years later; any endogeneity from NTR tariffs being higher in weak-growth industries would bias against finding a negative effect. Threat 1: unobserved US-specific technology shocks negatively correlated with the NTR gap not captured by input/skill/capital intensity controls. Addressed by re-estimating at HS 6-digit level with NAICS-industry-exporter-period fixed effects (Table 3), still finding negative effects. Threat 2: US-China competition in third markets — if PNTR shifted China&amp;rsquo;s export basket toward US-type products in high-NTR-gap industries. Tested by interacting with China&amp;rsquo;s market share (Table 4); the quadruple interaction is positive and insignificant, ruling this out.&lt;/p&gt;
&lt;h3 id="q3-what-are-the-three-mechanisms-and-how-are-they-distinguished-empirically-and-quantitatively"&gt;Q3. What are the three mechanisms and how are they distinguished empirically and quantitatively?&lt;/h3&gt;
&lt;p&gt;(1) Real market potential / export destruction: import liberalization lowers the US price index, makes the domestic market more competitive, shrinks real market potential and output, and (under scale economies) cuts productivity and exports — identified by the negative α1 on the NTR gap. (2) Input cost effect: lower imported-input costs cut production costs and raise exports — identified by α2 on the input-output-weighted upstream NTR gap (CostShock), found negative and significant (lower input costs → higher exports). (3) Foreign demand effect: GE expansion of global demand and the trade-balance link between imports and exports — absorbed by fixed effects in the regression but recovered in the calibrated model&amp;rsquo;s decomposition (equation 16). In GE: −1.8% (market potential), +2.4% (input cost), +2.7% (foreign demand), netting +3.2%.&lt;/p&gt;
&lt;h3 id="q4-what-heterogeneity-is-documented"&gt;Q4. What heterogeneity is documented?&lt;/h3&gt;
&lt;p&gt;Sector-level: the real market potential effect is negative in all goods sectors and stronger where the NTR gap is higher; the input cost effect is positively correlated with the NTR gap (due to heavy diagonal weight in the I-O table); the foreign demand effect is positive everywhere but uncorrelated with the NTR gap. Net exports/GDP rise in 12 of 15 goods sectors but fall in the highest-NTR-gap sectors — Textiles &amp;amp; Leather falls 22% (−32% market potential, +8.5% input cost, +4.6% foreign demand) and exports decline in 3 of the 4 highest-NTR-gap sectors. Under constant returns, by contrast, export growth is positive in all sectors and weakly POSITIVELY correlated with the NTR gap — qualitatively opposite. The correlation between sector-level export growth with vs without scale economies is insignificant (excluding Textiles &amp;amp; Leather) or significantly negative (including it).&lt;/p&gt;
&lt;h3 id="q5-what-robustness-checks-are-run"&gt;Q5. What robustness checks are run?&lt;/h3&gt;
&lt;p&gt;Appendix C checks robustness to: starting the post-period in 2001 instead of 2000; alternative NTR-gap definitions; aggregating exports across destinations; varying the exporter/importer/industry samples; allowing PNTR to affect domestic expenditure; and controlling for China import growth driven by non-PNTR shocks. An event study (equation 13, Figure 2) shows no NTR-gap/export relationship before 2000 and a negative one from 2001 until the 2007–08 financial crisis, ruling out pre-trends. The first-stage (Table 5) confirms higher-NTR-gap industries had lower OUTPUT growth (paralleling Pierce-Schott&amp;rsquo;s employment result). Alternative calibrations (Appendix D.5): without I-O linkages the market potential effect weakens but total export growth is roughly unchanged; allowing services scale economies raises US gains; combining Textiles &amp;amp; Leather with Other Manufacturing preserves results; using Bartelme et al. (2019) sector-varying elasticities still yields a negative specialization effect.&lt;/p&gt;
&lt;h3 id="q6-how-is-the-output-elasticity-calibrated-and-how-does-it-compare-to-the-structural-estimate"&gt;Q6. How is the output elasticity calibrated and how does it compare to the structural estimate?&lt;/h3&gt;
&lt;p&gt;The output elasticity for goods is calibrated to 0.821 by matching the simulated NTR-gap effect to the −0.10 conditional reduced-form estimate (Table 2, column i), with services output elasticity set to zero and trade elasticity (ε−1) set to 5 (Head-Mayer 2014). This is below the value of 1 implied by Krugman (1980) or the Pareto-Melitz model but close to the Bartelme et al. (2019) mean of 0.83. It is reassuringly close to the independent structural IV estimate of 0.74 (SE 0.41). The simulated effect is decreasing in the output elasticity (consistent with Proposition 2 part ii) and rises sharply as the elasticity approaches one; the model has a unique solution for output elasticities below 0.95.&lt;/p&gt;
&lt;h3 id="q7-how-does-the-welfare-decomposition-work-and-why-are-gains-smaller-with-scale-economies"&gt;Q7. How does the welfare decomposition work and why are gains smaller with scale economies?&lt;/h3&gt;
&lt;p&gt;Following Costinot-Rodríguez-Clare (2014), real-income gains decompose into an ACR term (changes in domestic expenditure share / trade openness) and a specialization term that exists only with scale economies (welfare from sectoral reallocation of employment, weighted by adjusted Leontief forward-linkage coefficients). With scale economies the ACR effect is +0.22% (vs +0.10% without), but it is more than offset by a −0.15% specialization effect, netting +0.068% real income — about 30% below the constant-returns gain. The specialization effect is negative because PNTR shifted resources toward services (weaker scale economies; goods output −0.55%, services +0.11%) and, more importantly per Appendix D.5, toward sectors with weaker FORWARD input-output linkages; cross-sectoral heterogeneity in scale economies alone contributes negligibly.&lt;/p&gt;
&lt;h3 id="q8-how-does-this-relate-to-and-differ-from-closely-related-prior-work"&gt;Q8. How does this relate to and differ from closely related prior work?&lt;/h3&gt;
&lt;p&gt;It extends Krugman (1984)&amp;rsquo;s partial-equilibrium oligopoly mechanism to a class of quantitative GE trade models (love-of-variety, external economies, Melitz-Pareto, or endogenous innovation — shown equivalent in Appendix A.3). Unlike prior scale-economy estimates (Antweiler-Trefler 2002, Lashkaripour-Lugovskyy 2018, Bartelme et al. 2019) and home-market-effect tests (Davis-Weinstein 2003, Costinot et al. 2019), it uses TRADE POLICY variation (not factor content, market size, or exchange rates) for identification and performs an ex-post policy analysis (echoing Goldberg-Pavcnik 2016). Relative to the PNTR/China-shock literature (Pierce-Schott 2016, Handley-Limão 2017, Autor-Dorn-Hanson 2013), it adds a new outcome — US EXPORTS and comparative advantage — and argues the &amp;lsquo;surprisingly swift&amp;rsquo; manufacturing decline would have been smaller absent scale economies. It complements Juhász (2018)&amp;rsquo;s infant-industry evidence (Napoleonic France) by quantifying the export-destruction cost while showing PNTR&amp;rsquo;s net effect on exports and welfare is positive. Dick (1994) tested the same hypothesis cross-sectionally for 1970 US data but found little support.&lt;/p&gt;
&lt;h3 id="q9-what-are-the-policy-implications-and-their-scope-conditions"&gt;Q9. What are the policy implications and their scope conditions?&lt;/h3&gt;
&lt;p&gt;The findings support the existence of the scale-economies channel traditionally invoked to justify protection: pre-PNTR import protection shifted US comparative advantage toward the most-protected industries, and in the calibrated model targeted import protection CAN promote sector-level exports — but not under constant returns. However, the export-destruction effect is dominated, for most sectors and in aggregate, by export-promoting channels (input cost, foreign demand); total export growth is even greater WITH scale economies; and the negative specialization effect is more than offset by traditional gains from trade, so US gains from PNTR remain positive (+0.068% real income). Scope conditions: results rest on the calibrated output elasticity (0.821) and trade elasticity (5); the model assumes constant markups and full employment, so welfare excludes pro-competitive effects (Jaravel-Sager 2020, Amiti et al. 2020) and employment effects (Autor-Dorn-Hanson 2013); it studies a single liberalization episode; and the analysis cannot distinguish among alternative SOURCES of increasing returns. The authors stress accounting for scale economies (or their absence) is a prerequisite for correctly evaluating sector-level trade flows and welfare.&lt;/p&gt;
&lt;h3 id="q10-what-other-notable-findings-or-caveats-appear"&gt;Q10. What other notable findings or caveats appear?&lt;/h3&gt;
&lt;p&gt;PNTR is calibrated as a reduced-form openness shock (α5=0.43; equation 15), equivalent to a 13% average trade-cost reduction on US imports from China (SD 6.6% across industries) given trade elasticity 5 — matching Handley-Limão&amp;rsquo;s 13-percentage-point estimate. The calibrated economy has 12 economies and 24 sectors (15 goods). Chinese gains exceed US gains more than tenfold (because the US was much larger in 2000, so PNTR was a bigger shock to China), and China&amp;rsquo;s nominal wage rose 6.0% relative to the US, contributing to factor-price convergence. For comparison, Caliendo-Parro (2015) find NAFTA raised US welfare 0.08% and Fajgelbaum et al. (2020) find the Trump trade war cut US real income 0.04%. The model in changes is solved via exact hat algebra, holding each country&amp;rsquo;s trade deficit as a constant share of global value-added (which induces the positive import-export link in the foreign-demand term).&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>Means-Tested Transfers in the US: Facts and Parametric Estimates</title><link>https://macropaperwarehouse.com/papers/means-tested-transfers-in-the-us-facts-and-parametric-estimates/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/means-tested-transfers-in-the-us-facts-and-parametric-estimates/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Guner, Rauh, and Ventura document the scope, generosity, distributional impact, and time evolution of means-tested transfers to working-age US households, and provide parametric estimates of transfer functions for use in applied macroeconomics and public finance. The paper addresses three questions: How large are these transfers? How do they affect income inequality? How have they changed over time? The contribution is descriptive and empirical rather than structural; the paper does not estimate behavioral effects but rather characterizes the effective transfer schedule that households face.&lt;/p&gt;
&lt;p&gt;The data source is the Survey of Income and Program Participation (SIPP), using five waves spanning 1998 to 2016. The benchmark analysis uses the 2014 wave (years 2013–2016). The sample is restricted to household-years in which the head is aged 25–54, is not self-employed, and does not switch marital status within the year — yielding 18,612 households and 38,375 household-year observations. Six programs are covered: TANF, SNAP, WIC, SSI, housing assistance, and Medicaid. For TANF, SNAP, WIC, and SSI, transfer values are observed directly. Medicaid values are imputed using regional HMO premium costs; housing values are imputed as the difference between Fair Market Rent and actual rent paid.&lt;/p&gt;
&lt;p&gt;In the 2013–2016 benchmark period, approximately 35% of working-age households receive some means-tested transfer in a given year, and, conditional on receipt, the average household receives about $17,000 (in 2016 dollars), exceeding one-fourth of average household income. Unconditional total transfers decline steeply with income but in a non-monotone way: households with zero non-transfer income receive $7,500 in non-medical and $13,700 in Medicaid transfers ($21,000 total, or 26% of mean household income). Transfers dip for households with small positive incomes (creating a hump shape), then rise slightly before declining again. At the bottom income decile (0–10%), households receive on average $4,125 in non-medical transfers and $14,141 total. At the median income decile (50–60%), households receive $425 non-medical and $3,006 total. In the top decile, non-medical transfers are negligible ($169) and total transfers are $1,200. The decline in unconditional transfers with income is driven primarily by reduced coverage: conditional on receipt, transfer amounts are relatively stable across income levels, remaining above 15% of mean household income throughout the distribution. The extensive margin of coverage is 82% for zero-income households, 70% for the bottom decile, 29% at the median, and still 5% (non-medical) to 11% (including Medicaid) in the top decile.&lt;/p&gt;
&lt;p&gt;Medicaid is the dominant program throughout. For zero-income households, Medicaid transfers are more than six times larger than the next-largest program (SNAP). Medicaid&amp;rsquo;s share of total transfers rises with income. As a single program, Medicaid reaches 31% of working-age households with an average conditional benefit of about $15,000 per recipient. SNAP covers 18% of households with conditional benefits of about $3,000.&lt;/p&gt;
&lt;p&gt;Transfers substantially compress inequality. The pre-transfer Gini coefficient is 0.48 and falls to 0.42 when all transfers (including Medicaid) are included, and to 0.46 with non-medical transfers only. The pre-transfer 50-10 income ratio of 10.2 drops to 3.0 with all transfers and to 5.6 with non-medical transfers only. The variance of log income falls by nearly 36% (47 log points) with all transfers and by 21% with non-medical transfers. These equalizing effects are concentrated at the bottom of the distribution; for households at 10% of average pre-transfer income, total transfers more than double disposable income.&lt;/p&gt;
&lt;p&gt;Between 1998–1999 and 2013–2016, total unconditional transfers per household quadrupled from approximately 2% to 7.3% of mean household income (from about $1,535 to $6,000). Household coverage rose from 19% to 35%. The expansion is driven almost entirely by Medicaid; non-medical transfers rose only marginally in magnitude (from about 1.3% to 1.8% of mean income), though their coverage increased from 16% to 24% of households. Notably, over this period the concentration of non-medical transfers shifted upward in the income distribution: households with zero income received a smaller relative share in 2013–2016 than in 1998–1999, while shares for households in the second, third, and fourth deciles increased. Pre-transfer income inequality rose substantially over the period, with the Gini increasing from 0.40 to 0.48; the post-transfer Gini rose more moderately, from 0.38 to 0.42, indicating that transfer growth largely offset rising market-income inequality at the bottom.&lt;/p&gt;
&lt;p&gt;For the parametric section, the paper estimates a flexible four-parameter Ricker-style function T(I) = exp(alpha) * exp(beta_0 * I) * I^beta_1 for positive income I (normalized by mean income), with a separate level parameter gamma at I = 0. This captures the hump-shaped pattern at low incomes and the rapid decline thereafter. Implicit benefit reduction rates derived from these estimates are large: earning one additional dollar when starting from zero income reduces total transfers by more than $11,000, as crossing from zero into positive income sharply reduces program eligibility. A more realistic $10,000 income increase reduces total transfers by more than $5,000 — an implicit marginal tax penalty exceeding 50%. Non-medical transfer penalties are somewhat smaller: the first dollar earned reduces non-medical transfers by more than $4,500, and a $10,000 income increase reduces them by about $3,300.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-identification-strategy-and-what-are-the-main-threats-to-it"&gt;Q1. What is the identification strategy and what are the main threats to it?&lt;/h3&gt;
&lt;p&gt;The paper is descriptive, not causal — there is no causal identification strategy in the traditional sense. The authors document reduced-form facts about transfer receipt by income level and demographic group using SIPP microdata. The main methodological choices and data limitations are: (1) Medicaid and housing assistance values are imputed rather than directly observed — Medicaid is valued at regional HMO premiums, which may not accurately reflect the value recipients place on coverage; housing benefits are valued at the difference between state Fair Market Rent and actual rent paid, which can produce negative values (2.7% of cases, set to zero). (2) SIPP is known to under-report income at the top of the distribution relative to the CPS; the paper documents that income shares of the top quintile differ by about five percentage points between SIPP and CPS, largely due to SIPP&amp;rsquo;s poor measurement of asset income. This means the effective transfer schedule at the top of the income distribution may be somewhat distorted. (3) The SIPP was overhauled after 2016, precluding analysis of more recent waves and meaning the trends analysis ends in 2013–2016. (4) Self-employed households are excluded (~7% of households) as their income measurement is noisier.&lt;/p&gt;
&lt;h3 id="q2-how-does-the-paper-handle-the-non-linear-hump-shaped-pattern-in-transfers-at-low-income-levels"&gt;Q2. How does the paper handle the non-linear hump-shaped pattern in transfers at low income levels?&lt;/h3&gt;
&lt;p&gt;The paper documents a hump-shaped pattern: transfers are positive at zero income, fall sharply at very low positive income (around the bottom 1% of the distribution), then increase modestly before declining monotonically. This arises because crossing from zero income to any positive income can reduce eligibility for several programs simultaneously. The parametric functional form — the Ricker function from fisheries biology — is specifically chosen to capture this pattern: for I &amp;gt; 0, T(I) = exp(alpha) * exp(beta_0 * I) * I^beta_1, where the beta_0 term governs the initial decline/rise and beta_1 allows further curvature. The zero-income level gamma is estimated separately as a discontinuity. The tight confidence intervals around observed income-percentile averages confirm that the fitted function closely tracks the data.&lt;/p&gt;
&lt;h3 id="q3-what-heterogeneity-by-demographic-group-is-documented"&gt;Q3. What heterogeneity by demographic group is documented?&lt;/h3&gt;
&lt;p&gt;The paper documents heterogeneity along three dimensions — marital status, number of children, and age of children — in each case reporting both unconditional and conditional transfer amounts and coverage by income decile. Key findings: (a) Marital status: Single-woman households with zero income receive 12% of mean household income in non-medical transfers and about 31% in total transfers. Married households with zero income receive 27% total, and single men receive 17.9% total. At higher income levels, married households can receive more in total transfers than single women, because Medicaid coverage is broader for families. Single-woman households show the highest coverage at very low incomes (88% receive some transfer), but married households lead in coverage at middle income levels. Single men show surprisingly high coverage even at relatively high incomes. (b) Number of children: Transfers increase substantially with children. A first-decile married household without children receives about 1.7% of average income in non-medical transfers and 9% total; with two or more children, non-medical transfers rise nearly five-fold for single-woman households in the same decile. (c) Age of children: Transfers decline as children age, but the magnitude of the age gradient is smaller than the number-of-children gradient.&lt;/p&gt;
&lt;h3 id="q4-how-do-conditional-and-unconditional-transfers-compare-across-the-income-distribution"&gt;Q4. How do conditional and unconditional transfers compare across the income distribution?&lt;/h3&gt;
&lt;p&gt;Unconditional transfers (averaged over all households including non-recipients) decline steeply with income, driven primarily by falling coverage rates. Conditional transfers (among recipients only) are much more stable. For zero-income households, total conditional transfers average $26,500 (32% of mean income) versus $21,000 unconditionally. In the bottom decile, conditional total transfers are about $21,000 or 26% of mean income. After the third income decile, conditional transfer levels stabilize and remain above 15% of mean income throughout most of the distribution. This means that once a household is enrolled in the transfer system, the amounts received are relatively constant regardless of where in the distribution they fall; the intensive margin differences are largely accounted for by Medicaid, which has high conditional values even at middle income levels.&lt;/p&gt;
&lt;h3 id="q5-what-role-does-medicaid-play-relative-to-non-medical-programs"&gt;Q5. What role does Medicaid play relative to non-medical programs?&lt;/h3&gt;
&lt;p&gt;Medicaid dominates the transfer system for working-age households by every measure. It reaches 31% of households in the benchmark period (the next largest program, SNAP, covers 18%). For zero-income households, Medicaid transfers are more than six times larger than SNAP (the next largest non-medical program). Medicaid&amp;rsquo;s share of total transfers grows with income: for zero-income households, total transfers are less than three times non-medical transfers; for households in the 50–60th percentile, this ratio exceeds six. In terms of aggregate spending, Medicaid rose from below 1% of GDP in 1980 to more than 3% in 2022, while non-medical transfers declined from 1.6% to about 1% of GDP over the same period. Almost the entire growth in household transfers between 1998 and 2016 is attributable to Medicaid expansion. Medicaid is also the most important single contributor to measured inequality reduction.&lt;/p&gt;
&lt;h3 id="q6-how-do-transfers-affect-income-inequality-and-how-has-this-changed-over-time"&gt;Q6. How do transfers affect income inequality and how has this changed over time?&lt;/h3&gt;
&lt;p&gt;In the 2013–2016 benchmark, total transfers reduce the Gini coefficient by 6 points (from 0.48 to 0.42) and the variance of log income by nearly 36%. The 50-10 income ratio falls from 10.2 to 3.0. Non-medical transfers alone reduce the Gini by 2 points (to 0.46) and the 50-10 ratio to 5.6. The impact is concentrated at the bottom of the distribution: transfers more than double total income of households with pre-transfer income around 10% of the mean. Over time, pre-transfer inequality rose sharply, with the Gini going from 0.40 (1998–1999) to 0.48 (2013–2016) and the 50-10 ratio doubling from 4.19 to 10.2. Post-transfer inequality rose more mildly: the Gini increased from 0.38 to 0.42 (all transfers), and the 50-10 ratio remained stable at around 3 throughout. Excluding Medicaid, the moderating effect is weaker; the Gini rose from 0.39 to 0.46 on a post-non-medical-transfer basis.&lt;/p&gt;
&lt;h3 id="q7-how-has-the-concentration-of-transfers-across-income-groups-evolved-over-time"&gt;Q7. How has the concentration of transfers across income groups evolved over time?&lt;/h3&gt;
&lt;p&gt;A notable distributional shift occurred between 1998–1999 and 2013–2016. For non-medical transfers, the share accruing to households with zero income declined substantially — from receiving about $9 per $100 of total transfers distributed in 1998–1999 to about $4 in 2013–2016. Similarly, the relative share for the bottom decile declined. In contrast, the share going to households in the second, third, and fourth income deciles increased. For total transfers including Medicaid, the pattern is similar but the shift is less pronounced, partly because Medicaid expansion was broad and reached middle-income working families. The authors interpret this as reflecting the design changes in the transfer system: TANF (which targeted the very bottom) declined sharply while Medicaid expansion (which reaches further up the distribution) grew.&lt;/p&gt;
&lt;h3 id="q8-what-are-the-implicit-benefit-reduction-rates-and-why-do-they-matter"&gt;Q8. What are the implicit benefit reduction rates and why do they matter?&lt;/h3&gt;
&lt;p&gt;The paper derives implicit benefit reduction rates from the estimated parametric transfer functions. At zero income, earning the first dollar of income triggers a very large decline in transfers because eligibility for several programs is lost simultaneously. Specifically, earning $1 reduces non-medical transfers by more than $4,500 and total transfers by more than $11,000. This enormous implicit marginal tax reflects the discontinuity at zero income. For more realistic income increments, earning an additional $10,000 when starting from zero income reduces total transfers by more than $5,000 (over 50% implicit tax rate) and non-medical transfers by about $3,300. These findings are directly relevant for quantitative macroeconomic models that study labor supply and welfare, since the effective marginal tax on low-income workers entering employment is substantially higher than the statutory rate.&lt;/p&gt;
&lt;h3 id="q9-how-does-the-paper-differ-from-prior-work-on-parametric-tax-and-transfer-functions"&gt;Q9. How does the paper differ from prior work on parametric tax and transfer functions?&lt;/h3&gt;
&lt;p&gt;The closest antecedents are Gouveia and Strauss (1994), Heathcote, Storesletten, and Violante (2017) (who use the Benabou log-linear tax function), and Guner, Kaygusuz, and Ventura (2014) (who provide effective income tax estimates). Prior work either focused on taxes only or combined taxes and transfers into a single progressivity measure. This paper is the first to estimate effective transfer functions separately from the tax system, decomposed by program, by marital status, and by number of children. Relative to Guner et al. (2023), which assumed transfers decline linearly with income, this paper estimates a more flexible non-linear function that captures the hump at very low incomes. Relative to Ferriere et al. (2023), who propose a transfer function that increases then decreases with income, the current paper provides empirical estimates rather than a theoretical prescription. The functional form (a Ricker-style function with a separate parameter at zero income) is also more flexible than prior approximations.&lt;/p&gt;
&lt;h3 id="q10-what-data-limitations-are-noted-and-how-do-they-affect-comparability-with-other-sources"&gt;Q10. What data limitations are noted and how do they affect comparability with other sources?&lt;/h3&gt;
&lt;p&gt;The paper compares SIPP income distributions with the CPS. Both surveys yield similar Gini coefficients and variance of log income, but SIPP shows higher income shares for the bottom quantiles and lower shares for the top quintile (a discrepancy of about five percentage points). This reflects SIPP&amp;rsquo;s weaker measurement of asset income, which is a larger component of total income as one moves up the distribution. The analysis excludes self-employed households (~7%) because their income is harder to measure. The SIPP was overhauled after 2016, making cross-wave comparisons infeasible for later years; this means the paper cannot characterize the effects of post-2016 Medicaid expansion, the COVID-19 pandemic transfer surge, or recent SNAP reforms. For Medicaid, the imputation using regional HMO costs does not capture the insurance value as households themselves perceive it, a standard limitation in this literature also noted by Ben-Shalom et al. (2012) and Scholz et al. (2009) whose methods the paper follows.&lt;/p&gt;
&lt;h3 id="q11-what-are-the-policy-implications-of-the-findings"&gt;Q11. What are the policy implications of the findings?&lt;/h3&gt;
&lt;p&gt;Several implications follow with scope conditions: (1) The transfer system substantially reduces income inequality, but the lion&amp;rsquo;s share of the reduction comes from Medicaid. Policies that reduce Medicaid coverage would substantially raise measured inequality, particularly at the bottom of the distribution. (2) The implicit benefit reduction rates documented — above 50% for a $10,000 income gain at the bottom — generate large effective marginal taxes on low-income households entering employment, relevant for evaluating welfare-to-work policies and for calibrating labor supply elasticities in quantitative models. (3) Despite the large size of the system, the decline in TANF spending (from above 1% of GDP to 0.1%) means that unrestricted cash assistance to the very poorest has fallen sharply; the system has shifted toward in-kind and medical programs that provide less flexibility to recipients. (4) The shift in transfer concentration away from zero-income households toward the second through fourth deciles suggests that the system increasingly supports the working poor rather than the non-working poor — a structural change in the composition of welfare that quantitative models should incorporate. These implications pertain to households headed by working-age adults (25–54), are based on pre-2016 data, and exclude the institutionalized population and self-employed households.&lt;/p&gt;
&lt;h3 id="q12-what-are-the-key-features-of-the-parametric-function-and-how-well-does-it-fit-the-data"&gt;Q12. What are the key features of the parametric function and how well does it fit the data?&lt;/h3&gt;
&lt;p&gt;The estimated function has the form T(I) = exp(alpha) * exp(beta_0 * I) * I^beta_1 for I &amp;gt; 0 and T(0) = gamma, estimated by non-linear least squares on income-percentile averaged data. The function is flexible enough to capture: (a) a strictly positive level at zero income; (b) an initial increase then decrease at very low positive incomes (the hump); (c) a decay toward zero at high incomes that can be faster or slower depending on beta_1. The fit is shown to be close — Figure 7 documents tight confidence intervals around mean transfers by percentile, confirming that a smooth function well approximates the data. Parameter estimates are provided for each individual program, for non-medical aggregates, for total transfers, and separately for married and single households and by number of children (in appendix tables C10–C12). The zero-income gamma parameter is notably small for TANF (0.00) and large for Medicaid (0.24) and total transfers (0.26), consistent with the descriptive findings on coverage.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Means-tested transfer&lt;/strong&gt;: In this paper, a government transfer program for which eligibility and benefit amounts are conditioned on household income and assets, targeting the non-retired working-age population. The six programs studied are TANF, SNAP, WIC, SSI, housing assistance, and Medicaid.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Intensive margin of coverage&lt;/strong&gt;: The fraction of months in a given calendar year during which a household receives a positive transfer amount, as distinct from the extensive margin (whether the household receives any transfer at all during the year). The paper documents both margins separately.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Implicit benefit reduction rate (implicit penalty)&lt;/strong&gt;: The reduction in transfer payments associated with a marginal increase in non-transfer income, expressed as the derivative of the estimated transfer function with respect to income. In this paper the implicit penalty at zero income is very large because moving from zero to any positive income simultaneously triggers loss of eligibility in multiple programs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Unconditional vs. conditional transfer&lt;/strong&gt;: Unconditional transfers are averages computed over all households at a given income level, including non-recipients. Conditional transfers are averages computed only among households that actually receive a positive amount. The paper shows that the steep decline in unconditional transfers with income is almost entirely a coverage effect; conditional amounts remain relatively stable across the distribution.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Ricker transfer function&lt;/strong&gt;: The parametric functional form T(I) = exp(alpha) * exp(beta_0 * I) * I^beta_1 adopted by the paper to fit the non-linear relationship between normalized household income and normalized transfer receipt for I &amp;gt; 0, with a separate parameter gamma for I = 0. Borrowed from the Ricker (1954) stock-recruitment model in fisheries biology and chosen for its flexibility in capturing the hump-shaped pattern at very low incomes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Non-medical transfers&lt;/strong&gt;: The aggregate of TANF, SNAP, WIC, SSI, and housing assistance — the programs that provide cash or in-kind support excluding health insurance. The paper distinguishes these from total transfers throughout to separate the role of Medicaid, which dominates all other programs in magnitude.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Medicaid imputation&lt;/strong&gt;: The procedure used to assign a monetary value to Medicaid enrollment, following Scholz et al. (2009) and Ben-Shalom et al. (2012). Each enrolled household member is assigned the cost of a single HMO policy in their Census region (from the Kaiser Foundation Employer Health Benefits survey), with family policies or sums of individual policies used for multi-member households, and a 2.5× multiplier for elderly or disabled individuals to reflect higher medical needs.&lt;/p&gt;</description></item><item><title>Medical innovation and health disparities</title><link>https://macropaperwarehouse.com/papers/medical-innovation-and-health-disparities/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/medical-innovation-and-health-disparities/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper asks why medical innovation can widen health disparities even when it unambiguously improves health for everyone who takes it. The authors argue that the standard access-versus-preferences dichotomy is a false one: disadvantaged patients can rationally forgo effective medications because treatment side effects interfere with work, and the income cost of not working is particularly severe for low-education workers who hold physically demanding, inflexible jobs. Health-maximizing and welfare-maximizing behavior are therefore not the same thing, and the gap between the two is systematically larger for lower-education individuals.&lt;/p&gt;
&lt;p&gt;The empirical setting is the introduction of Highly Active Antiretroviral Therapy (HAART) for HIV in the mid-1990s. HAART was substantially more effective than prior mono- and combo-therapy at preventing AIDS progression and death, but it produced harsh physical side effects (fatigue, diarrhea, headache, fever). Data come from the Multi-Center AIDS Cohort Study (MACS), a semi-annual panel of men who have sex with men in Baltimore, Chicago, Pittsburgh, and Los Angeles, covering 1991–2003. After sample restrictions, the analysis uses 11,290 person-visit observations for 1,201 HIV-positive individuals aged 30–64, approximately 63% of whom hold a college degree or more. The study dichotomizes education into less-than-college versus college-or-more and tracks treatment choices, labor supply, immune-system health (CD4 count, with AIDS threshold at 250), physical ailments, income, insurance, and out-of-pocket medical expenditures.&lt;/p&gt;
&lt;p&gt;The structural model is a lifecycle discrete-choice dynamic programming framework in which forward-looking individuals simultaneously choose treatment (no treatment, monotherapy, combotherapy, and post-1995 HAART) and full-time work or non-work each half-year period to maximize expected lifetime utility. Health and survival evolve stochastically as functions of prior health, treatment, and age. Utility is a function of consumption (income minus out-of-pocket expenses), ailments, and labor supply, with utility parameters allowed to differ by education. The model is estimated via maximum likelihood using nested backwards induction; the quasi-experimental introduction of HAART as an unanticipated shock helps identify utility parameters.&lt;/p&gt;
&lt;p&gt;Key quantitative results: (1) HAART drastically reduced mortality for both groups—six-month mortality fell from 9% to 2% for less-educated men and from 6% to 1% for college graduates—and raised the probability of maintaining a high CD4 count from 62% to 78% (less-educated) and 68% to 83% (college+). (2) Despite equivalent access (both groups face roughly 91-95% insurance coverage and similarly low out-of-pocket costs), lower-educated men adopted HAART at a lower rate (58% of post-HAART visits versus 66% for college graduates) and approximately five months later. (3) The structural utility parameters confirm that while the direct disutility of ailments is not significantly different across education groups, the disutility of working while experiencing ailments is substantially larger in magnitude for less-educated men (estimated parameter -2.73) than for college graduates (-1.97). (4) Measured as expected lifetime utility, HAART&amp;rsquo;s introduction increased value for low-CD4 men by 236.1% (less-educated) versus 176.6% (college+), but in absolute utility units the gains were larger for college graduates—establishing that HAART increased welfare inequality. (5) Decompositions show the largest single driver of the education gap in HAART value is the differential survival process; income differences also matter but financial access variables (insurance, out-of-pocket costs) explain little. (6) A simulated six-month HAART mandate improves health—by 1.7 percentage points more for less-educated men—but reduces expected lifetime value by 2.8% for the less-educated versus 1.4% for college graduates, and reduces employment by 4.1% versus 1.6%, as mandated HAART forces men into ailment-producing treatment whose side effects they cannot manage alongside work. (7) A counterfactual $10,000-per-six-months non-labor income subsidy (similar to COVID-19 transfer policies) reduces work by 31–49% for less-educated men and by 25–39% for college graduates, while inducing an 81.2% increase in HAART take-up among less-educated men in good health who were not previously on treatment (from 5% to 9% baseline probability), and a 44.5% increase for similar college graduates (8% to 11%). For men with AIDS-level CD4 counts not on treatment, the policy raises the probability of being healthy next period by 12.6% for less-educated men and 5.3% for college graduates.&lt;/p&gt;
&lt;p&gt;The central mechanism is a wedge between health and welfare that is steeper for disadvantaged workers: occupational conditions make it harder to work while experiencing side effects, so the opportunity cost of HAART compliance is higher. This means effective medical innovation—precisely by creating more severe side effects than older regimens—can widen welfare inequality even as it compresses mortality gaps. Clinical trials that randomize assignment to treatment and measure health outcomes will register the innovation as a success while masking the distributional welfare costs. Policy interventions that reduce the cost of not working (income transfers, labor market restructuring) can simultaneously increase HAART take-up and improve health, with effects concentrated among the disadvantaged.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-main-identification-strategy-and-what-are-the-key-threats-to-identification"&gt;Q1. What is the main identification strategy and what are the key threats to identification?&lt;/h3&gt;
&lt;p&gt;The model is estimated by maximum likelihood using nested backwards induction over observable state variables. A key identifying variation is the quasi-experimental, unanticipated introduction of HAART in 1995, which shifts the choice set mid-panel and allows the authors to trace behavioral responses to an exogenous change in treatment efficacy and side-effect profiles. Disutility of ailments and work parameters are identified by conditional choice probabilities given state variables (health, ailment status, prior treatment) and by comparing behavior before and after HAART availability. The authors follow Magnac and Thesmar (2002) to establish that under the distributional assumptions (Type I EV shocks, fixed discount factor β=0.95) and the normalization imposed, the likelihood has a unique maximum. The main threats are: (a) the assumption that individuals were surprised by HAART (no forward-looking anticipation), which simplifies the model but is explicitly noted—Hamilton et al. (2021) show that incorporating individual expectations substantially complicates the framework; (b) the exclusion of unobserved heterogeneity in the utility function, though specifications including it produce very small probabilities of a second type (below 5%); (c) the absence of borrowing and saving, which could allow more educated individuals to smooth consumption across treatment cycles—the authors note this would bias downward the disutility of working with ailments for higher-educated individuals, meaning the estimated cross-education difference in that parameter is a lower bound; (d) the sample is restricted to white men in four cities, limiting external validity; and (e) the education dichotomy collapses heterogeneity within education groups.&lt;/p&gt;
&lt;h3 id="q2-what-are-the-main-mechanisms-through-which-education-moderates-the-health-welfare-tradeoff-and-how-are-they-distinguished-empirically"&gt;Q2. What are the main mechanisms through which education moderates the health-welfare tradeoff, and how are they distinguished empirically?&lt;/h3&gt;
&lt;p&gt;The paper identifies two nested channels. First, the estimated structural utility parameter for working while experiencing ailments is larger in magnitude for less-educated men (θ = -2.73) than for college graduates (θ = -1.97), indicating greater disutility from combining work and side effects. The paper argues this reflects occupational sorting: lower-education men are significantly more likely to hold manual occupations (occupation score 5.12 versus 4.49 for college graduates, where higher scores indicate more manual tasks per Autor et al. 2003), making physical side effects especially incompatible with job performance. Second, lower-educated men have lower incomes ($15,373 versus $22,290 per half-year for less-educated versus college-educated, pre-HAART), so the income cost of not working is larger in relative terms, creating stronger incentives to maintain employment even at the cost of forgoing treatment. The authors decompose the relative contribution of these mechanisms in the non-labor income subsidy simulation: when they give lower-educated men the income process of higher-educated men (Appendix Figure A1), the gap in behavioral response narrows but does not close; when they give lower-educated men the disutility parameters of higher-educated men (Figure A2), similarly the gap narrows but remains. Both mechanisms are jointly operative.&lt;/p&gt;
&lt;h3 id="q3-what-heterogeneity-in-haart-take-up-and-welfare-value-is-documented"&gt;Q3. What heterogeneity in HAART take-up and welfare value is documented?&lt;/h3&gt;
&lt;p&gt;Education is the primary heterogeneity dimension examined. Post-HAART, lower-educated men used HAART in 58% of observations versus 66% for college graduates, were slower to start (5 months later on average), and less likely to ever use it (67% versus 81%). Health status interacts with education: low-CD4 men gain more in percentage terms from HAART because they are more in need of its health-improving effects (236.1% gain for less-educated low-CD4 versus 176.6% for college-educated low-CD4; 85.7% versus 76.3% for high-CD4 men, with college graduates gaining more in absolute utility units throughout). The welfare cost of a treatment mandate is higher for less-educated men (2.8% lifetime value decline versus 1.4%), and the employment reduction induced by the mandate is also larger for them (4.1% versus 1.6%). In the income subsidy simulation, low-CD4 men not on any medication show the largest health response. The paper does not examine race/ethnicity heterogeneity, having excluded non-white individuals from the analysis due to sampling methodology concerns.&lt;/p&gt;
&lt;h3 id="q4-what-does-the-value-decomposition-reveal-about-why-haart-benefited-more-educated-men-more"&gt;Q4. What does the value decomposition reveal about why HAART benefited more-educated men more?&lt;/h3&gt;
&lt;p&gt;Table A17 sequentially replaces the processes and parameters of lower-educated agents with those of higher-educated agents. Giving lower-educated men the income process of college graduates narrows but does not close the gap—income is not the primary driver. Replacing the insurance and medical expenditure processes slightly reduces value for less-educated men relative to giving them only the income process, because more-educated individuals actually have somewhat higher out-of-pocket costs. Changing the health and ailments processes has modest positive effects. The largest single contributor to closing the education gap is the survival process: less-educated men face much higher baseline mortality, which depresses the expected present value of all future flows including the gains from HAART. This suggests that policies targeting survival differentials (e.g., access to other health services) could partially close the HAART welfare gap. Finally, replacing the utility parameters mechanically closes the remaining gap, but preferences are less amenable to direct policy intervention than the survival process.&lt;/p&gt;
&lt;h3 id="q5-what-do-the-treatment-mandate-simulations-show-and-why-do-they-matter-for-evaluating-clinical-trials"&gt;Q5. What do the treatment mandate simulations show, and why do they matter for evaluating clinical trials?&lt;/h3&gt;
&lt;p&gt;A six-month HAART mandate mimics randomized assignment to treatment in a clinical trial. It improves health—the probability of high CD4 rises by 1.7 percentage points more for less-educated men than baseline (reflecting a larger baseline gap in HAART use)—which would appear a policy success from a health-only perspective. However, expected lifetime utility falls by 2.8% for less-educated men and 1.4% for college graduates, because mandated HAART forces individuals into ailment-inducing treatment they would not have chosen, inhibiting labor supply. Employment falls by 4.1% for less-educated men versus 1.6% for college graduates. Appendix analyses removing the ailment-producing properties of treatment largely eliminate both the welfare cost and the employment effect, confirming that ailments are the mediating channel. This shows that clinical trials—which typically report health endpoints and do not measure welfare or distributional consequences—can mask the costs that effective but side-effect-heavy treatments impose, and that those costs fall disproportionately on less-advantaged patients.&lt;/p&gt;
&lt;h3 id="q6-what-does-the-non-labor-income-subsidy-simulation-show-and-which-groups-respond-most"&gt;Q6. What does the non-labor income subsidy simulation show, and which groups respond most?&lt;/h3&gt;
&lt;p&gt;A permanent $10,000-per-six-months increase in non-employment income (approximately 50% of median income, calibrated to COVID-era transfer policies) induces labor force exit across all groups but concentrates its health-promoting effects among disadvantaged men who were not already on HAART. Among relatively healthy (high-CD4) less-educated men not using any medication, HAART take-up rises by 81.2% (from 5% to 9%); the corresponding figure for college graduates is 44.5% (from 8% to 11%). Among men with AIDS-level (low) CD4 not on treatment, the probability of being healthy next period increases by 12.6% for less-educated men and 5.3% for college graduates. Men already on HAART—who are unlikely to change treatment regardless—show little response. The policy has small but positive health externalities beyond the immediate recipients, since people on antiretrovirals have lower viral loads and lower transmission risk. Decomposition simulations (Appendix Figures A1–A2) show that both the income-level channel and the disutility-of-work-with-ailments channel independently contribute to the larger lower-education response, with neither alone sufficient to fully explain the differential.&lt;/p&gt;
&lt;h3 id="q7-how-does-this-paper-relate-to-and-differ-from-closely-related-prior-work"&gt;Q7. How does this paper relate to and differ from closely related prior work?&lt;/h3&gt;
&lt;p&gt;The paper is most closely related to Papageorge (2016, Quantitative Economics), which uses the same MACS data and setting to link non-uptake of HAART to labor supply and side effects. The key difference is scope: Papageorge (2016) focuses on individual-level mechanisms; the present paper&amp;rsquo;s goal is to characterize distributional differences in the health-welfare tradeoff across education groups and to show that innovation can exacerbate existing inequality. Chan, Hamilton, and Papageorge (2016, Review of Economic Studies) also use the MACS setting to study the value of medical innovation, and Hamilton, Hincapié, Miller, and Papageorge (2021, International Economic Review) examine the diffusion of HAART. Relative to the sociological fundamental cause theory literature (Link and Phelan 1995; Phelan et al. 2010), which documents that medical innovations tend to widen health disparities, the present paper provides a structural quantification of the specific mechanisms and their relative magnitude. Relative to papers attributing health disparities primarily to access barriers (insurance, cost), the paper provides evidence that for this sample—where insurance coverage exceeds 91% even for less-educated men and HIV drugs are inexpensive—access explains little of the educational disparity in HAART use or health outcomes.&lt;/p&gt;
&lt;h3 id="q8-what-are-the-policy-implications-and-their-scope-conditions"&gt;Q8. What are the policy implications and their scope conditions?&lt;/h3&gt;
&lt;p&gt;The core implication is that policies reducing the cost of not working—income transfers, disability benefits, worker protections—can raise HAART adoption and improve health among disadvantaged patients, precisely the group for whom standard health-access policies have limited traction. The non-labor income subsidy simulation suggests that the health improvements are modest in absolute magnitude (a 0.2% rise in probability of being healthy next period for the best-responding group among high-CD4 non-HAART users, and 13% for low-CD4 non-HAART users), but there are unmodeled positive externalities through reduced transmission risk that would multiply the social return. Scope conditions: (1) The sample is white men who have sex with men in four U.S. cities during 1991–2003, enrolled in a prospective cohort study; generalizability to other populations (women, racial minorities, other diseases) is uncertain. (2) The income subsidy that triggers HAART take-up must be large enough to induce labor force exit; a $10,000 per-six-months transfer is needed to generate the simulated behavioral response, larger for higher-income workers. (3) The paper explicitly notes that drug costs and insurance are not binding constraints in this sample, and the policy conclusions may differ in settings with weaker drug coverage. (4) Mental health is excluded from the model; the paper shows depression variables have smaller effects on treatment choice than the physical mechanisms included, but mental health could independently affect some populations&amp;rsquo; response. The paper&amp;rsquo;s conclusions extend to other conditions where effective treatment has disabling side effects and disadvantaged patients hold inflexible physical jobs—the authors invoke COVID-19 as a contemporary analog.&lt;/p&gt;
&lt;h3 id="q9-what-robustness-checks-are-conducted"&gt;Q9. What robustness checks are conducted?&lt;/h3&gt;
&lt;p&gt;The authors report several robustness exercises. Treatment transition results are shown to be robust to defining the HAART introduction period as survey visit 23 or 25 rather than 24. Ailment specifications are noted to be robust to varying the type or frequency of ailments counted (citing Papageorge 2016 for this). Specifications including unobserved heterogeneity in the utility function produce very small second-type probabilities (below 5%), arguing against its inclusion. The treatment mandate simulations are run under three alternative shock-assignment methods (2 draws, 8 draws, and the preferred 2-draw approach), with results consistent across methods on the main welfare-versus-health asymmetry. Appendix Tables A19 and A20 remove ailments from all medications and from HAART only, respectively, confirming that the welfare cost of mandates is driven by treatment-induced ailments. Appendix Figures A1 and A2 mechanically decompose the education-differential response to the income subsidy by replacing income processes and disutility parameters separately, confirming that both channels are active. The model fit (Table A9) shows overall employment (66% model, 66% data) and HAART use (33% model, 36% data) closely matching, though the model slightly over-predicts medication use among low-CD4 individuals.&lt;/p&gt;
&lt;h3 id="q10-why-does-the-paper-focus-on-white-men-only-and-what-does-this-imply-for-interpretation"&gt;Q10. Why does the paper focus on white men only, and what does this imply for interpretation?&lt;/h3&gt;
&lt;p&gt;The authors drop 1,098 observations from 390 non-white individuals because of concerns about the sampling methodology used to recruit the refresher sample for those individuals—specifically, non-white participants entered the panel via a different selection process that could confound estimates. The paper does not investigate racial disparities in HAART take-up, which are also well-documented in the literature. This is a significant limitation because HIV/AIDS has disproportionately affected Black men in the United States, and the mechanisms the paper identifies—occupational sorting, income constraints, disutility of working with ailments—may operate differently or more intensely along racial lines. The authors acknowledge this limitation and note that the structural framework could in principle be applied to other groups if appropriate data were available.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Health-welfare tradeoff&lt;/strong&gt;: In this paper, the wedge between the action that maximizes health (taking effective medication despite side effects) and the action that maximizes lifetime utility (avoiding medication to remain employed and maintain income). The tradeoff is not a bias or error but a rational response to economic constraints, and it is wider for less-educated individuals whose occupational conditions make working with side effects especially costly.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;HAART (Highly Active Antiretroviral Therapy)&lt;/strong&gt;: A combination antiretroviral HIV treatment introduced in the mid-1990s, far more effective than prior mono- or combo-therapy at improving CD4 count and preventing AIDS-level immune decline and death. In this paper&amp;rsquo;s model, HAART serves as the innovation whose adoption the authors study: it is more efficacious but produces harsher side effects than earlier treatments, and its introduction is treated as an unanticipated aggregate shock.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Disutility of working with ailments&lt;/strong&gt;: A structural utility parameter (θ_2,f=0) capturing how much worse-off an agent feels from working while experiencing physical ailments (fatigue, diarrhea, headache, fever). Estimated at -2.73 for less-educated men and -1.97 for college graduates, this parameter is the primary driver of the differential health-welfare tradeoff across education groups and explains why side-effect-bearing treatments like HAART are disproportionately avoided by lower-education workers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Treatment mandate simulation&lt;/strong&gt;: A counterfactual in which all agents are assigned to HAART for six months (eliminating choice among other treatment options), used to mimic randomized assignment in a clinical trial. The simulation is designed specifically to illustrate that health improvements observable in a clinical trial coexist with welfare reductions and employment disruptions that would not be captured in standard trial endpoints.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Fundamental cause theory&lt;/strong&gt;: A sociological framework (Link and Phelan 1995) arguing that socioeconomic status is a &amp;lsquo;fundamental cause&amp;rsquo; of health disparities that persists despite or is even amplified by medical innovation, because more advantaged individuals are better positioned to adopt and benefit from new treatments. The paper provides structural economic microfoundations for this theory by quantifying the mechanisms through which HAART&amp;rsquo;s introduction widened the welfare gap.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Non-labor income subsidy&lt;/strong&gt;: A counterfactual policy simulation in which non-employment income is raised by $10,000 per six months (approximately 50% of the median person&amp;rsquo;s income), modeled after COVID-19 transfer policies. In the paper&amp;rsquo;s model this policy reduces employment but increases HAART take-up and health improvements particularly for less-educated HIV-positive men who were previously forgoing treatment to maintain income from work.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Source text origin&lt;/strong&gt;: Not a paper-specific concept but denoted here: the full working paper text was obtained from the NBER Working Paper (No. 28864), not from abstract-only, satisfying the GUARD requirement.&lt;/p&gt;</description></item><item><title>Mortgage securitization and information frictions in general equilibrium</title><link>https://macropaperwarehouse.com/papers/mortgage-securitization-and-information-frictions-in-general-equilibrium/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/mortgage-securitization-and-information-frictions-in-general-equilibrium/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper develops a quantitative general equilibrium model of the U.S. housing finance system that jointly determines mortgage credit and mortgage-backed security (MBS) issuance, with the aim of measuring how information frictions in the securitization market amplify aggregate credit cycles. The central motivation is the tight co-movement of mortgage credit and MBS issuance documented in HMDA data from 1990 to 2016: from 2000 to 2019, originators sold or securitized roughly 70 percent of all residential mortgages within the first year of origination, making securitization the dominant source of funding for new lending. When this source of liquidity collapsed during the Great Financial Crisis (GFC), aggregate residential mortgage credit contracted by roughly 41 percent and RMBS issuance contracted by roughly 37 percent on average from 2008 to 2013.&lt;/p&gt;
&lt;p&gt;The model is a discrete-time, infinite-horizon DSGE framework with three types of agents: an impatient representative borrower household, a unit-mass continuum of heterogeneous lenders, and a government. Borrower households consume non-durables and housing services, take on long-term fixed-rate mortgages modeled as perpetuities with geometrically declining payments, and can endogenously default when idiosyncratic housing valuation shocks erode their equity. Lenders face stochastic loan origination costs drawn i.i.d. from a continuous distribution, can privately identify the quality of loans in their portfolios, and access a securitization market modeled after the to-be-announced (TBA) forward market for agency MBS — the largest liquid MBS market in the U.S. The TBA market features anonymous, non-exclusive trades at a single pooling price, and the &amp;ldquo;cheapest-to-deliver&amp;rdquo; convention gives sellers the incentive to offload their lowest-value loans, giving rise to a classic Akerlof-style adverse selection problem. The government captures GSE credit guarantees through a state-contingent subsidy to MBS buyers, financed by a distortionary fee on originators and lump-sum taxes on households. The model is calibrated to match key cross-sectional moments of the HMDA dataset for 1990 to 2006, including the distribution of lending: the top 1 percent of originators accounted for 62 percent of lending and the top 10 percent for 89 percent. These moments of market concentration are central to quantifying the amplification channel.&lt;/p&gt;
&lt;p&gt;Two novel theoretical features distinguish this framework. First, the mortgage interest rate and the security price are jointly determined in equilibrium — a &amp;ldquo;joint price determination&amp;rdquo; property. Second, the severity of information frictions is itself an endogenous function of equilibrium prices, the household default rate, and lenders&amp;rsquo; trading decisions. When household credit risk rises, more loans become low-quality, deteriorating the average quality of the pool offered by sellers. MBS buyers, aware of sellers&amp;rsquo; incentives, demand a larger adverse selection discount; security prices fall; fewer lenders find it profitable to securitize; an endogenous liquidity shortage follows in the credit market; and tighter lending conditions further weaken household balance sheets. This feedback constitutes the adverse selection multiplier.&lt;/p&gt;
&lt;p&gt;Quantitatively, when the calibrated model is fed the sequence of income and housing-valuation shocks observed from 2006 to 2016, it replicates two-thirds of the observed 41 percent contraction in mortgage lending and the full 37 percent contraction in MBS issuance from 2008 to 2013. A shock decomposition (Table 7) shows that, on average over 2008–2013, information frictions account for 40 percent of the model&amp;rsquo;s predicted decline in mortgage lending (52 percentage points from housing valuation shocks and 5 percentage points from income shocks make up the remainder; comparable shares hold in the securitization market). There is a 1.5 adverse selection multiplier: absent information frictions, credit would have contracted by 27 percent rather than 41 percent. Housing valuation shocks account for roughly half the total dynamics; income shocks account for about 5 percent.&lt;/p&gt;
&lt;p&gt;Regarding the post-GFC structural changes, the paper evaluates the effect of GSEs expanding their market share to 100 percent (up from 69 percent in 1990–2006) and the threefold increase in the guarantee fee (from 20 to 60 basis points after 2012). These changes reduce the volatility of the mortgage spread from 6.3 to 4.7 percentage points and lower the unconditional probability of a securitization market collapse from 6.5 to near zero. However, the policy generates inefficiently high levels of liquidity, produces only small welfare gains for borrowers (0.06 percent in consumption-equivalent units), and distributes gains unequally — lenders gain approximately 1.3 percent. Households face higher interest rates (lenders pass through the guarantee fee) and higher taxes. The model corroborates other GE studies in finding that credit guarantees were underpriced before the GFC; the actuarially fair price is closer to the post-2012 fee.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-papers-identification-strategy-and-what-is-the-nature-of-the-quantitative-exercise"&gt;Q1. What is the paper&amp;rsquo;s identification strategy and what is the nature of the quantitative exercise?&lt;/h3&gt;
&lt;p&gt;The paper does not use a reduced-form empirical identification strategy; it is a structural DSGE model. The quantitative exercise feeds the calibrated model the observed sequences of aggregate household income shocks and housing valuation shocks from 2006 to 2016, with the model calibrated to match pre-GFC (1990–2006) moments of the U.S. mortgage market. The decomposition of information frictions is accomplished by simulating a complete-information counterfactual for the same shock sequence: the difference between the benchmark model and the complete-information economy quantifies the contribution of private information.&lt;/p&gt;
&lt;h3 id="q2-what-is-the-securitization-liquidity-channel-and-how-does-it-operate-mechanically-in-the-model"&gt;Q2. What is the securitization liquidity channel, and how does it operate mechanically in the model?&lt;/h3&gt;
&lt;p&gt;The securitization liquidity channel is the transmission mechanism from the securitization market to mortgage credit supply. In normal times, lenders with low origination costs (sellers) securitize their loan portfolios, freeing up funds to originate new loans, while high-cost lenders purchase securities rather than originate, effectively specializing their roles through the market. A shock that increases household default risk worsens pool quality. Buyers face a larger adverse selection discount, security prices fall, and the wedge between the market price and a seller&amp;rsquo;s valuation of high-quality loans widens. Many lenders switch from selling to holding, reducing the supply of liquidity in the securitization market. Constrained by limited access to debt markets, lenders cut new mortgage origination. The resulting tightening in credit further deteriorates household balance sheets, creating an amplification loop.&lt;/p&gt;
&lt;h3 id="q3-what-are-the-three-types-of-lenders-in-the-model-and-what-determines-their-trading-decisions"&gt;Q3. What are the three types of lenders in the model, and what determines their trading decisions?&lt;/h3&gt;
&lt;p&gt;Lenders endogenously sort into three groups based on their idiosyncratic origination cost draw z relative to two equilibrium cutoffs. Sellers (low-cost lenders, z below the first cutoff) find origination sufficiently profitable to sell their inventory of loans into the securitization market and originate new ones. Buyers (high-cost lenders, z above the second cutoff) find origination too costly and instead buy securities from sellers. Holders (lenders with z between the two cutoffs) neither sell at the prevailing adverse-selection-discounted price nor buy at the effective cost grossed up by the information wedge; they retain their illiquid loan portfolios and originate fewer new loans. The information wedge — the distance between the two cutoffs — is a decreasing function of the subsidy coverage and an increasing function of the adverse selection discount.&lt;/p&gt;
&lt;h3 id="q4-how-is-the-adverse-selection-discount-endogenously-determined-and-why-does-it-amplify-shocks"&gt;Q4. How is the adverse selection discount endogenously determined, and why does it amplify shocks?&lt;/h3&gt;
&lt;p&gt;The per-unit adverse selection discount mu_t is defined as the aggregate fraction of low-quality loans traded in the securitization market: mu_t = S_B_t / S_t, where S_B_t is the aggregate supply of low-quality loans and S_t is total loans traded. This fraction is endogenous: it depends on which lenders sort into the seller category and what quality distribution their portfolios have, which in turn depends on the household default rate and the equilibrium price. When household credit risk rises, the default rate increases, more loans become low-quality, and sellers selectively offload bad loans while retaining good ones. The endogenous deterioration in mu_t raises buyers&amp;rsquo; required discount, further reducing the security price, which causes additional holders to switch away from selling, compounding the adverse selection problem. This self-reinforcing dynamic is the multiplier.&lt;/p&gt;
&lt;h3 id="q5-under-what-conditions-can-the-securitization-market-shut-down-entirely-and-what-happens-to-credit-in-that-case"&gt;Q5. Under what conditions can the securitization market shut down entirely, and what happens to credit in that case?&lt;/h3&gt;
&lt;p&gt;Proposition 2 establishes that a sufficient condition for market shutdown in the steady state is that the market effective cost of buying securities exceeds the origination cost of the highest-cost lender in the economy. When this condition holds: (1) the securitization market does not operate; (2) every lender originates using only her own technology; and (3) the mortgage rate is higher than when the market operates. Critically, even when the securitization market collapses, the credit market continues to function, but with higher interest rates and lower intermediation volumes. The economy can transition between states with and without an active securitization market.&lt;/p&gt;
&lt;h3 id="q6-what-role-does-market-concentration-of-mortgage-originators-play-in-the-quantitative-results"&gt;Q6. What role does market concentration of mortgage originators play in the quantitative results?&lt;/h3&gt;
&lt;p&gt;Market concentration is crucial for the magnitude of amplification. From 1990 to 2016, the top 1 percent of originators accounted for 62 percent of lending and the top 10 percent for 89 percent (from HMDA data). The model is calibrated to match these moments. Because large originators specialize as securitization sellers, their decision to switch from selling to holding — triggered by rising adverse selection discounts — produces very large contractions in aggregate credit supply. The calibrated lending-cost distribution shows a large discontinuity: the last marginal securitization seller originates a volume four times larger than the next marginal holder. When the most efficient, high-volume lenders exit the securitization market, the aggregate effect is disproportionately large.&lt;/p&gt;
&lt;h3 id="q7-how-does-the-government-subsidy-policy-interact-with-adverse-selection-and-what-are-its-theoretical-properties"&gt;Q7. How does the government subsidy policy interact with adverse selection, and what are its theoretical properties?&lt;/h3&gt;
&lt;p&gt;The GSE credit guarantee is modeled as a state-contingent subsidy tau_t = alpha_G * mu_t, where alpha_G in [0,1] represents the degree of insurance provided. Any positive subsidy reduces the adverse selection wedge by moving the second cutoff leftward, expanding the mass of security buyers. A full subsidy (alpha_G = 1) completely offsets buyers&amp;rsquo; losses from default risk, stabilizing security demand regardless of household credit risk and minimizing the probability of market collapse. However, Proposition 3 establishes that a full subsidy generates inefficiently high levels of liquidity compared to the complete information benchmark: it expands the volume of MBS at lower average quality relative to an economy where low-quality loans are screened out. A full subsidy also fails to replicate complete-information allocations because the guarantee fee distorts lenders&amp;rsquo; origination decisions and raises borrowers&amp;rsquo; mortgage rates.&lt;/p&gt;
&lt;h3 id="q8-what-are-the-welfare-implications-of-the-post-gfc-policy-changes"&gt;Q8. What are the welfare implications of the post-GFC policy changes?&lt;/h3&gt;
&lt;p&gt;The welfare analysis (Table 9) finds small positive but unequal welfare gains. The overall post-GFC policy changes (full subsidy plus higher guarantee fee) yield borrower welfare gains of 0.06 percent and lender welfare gains of 1.3 percent in consumption-equivalent units. Decomposing the changes: the increase in the subsidy (alpha_G from 69 to 100 percent) generates borrower welfare losses of -0.16 percent (due to higher taxes and interest rates, offset partially by lower volatility) and lender gains of 3.01 percent (from improved lending efficiency). The increase in the guarantee fee reverses some of this by generating borrower gains of 0.18 percent and lender losses of -1.53 percent. The paper characterizes these as upper bounds because the full subsidy may generate moral hazard by weakening originators&amp;rsquo; incentives to screen loan quality.&lt;/p&gt;
&lt;h3 id="q9-how-does-this-paper-relate-to-and-extend-justiniano-et-al-2015-2019-and-landvoigt-2016"&gt;Q9. How does this paper relate to and extend Justiniano et al. (2015, 2019) and Landvoigt (2016)?&lt;/h3&gt;
&lt;p&gt;Justiniano et al. (2015, 2019) argue that credit supply constraints — limits on the funds available to lenders — are quantitatively more important than credit demand forces in explaining mortgage credit fluctuations. This paper provides a microfoundation for those constraints by modeling securitization as the dominant source of liquidity for lenders and deriving endogenously how adverse selection limits that liquidity. Landvoigt (2016) introduces securitization in a DSGE housing model in reduced form. This paper goes further by modeling an endogenous securitization market where lenders optimally trade off liquidity benefits against information friction costs, so security prices and mortgage rates are jointly determined rather than imposed exogenously.&lt;/p&gt;
&lt;h3 id="q10-how-does-this-paper-relate-to-the-kurlat-2013-and-bigio-2015-models-of-adverse-selection-in-asset-markets"&gt;Q10. How does this paper relate to the Kurlat (2013) and Bigio (2015) models of adverse selection in asset markets?&lt;/h3&gt;
&lt;p&gt;The securitization design combines Kurlat (2013)&amp;rsquo;s framework of asset creation and reallocation with two additional features specific to the TBA market: (1) the cheapest-to-deliver convention, which means sellers can select the lowest-value loans in their inventory satisfying trade terms; and (2) the non-exclusive, anonymous nature of TBA trades, which ensures a pooling price. Bigio (2015) models endogenous liquidity and the business cycle through information frictions in interbank markets. This paper extends the adverse selection approach to the mortgage market specifically and provides an equilibrium linkage between the securitization market and the credit market rather than modeling them as a single market.&lt;/p&gt;
&lt;h3 id="q11-what-are-the-non-targeted-moments-and-how-well-does-the-model-fit-the-data"&gt;Q11. What are the non-targeted moments and how well does the model fit the data?&lt;/h3&gt;
&lt;p&gt;Three non-targeted moments are reported (Table 5). The model generates a fraction of loan sales of 73.9 percent (data: 61.8 percent from HMDA), a correlation between loan sales and new lending of 0.86 (data: 0.90), and a mortgage spread of 178 basis points (data: 330 basis points). The loan sales fraction is somewhat above data and the spread is substantially below. For targeted cross-sectional moments (Table 6), the model closely matches the distribution of lending by quartile, with Q4 market shares of 0.957 in the model versus 0.959 in the data. For the dynamic GFC episode, the model replicates two-thirds of the 41 percent contraction in mortgage lending and the full 37 percent contraction in MBS issuance.&lt;/p&gt;
&lt;h3 id="q12-what-are-the-sources-of-aggregate-shocks-and-how-are-they-calibrated"&gt;Q12. What are the sources of aggregate shocks and how are they calibrated?&lt;/h3&gt;
&lt;p&gt;The two exogenous aggregate state variables are household income Y_t and the variance of idiosyncratic housing valuation shocks sigma_omega_t (the proxy for mortgage credit risk). They follow a first-order joint Markov process. Income is identified using the cyclical component of disposable personal income from the flow-of-funds accounts. The variance of housing shocks is calibrated to match the national delinquency rate for loans 90+ days delinquent or in foreclosure from the National Mortgage Database (FHFA). The calibrated states produce default rates of 1.8 percent in the low-risk state and 7.9 percent in the high-risk state, with an unconditional default rate of 2.6 percent.&lt;/p&gt;
&lt;h3 id="q13-what-are-the-key-limitations-and-caveats-of-the-analysis"&gt;Q13. What are the key limitations and caveats of the analysis?&lt;/h3&gt;
&lt;p&gt;Several limitations are noted. First, the welfare analysis of the full subsidy is characterized as an upper bound because moral hazard — the impact of guaranteed insurance on originators&amp;rsquo; incentives to screen loan quality — is not modeled. Second, the model abstracts from other consequences of default for borrowers, such as reputation concerns and long-term credit market exclusion. Third, the paper focuses on information frictions between lenders and investors (the securitization chain), not between borrowers and lenders. Fourth, the non-targeted mortgage spread (178 bps in model versus 330 bps in data) suggests some quantitative limitations in matching all features of the credit market simultaneously. Fifth, the exercise is a structural model exercise and not empirically identified through exogenous variation.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Securitization liquidity channel&lt;/strong&gt;: The mechanism by which mortgage originator funding capacity depends on their ability to sell loan portfolios in the securitization market; when securitization demand falls, originators face an endogenous liquidity shortage and reduce new mortgage lending, transmitting shocks from the MBS market to the credit market.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Adverse selection multiplier&lt;/strong&gt;: The amplification factor arising from private information in the securitization market: as household credit risk rises, sellers&amp;rsquo; incentives to offload low-quality loans worsen pool quality, causing buyers to demand a larger discount, which causes more lenders to withdraw from selling, creating a feedback loop that magnifies the initial shock to credit supply. Quantified at 1.5 for the GFC episode.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;TBA (to-be-announced) forward market&lt;/strong&gt;: The dominant trading venue for agency MBS in the U.S., accounting for over 90 percent of MBS trading volume, where the specific securities to be delivered are not identified at the trade date and sellers can deliver the cheapest eligible pool (&amp;lsquo;cheapest-to-deliver&amp;rsquo;), institutionalizing adverse selection incentives.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Cheapest-to-deliver convention&lt;/strong&gt;: A TBA market practice by which a seller selects and delivers the lowest-value mortgage pools in its inventory that satisfy the terms of trade, giving sellers a systematic informational advantage and incentivizing selective retention of high-quality loans.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Adverse selection discount (mu_t)&lt;/strong&gt;: In this paper, the per-unit discount arising from adverse selection, defined as the endogenous equilibrium fraction of low-quality loans in the aggregate supply of traded loans (S_B_t / S_t); this fraction is determined jointly with prices and lenders&amp;rsquo; trading decisions, and rises when household default risk increases.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mortgage credit risk (sigma_omega_t)&lt;/strong&gt;: The standard deviation of idiosyncratic housing valuation shocks to household members, which is the exogenous aggregate state variable that drives default rates; when sigma_omega_t rises, more households fall below the default threshold, increasing the aggregate default rate and degrading the quality composition of lenders&amp;rsquo; portfolios.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Joint price determination&lt;/strong&gt;: A novel equilibrium property of the model in which the mortgage interest rate (in the credit market) and the price of securities (in the securitization market) are simultaneously determined; this interdependence means that adverse selection dynamics in the securitization market directly affect the cost of credit and vice versa.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;GSE credit guarantee (subsidy policy)&lt;/strong&gt;: A state-contingent subsidy tau_t = alpha_G * mu_t paid to MBS buyers, representing the credit guarantees of Fannie Mae and Freddie Mac; financed by a guarantee fee (distortionary tax on originators) and lump-sum taxes on households; alleviates adverse selection by stabilizing security demand but generates inefficiently high liquidity and fails to deliver meaningful household welfare gains.&lt;/p&gt;</description></item><item><title>Pricing-to-market in business cycle models</title><link>https://macropaperwarehouse.com/papers/pricing-to-market-in-business-cycle-models/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/pricing-to-market-in-business-cycle-models/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper evaluates five microfounded pricing-to-market (PTM) mechanisms and one reduced-form aggregator in a two-country DSGE model with volatile exchange rates driven by financial shocks (following Gabaix and Maggiori 2015) and real productivity shocks. The central question is whether existing open-economy theories can jointly achieve three empirically mandated targets — low exchange-rate pass-through to import prices, muted expenditure switching (low short-run trade elasticity), and plausible producer markups — when exchange rates are volatile and act as a major independent source of fluctuations. The paper&amp;rsquo;s main contribution is to show analytically and quantitatively that no existing microfounded PTM model fully escapes a structural tension among these three targets, which the authors call the parameterization trilemma.&lt;/p&gt;
&lt;p&gt;The models evaluated are: (i) the Kimball Aggregator (KA; reduced-form, Itskhoki-Mukhin application); (ii) the Distribution Cost model (CD; Corsetti-Dedola 2005); (iii) the Price Dispersion model (PD; Alessandria 2009); (iv) the Nested CES/Cournot model (NCES; Atkeson-Burstein 2008); (v) the Deep Habits model (DH; Ravn-Schmitt-Grohe-Uribe 2007); and (vi) the Customer Capital model (CC; Drozd-Nosal 2012). The encompassing framework uses the Backus-Kehoe-Kydland (1995) two-country structure augmented with a financial sector that generates UIP deviations via a capacity-constrained arbitrageur segment and exogenous noise-trader positions. The model is estimated/calibrated to quarterly U.S. data (1981Q1–2009Q4 for prices, 1980Q1–2004Q1 for quantities), HP-filtered with lambda = 1,600.&lt;/p&gt;
&lt;p&gt;The baseline markup target is 50%, consistent with BEA input-output tables for U.S. tradable sectors (ranging 45–50% across 2007, 2012, 2017); listed-firm SEC data imply higher values around 73–75%, which the authors treat as an upper bound. The empirical pass-through target is 0.4 (midpoint of a 0.2–0.6 range estimated by Campa-Goldberg 2005 and others; Gopinath-Itskhoki 2022 estimate 0.2–0.3). The short-run trade elasticity target is 0.7, measured using the volatility ratio of quantities to prices, which yields an upper-bound estimate. Real exchange rate volatility is targeted at 3.97 (standard deviations relative to GDP). Imports-to-GDP ratio is targeted at 12%.&lt;/p&gt;
&lt;p&gt;The central analytic finding — the parameterization trilemma — is characterized precisely for each model. For the KA model, the demand elasticity parameter gamma(1) simultaneously pins down both the markup and the trade elasticity, so matching 50% markups implies trade elasticity of approximately 1.5 (above the desired range of less than 1) and any value below TE = 1 is simply unattainable. For the CD model, pass-through of 0.4 requires a distribution cost markup wedge of 150% above the producer&amp;rsquo;s markup, which is inconsistent with the 50% markup target. For the PD model, the structural formula links PT and markups but less severely, so the trilemma is partially mitigated. For the NCES model, the trade elasticity equals the firm-level elasticity theta, which is also the main driver of pass-through, recreating a binding version of the KA trilemma on the quantity side. For the CC model, the market-expansion friction (captured by adjustment-cost parameter psi) provides an additional degree of freedom that allows trade elasticity to be set independently of pass-through and markups; at symmetric bargaining power eta = 0.5 and 50% markups, the model delivers PT = 0.33 analytically, close to the data target.&lt;/p&gt;
&lt;p&gt;Quantitative results confirm the analytic predictions. The KA model fails on quantity statistics because it implies trade elasticity far above target, generating counterfactually negative international comovement of consumption, investment, and employment. The CD model delivers only moderately incomplete pass-through (substantially above the 0.4 target), underperforming on price statistics, and implies a counterfactual correlation of net exports with the terms of trade. The PD model delivers pass-through of approximately 0.70 — better than CD but still above target — and performs well on quantities. The NCES model achieves pass-through of 0.63 (close to but above the 0.4 target) but at the cost of large, negative international comovement in general equilibrium, including a counterfactual positive correlation of net exports with output. The DH model generates more-than-complete pass-through in the presence of persistent exchange rates, failing on prices. The CC model delivers PT = 0.36, closest to the empirical target, achieves correct signs for international quantity comovement, and generates a positive terms-of-trade/net-exports correlation — but requires assumed productivity shock correlation of 0.75 to match measured TFP correlation of 0.3 due to endogenous marketing investment affecting measured TFP, and fails to deliver a positive correlation between terms of trade and the exchange rate.&lt;/p&gt;
&lt;p&gt;The paper concludes that further research is needed into frictions that simultaneously dampen the price and quantity responses to volatile exchange rates without violating markup discipline. The reduced-form KA model neither nests nor outperforms the microfounded alternatives. The CC and PD search-based models perform best overall but introduce frictions that are harder to identify and measure directly.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-parameterization-trilemma-and-how-is-it-characterized-analytically"&gt;Q1. What is the parameterization trilemma and how is it characterized analytically?&lt;/h3&gt;
&lt;p&gt;The trilemma is the structural impossibility of jointly satisfying three empirically necessary targets: (a) plausible steady-state producer markups (calibrated at 50%), (b) low short-run trade elasticity (targeted at 0.7 or below), and (c) low exchange-rate pass-through to import prices (targeted at 0.4). The authors derive closed-form expressions for pass-through (PT), trade elasticity (TE), and markups (mu) for each model and show that satisfying any two targets forces a violation of the third. For the KA model, the key parameter gamma(1) satisfies TE = gamma(1) and mu = (gamma(1) - 1)^{-1}, so targeting 50% markups forces TE = 3 and targeting TE = 1.5 forces markups of 200%. For the CD model, PT = 0.4 requires the distribution-cost wedge xi/(theta-1) = 1.5, implying markups more than 150% above the friction-free level, incompatible with a 50% target. For the PD model the formula is PT = 1 - mu/(1+mu), which is less restrictive. For the NCES model, TE = theta (the firm-level elasticity) and theta also drives pass-through, recreating the KA-type trilemma on the quantity side. For the CC model, the friction parameter psi in marketing capital accumulation independently controls TE, providing an extra degree of freedom that lets the model partially escape the trilemma.&lt;/p&gt;
&lt;h3 id="q2-what-is-the-identification-strategy-for-pass-through-and-trade-elasticity-and-what-are-its-main-assumptions"&gt;Q2. What is the identification strategy for pass-through and trade elasticity, and what are its main assumptions?&lt;/h3&gt;
&lt;p&gt;The theoretical pass-through coefficient (PT) is defined as the partial equilibrium, on-impact elasticity of the import price with respect to the exchange rate, computed at the steady state while holding constant marginal costs (v, v*), the stochastic discount factor, and the domestic price of the home good. This mimics what regression-based pass-through estimates do (controlling for local costs). Trade elasticity (TE) is defined analogously as the PT-scaled elasticity of the import/domestic quantity ratio with respect to the exchange rate, under a one-time shock that reverts to the steady state next period (except for the DH model, where a permanent shock is considered). A key assumption is that importers take aggregate price indices as consistent with all importers behaving the same way (a rational-expectations fixed point). General-equilibrium co-movements between exchange rates and marginal costs are abstracted from in the analytic section, consistent with the goal of isolating each model&amp;rsquo;s intrinsic PTM mechanism.&lt;/p&gt;
&lt;h3 id="q3-why-does-the-ka-model-fail-on-quantity-statistics-despite-being-able-to-match-any-degree-of-pass-through"&gt;Q3. Why does the KA model fail on quantity statistics despite being able to match any degree of pass-through?&lt;/h3&gt;
&lt;p&gt;The KA model can match pass-through of 0.4 by freely choosing the curvature of the demand aggregator g&amp;rsquo;&amp;rsquo;(1) (independently of gamma(1)). However, the steady-state demand elasticity gamma(1) simultaneously determines both the markup (mu = (gamma(1)-1)^{-1}) and the trade elasticity (TE = gamma(1)). Matching 50% markups forces gamma(1) = 3 and therefore TE = 3, far above the target of 0.7. This excessive trade elasticity generates counterfactually large expenditure switching in response to exchange-rate shocks, leading to counterfactual negative international comovement of consumption, investment, and employment. A modified Kimball aggregator with a convex adjustment cost (equation 62) does not resolve the problem because the convex cost parameter also enters the steady-state markup formula, so targeting 50% markups still forces high effective trade elasticity.&lt;/p&gt;
&lt;h3 id="q4-why-does-the-deep-habits-model-generate-more-than-complete-pass-through-when-exchange-rates-are-persistent"&gt;Q4. Why does the Deep Habits model generate more-than-complete pass-through when exchange rates are persistent?&lt;/h3&gt;
&lt;p&gt;In the DH model, producers internalize the law of motion for habits: by lowering prices today they accumulate more customer habits, which allows them to raise prices later. When the exchange rate appreciates persistently (from the foreign exporter&amp;rsquo;s perspective), exporters expect their foreign sales and thus foreign habit stocks to fall over time. This reduces the shadow value of habit (Delta_f), so producers let prices fall by more than the exchange rate movement, generating pass-through greater than one. The authors derive analytically that, for a permanent shock, PT &amp;gt; 1 because dlog(gh)/dlog(x) &amp;lt; 0 (habit falls upon appreciation), and this dominates the direct pricing effect. For a purely transitory shock, the sign reverses (PT &amp;lt; 1), but since exchange rates are highly persistent in the data, the first property dominates. The quantitative section confirms this: the DH model generates PT &amp;gt; 1, marked as 1.00 in Table 4, disqualifying it on prices.&lt;/p&gt;
&lt;h3 id="q5-how-does-the-customer-capital-cc-model-partially-escape-the-trilemma"&gt;Q5. How does the Customer Capital (CC) model partially escape the trilemma?&lt;/h3&gt;
&lt;p&gt;The CC model introduces two key elements absent from other frameworks: (1) Nash bargaining over prices within bilateral matches, which directly ties pass-through to the sharing of exchange-rate-driven surplus rather than to demand elasticity; and (2) a convex adjustment friction on marketing capital (psi) that controls the pace of trade-share adjustment, independently setting the short-run trade elasticity. Because prices are determined by bargaining (equation 53: pf = eta*P_d + (1-eta)*v), they depend on the retail marginal value of the foreign good (P_d) and the foreign marginal cost (v), but not on quantity within the match. This decouples PT from TE. Analytically, at static steady state, PT = (1-eta)(1 + mu - (TE/gamma)(eta+mu)*omega)^{-1}; for eta = 0.5 and 50% markups and TE/gamma approaching zero, PT approaches (1-eta)/(1+mu) = 1/3. The psi parameter then tunes TE separately from markups and PT. However, a high long-run elasticity gamma (= 7.9) is required to generate sufficient retail-price responsiveness.&lt;/p&gt;
&lt;h3 id="q6-what-does-the-nces-model-achieve-on-prices-and-why-does-it-fail-on-quantities"&gt;Q6. What does the NCES model achieve on prices and why does it fail on quantities?&lt;/h3&gt;
&lt;p&gt;The NCES (Nested CES with Cournot competition) model generates incomplete pass-through of 0.63, the second-best performance on prices after the CC model. The mechanism is that non-atomistic (Cournot) firms internalize the impact of their pricing on the sectoral price index; when the exchange rate moves, foreign exporters&amp;rsquo; market share changes, altering the endogenous demand elasticity they face and dampening their pass-through. To calibrate the model with only one exporting firm (NX=1 out of N=5), the authors maximize the Cournot effect. However, this calibration implies TE = theta (the firm-level elasticity, set at 7.9 in calibration), far exceeding the target of 0.7. A quantity adjustment cost cannot remedy this because it would simultaneously constrain import-share movements, which are the source of the endogenous demand elasticity variation that generates incomplete pass-through. Consequently, the model implies large negative international comovement of output, consumption, employment, and investment — a worse quantity performance than most other models.&lt;/p&gt;
&lt;h3 id="q7-how-does-the-paper-measure-markups-and-what-data-sources-does-it-use"&gt;Q7. How does the paper measure markups and what data sources does it use?&lt;/h3&gt;
&lt;p&gt;The paper equates markups with gross margins under the maintained assumptions of Cobb-Douglas production and static cost minimization (Hall 1988; De Loecker et al. 2020). Under Cobb-Douglas, marginal cost v = wl/y, so markup mu = P&lt;em&gt;y/(w&lt;/em&gt;l) - 1 = sales/(cost of goods sold) - 1. Three data sources are used, all for U.S. data 2007-2017: (1) BEA 402 Industry Input-Output Use Tables, which give gross margins of approximately 39-41% for all sectors and 45-50% for traded sectors (import share &amp;gt; 3%). (2) S&amp;amp;P 500 Compustat with BEA sector value-added adjustment, yielding approximately 73-74% for all non-FIRE/GOV/NGO firms. (3) Unadjusted Compustat, yielding 43-49%. The paper adopts 50% as the baseline calibration target, treating it as conservative given the data range, and noting that the BEA I-O measure is the broadest and likely most accurate. The paper explicitly holds that models must respect profit and margin accounting within their own structure.&lt;/p&gt;
&lt;h3 id="q8-how-does-the-papers-conclusion-differ-from-itskhoki-and-mukhin-2021-regarding-the-kimball-aggregator"&gt;Q8. How does the paper&amp;rsquo;s conclusion differ from Itskhoki and Mukhin (2021) regarding the Kimball Aggregator?&lt;/h3&gt;
&lt;p&gt;Itskhoki and Mukhin (2021) use indirect inference and treat producer margins/markups as a free parameter, implicitly allowing for a much higher markup value — substantially above 50%. Under their calibration approach, the KA model can reconcile low pass-through with better quantity performance. Drozd, Kolasa, and Nosal instead impose a markup discipline: models must match empirically observed gross margins of 50% (for tradable sectors from BEA I-O tables) in their steady state. Under this discipline, the KA model&amp;rsquo;s trilemma becomes binding, and the model fails on quantity statistics. The authors argue that higher markup assumptions change the effective structure of the model and should be treated as a separate research agenda rather than a free calibration choice.&lt;/p&gt;
&lt;h3 id="q9-what-is-the-role-of-financial-shocks-in-the-model-and-how-are-they-implemented"&gt;Q9. What is the role of financial shocks in the model and how are they implemented?&lt;/h3&gt;
&lt;p&gt;Financial shocks generate exchange-rate volatility that is largely decoupled from real fundamentals — mimicking the observed &amp;rsquo;exchange rate disconnect&amp;rsquo; from output and consumption. They are modeled following Gabaix and Maggiori (2015): a global financial sector with short-lived arbitrageurs and noise traders. Arbitrageurs face a capacity constraint (parameterized by Gamma) that prevents them from fully exploiting UIP violations, resulting in a distorted UIP condition where the interest rate differential includes a term proportional to the arbitrageur&amp;rsquo;s position. Noise traders take exogenous positions n(t) that follow an AR(1) process (persistence rho_n = 0.97 in calibration) with standard deviations ranging from 21.2 (CC model) to 114.9 (NCES model) across calibrations. These shocks generate real exchange rate volatility of 3.97% (standard deviations relative to GDP), matching the data target. The paper notes that the precise implementation (Gabaix-Maggiori vs. Itskhoki-Mukhin) has little impact on exchange-rate properties in a linearized setting.&lt;/p&gt;
&lt;h3 id="q10-what-robustness-checks-and-extensions-does-the-paper-consider"&gt;Q10. What robustness checks and extensions does the paper consider?&lt;/h3&gt;
&lt;p&gt;The paper considers a modified Kimball aggregator with a convex adjustment cost on the ratio of imported to domestic quantities (equation 62) as a potential fix for the KA model&amp;rsquo;s high trade elasticity. This is shown not to resolve the trilemma because the convex cost parameter also enters the steady-state markup formula, keeping the binding constraint in place. Results for this modified model are reported in the Online Appendix. The paper also notes that the DH model&amp;rsquo;s pass-through is analyzed under both permanent and transitory shocks, with the sign reversal for purely transitory shocks documented analytically. The paper abstracts from nominal rigidities throughout, justifying this by citing Gopinath-Itskhoki (2011) evidence that conditioning pass-through on price adjustments versus non-adjustments makes little difference in observed pass-through patterns, suggesting limited pass-through is largely a real phenomenon.&lt;/p&gt;
&lt;h3 id="q11-what-are-the-papers-main-implications-for-the-dsge-modeling-of-open-economies"&gt;Q11. What are the paper&amp;rsquo;s main implications for the DSGE modeling of open economies?&lt;/h3&gt;
&lt;p&gt;The paper implies that the standard toolkit for generating incomplete exchange-rate pass-through and muted expenditure switching is inadequate when exchange rates are volatile and act as a major shock. All models face tension among the three targets; the best performers (CC and PD) do so by introducing search frictions that are intrinsically difficult to identify and measure directly. The paper does not claim to provide a solution; rather, it performs a clean diagnostic showing that more research is needed into real frictions that simultaneously insulate import prices and trade quantities from exchange-rate volatility. The finding that the Kimball reduced-form aggregator neither nests nor outperforms microfounded alternatives has implications for monetary-policy DSGE models that frequently use the KA for tractability, suggesting that researchers should be aware of the high implicit markup that is required for the KA to work well in open-economy settings with volatile exchange rates.&lt;/p&gt;
&lt;h3 id="q12-what-moments-from-the-data-are-targeted-in-calibration-and-what-is-the-quantitative-approach"&gt;Q12. What moments from the data are targeted in calibration and what is the quantitative approach?&lt;/h3&gt;
&lt;p&gt;The model is calibrated quarterly and HP-filtered (lambda = 1,600). Common targets include: imports/GDP = 12%; 50% producer markups; 30% work hours relative to time endowment; investment volatility relative to GDP = 2.79; short-run trade elasticity (volatility ratio) = 0.7; cross-country TFP correlation = 0.3; TFP volatility = 0.8% and autocorrelation = 0.72; real exchange rate volatility = 3.97%. The pass-through target of 0.4 is used only as an additional degree of freedom for the KA model; for all others, pass-through is an outcome of the structural parameterization. The financial shock persistence is set arbitrarily at rho_n = 0.97 for lack of a target. When a model cannot satisfy all targets (as with KA and NCES on trade elasticity), that target is dropped in favor of best performance on prices. Pass-through is measured in the quantitative section by running regressions analogous to Campa-Goldberg (2005) on model-generated data, rather than using the analytic partial-equilibrium formula.&lt;/p&gt;
&lt;h3 id="q13-what-is-the-sign-of-the-terms-of-trade-and-exchange-rate-correlation-and-what-does-it-imply-for-model-evaluation"&gt;Q13. What is the sign of the terms-of-trade and exchange-rate correlation, and what does it imply for model evaluation?&lt;/h3&gt;
&lt;p&gt;In model-generated data (without noise), the correlation of terms of trade (tot = pf/px) with the exchange rate (x) is either -1 (when PT &amp;lt; 0.5) or +1 (when PT &amp;gt; 0.5). The empirical target from U.S. data is approximately -1. This means matching PT &amp;lt; 0.5 and a negative tot-x correlation are equivalent predictions. In the quantitative results, only the KA and CC models achieve PT &amp;lt; 0.5 and thus generate the correct negative correlation; all other models (CD, PD, NCES, DH) generate PT &amp;gt; 0.5 and thus positive tot-x correlation. The authors note that the strict 0.4 target may be too aggressive for aggregate data — PT slightly above 0.5 would be consistent with a positive (near zero) correlation — pointing to Gopinath et al. (2020) who find small, statistically insignificant tot-x coefficients ranging from positive to negative.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Parameterization Trilemma&lt;/strong&gt;: The structural impossibility of jointly achieving three empirically necessary targets in standard PTM models: (1) plausible producer gross margins (~50%), (2) low short-run trade elasticity (~0.7 or below), and (3) low exchange-rate pass-through to import prices (~0.4). Each PTM model can satisfy at most two of the three targets simultaneously under quantitative discipline; the third is either infeasible or inconsistent given the model&amp;rsquo;s internal constraints.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Pricing-to-Market (PTM)&lt;/strong&gt;: The practice by which internationally active firms set different prices in home and foreign markets as a function of the bilateral exchange rate, rather than uniformly passing exchange-rate changes through to import prices. In this paper, PTM is measured by the degree of incomplete pass-through (PT &amp;lt; 1) and is generated by specific microfounded frictions (distribution costs, search, habits, market power, customer capital) rather than by nominal rigidities.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Exchange-Rate Pass-Through (PT)&lt;/strong&gt;: The elasticity of the import price (in the importing country&amp;rsquo;s currency) with respect to the bilateral real exchange rate, computed in partial equilibrium at the steady state, controlling for local costs. Values used in calibration: empirical short-run range 0.2–0.6; paper target 0.4. Models in which PT = 1 satisfy the law of one price; models with PT &amp;lt; 1 exhibit pricing-to-market.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Short-Run Trade Elasticity (TE)&lt;/strong&gt;: The elasticity of import quantities relative to domestic quantities with respect to the exchange rate (equivalently, the expenditure-switching response to import price changes), measured at business-cycle frequencies. The paper measures this using the volatility ratio of trade-flow quantities to prices (an upper-bound estimate abstracting from correlations), targeting a value of 0.7. Long-run elasticity estimates based on trade liberalization episodes are much higher (typically 6 and above) and are used as the long-run elasticity parameter gamma in search-based models.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Customer Capital (CC) Model&lt;/strong&gt;: A PTM model (Drozd-Nosal 2012) in which firms build market-specific customer relationships through costly, time-consuming investment in marketing capital, and within-match prices are set by Nash bargaining. The combination of a capacity constraint on quantities traded within each match and bargaining-determined prices decouples the short-run trade elasticity from pass-through, allowing the model to partially escape the parameterization trilemma via the adjustment-cost parameter psi.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Kimball Aggregator (KA)&lt;/strong&gt;: A reduced-form, implicitly defined demand aggregator (Kimball 1995) that generates variable demand elasticity through the curvature of the function g(·) around the steady state. In the open-economy application of Itskhoki-Mukhin (2021), two curvature parameters (g&amp;rsquo;(1) and g&amp;rsquo;&amp;rsquo;(1)) can independently control markup and pass-through — but not trade elasticity simultaneously, which is bound to the steady-state demand elasticity gamma(1) and hence to the markup. The paper shows this model neither nests nor outperforms microfounded alternatives under markup discipline.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Financial Shock&lt;/strong&gt;: An exogenous disturbance to the position of noise traders in the international bond market (following Gabaix-Maggiori 2015), which drives deviations from Uncovered Interest Parity via the capacity constraint on arbitrageurs. These shocks generate exchange-rate volatility that is largely disconnected from real fundamentals (productivity), calibrated with persistence rho_n = 0.97 to match U.S. real exchange rate volatility of 3.97% relative to GDP.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Gross Margin / Producer Markup&lt;/strong&gt;: In this paper, defined as (price - marginal cost) / marginal cost = (sales - cost of goods sold) / cost of goods sold, where under Cobb-Douglas production and static cost minimization, the markup equals the gross margin. The paper targets 50% for U.S. tradable-sector firms based on BEA 402 Industry I-O Use Tables (which yield 45–50% for tradable sectors across 2007–2017), treating this as a hard empirical constraint that models must satisfy in the steady state.&lt;/p&gt;</description></item><item><title>Resource Misallocation in European Firms: The Role of Constraints, Firm Characteristics and Managerial Decisions</title><link>https://macropaperwarehouse.com/papers/resource-misallocation-in-european-firms-the-role-of-constraints-firm-characteristics-and-managerial-decisions/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/resource-misallocation-in-european-firms-the-role-of-constraints-firm-characteristics-and-managerial-decisions/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper investigates why firms in the European Union exhibit wide dispersion in marginal revenue products (MRP) of capital and labor — a direct indicator of resource misallocation — and asks how much aggregate productivity the EU forfeits as a result. The research question is motivated by the persistent productivity gap between the EU and the United States, by evidence that within-country MRP dispersion in Europe has been trending upward since the mid-1990s, and by an institutional context in which the EU single market (launched in 1993) has not eliminated cross-country factor market frictions even three decades later.&lt;/p&gt;
&lt;p&gt;The primary data source is the EIB Investment Survey (EIBIS), a stratified random survey of non-financial enterprises conducted annually since 2016 across all 28 EU member states, covering manufacturing, services, utilities, and construction (NACE categories C–J). The analysis uses three waves (2016–2018), with approximately 12,500 firms per wave and a panel component of roughly 2,000 firms appearing in all three waves. Survey responses are matched to Orbis administrative data; the correlation between log employment in EIBIS and Orbis is 0.91, confirming data quality. MRP of capital (MRPK) is measured as the capital cost share times revenue divided by fixed assets; MRP of labor (MRPL) is the labor cost share times revenue divided by employment. Cost shares are calibrated from OECD STAN and Eurostat national accounts at the country–year–industry level.&lt;/p&gt;
&lt;p&gt;The theoretical framework is a dynamic model of a profit-maximizing firm with Cobb-Douglas production, isoelastic demand, and quadratic adjustment costs. Under the assumption that pure economic profits are small and that the labor output distortion is negligible (following Hsieh-Klenow 2009), the model implies that log MRPK and log MRPL can be approximated by observable average revenue products. The empirical strategy is a Mincerian regression of log MRPK (and log MRPL) on a rich vector of firm-level characteristics — firm demographics, input quality, capacity utilization, investment constraints, dynamic adjustment variables, and financing sources — plus country, industry, and year fixed effects (and their interactions). Because regressors are endogenous, the R² from OLS is interpreted as an upper bound on the share of MRP variance attributable to each factor (formally shown to dominate the IV R²). Marginal R² increments when a variable block is added identify the contribution of that block to the variance in MRP, which is then mapped into productivity gains via the Hsieh-Klenow formula.&lt;/p&gt;
&lt;p&gt;The main quantitative findings are as follows. Raw dispersion is large: the standard deviation of log MRPK is 1.43 and of log MRPL is 1.19 (and 1.63 for log MRPL minus log MRPK), all substantially exceeding comparable US figures (0.98 for capital and 0.58 for labor from Asker et al. 2014 and Bartelsman et al. 2013). The R² in the full regression is 0.14 (without fixed effects) and 0.49 (with country × industry × year fixed effects) for MRPK, and 0.29 and 0.74 respectively for MRPL. Among firm-characteristic blocks, the &amp;ldquo;adjustment&amp;rdquo; (dynamic investment and employment growth) and &amp;ldquo;demographics&amp;rdquo; (firm size, age, subsidiary and exporter status) blocks carry the largest marginal R² contributions; the &amp;ldquo;obstacles to investment&amp;rdquo; block (direct reports of constraints) contributes modestly by comparison. Country fixed effects alone explain R² = 0.052 for MRPK and R² = 0.445 for MRPL, while industry fixed effects alone explain R² = 0.239 for MRPK and R² = 0.268 for MRPL. The combined country–industry–year fixed-effects R² reaches 0.275 for MRPK and 0.611 for MRPL; adding the full interaction yields 0.492 and 0.736 respectively.&lt;/p&gt;
&lt;p&gt;Treating the &amp;ldquo;distortions&amp;rdquo; block of variables as genuine frictions, removing them would raise EU aggregate productivity by more than 40 percent (computed as 1.5 × 1.42 × 0.186 + 0.13 × 2.66 × 0.134 = 0.442). If all variables in X are treated as distortions, the implied gain is approximately 72 percent (0.715 in log points). Removing cross-country inequality in average MRPs (equalizing country fixed effects) would imply a 102 percentage log-point gain in productivity under the Hsieh-Klenow formula; removing barriers between industries and countries could raise productivity by at least 143 percentage log points.&lt;/p&gt;
&lt;p&gt;A Machado-Mata distributional decomposition comparing Germany (σ(log MRPK) = 0.92, σ(log MRPL) = 0.61) and Greece (σ(log MRPK) = 1.64, σ(log MRPL) = 0.91) reveals that the primary driver of Greece&amp;rsquo;s higher dispersion is the &amp;ldquo;prices&amp;rdquo; (regression coefficients reflecting institutional and policy environment), not the &amp;ldquo;endowments&amp;rdquo; (firm characteristics). Giving Greece German institutional &amp;ldquo;prices&amp;rdquo; reduces the counterfactual standard deviation of Greek MRPK from 1.66 to 0.94. This pattern generalizes across EU countries: German b (coefficients) tends to reduce MRPK dispersion for most countries, while German X (firm characteristics) tends to increase it, because Germany has more heterogeneous firms but an environment that prices those characteristics in a way that equalizes returns. This finding constitutes large-scale microeconomic evidence that institutions matter — cross-country differences in MRP dispersion reflect how business, institutional, and policy environments translate firm heterogeneity into outcomes, more than they reflect differences in firm characteristics per se.&lt;/p&gt;
&lt;p&gt;The policy implication is that deep institutional reform — not merely changes in firm composition — is required to narrow EU resource misallocation. The scope condition is that these estimates are upper bounds, and some observed MRP dispersion likely reflects compensating differentials (e.g., higher-quality capital commanding a higher MRPK) rather than pure distortions.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-identification-strategy-and-what-are-the-main-threats-to-it"&gt;Q1. What is the identification strategy, and what are the main threats to it?&lt;/h3&gt;
&lt;p&gt;The paper does not attempt causal identification. Instead, it uses OLS to estimate equilibrium (Mincerian-type) regressions of log MRPK and log MRPL on firm characteristics plus fixed effects. The key insight is that OLS R² provides an upper bound on the share of MRP variance causally attributable to each regressor, because simultaneity or omitted variables can only inflate OLS R² above the true IV R². The main threats are: (1) endogeneity of regressors — a growing firm facing red tape will have high MRPK and a binding constraint simultaneously, inflating the R² attributed to constraints; (2) classical measurement error in survey responses, which attenuates R² toward zero (so OLS actually understates causal effects in this direction); (3) omitted variable bias via unobserved firm quality (managerial talent, etc.); (4) use of same variables (employment, fixed assets) on both left and right sides, addressed by cross-checking with Orbis data as instruments. The authors argue these threats are mostly conservative — they overstate, not understate, the upper bound.&lt;/p&gt;
&lt;h3 id="q2-what-is-the-theoretical-justification-for-using-average-revenue-products-to-measure-marginal-revenue-products"&gt;Q2. What is the theoretical justification for using average revenue products to measure marginal revenue products?&lt;/h3&gt;
&lt;p&gt;Under the assumption that the share of pure economic profits is small (following Basu and Fernald 1997), the optimality conditions of the dynamic model imply that MRPK ≈ (capital cost share) × (revenue / capital) and MRPL ≈ (labor cost share) × (revenue / employment). These are average revenue products scaled by factor cost shares, matching Hsieh and Klenow (2009). The distortion framework further implies that the variance of log MRPK and log MRPL, when distortions are log-normally distributed and uncorrelated, maps directly into the Hsieh-Klenow productivity-loss formula, linking the regression R² to quantitative welfare calculations.&lt;/p&gt;
&lt;h3 id="q3-what-is-the-role-of-compensating-differentials-versus-true-distortions-in-interpreting-the-results"&gt;Q3. What is the role of compensating differentials versus true distortions in interpreting the results?&lt;/h3&gt;
&lt;p&gt;The paper emphasizes that not all dispersion in MRPs reflects inefficient distortions. Some dispersion — particularly from &amp;lsquo;quality of capital,&amp;rsquo; &amp;lsquo;capacity utilization,&amp;rsquo; and &amp;lsquo;dynamic adjustment&amp;rsquo; — may reflect compensating differentials: firms that invest in higher-quality capital rationally face higher costs, demanding a higher MRPK in equilibrium, analogous to how more educated workers earn higher wages in a Mincerian framework. If these variables reflect compensating differentials rather than frictions, using &amp;lsquo;raw&amp;rsquo; MRP dispersion overstates misallocation. Conversely, if all variables proxy for distortions, the productivity gains from reform are even larger (72 percent versus 40 percent). The paper presents both interpretations explicitly, making the framework &amp;lsquo;highly portable&amp;rsquo; for different views of what drives observed dispersion.&lt;/p&gt;
&lt;h3 id="q4-what-heterogeneity-in-mrp-dispersion-is-documented-across-eu-countries-and-industries"&gt;Q4. What heterogeneity in MRP dispersion is documented across EU countries and industries?&lt;/h3&gt;
&lt;p&gt;Dispersion is notably lower in Germany (σ(log MRPK) = 0.92, σ(log MRPL) = 0.61) than in Greece (1.64 and 0.91) or smaller countries such as Malta, Luxembourg, and Cyprus. Country fixed effects explain R² = 0.445 of MRPL variation but only R² = 0.052 of MRPK variation, meaning labor is more segmented across countries than capital. Industry fixed effects explain R² = 0.239 for MRPK versus R² = 0.268 for MRPL, indicating capital is more segmented across industries than across countries. Core EU countries (France, Denmark) are relatively insensitive to counterfactual substitution of German coefficients, while periphery countries (Portugal, Ireland) show large movements. Romania, which resembles Slovenia in raw MRPK dispersion, looks much more like the Netherlands after controlling for firm characteristics — illustrating that observed dispersion rankings can be misleading without adjustment.&lt;/p&gt;
&lt;h3 id="q5-what-does-the-machado-mata-decomposition-reveal-and-how-is-it-implemented"&gt;Q5. What does the Machado-Mata decomposition reveal, and how is it implemented?&lt;/h3&gt;
&lt;p&gt;The Machado-Mata (2005) decomposition separates the distribution of MRP into an &amp;rsquo;endowments&amp;rsquo; component (due to the values of firm characteristics X) and a &amp;lsquo;prices&amp;rsquo; component (due to the regression coefficients b, which capture how the institutional and policy environment translates X into outcomes). The decomposition draws B = 10,000 bootstrap samples from the empirical distribution of X for each country, combines them with quantile regression coefficients estimated separately for each country, and constructs counterfactual distributions. Applying Greek X with German b reduces Greece&amp;rsquo;s counterfactual σ(log MRPK) from 1.66 to 0.94 — close to Germany&amp;rsquo;s actual 0.92 — while applying German X with Greek b increases dispersion. The main finding is that differences in &amp;lsquo;prices&amp;rsquo; (institutional environment) dominate differences in &amp;rsquo;endowments&amp;rsquo; (firm characteristics) in explaining cross-country variation in within-country MRP dispersion. This pattern holds generally across EU countries: gains from &amp;lsquo;importing&amp;rsquo; German institutions are correlated with poor World Bank Governance Indicators and International Country Risk Guide scores.&lt;/p&gt;
&lt;h3 id="q6-how-do-the-papers-estimates-of-eu-misallocation-compare-to-us-benchmarks"&gt;Q6. How do the paper&amp;rsquo;s estimates of EU misallocation compare to US benchmarks?&lt;/h3&gt;
&lt;p&gt;The EU standard deviations of log MRPK (1.43) and log MRPL (1.19) substantially exceed comparable US figures of 0.98 for capital (Asker et al. 2014) and 0.58 for labor (Bartelsman et al. 2013). The paper discusses three caveats for this comparison: (1) EIBIS uses revenue rather than value added, which affects dispersion (approximately +0.16 log points for MRPL, -0.21 for MRPK) — insufficient to explain the full gap; (2) survey measurement error is present but small — averaging over multiple waves reduces the standard deviation of log MRPK by only 8–12 percent; (3) EIBIS measures firms (not plants), and since about two-thirds of within-firm MRPK variance occurs across plants within firms (Kehrig and Vincent 2017), the EU–US comparison likely understates the true difference. Qualitatively, the greater EU dispersion is consistent with lower EU aggregate TFP relative to the US.&lt;/p&gt;
&lt;h3 id="q7-what-specific-regression-results-are-reported-for-individual-variable-blocks"&gt;Q7. What specific regression results are reported for individual variable blocks?&lt;/h3&gt;
&lt;p&gt;The full R² (without / with country × industry × year fixed effects) is 0.14 / 0.49 for MRPK and 0.29 / 0.74 for MRPL. Among variable blocks, the &amp;lsquo;adjustment&amp;rsquo; (investment, employment growth, past and planned investment) and &amp;lsquo;demographics&amp;rsquo; (size, age, subsidiary, exporter) blocks have the largest marginal R². The &amp;lsquo;obstacles to investment&amp;rsquo; (direct constraint reports) block contributes modestly, with some coefficients not statistically significant. Within regression coefficients (from Table A.4): older, exporting, high-utilization firms have higher MRPK and MRPL; investment is strongly negatively associated with MRPK (movement down the MRPK curve as capital rises) and positively with MRPL (labor becomes relatively scarcer); employment growth is positively associated with MRPK and negatively with MRPL (symmetric logic); credit-constrained status is negatively correlated with both MRPK and MRPL.&lt;/p&gt;
&lt;h3 id="q8-what-robustness-checks-are-run"&gt;Q8. What robustness checks are run?&lt;/h3&gt;
&lt;p&gt;The paper reports: (1) &amp;lsquo;between&amp;rsquo; regressions on multi-year firm averages to reduce transitory variation and measurement error — results are qualitatively similar with slightly larger productivity gains; (2) restricting the sample to firms appearing in all three survey waves (Appendix Table A.5) — qualitatively similar results; (3) estimating equation (4) for each wave separately — similar results; (4) using Orbis employment and investment as regressors instead of EIBIS responses to address mechanical measurement-error correlation — nearly identical results (Appendix Table A.17); (5) replacing log(1+investment) with an indicator for positive investment (Appendix Table A.7) — similar results; (6) using industry-specific rather than country–year–industry cost shares — similar results; (7) confirming that measurement error can account for only a portion of the EU–US dispersion difference (8–12 percent reduction in standard deviation when averaging over waves). The paper also reports separate coefficient estimates for three blocs of EU countries (North/West, South, Center/East) in Appendix Tables A.10–A.16.&lt;/p&gt;
&lt;h3 id="q9-how-does-the-paper-relate-to-and-differ-from-hsieh-and-klenow-2009-and-related-prior-work"&gt;Q9. How does the paper relate to and differ from Hsieh and Klenow (2009) and related prior work?&lt;/h3&gt;
&lt;p&gt;The paper extends Hsieh and Klenow (2009) in several directions. First, while Hsieh-Klenow use administrative census-type data for India and China restricted to manufacturing, this paper uses a consistent cross-country survey covering all sectors in 28 EU countries, enabling direct cross-country comparison. Second, Hsieh-Klenow implicitly assume all MRP dispersion reflects distortions; this paper explicitly distinguishes distortions from compensating differentials and shows the distinction matters quantitatively. Third, this paper develops the Mincerian regression approach to apportion the variance in MRPs across observable factors — analogous to labor economists decomposing wage dispersion — and shows OLS R² provides a valid upper bound without requiring exogenous variation. Fourth, unlike country-level distortion measures (Gamberoni et al. 2016), tight theoretical restrictions (David and Venkateswaran 2017), or specific reforms (Rotemberg 2019), this paper draws on firm-level survey data with minimal restrictions and maintains high external validity. Fifth, the Machado-Mata distributional decomposition adds a new dimension absent from Hsieh-Klenow: decomposing cross-country differences into endowments vs. institutional &amp;lsquo;prices.&amp;rsquo;&lt;/p&gt;
&lt;h3 id="q10-what-are-the-policy-implications-and-their-scope-conditions"&gt;Q10. What are the policy implications and their scope conditions?&lt;/h3&gt;
&lt;p&gt;The primary policy implication is that EU productivity could rise by more than 40 percent if distortions to resource allocation were removed — and up to 72 percent if all observed MRP variation is attributed to distortions. A more modest goal of equalizing within-industry MRP dispersion across countries (i.e., making Germany and Greece similar within industries) implies gains of approximately 31–53 percent depending on interpretation. The decomposition evidence implies that institutional reform (changing how environments price firm characteristics) is more important than directly changing firm composition. The scope conditions are: (1) these are upper bounds derived from OLS; (2) some dispersion reflects compensating differentials that should not be counted as losses; (3) the EIBIS covers firms with at least 5 employees, so very small firms are excluded; (4) the framework assumes log-normal, uncorrelated distortions and constant returns to scale — relaxing these can increase estimated losses further (Jones 2011); (5) the estimates do not account for firm-level markup heterogeneity, which could overstate or understate other channels.&lt;/p&gt;
&lt;h3 id="q11-what-does-the-paper-contribute-to-the-literature-on-measurement-error-in-mrp-studies"&gt;Q11. What does the paper contribute to the literature on measurement error in MRP studies?&lt;/h3&gt;
&lt;p&gt;The paper shows formally (Appendix D) that classical measurement error in regressors attenuates OLS R² toward zero, so OLS provides a conservative upper bound from this direction. It also shows that averaging across multiple survey waves reduces measurement error while also attenuating transitory adjustment-cost variation, so multi-year averages likely overstate the role of measurement error. Crucially, the paper validates EIBIS against Orbis administrative data, finding a 0.91 correlation for log employment, similar standard deviations of log MRPK (1.44 in Orbis vs. 1.37 in EIBIS) and log MRPL (1.07 in Orbis vs. 1.30 in EIBIS) for matched firms, and a mean absolute log difference in standard deviations of approximately 2 percent across countries. This contributes to the debate initiated by Bils et al. (2017) on whether measured MRP dispersion reflects mismeasurement, and corroborates that surveys can be reliable substitutes for census-type administrative data in cross-country analysis.&lt;/p&gt;
&lt;h3 id="q12-what-does-the-paper-find-about-the-role-of-credit-constraints-specifically"&gt;Q12. What does the paper find about the role of credit constraints specifically?&lt;/h3&gt;
&lt;p&gt;Credit constraint status (defined as loan rejection, discouragement from applying, or receiving a loan that was too small or too expensive) is negatively correlated with both MRPK and MRPL in the full regression. This is consistent with credit-constrained firms being unable to invest to the point where MRPK is equalized with the cost of capital, but the negative sign also raises the interpretive caveat noted by the authors: cross-sectional equilibrium relationships can have signs inconsistent with causal priors because constraints may be more binding for firms that are already performing poorly. The &amp;lsquo;source of funds&amp;rsquo; block (share of investment from internal vs. external sources, and credit constraint) is grouped with &amp;lsquo;distortions&amp;rsquo; in the paper&amp;rsquo;s preferred decomposition.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Marginal Revenue Product (MRPK/MRPL)&lt;/strong&gt;: In this paper, the marginal revenue product of capital (MRPK) and labor (MRPL) are measured as observable average revenue products — the capital or labor cost share times revenue divided by the stock of capital or employment. Under the paper&amp;rsquo;s model assumptions, these approximate the shadow cost of inputs and serve as the primary measure of firm-level resource allocation efficiency. A firm with a high MRPK relative to its cost of capital is under-capitalized; dispersion of MRPK across firms signals misallocation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Compensating differentials (in the MRP context)&lt;/strong&gt;: The paper adapts the Mincerian concept of compensating differentials from labor markets to the firm side: some observed dispersion in MRPK and MRPL may reflect optimal responses to heterogeneity in input quality, capital utilization, or adjustment dynamics — not inefficient distortions. For example, a firm with state-of-the-art machinery may face a higher MRPK reflecting the quality premium, not a barrier to investment. Because such dispersion is rational, it should be subtracted from productivity-loss calculations rather than counted as welfare-reducing misallocation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Machado-Mata decomposition&lt;/strong&gt;: A distributional decomposition technique (Machado and Mata 2005) applied here to attribute cross-country differences in the dispersion of MRPK and MRPL to two components: &amp;rsquo;endowments&amp;rsquo; (the empirical distribution of firm characteristics X in a given country) and &amp;lsquo;prices&amp;rsquo; (the regression coefficients b, which capture how the country&amp;rsquo;s business, institutional, and policy environment translates those characteristics into marginal revenue products). The decomposition constructs counterfactual MRP distributions by combining one country&amp;rsquo;s X with another country&amp;rsquo;s b.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mincerian productivity regression&lt;/strong&gt;: The paper&amp;rsquo;s core empirical framework, modeled explicitly on Mincer&amp;rsquo;s (1958) wage regression: just as wages are regressed on worker characteristics (education, experience) to decompose earnings dispersion, log MRPK and log MRPL are regressed on firm characteristics (demographics, quality, utilization, adjustment, constraints, financing) to decompose MRP dispersion. OLS R² in this regression is an upper bound on the share of MRP variance attributable to each regressor.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;EIB Investment Survey (EIBIS)&lt;/strong&gt;: An annual firm-level survey administered by Ipsos MORI on behalf of the European Investment Bank since 2016, covering all 28 EU member states with a stratified random sample of approximately 12,500 non-financial enterprises per wave (minimum 5 employees, NACE C–J). Unique features include consistent cross-country design, merger with Orbis administrative data, and questions on investment plans, capital quality, capacity utilization, perceived obstacles, and financing sources — all directly informative about sources of MRP variation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Institutional &amp;lsquo;prices&amp;rsquo; on firm characteristics&lt;/strong&gt;: In the Machado-Mata framework as applied here, &amp;lsquo;prices&amp;rsquo; refer to the country-specific regression coefficients b in the MRP regression — how steeply a country&amp;rsquo;s environment (regulations, institutions, policies) translates a given unit of firm heterogeneity in X into a difference in marginal revenue products. Countries with smaller b magnitudes (like Germany) achieve more equalization of MRPs across heterogeneous firms, reflecting an efficient institutional environment; countries with large b (like Greece) amplify firm-level heterogeneity into large MRP dispersion.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Upper-bound R² approach to productivity gains&lt;/strong&gt;: The paper&amp;rsquo;s portable method for quantifying productivity gains from removing a friction: the marginal R² increment in an OLS regression of log MRPK (or log MRPL) when a friction variable is added is an upper bound on the share of MRP variance attributable to that friction. This bound, multiplied by the variance of log MRP and the Hsieh-Klenow productivity-loss formula parameters, gives an upper-bound estimate of the aggregate TFP gain from eliminating that friction. The method does not require exogenous variation or tight structural assumptions.&lt;/p&gt;</description></item><item><title>Self-Fulfilling Prophecies in the Transition to Clean Technology</title><link>https://macropaperwarehouse.com/papers/self-fulfilling-prophecies-in-the-transition-to-clean-technology/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/self-fulfilling-prophecies-in-the-transition-to-clean-technology/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper by Smulders and Zhou challenges the standard lock-in narrative for the slow green transition. The conventional explanation — path dependency in directed technical change (DTC) — is hard to reconcile with forward-looking investors who anticipate an eventual move to clean technology. The authors propose an alternative: strategic investment complementarities among innovators can produce self-fulfilling prophecies that delay the low-carbon transition even when all agents foresee it will ultimately occur.&lt;/p&gt;
&lt;p&gt;The framework is a continuous-time general equilibrium DTC model in the tradition of Acemoglu et al. (2012), modified in two key ways: patents last forever (rather than one period), and labor is mobile between production and R&amp;amp;D. The economy has a clean and a dirty final-goods sector with substitution elasticity σ between them. A continuum of monopolistic intermediate goods suppliers in each sector invest in R&amp;amp;D to improve product quality. The key mechanism is a demand externality: when goods are gross substitutes (σ &amp;gt; 1), innovation in a sector reduces the relative price of that sector&amp;rsquo;s output, shifting consumer expenditure toward it. This raises the return to all innovation in the sector. For σ &amp;gt; 2, this demand externality outweighs the intra-sector business-stealing effect, making within-sector innovations strategic complements — each firm&amp;rsquo;s R&amp;amp;D raises the payoff to R&amp;amp;D for all others in the same sector. The threshold σ &amp;gt; 2 is necessary and sufficient for a coordination problem to arise in the unregulated economy.&lt;/p&gt;
&lt;p&gt;The paper establishes three steady states: two saddlepath-stable corner steady states (one with innovation only in the clean sector, one only in the dirty sector) and an unstable interior steady state with simultaneous R&amp;amp;D. When σ &amp;gt; 2, there exists a range of initial clean market shares θc,0 (the &amp;ldquo;overlap&amp;rdquo;) from which both corner steady states are reachable under rational expectations. The overlap grows with σ and shrinks with impatience ρ (Proposition 3). Furthermore, for any initial condition within the overlap, multiple transition paths to the same corner steady state exist: a &amp;ldquo;fast&amp;rdquo; path with immediate concentration of R&amp;amp;D in one sector, and &amp;ldquo;delayed&amp;rdquo; paths in which firms temporarily innovate in the competing sector before finally converging. For higher σ values, these delays may involve regime switches between the clean-only and dirty-only innovation regimes (σ ∈ [σ-bar, σ-bar-bar)) or even stagnation periods with zero R&amp;amp;D (σ &amp;gt; σ-bar-bar), producing non-monotonic patterns of clean innovation — rises followed by falls before eventual clean dominance (Proposition 4).&lt;/p&gt;
&lt;p&gt;The welfare-maximizing path always leads to the clean steady state: a dirty steady state violates the transversality condition on the carbon stock because unbounded climate damages accumulate. The paper calibrates to 2019 data: initial clean sector share θc,0 = 0.177 (matching the 17.7% renewable energy share in global final energy consumption), world GDP per capita of $11,019 (constant 2015 USD), per capita carbon emissions of 1.22 metric tons, emission intensity ad = 0.198 tonnes per thousand USD, and σ = 1.5. Under this calibration, three distinct equilibrium paths coexist under an optimal Pigouvian carbon tax — one with clean-only innovation from the start and two involving temporary dirty R&amp;amp;D — all converging to the clean steady state but at different speeds and with different amounts of stranded dirty assets.&lt;/p&gt;
&lt;p&gt;The central policy finding (Proposition 7) is that a Pigouvian carbon tax set equal to the social cost of carbon at all times eliminates the dirty steady state but does not pin down a unique transition path. Multiple equilibria with different durations of dirty innovation persist under the first-best carbon tax. Effective coordination requires a second instrument that directly controls relative innovator profitability: a minimum clean revenue guarantee, an emission cap, a dirty R&amp;amp;D tax, or a contingent super-Pigouvian carbon tax all qualify. A clean R&amp;amp;D subsidy works but is an inferior device because it distorts labor allocation between production and research. Crucially, commitment is required: unless the government commits to maintaining the coordination instrument until the economy exits the multiple-equilibria region, delayed transitions remain possible.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-core-mechanism-generating-multiple-equilibria-and-why-does-it-require-σ--2"&gt;Q1. What is the core mechanism generating multiple equilibria, and why does it require σ &amp;gt; 2?&lt;/h3&gt;
&lt;p&gt;Intermediate good monopolists in each sector earn profits proportional to their sector&amp;rsquo;s expenditure share, which rises with relative quality when σ &amp;gt; 1 (demand shift effect). But a firm&amp;rsquo;s share of sector profits falls as rivals innovate (business-stealing effect). From equation (24), the relative marginal profit of clean versus dirty innovation scales as (Qc/Qd)^(σ-2). The demand shift effect dominates the business-stealing effect if and only if σ &amp;gt; 2. When σ &amp;gt; 2, innovations within a sector are strategic complements: any firm&amp;rsquo;s R&amp;amp;D raises all other firms&amp;rsquo; marginal return to R&amp;amp;D in the same sector. This complementarity means beliefs about which sector will be large in the future become self-reinforcing: if investors expect the clean sector to grow, clean innovation is profitable, and the expectation is validated.&lt;/p&gt;
&lt;h3 id="q2-how-do-the-two-modifications-from-acemoglu-et-al-2012-affect-the-results"&gt;Q2. How do the two modifications from Acemoglu et al. (2012) affect the results?&lt;/h3&gt;
&lt;p&gt;First, infinite (rather than one-period) patents allow future expected profits to influence innovation decisions, giving expectations a more direct role. Second, labor mobility between production and R&amp;amp;D makes the speed of innovation endogenous alongside its direction. However, the paper shows (OA3.2 and Section 3.3) that neither modification is necessary for the qualitative result: the overlap and strategic complementarity arise even with finite patent length and segmented labor markets. Longer patent length has an effect similar to lower impatience — it increases the overlap. OA4 shows that a segmented labor market model has essentially identical dynamics but requires a third state variable (an effective savings-rate proxy), so it is no simpler than the baseline.&lt;/p&gt;
&lt;h3 id="q3-what-types-of-transition-delays-are-possible-and-how-do-they-depend-on-σ"&gt;Q3. What types of transition delays are possible and how do they depend on σ?&lt;/h3&gt;
&lt;p&gt;Proposition 4 identifies three regimes of delay: (a) for 2 &amp;lt; σ &amp;lt; σ-bar, only temporary simultaneous R&amp;amp;D is possible as a delay; (b) for σ ∈ [σ-bar, σ-bar-bar), delay must include temporary regime switches between the clean-only and dirty-only innovation regimes; (c) for σ &amp;gt; σ-bar-bar, delay must include a stagnation period with no R&amp;amp;D at all. The numerical example shows that for σ = 2.5 and σ = 3, delayed paths involve a flat simultaneous-research segment (mc = 1/2). For σ = 5 and σ = 7, equilibrium paths involve switches between clean-only and dirty-only regimes. For σ = 8 and σ = 9, paths contain vertical stagnation sections and multiple regime switches, with clean innovation peaking, falling, then rising again before converging to the clean steady state.&lt;/p&gt;
&lt;h3 id="q4-what-does-the-welfare-analysis-reveal-about-the-costs-of-delayed-transition"&gt;Q4. What does the welfare analysis reveal about the costs of delayed transition?&lt;/h3&gt;
&lt;p&gt;Under the calibrated model (σ = 1.5, θc,0 = 0.177), three equilibrium paths coexist under the Pigouvian carbon tax, corresponding to no delay, short delay, and long delay in clean innovation. Paths with delay accumulate more dirty capital (Qd,∞ &amp;gt; Qd,0), creating more stranded assets in the long run. Figure 4 shows that, at calibrated emission intensity (ad = 0.198), the clean-only path dominates in welfare whenever multiple equilibria arise. However, at a counterfactually low pollution intensity (ad = 0.0198, one-tenth of calibrated), the planner may prefer some temporary dirty innovation when the clean sector starts small, because investment complementarities in the (larger) dirty sector generate higher short-run consumption growth that outweighs the smaller pollution cost.&lt;/p&gt;
&lt;h3 id="q5-why-does-a-pigouvian-carbon-tax-fail-to-coordinate-the-transition-and-what-instruments-can-succeed"&gt;Q5. Why does a Pigouvian carbon tax fail to coordinate the transition, and what instruments can succeed?&lt;/h3&gt;
&lt;p&gt;A Pigouvian tax changes the marginal cost of emissions and affects relative profitability, but it does not fully control relative innovation profitability because strategic complementarities within a sector persist: total innovation in a sector still raises marginal returns for all firms in it, and the complementarity can dominate the tax effect. An emission cap, by contrast, fixes the quantity of dirty output (given the Leontief emissions-to-output structure), which mutes the complementarity: expanding dirty productivity no longer pays if the quantity cap is binding. A minimum clean revenue guarantee sets a floor on clean firms&amp;rsquo; profits that controls relative profitability directly without taxing the dirty sector. A dirty R&amp;amp;D tax raises the marginal cost of dirty research, shifting the innovation regime border and eliminating dirty equilibrium paths. A contingent super-Pigouvian carbon tax (above the social cost of carbon) that activates only when the economy innovates in the dirty sector also works. All of these require policy commitment over the duration of the multiple-equilibria region; without commitment they fail.&lt;/p&gt;
&lt;h3 id="q6-how-does-the-paper-relate-to-and-differ-from-acemoglu-et-al-2012"&gt;Q6. How does the paper relate to and differ from Acemoglu et al. (2012)?&lt;/h3&gt;
&lt;p&gt;The model starts from Acemoglu et al. (2012) but reaches a qualitatively different policy conclusion. Acemoglu et al. (2012) acknowledge the multiplicity of equilibria in their appendix but restrict their analysis to initial conditions and policies that make equilibrium unique, concluding that a Pigouvian tax combined with an R&amp;amp;D subsidy is sufficient for the optimal transition. This paper shows that when forward-looking expectations and investment complementarities are fully accounted for, the coordination failure is separate from the pollution and monopoly externalities, and a Pigouvian tax — even when optimal — does not resolve it. The paper also differs by using infinite patent length (vs. one-period) and an integrated labor market (vs. segmented), though Appendices OA3.2 and OA4 show the qualitative conclusions are robust to these modeling choices.&lt;/p&gt;
&lt;h3 id="q7-how-does-the-paper-relate-to-the-stranded-asset-literature"&gt;Q7. How does the paper relate to the stranded asset literature?&lt;/h3&gt;
&lt;p&gt;Van der Ploeg and Rezai (2020) and Kalkuhl et al. (2020) explain asset stranding through policy uncertainty, distributional effects, or disordered transition. This paper provides a complementary explanation: excess dirty investment and asset stranding can occur even under a committed, fully optimal Pigouvian tax — not because of uncertainty, but because of rational coordination failure. Firms continue investing in polluting technologies, knowing a clean steady state is inevitable, because strategic complementarities make the dirty sector temporarily attractive when the dirty sector is larger. The amount of stranded assets varies across equilibria: the longer the delay in clean innovation, the larger the accumulated stock of ultimately worthless dirty technology capital (Qd,∞ &amp;gt; Qd,0).&lt;/p&gt;
&lt;h3 id="q8-what-role-do-knowledge-spillovers-and-cross-sectoral-knowledge-externalities-play"&gt;Q8. What role do knowledge spillovers and cross-sectoral knowledge externalities play?&lt;/h3&gt;
&lt;p&gt;The baseline model assumes knowledge spillovers within sectors (quality in sector j benefits from sector-wide average quality Qj). The Online Appendix (OA3) shows that inter-sectoral knowledge spillovers (parameter χ) do not affect complementarities at all, because knowledge stock is predetermined and current rival innovation cannot affect one&amp;rsquo;s own value through the knowledge channel. Learning-by-doing production spillovers (parameter ε) strengthen complementarities. The general condition for self-fulfilling prophecies in the extended model is ψ &amp;gt; max{0, -η}, where ψ = (1+ε)(σ-1)(1-α)/(1-ωα) - 1 and η measures own-sector knowledge advantage in innovation productivity. The baseline model (ε=0, ω=1) gives ψ = σ-2, recovering the σ &amp;gt; 2 condition.&lt;/p&gt;
&lt;h3 id="q9-what-are-the-policy-implications-and-their-scope-conditions"&gt;Q9. What are the policy implications and their scope conditions?&lt;/h3&gt;
&lt;p&gt;The main policy implication is that a single Pigouvian carbon tax is insufficient for the optimal green transition even if credibly committed to; a coordination device is necessary as a second instrument. Scope conditions: (1) This conclusion holds whenever σ &amp;gt; 1 under optimal industry policy (which internalizes monopoly and spillover externalities) — the threshold is lower than σ &amp;gt; 2 in the unregulated economy. (2) The preferred coordination device (revenue guarantee, emission cap, dirty R&amp;amp;D tax, or contingent super-Pigouvian tax) depends on institutional constraints. (3) All coordination devices require policy commitment for the duration of the multiple-equilibria region. (4) The conclusion that the clean-only path is welfare-superior when multiple equilibria arise holds at calibrated emission intensity; at very low pollution intensity the planner might prefer some temporary dirty innovation. (5) The analysis abstracts from uncertainty, heterogeneous beliefs, large players, multiple abatement options, and physical capital — directions for future quantitative work.&lt;/p&gt;
&lt;h3 id="q10-what-is-the-role-of-impatience-ρ-and-patent-length-in-the-size-of-the-coordination-problem"&gt;Q10. What is the role of impatience (ρ) and patent length in the size of the coordination problem?&lt;/h3&gt;
&lt;p&gt;Proposition 3 shows that the overlap (the range of initial conditions admitting multiple equilibria) decreases with impatience ρ. When ρ is large, investors discount future profits heavily, limiting how far ahead expectations can drive current investment choices. In the limit of infinite impatience, only current profit matters and the game collapses to a static one-period coordination problem (Section 3.3). Shorter patent length, modeled as a Poisson patent infringement risk ι (OA3.2), acts identically to higher ρ in the equilibrium dynamics: the dynamics of the model with infringement risk ι are identical to the baseline with ρ replaced by ρ + ι. Hence shorter patents shrink the overlap, and policy must subsidize R&amp;amp;D to compensate for the excessively short investment horizon.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Strategic investment complementarity&lt;/strong&gt;: Within-sector R&amp;amp;D is a strategic complement when σ &amp;gt; 2: one firm&amp;rsquo;s innovation raises the return to other firms&amp;rsquo; innovation in the same sector, because the demand shift effect (innovation increases sector expenditure share) outweighs the business-stealing effect (innovation dilutes rivals&amp;rsquo; profit share). This is not a knowledge spillover but a demand externality operating through the market size of the innovating sector.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Overlap&lt;/strong&gt;: The range of initial clean market shares θc,0 from which both the clean and dirty corner steady states can be reached in a rational expectations equilibrium. The overlap exists if and only if σ &amp;gt; 2 in the unregulated economy (σ &amp;gt; 1 under optimal industry policy), grows with the substitution elasticity σ, and shrinks with impatience ρ or shorter patent length.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Market valuation share (mc)&lt;/strong&gt;: The share of the clean sector in the total marginal value of innovation across sectors, defined as mc = Qcλc / (Qcλc + Qdλd). When mc &amp;gt; 1/2, the economy is in the clean-only innovation regime; when mc &amp;lt; 1/2, in the dirty-only regime; when mc = 1/2, simultaneous research is active. Because mc is a forward-looking, continuous variable, it captures investors&amp;rsquo; collective expectation about future market conditions and directly determines the direction of technical change.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Self-fulfilling prophecy (in innovation)&lt;/strong&gt;: An equilibrium in which investors&amp;rsquo; shared belief about the future direction of innovation is rational precisely because all investors, acting on that belief, make it come true. If all investors expect the dirty sector to remain large, they concentrate R&amp;amp;D there, the dirty sector grows, and the belief is confirmed. The same logic applies to clean beliefs. In the paper&amp;rsquo;s context, self-fulfilling prophecies extend to the speed of transition: even if firms agree the economy will eventually go clean, pessimistic beliefs about timing can rationally support periods of dirty innovation before the switch.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Delayed transition&lt;/strong&gt;: An equilibrium path in which the economy ultimately converges to the clean steady state but investors temporarily concentrate R&amp;amp;D in the dirty sector before switching permanently to clean. The delay generates more stranded dirty assets (a higher terminal dirty technology stock Qd,∞) and higher short-run growth (via dirty-sector complementarities) relative to the fast-transition path. Multiple delayed paths may coexist, distinguished by the length of the dirty innovation period and the amount of accumulated dirty capital.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Coordination device&lt;/strong&gt;: A policy instrument that directly controls the relative profitability of clean versus dirty innovation, thereby eliminating the undesired equilibrium paths without relying solely on price incentives. The paper identifies four classes: (1) minimum clean revenue guarantee, (2) emission cap (quantity-based), (3) dirty R&amp;amp;D tax or clean R&amp;amp;D subsidy, and (4) contingent super-Pigouvian carbon tax. All require government commitment for the duration of the multiple-equilibria region. A clean R&amp;amp;D subsidy is inferior because it distorts labor allocation toward innovation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Stranded assets&lt;/strong&gt;: In this paper, the dirty technology capital that becomes economically worthless in the clean steady state. The amount of stranding is determined by the dirty technology stock at the moment the economy permanently switches to clean innovation (Qd,∞). Different equilibrium paths — fast vs. delayed transitions — imply different terminal dirty stocks and hence different quantities of stranded assets. Excess stranding relative to the social optimum is a welfare cost of coordination failure.&lt;/p&gt;</description></item><item><title>Unconventional Monetary Policies and Inequality</title><link>https://macropaperwarehouse.com/papers/unconventional-monetary-policies-and-inequality/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/unconventional-monetary-policies-and-inequality/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper asks whether the Federal Reserve&amp;rsquo;s unconventional monetary policies (UMP) — specifically quantitative easing (QE) and forward guidance — exacerbated income and welfare inequality in the United States during the effective lower bound (ELB) episode following the Great Recession (2009–2015). The question is empirically and theoretically contested: QE raises profits and equity prices, benefiting wealthy households who hold most equity, while simultaneously reducing unemployment, which benefits poorer households who rely almost entirely on labor income. Resolving the net effect requires a unified framework that captures both channels simultaneously, with empirically realistic responses of profits, wages, and unemployment to monetary policy.&lt;/p&gt;
&lt;p&gt;The paper builds a medium-scale Heterogeneous Agent New Keynesian (HANK) model that incorporates: (i) a two-asset structure (liquid deposits and illiquid equity) with portfolio adjustment costs; (ii) three working statuses — employed, unemployed, and business owner — with endogenous job-finding rates determined by a search-and-matching labor market; (iii) a banking sector modeled after Gertler and Karadi (2011), with a moral-hazard leverage constraint; (iv) a substantial fixed cost in production that, combined with wage rigidity, generates procyclical profit responses to monetary policy shocks — a feature absent from standard New Keynesian models and critical for capturing benefits to wealthy households; and (v) an occasionally binding ELB constraint with QE modeled as central bank asset purchases and forward guidance modeled as exogenous expected ELB durations following Jones (2017). The model is calibrated to match the 2007 Survey of Consumer Finances (SCF), targeting the top decile&amp;rsquo;s share of wealth (~70%), income composition across wealth groups, and standard labor market and financial sector moments. Remaining parameters are estimated using Bayesian methods on U.S. quarterly data from 1992 Q1 to 2018 Q4, using ten observables (output, consumption, investment, inflation, nominal interest rate, real wage, unemployment, lump-sum transfers, profits, and Federal Reserve assets), with the ELB regime handled via an inversion filter and the Kulish-Jones method for exogenous ELB durations.&lt;/p&gt;
&lt;p&gt;At the posterior mode, the model attributes the Great Recession primarily to a series of large negative risk premium shocks around 2008–2009, causing investment to fall by more than 20% relative to the pre-crisis level. The central counterfactual compares the actual ELB episode (with UMP) against a scenario where the central bank held its balance sheet constant and allowed ELB durations to be determined endogenously by fundamentals. Between 2009 and 2015, UMP on average produced: a 3.3% increase in profits, a 0.9% increase in equity prices, a 1.5 percentage-point reduction in the unemployment rate, and only a 0.1% increase in real wages (reflecting high estimated wage rigidity). Output and investment were higher by approximately 1% and 3% respectively on average, with profits rising as much as 8% during the ELB episode.&lt;/p&gt;
&lt;p&gt;These aggregate effects translated into non-linear distributional outcomes. For the Gini index, lower unemployment reduced the income Gini by up to 0.6 percentage points, but this was offset by about 80% by the increase in profits and equity prices — leaving only a marginal net Gini reduction of 0.04 percentage points on average. When computed for the bottom 90% alone, the Gini reduction was more pronounced because that group relies overwhelmingly on labor income. However, the income share of the top 10% rose by an average of 0.17 percentage points, driven mainly by higher profits and equity prices. Thus the answer to whether UMP raised inequality is measure-dependent: UMP reduced within-bottom-90% inequality while widening the top-decile income gap.&lt;/p&gt;
&lt;p&gt;Welfare gains (consumption equivalents over the ELB episode) were U-shaped across the wealth distribution: the average gain was 0.27% of lifetime consumption, but households at both extremes gained more than the middle. The bottom 10% benefited from higher job-finding rates (gaining ~0.3%), the top 10% from profits and equity prices (also ~0.3%), and the top 1% gained ~0.33%. The middle 60% gained only ~0.26%. By working status, business owners gained the most (0.82%), followed by the unemployed (0.35%) and the employed (0.27%).&lt;/p&gt;
&lt;p&gt;Decomposing UMP into QE and forward guidance, the paper finds that forward guidance accounted for approximately 55% of total UMP stimulus. Forward guidance amplified both the aggregate and distributional effects of asset purchases: QE alone raised the top 10% income share by about 0.1 percentage point, and forward guidance added a further 0.09 percentage point increase. Forward guidance lowered the overall Gini by about 0.05 percentage points more than QE alone around 2013, and reduced the bottom-90% Gini by an additional 0.2 percentage points during the same period. The interaction intensified what the paper calls a &amp;ldquo;hollowing out&amp;rdquo; of the middle class: forward guidance further reduced middle-60% income shares while leaving bottom-10% shares nearly unchanged, because the additional stimulus disproportionately raised profits and equity prices (by about 2% and 1%, respectively, between 2011 and 2014).&lt;/p&gt;
&lt;p&gt;Comparing QE with a hypothetical conventional monetary policy (CMP) that would have allowed the nominal rate to drop to approximately -1%, the paper finds that CMP would have produced larger aggregate stimulus than QE but more adverse distributional effects. Under CMP, lower financing costs disproportionately boosted bank net worth, indirectly raising profits and benefiting wealthy households even more than QE did. Under QE, central bank asset purchases crowded out private bank investment by reducing expected equity returns even as they raised equity prices, partially dampening the profitability gains to the financial sector. Consequently, CMP would have delivered above-average welfare gains only to the bottom 1% (debtors benefiting from lower real rates) and the top 10% (through larger bank profit effects), while the broad middle class would have fared no better and in some dimensions worse.&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s key methodological contribution is the first Bayesian estimation of a HANK model with an occasionally binding ELB constraint. Its key substantive finding is that standard NK models, which generate countercyclical profits, systematically understate the benefits that expansionary monetary policy delivers to wealthy households, producing a misleading or incomplete picture of the distributional effects of monetary policy.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-models-identification-strategy-and-how-is-the-elb-period-handled-in-estimation"&gt;Q1. What is the model&amp;rsquo;s identification strategy and how is the ELB period handled in estimation?&lt;/h3&gt;
&lt;p&gt;The model is estimated with Bayesian methods using an inversion filter (following Guerrieri and Iacoviello 2017 and Cuba-Borda et al. 2019) on ten quarterly observables from 1992 Q1 to 2018 Q4. The key identification challenge is the occasionally binding ELB constraint. The paper follows Kulish et al. (2014) and Jones (2017), treating the ELB as a temporary alternative regime with exogenous expected durations. These expected durations are themselves estimated as latent variables, with priors informed by the New York Fed&amp;rsquo;s primary dealer survey. The Metropolis-Hastings algorithm is used for structural parameters (treating ELB durations as fixed in each draw), while ELB durations are drawn separately using a discrete uniform proposal density. To make estimation computationally feasible given the large idiosyncratic state space, the paper follows Bayer and Luetticke (2020) and updates only the subset of the model Jacobian corresponding to &amp;lsquo;aggregate&amp;rsquo; and &amp;lsquo;summary&amp;rsquo; equations during each iteration, leaving the &amp;lsquo;idiosyncratic&amp;rsquo; blocks fixed across estimated parameters.&lt;/p&gt;
&lt;h3 id="q2-what-are-the-main-mechanisms-by-which-ump-affects-inequality-and-how-does-the-model-distinguish-them-empirically"&gt;Q2. What are the main mechanisms by which UMP affects inequality and how does the model distinguish them empirically?&lt;/h3&gt;
&lt;p&gt;The paper identifies four main channels: (1) Profit and equity price channel — QE raises equity prices and reduces financing costs, increasing profits and the dividend rate on illiquid assets. Because the top decile holds ~70% of total wealth overwhelmingly in the form of equity, with capital and business income accounting for ~50% of their income, this channel benefits the wealthy disproportionately. (2) Unemployment channel — lower interest rates stimulate demand and raise the job-finding rate. Because households at the bottom of the wealth distribution are more likely to be unemployed at the onset of the ELB episode (8.75% of the bottom decile vs. 6.54% in the middle quintile in 2009 Q1), this channel is progressive. (3) Wage channel — nominal and real wage rigidity (only one-fifth of the real wage adjusts to labor productivity changes) means that the wage channel is very weak; average real wages rose by only 0.1% due to UMP. (4) Inflation/redistribution channel — forward guidance generates inflationary expectations that compress real rates, redistributing from savers to debtors. The empirical decomposition is performed by first isolating QE alone (endogenizing ELB durations) and then comparing to the full UMP scenario (exogenous ELB durations), attributing the residual effect to forward guidance.&lt;/p&gt;
&lt;h3 id="q3-what-is-the-key-modeling-innovation-regarding-profits-and-why-does-it-matter-for-inequality"&gt;Q3. What is the key modeling innovation regarding profits, and why does it matter for inequality?&lt;/h3&gt;
&lt;p&gt;Standard New Keynesian models generate countercyclical profit responses to monetary policy shocks: when demand rises, price rigidity keeps prices sticky while factor prices (wages) adjust upward, squeezing markups and reducing profits. This contradicts empirical evidence from structural VARs, which show procyclical profits. The paper introduces three interacting features that resolve this: (a) a substantial fixed cost of production calibrated to roughly 20% of steady-state output, so that average production cost falls even as marginal cost rises, boosting net profits; (b) wage rigidity with search-and-matching frictions, so that real wages respond very weakly to monetary shocks; and (c) a banking sector with a financial accelerator, so that rising equity prices boost banks&amp;rsquo; net worth and their investment demand, further amplifying profits. Without procyclical profits, the model would understate the benefits wealthy households (whose income depends heavily on profits and equity returns) gain from expansionary monetary policy, producing an incomplete picture of distributional effects.&lt;/p&gt;
&lt;h3 id="q4-what-heterogeneity-in-households-balance-sheets-and-income-composition-is-documented-and-how-does-it-shape-distributional-results"&gt;Q4. What heterogeneity in households&amp;rsquo; balance sheets and income composition is documented, and how does it shape distributional results?&lt;/h3&gt;
&lt;p&gt;Using the 2007 SCF, the paper documents stark composition differences. The bottom 80% of the wealth distribution derives ~80% of income from labor, with transfer income making up most of the rest. The top 10% derives about 50% from labor and 50% from capital (equity and business income). For the top 0.1%, labor income is only 16% and capital/business income is about 83–85%. In the model, the top 10% hold about 70% of total wealth, overwhelmingly in illiquid equity. These composition differences mean that any policy raising profits and equity prices is strongly progressive at the top and neutral-to-mild at the bottom, while any policy reducing unemployment is strongly progressive at the bottom. The interplay of these two forces explains why UMP simultaneously reduces bottom-90% inequality (through the unemployment channel) and widens the top-vs.-rest gap (through the profit and equity channel), and why welfare gains are U-shaped rather than monotone.&lt;/p&gt;
&lt;h3 id="q5-what-is-the-welfare-accounting-methodology-and-what-are-the-key-welfare-findings"&gt;Q5. What is the welfare accounting methodology and what are the key welfare findings?&lt;/h3&gt;
&lt;p&gt;Welfare gains are measured as consumption equivalents — the fraction of lifetime consumption that a household in the counterfactual (no UMP) scenario would be willing to forgo to enjoy the UMP outcome. Households are sorted into wealth groups based on their 2009 Q1 wealth position (so group composition is not affected by UMP), and the same households are followed throughout the episode. Beyond the sample end (2018 Q4), no further shocks are assumed. The average welfare gain at the posterior mode is 0.27% of lifetime consumption. Bottom 10%: ~0.3% (driven by higher job-finding rates). Top 10%: ~0.3% (driven by profits and equity gains). Top 1%: ~0.33%. Middle 60%: ~0.26%. Business owners: 0.82%. The unemployed: 0.35%. The employed: 0.27%. Critically, the welfare gaps between extremes and middle are smaller than the income gaps, because anticipated tapering after the sample implies lower future profits and equity prices for wealthy households, narrowing their long-term advantage.&lt;/p&gt;
&lt;h3 id="q6-how-do-the-contributions-of-qe-and-forward-guidance-compare-in-aggregate-and-distributional-terms"&gt;Q6. How do the contributions of QE and forward guidance compare in aggregate and distributional terms?&lt;/h3&gt;
&lt;p&gt;Forward guidance accounted for approximately 55% of the total UMP stimulus at the posterior mode. Exogenous expected ELB durations exceeded endogenous (fundamentals-based) durations by 1–2 quarters on average, and sometimes by up to 8 quarters, with the divergence widening from 2011 onward. In distributional terms, QE alone initially reduced the bottom-90% Gini and raised the top 10% income share by about 0.1 percentage point. Forward guidance amplified both effects: it lowered the overall Gini by an additional ~0.05 pp and the bottom-90% Gini by an additional 0.2 pp around 2013, but also added a further ~0.09 pp to the top 10% income share between 2011 and 2014. The amplification occurred because forward guidance raised profits and equity prices by about 2% and 1% respectively during that window, intensifying the income concentration at the top while also stimulating job creation at the bottom. The middle class saw its income share further compressed.&lt;/p&gt;
&lt;h3 id="q7-how-does-qe-compare-with-conventional-monetary-policy-in-terms-of-aggregate-and-distributional-effects"&gt;Q7. How does QE compare with conventional monetary policy in terms of aggregate and distributional effects?&lt;/h3&gt;
&lt;p&gt;In the counterfactual CMP scenario, the nominal policy rate drops to approximately -1% and remains negative for an extended period. CMP produces larger aggregate stimulus than QE: the stimulus effects of QE were partly crowded out by general equilibrium effects, specifically QE reduced banks&amp;rsquo; expected return on equity even as it raised equity prices, discouraging private bank investment. Under CMP, lower nominal rates instead benefit banks through lower financing costs, boosting bank net worth via an accelerator mechanism more strongly than under QE. This difference has distributional consequences: CMP would have delivered higher welfare gains only to the bottom 1% (low-wealth debtors benefiting from lower real rates on their liabilities) and the top 10% (benefiting from larger bank profits). Households in the broad middle — already employed, holding limited equity, neither heavy borrowers nor large business income recipients — would have been no better off and in some dimensions worse off under CMP. The paper thus concludes that QE had less adverse distributional effects than CMP would have had, absent the ELB constraint.&lt;/p&gt;
&lt;h3 id="q8-what-robustness-checks-and-sensitivity-analyses-are-conducted"&gt;Q8. What robustness checks and sensitivity analyses are conducted?&lt;/h3&gt;
&lt;p&gt;The paper checks results against: (a) the full 10th–90th percentile range of the posterior distribution for all key findings on aggregate effects, income inequality, welfare gains, and QE vs. CMP comparisons, showing that qualitative findings are robust to parameter uncertainty; (b) a comparison between rigid-wage and flexible-wage model variants (Table A1), showing that the flexible-wage version generates countercyclical profits, a weak unemployment response, and a strong real wage response — inconsistent with empirical SVAR evidence — validating the modeling choice of high wage rigidity; (c) a structural VAR analysis on U.S. data confirming procyclical profits, weak real wage responses, and significant unemployment responses to monetary policy shocks; (d) a comparison of the OccBin method (endogenous ELB durations, Guerrieri and Iacoviello 2015) vs. the Kulish-Jones method (exogenous durations) for solving the occasionally binding constraint; (e) a check that wages implied by the calibrated wage function always remain in the bargaining set, validating the equilibrium wage assumption.&lt;/p&gt;
&lt;h3 id="q9-what-are-the-key-differences-between-this-paper-and-the-closest-prior-work"&gt;Q9. What are the key differences between this paper and the closest prior work?&lt;/h3&gt;
&lt;p&gt;Kaplan, Moll, and Violante (2018) and Bayer et al. (2020) have two-asset HANK models but omit frictional labor markets, so they cannot capture how monetary policy affects employment and thus the progressive unemployment channel. Gornemann et al. (2016) include search-and-matching labor markets but only one asset, so they cannot capture the capital income benefits to wealthy households. Broer et al. (2019) and Auclert et al. (2023) identify the countercyclical profit problem but their solutions (wage rigidity alone) produce procyclical profits that are too weak quantitatively. This paper combines fixed costs, wage rigidity, and a banking sector to produce procyclical profits quantitatively consistent with SVAR evidence. On unconventional policy specifically, Lenza and Slacalek (2018) and Casiraghi et al. (2018) study ECB QE with partial equilibrium methods and find inequality-reducing effects; Bivens (2015) and Montecino and Epstein (2015) reach opposite conclusions for U.S. QE. This paper is the first to study both QE and forward guidance jointly in a Bayesian-estimated HANK model with an explicitly binding ELB, and is to the author&amp;rsquo;s knowledge the first to estimate a HANK model with an occasionally binding ELB constraint.&lt;/p&gt;
&lt;h3 id="q10-what-are-the-main-policy-implications-and-their-scope-conditions"&gt;Q10. What are the main policy implications and their scope conditions?&lt;/h3&gt;
&lt;p&gt;First, UMP&amp;rsquo;s inequality effects are measure-dependent: policies that simultaneously stimulate employment and profits can reduce within-bottom-90% inequality while widening the top-vs.-rest gap. Policymakers who cite Gini reductions and those who cite rising top-income shares are both correct, pointing to different parts of the distribution. Second, forward guidance amplifies inequality effects as much as it amplifies aggregate effects, so its use carries a distributional cost concentrated at the top of the distribution. Third, QE had less adverse distributional effects than conventional monetary policy would have had, suggesting that concerns about QE&amp;rsquo;s inequality effects should be placed in context of the ELB constraint — the relevant comparison is not QE vs. no policy but QE vs. CMP with the ELB absent. Fourth, models that generate countercyclical profits will systematically understate benefits to the wealthy and potentially reach qualitatively different conclusions about whether monetary policy raises or reduces inequality. These findings are scoped to the U.S. Great Recession ELB episode, estimated with the specific HANK model structure and Bayesian posterior; findings may differ for different financial structures, more generous unemployment insurance, or different asset price dynamics.&lt;/p&gt;
&lt;h3 id="q11-what-drives-the-great-recession-in-the-model-and-how-is-ump-modeled-mechanically"&gt;Q11. What drives the Great Recession in the model and how is UMP modeled mechanically?&lt;/h3&gt;
&lt;p&gt;At the posterior mode, the Great Recession is primarily attributed to a series of large negative risk premium shocks (shocks to banks&amp;rsquo; discount factor) around 2008–2009, which caused banks to sharply contract their investment, leading to the investment collapse (&amp;gt;20% below pre-crisis). QE is modeled following Gertler and Karadi (2011): the central bank issues bonds (sold to the private sector) and uses proceeds to purchase equity directly, converting non-productive asset demand into productive capital demand and raising equity prices and investment. Forward guidance is modeled as setting exogenous expected ELB durations longer than would be implied endogenously by the Taylor rule fundamentals, effectively mimicking future negative interest rate shocks and inducing inflationary pressure via intertemporal substitution. The expected ELB durations at the posterior mode range from 6 to 8 quarters through 2013, falling sharply to 1–2 quarters by late 2014–2015.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Heterogeneous Agent New Keynesian (HANK) model&lt;/strong&gt;: As used in this paper, a DSGE model where households differ ex-post in idiosyncratic productivity, asset holdings (liquid deposits and illiquid equity), and employment status; combined with search-and-matching labor markets, a banking sector with leverage constraints, and a zero lower bound on the policy rate. The heterogeneity in wealth composition and income sources determines how aggregate policy shocks translate into distributional outcomes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Procyclical profits&lt;/strong&gt;: The property, established empirically via SVAR and reproduced in the model, that firm profits rise in response to expansionary monetary policy shocks. Standard New Keynesian models generate the opposite (countercyclical profits) because price rigidity compresses markups when demand rises. In this paper, the combination of large fixed costs in production, wage rigidity, and a banking sector financial accelerator is required to generate quantitatively realistic procyclical profit responses.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Effective lower bound (ELB) episode&lt;/strong&gt;: The period from 2009 Q1 to 2015 Q4 during which the Federal Reserve&amp;rsquo;s policy rate was constrained at zero. In the model, this is treated as a temporary alternative regime with exogenous expected durations; when the policy rate hits the ELB, the central bank can only affect the economy through asset purchases (QE) and forward guidance.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Forward guidance (as exogenous expected ELB durations)&lt;/strong&gt;: In this paper&amp;rsquo;s framework, forward guidance is operationalized as the central bank committing to maintain the policy rate at zero for a longer period than the endogenous (fundamentals-based) Taylor rule would prescribe. This is parameterized as an exogenous expected ELB duration that exceeds the endogenous one, creating anticipations of future negative interest rate shocks and thus stimulating activity through intertemporal substitution.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Consumption equivalent welfare gain&lt;/strong&gt;: The fraction of lifetime consumption that a household in the counterfactual scenario (no UMP) would be willing to forgo in order to instead experience the outcomes under UMP. Used to compare welfare across heterogeneous households in a cardinal, utility-based metric rather than income alone.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Business owner working status&lt;/strong&gt;: A third working status (alongside employed and unemployed), following Bayer et al. (2019), in which households receive a fixed fraction of aggregate profits as income without supplying labor. Business owners transition into and out of this status exogenously and are the highest-income group in the model, calibrated to match the top-decile&amp;rsquo;s share of liquid assets and the income composition data showing that capital and business income dominate the very top of the wealth distribution.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Inversion filter&lt;/strong&gt;: The likelihood evaluation method used in this paper for Bayesian estimation, following Guerrieri and Iacoviello (2017). Rather than running a Kalman filter, structural shocks are backed out directly by inverting the linear solution of the model given the observed data and a given set of expected ELB durations. This avoids continuously updating the large state-transition matrix and makes estimation computationally feasible.&lt;/p&gt;</description></item><item><title>University Research and the Market for Higher Education</title><link>https://macropaperwarehouse.com/papers/university-research-and-the-market-for-higher-education/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/university-research-and-the-market-for-higher-education/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper proposes that university R&amp;amp;D is determined endogenously by competition for tuition and talented students in the market for higher education, and asks why universities fund research internally with tuition despite negligible returns to patenting. Motivation: between 2000 and 2018 U.S. universities accounted for 13% of aggregate R&amp;amp;D spending and 53% of all basic-research spending, yet in 2018 over 25% of university research was internally funded (25.54% in 2018; federal government 52.97%) while between 1991 and 2018 the median university earned patent licensing revenue totaling less than 2% of its R&amp;amp;D expenditure. Internal funds therefore come essentially from tuition.&lt;/p&gt;
&lt;p&gt;Approach: (1) four stylized facts from administrative microdata (IPEDS, NSF HERD survey covering 916 universities / 99.1% of sector R&amp;amp;D, AUTM patent-licensing survey, Web of Science / Leiden bibliometrics); (2) a causal natural experiment; (3) a general-equilibrium model of the higher-education sector with heterogeneous universities choosing teaching and research, calibrated to U.S. data; and (4) policy counterfactuals.&lt;/p&gt;
&lt;p&gt;Causal evidence: the authors exploit the 1998-2003 doubling of the NIH budget (from $13.6bn to $27.1bn) using a Bartik shift-share instrument built from each university&amp;rsquo;s pre-period (1993-1997) share of federal life-science grants, regressing the change in net tuition (1993-1997 to 2004-2008) on the instrumented change in R&amp;amp;D per student, with state-clustered standard errors and state-specific trends. The benchmark estimate is that a $1.00 increase in R&amp;amp;D spending per student raises tuition by $0.15 (s.e. 0.05) — universities recoup up to 15% of R&amp;amp;D through higher tuition. Across specifications the effect ranges $0.10-$0.15; it is driven by research universities (non-liberal-arts), is statistically insignificant for liberal arts colleges, and a placebo using student-amenities spending shows no significant effect. The point estimate is about 60% larger at private non-profits than publics, but that difference is not statistically significant.&lt;/p&gt;
&lt;p&gt;Model and mechanism: education quality q = k^ωk * z̄^ωz * eT^ωe depends on intangible knowledge capital k (accumulated via research, k&amp;rsquo; = k^γk * eR^γe), peer ability z̄, and teaching spending. Universities maximize discounted education quality, funding research from tuition. Equilibrium features an endogenous college hierarchy with two-dimensional sorting by ability and family income. The research share sR rises with the steepness of the college quality-ladder Σq/Σk; when students are highly stratified or tuition rises sharply with rank, universities invest in research even if the direct contribution to teaching (ωk) is small — research persists even as ωk→0 (acting as a pure signal). Incentives fall when intangible capital is highly dispersed across colleges.&lt;/p&gt;
&lt;p&gt;Calibration matches the joint distribution of research, tuition, and student ability, plus untargeted R&amp;amp;D dispersion; simulated NIH expansion yields $0.18 per $1 in steady state and $0.11 along the transition, bracketing the empirical $0.10-$0.15.&lt;/p&gt;
&lt;p&gt;Policy findings (long-run, vs baseline): removing all need-based federal tuition subsidies cuts university research by 8.1% (replacing progressive with revenue-neutral flat tuition subsidy: -2.2%); progressive aid compresses revenue dispersion, steepens the quality-ladder, and raises the research share (+0.8 pp). Removing all federal research grants cuts research by 69.1% — only 6.9 pp below the government&amp;rsquo;s 76% funding share, implying crowding-out: the meritocratic grant structure concentrates funds at top schools, flattening the ladder and cutting the research share by 16.4 pp. A revenue-neutral flat research subsidy would instead raise research by 14.8%, human capital by 9.6%, and output by 11.1%.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-identification-strategy-and-what-are-the-main-threats-to-it"&gt;Q1. What is the identification strategy and what are the main threats to it?&lt;/h3&gt;
&lt;p&gt;A Bartik/shift-share IV exploiting the 1998-2003 NIH budget doubling. Each university&amp;rsquo;s change in R&amp;amp;D is instrumented by its pre-period (1993-1997) share of all federal life-science research grants. Relevance: NIH was the bulk of federal life-science funding before the shock and did not substantially change award criteria, so high-share schools received mechanically larger funding increases. Exogeneity requires that universities did not systematically invest in life-science research in the pre-period in anticipation of the expansion. The estimation is in long-differences comparing steady states; standard errors are clustered at the state level with state-specific tuition trends. Threats: the NIH expansion occurs at a common point in time, so it may correlate with other contemporaneous market changes; initially larger or higher-quality research universities might have raised tuition for reasons unrelated to R&amp;amp;D. The authors address this with group-specific time trends (public/private, pre-existing life-science status, school size, initial quality via faculty-student ratio) and pre-trend controls (1987-1992 faculty-student ratio, FTE size, life-science status). A limitation the authors acknowledge: they cannot test the effect on subsequent student ability because ability proxies are only available after the intervention.&lt;/p&gt;
&lt;h3 id="q2-what-are-the-main-mechanisms-and-how-are-they-distinguished"&gt;Q2. What are the main mechanisms and how are they distinguished?&lt;/h3&gt;
&lt;p&gt;The college quality-ladder Σq/Σk (the cross-sectional elasticity of education quality with respect to intangible capital) is the sufficient statistic for research incentives. Equation (14) decomposes it into three channels: (i) the direct teaching contribution of research ωk; (ii) attracting better students, ωz × Σz̄/Σk; and (iii) charging higher tuition, ωe × ΣR/Σk. Channels (ii) and (iii) flow from competition for talented students and tuition and can dominate even when ωk is tiny. Empirically, Σz̄/Σk maps to the cross-sectional elasticity of student ability w.r.t. research (Figure 3) and ΣR/Σk to the elasticity of tuition w.r.t. research (Figure 4), so the calibration disciplines these channels with observable cross-sectional relationships.&lt;/p&gt;
&lt;h3 id="q3-what-heterogeneity-is-documented"&gt;Q3. What heterogeneity is documented?&lt;/h3&gt;
&lt;p&gt;The tuition effect is concentrated in research universities (non-liberal-arts), with a larger, highly significant point estimate; for liberal arts colleges the NIH shock has no statistically significant effect on tuition (the authors caution the LAC sample is smaller — ~32% of institutions, ~24% of FTE — and more heterogeneous, so power may be insufficient). The effect appears ~60% stronger at private non-profits than publics, but the difference is not statistically significant. Across the model, top schools and bottom schools both invest less in research when intangible capital is highly dispersed (top schools face weak incentives to improve already-secure rank; bottom schools find climbing too costly).&lt;/p&gt;
&lt;h3 id="q4-what-robustness-checks-are-run"&gt;Q4. What robustness checks are run?&lt;/h3&gt;
&lt;p&gt;Empirically: adding pre-trend controls (column 3) leaves estimates intact; splitting by NLA vs LAC; and a placebo replacing R&amp;amp;D with student-services (amenities) spending, which yields no significant effect, rejecting spurious cross-category correlation. In the model: (1) the limiting case ωk→0 where research is a pure signal — the research share falls from 8.8% to 2.4% of tuition but stays strictly positive, and policy effects retain 50% (tuition-subsidy removal: -0.4 pp vs -0.8) and 66% (research-subsidy removal: +10.8 vs +16.4 pp) of their magnitude; (2) allowing some teaching expenditure to also enter intangible-capital production (γT&amp;gt;0), where the research share falls from 8.8% to 4.7% and policy effects moderate (-0.4 pp and +7.1 pp). In both, existing tuition policies still boost research and federal research grants still crowd it out.&lt;/p&gt;
&lt;h3 id="q5-how-does-this-relate-to-and-differ-from-prior-work"&gt;Q5. How does this relate to and differ from prior work?&lt;/h3&gt;
&lt;p&gt;It builds on equilibrium higher-education models — Epple, Romano &amp;amp; Sieg (2006) (quality maximization, exogenous endowment hierarchy, finite universities with market power) and Cai &amp;amp; Heathcote (2022) (competitive, constant-returns technology) — but endogenizes university R&amp;amp;D alongside teaching. A theoretical contribution is proving existence of a unique dynamic equilibrium with quality maximization and an endogenous college-quality hierarchy with a continuum of colleges; Cai &amp;amp; Heathcote argued no quality-maximization equilibrium exists when colleges are ex-ante identical (all want to be at the top), which this paper resolves via the endogenous knowledge hierarchy. It contributes to the economics of science / university-R&amp;amp;D literature by adding market-driven incentives, and to the basic-research-subsidy literature (Akcigit et al.) by showing universities have private incentives to do basic research, implying the need for government subsidy may be smaller than the standard Nelson/Arrow/Rosenberg view holds.&lt;/p&gt;
&lt;h3 id="q6-what-are-the-policy-implications-and-their-scope-conditions"&gt;Q6. What are the policy implications and their scope conditions?&lt;/h3&gt;
&lt;p&gt;Two main implications. First, a novel complementarity between equity and innovation: progressive need-based tuition aid compresses revenue dispersion across colleges, makes them more similar, steepens the quality-ladder, and raises research (+8.1% relative to a no-subsidy world; flat subsidy gives only ~one-quarter of that, +2.2%). Second, current meritocratic federal research grants partially crowd out internal research and raise educational inequality by concentrating resources at top schools; removing them cuts research by 69.1% (only 6.9 pp below the 76% federal share, the gap being the crowding-out). A revenue-neutral flat research subsidy would raise research by 14.8%, human capital 9.6%, and output 11.1%, eliminating the equity-innovation trade-off because it lowers research cost without altering market structure. Scope conditions: these are long-run steady-state comparisons in a calibrated model of 4-year public and private non-profit U.S. institutions; magnitudes depend on the hard-to-measure ωk and on the research-technology specification, as the robustness exercises show.&lt;/p&gt;
&lt;h3 id="q7-why-do-universities-fund-research-from-tuition-rather-than-patents-and-does-the-model-rationalize-it"&gt;Q7. Why do universities fund research from tuition rather than patents, and does the model rationalize it?&lt;/h3&gt;
&lt;p&gt;Because patent licensing is too small (median &amp;lt;2% of R&amp;amp;D, 1991-2018) to fund the &amp;gt;25% of R&amp;amp;D that is internal, and unrestricted operating funds are composed almost entirely of tuition (much of it from unrecovered facilities-and-administration costs on sponsored projects — roughly $7bn in 2018). The model rationalizes diverting tuition to research because research raises education quality and thus students&amp;rsquo; willingness to pay, so in a competitive sector students accept it. The model also replicates the joint pattern that higher-R&amp;amp;D universities are higher-ranked, attract wealthier and abler students, and charge higher tuition.&lt;/p&gt;
&lt;h3 id="q8-what-are-the-sources-of-inefficiency-in-the-model"&gt;Q8. What are the sources of inefficiency in the model?&lt;/h3&gt;
&lt;p&gt;Two. First, borrowing constraints prevent efficient sorting of students by ability (a social planner would send the ablest to the best colleges, but students are limited by parental capacity to pay). Second, university knowledge has positive spillovers to the real economy (calibrated ιk = 0.1) that colleges do not internalize, causing under-investment; however, quality-maximizing colleges face extra competitive incentives to do research, so net under- or over-investment is ambiguous and depends on stratification relative to spillover strength.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;College quality-ladder (Σq/Σk)&lt;/strong&gt;: The equilibrium cross-sectional elasticity of education quality with respect to a university&amp;rsquo;s intangible knowledge capital — a sufficient statistic for a university&amp;rsquo;s private incentive to invest in research. Steeper ladder (more stratification, tuition rising more with rank) means stronger research incentives.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Intangible (knowledge) capital k&lt;/strong&gt;: Institution-specific intangible capital accumulated by investing in research (k&amp;rsquo; = k^γk eR^γe). It is primarily frontier knowledge and ideas exposed to students, but also networks, recruiting, labs, and methods; it can act purely as a reputation signal in the limiting case ωk→0.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Research share (sR)&lt;/strong&gt;: The share of a university&amp;rsquo;s tuition revenue allocated to research in equilibrium (≈8.8% under existing policies). It increases with college forward-lookingness (βc) and the steepness of the quality-ladder, and decreases with the dispersion of intangible capital across colleges.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Crowding-out of internal research&lt;/strong&gt;: In the paper&amp;rsquo;s sense, the phenomenon whereby federal grants, by concentrating funds at top schools, raise the dispersion of research (Σk), flatten the quality-ladder (Σq/Σk), lower the research share, and thereby reduce universities&amp;rsquo; internal research spending — so total research rises less than the government&amp;rsquo;s funding share (69.1% decline vs 76% share on removal).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Equity-innovation complementarity&lt;/strong&gt;: The model&amp;rsquo;s finding that progressive need-based tuition aid, by compressing revenue dispersion and making colleges more similar, steepens competition and raises university research — so equity-promoting policy also boosts basic research, rather than trading off against it.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Education-innovation gap (ωk calibration)&lt;/strong&gt;: Biasi &amp;amp; Ma&amp;rsquo;s (2021) measure of how frontier-current a university&amp;rsquo;s curriculum is, interpreted in the model as log(k). A one-unit decrease is associated with a 0.011% rise in graduate income; normalized by its school-level standard deviation of 0.85, it is used to pin down ωk via ωk·α = .011/.85·Σk.&lt;/p&gt;</description></item><item><title>(Not) Thinking About the Future: Financial Information and Maternal Labor Supply</title><link>https://macropaperwarehouse.com/papers/not-thinking-about-the-future-financial-information-and-maternal-labor-supply/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/not-thinking-about-the-future-financial-information-and-maternal-labor-supply/</guid><description>&lt;p&gt;This paper investigates whether information constraints — rather than fully forward-looking choices — contribute to mothers&amp;rsquo; reduced labor supply after childbirth, a key driver of gender inequality. The authors deploy two complementary methods in Switzerland: a representative descriptive survey of Swiss mothers aged 25–50, and a large-scale randomized controlled trial (RCT) among approximately 2,400 female public school teachers with children who work part-time.&lt;/p&gt;
&lt;p&gt;The descriptive survey first establishes that long-term financial factors are not top of mind for mothers making labor supply decisions: only about 11% of mothers spontaneously mention pensions or long-term career considerations when asked about their post-childbirth employment choices, compared to roughly half who mention child or own well-being. Beyond salience, the survey documents substantial misperceptions: 62% of women over-estimate pension receipt under part-time work by more than 10%, and a similar share believes wage growth under low part-time hours (40% FTE) is at least as high as under 80% employment. The authors label mothers with overly optimistic beliefs on both dimensions &amp;ldquo;cost-unaware&amp;rdquo;; 42% of the sample qualifies. Cost-unawareness is more prevalent among less-educated mothers and correlates with less financial interest and more gender-conservative attitudes.&lt;/p&gt;
&lt;p&gt;The RCT tests whether providing objective, individualized information shifts financial planning and labor supply. Teachers in treatment schools (two-thirds of all schools) were individually randomized into a treatment group viewing an informational video about the long-run earnings, pension, and life-event consequences of sustained part-time employment, plus access to a Future Calculator tool, or a placebo video on unrelated financial topics. The two-stage randomization (school-level first, then individual within treated schools) allows identification of both direct treatment effects and spillovers. Outcomes are measured in a Wave 1 post-video survey, a follow-up survey two months later, and linked administrative personnel records from the Department of Education one year post-intervention.&lt;/p&gt;
&lt;p&gt;Main findings: treated teachers are 31.26 percentage points (58% over the pure control mean) more likely to correctly rank the relative magnitude of long- versus short-term financial factors. Demand for financial planning tools rises by 0.39 standard deviations (SD) overall and by 0.31 SD among cost-unaware women specifically. In terms of stated labor supply plans, the treatment raises planned employment for the next academic year by 1.69 percentage points (ppt) in the full sample and by 4.95 ppt (9% over the pure control mean) among cost-unaware women. These plan effects persist two months later for cost-unaware women but fade for the full sample.&lt;/p&gt;
&lt;p&gt;Critically, stated plans translate into verified behavior: linked administrative data one year post-intervention show that cost-unaware teachers increase their contracted employment level by 3.87 ppt, or 7% over the pure control mean of 53.30% FTE. Cost-aware and overly pessimistic women do not reduce their labor supply upon learning they are better off than feared, an asymmetry consistent with agents responding more to perceived losses than gains. If the 3.87 ppt increase were sustained from age 40 onward, cost-unaware teachers would accumulate an additional 130,000 CHF in lifetime income and 40,000 CHF in pension wealth, shrinking the gender gap in lifetime income and pension receipt among teachers by approximately 18% each.&lt;/p&gt;
&lt;p&gt;The paper is scoped to Swiss female public school teachers — a population with linear pay scales, no part-time promotion penalty, and relatively low adjustment barriers — meaning the measured lifetime earnings and pension losses likely represent a lower bound relative to other occupations. Short-term RCT findings replicate among a sample of pregnant women in the general Swiss population, and the paper argues that similar labor supply adjustment magnitudes are feasible for a broader segment of part-time working mothers.&lt;/p&gt;
&lt;p&gt;Q: What is the central research question and why does it matter?
A: The paper asks whether mothers&amp;rsquo; post-childbirth reduction in labor supply is partly driven by information constraints — specifically, whether mothers fail to account for the full long-term financial consequences of working reduced hours. This matters because if the child penalty partly reflects uninformed choices rather than deliberate tradeoffs, standard policy tools (parental leave, childcare subsidies) may underperform precisely because their long-term financial benefits are not internalized.&lt;/p&gt;
&lt;p&gt;Q: How prevalent is cost-unawareness among Swiss mothers?
A: 62% of mothers in the descriptive survey over-estimate pension receipt under part-time work by more than 10%, a similar share believes wage growth under low part-time (40% FTE) is at least as high as under 80% employment, and 42% are overly optimistic on both dimensions simultaneously. Cost-unawareness follows an education gradient: 77% of low-education women over-estimate pension receipt versus 51% of high-education women.&lt;/p&gt;
&lt;p&gt;Q: What share of mothers spontaneously considers long-term financial factors when deciding on their labor supply?
A: Only about 11% of mothers mention any long-term financial factor (pensions, financial independence, long-term career considerations) in open-ended responses; the share is similarly low across education groups (6% low, 12% mid, 13% high). About 50% mention child or own well-being; roughly 30% raise short-term financial factors such as current childcare costs.&lt;/p&gt;
&lt;p&gt;Q: What are the actual long-term financial stakes of the average female teacher&amp;rsquo;s part-time employment pattern in Switzerland?
A: Compared to full-time employment, the average female teacher&amp;rsquo;s employment trajectory produces a 35% reduction in potential lifetime earnings (approximately 3.34 million CHF versus 5.12 million CHF). Monthly pension receipt under the part-time scenario is 31% lower overall and 43% lower from the occupational second-pillar scheme specifically — a gap comparable to the average 47.5% gender pension gap observed in the second pillar in Switzerland in 2024.&lt;/p&gt;
&lt;p&gt;Q: How was the RCT designed and what populations were included?
A: The study recruited 2,359 part-time working mothers employed as public school teachers in a German-speaking Swiss canton. A two-stage randomization assigned two-thirds of schools to treatment schools (within which teachers were individually randomized 50/50 to treatment or spillover control) and one-third to pure control schools. This design allows estimation of direct treatment effects and spillover effects. The intervention was timed to precede December–January, the period when teachers communicate their preferred employment levels for the next school year.&lt;/p&gt;
&lt;p&gt;Q: What was the treatment intervention?
A: Treated teachers watched an informational video following a representative female teacher considering an employment-level increase, covering the impact of part-time work on lifetime earnings, monthly pension receipt, and financial exposure after adverse events such as divorce; it also benchmarked these magnitudes against childcare costs. Treated teachers additionally received individualized access to the Future Calculator, an online projection tool developed with a Swiss bank, calibrated to teachers&amp;rsquo; deterministic salary and pension schedules.&lt;/p&gt;
&lt;p&gt;Q: Did treated teachers understand and retain the treatment information?
A: Yes. Treated teachers were 31.26 ppt (58% over the pure control mean) more likely immediately after the intervention to correctly rank long- versus short-term financial factors in a vignette. Two months later, the treatment group remained significantly more likely to apply the information correctly (22.63 ppt higher), indicating the knowledge was not short-lived.&lt;/p&gt;
&lt;p&gt;Q: How did demand for financial planning tools respond to the treatment?
A: The treatment raised a financial information/tools index by 0.39 SD overall. For cost-unaware women specifically, demand for financial tools rose by 0.31 SD; cost-aware and pessimistic women showed no significant change. There was no significant average treatment effect on sign-up for an incentivized financial consultation.&lt;/p&gt;
&lt;p&gt;Q: How large were the labor supply plan effects in the survey, and did they persist?
A: For the full sample, treated teachers planned a 1.69 ppt higher employment level for the next school year immediately after the treatment, and 3.13 ppt higher in 10 years. For cost-unaware women, the short-run planned increase was 4.95 ppt (9% over the pure control mean of about 55%), and plans for 5 and 10 years into the future rose by approximately 4 ppt (6–7% over the mean). The short-run effects for cost-unaware women persisted to the two-month follow-up, while full-sample short-run effects faded.&lt;/p&gt;
&lt;p&gt;Q: What do the linked administrative data show about actual labor supply one year post-intervention?
A: Cost-unaware women in the treatment group increased their contracted employment level by 3.87 ppt relative to the pure control group (7% over the pure control mean of 53.30% FTE), closely matching the planned increase stated immediately after the treatment. Cost-aware women and the full sample showed no statistically significant shift in actual hours.&lt;/p&gt;
&lt;p&gt;Q: What asymmetry did the authors observe between cost-unaware and cost-aware women?
A: Cost-unaware (overly optimistic) women increased their labor supply upon learning the true financial costs; cost-aware and overly pessimistic women did not reduce their labor supply upon learning they were better off than expected. The authors interpret this as consistent with agents responding more to perceived losses (bad news for cost-unaware women) than to gains (good news for pessimistic women), and with cost-aware women already having incorporated the financial logic into their decisions even without precise estimates.&lt;/p&gt;
&lt;p&gt;Q: What is the estimated lifetime impact of the observed labor supply adjustment?
A: If cost-unaware teachers maintain the 3.87 ppt employment increase from age 40 to retirement, they accumulate an additional 130,000 CHF in lifetime income and 40,000 CHF in pension wealth on average. This would reduce the gender gap in both lifetime income and pension receipt among teachers by approximately 18% each.&lt;/p&gt;
&lt;p&gt;Q: What emotional and social mechanisms did the paper document?
A: The treatment initially produced significantly negative emotional responses (−0.41 SD on an emotions index overall; −0.68 SD for cost-unaware women), consistent with cognitive dissonance from information conflicting with prior beliefs. Two months later, the treatment group reported feeling more in control and less stressed, and cost-unaware women returned to a neutral emotional baseline. Treated women were also 19.61 ppt more likely to have discussed the topic with anyone, with the largest effect on conversations with partners or family.&lt;/p&gt;
&lt;p&gt;Q: Did the treatment affect household-level labor supply — specifically, did partners reduce their hours?
A: No. The authors found no evidence that partners of cost-unaware women planned to work less in response to the treatment, and women did not plan to adjust future fertility. This suggests the observed hours increase by treated cost-unaware women was not offset by partner adjustments within the household.&lt;/p&gt;
&lt;p&gt;Q: Were there social spillover effects within schools?
A: Treated teachers were 11.59 ppt more likely to report having discussed the video with colleagues. Two months later, cost-unaware control teachers in treated schools (the spillover group) showed some evidence of absorbing the general treatment message and adjusting short-term labor supply plans upward, and a noisy increase in actual employment of roughly one-third the magnitude of the direct treatment effect, though these estimates were imprecise.&lt;/p&gt;
&lt;p&gt;Q: Why might cost-unaware women be uninformed in the first place?
A: In both the descriptive survey and the RCT sample, cost-unaware women lean more gender-conservative in their attitudes and report less interest in financial topics. The authors interpret this as suggesting a lack of information (rather than mere salience or forgetting) drives cost-unawareness, implying that passive information delivery through employers or pension funds could be effective.&lt;/p&gt;
&lt;p&gt;Q: What constraints to labor supply adjustment did the authors explore?
A: In a hypothetical scenario exercise, the scenario producing the largest desired employment increase for both treatment and control groups was if the partner were more engaged (roughly double the adjustment relative to a scenario of higher pay for additional hours). The treatment group adjusted their desired employment level by an additional 0.62–2.03 ppt relative to pure control across all scenarios except relaxing conservative gender norms.&lt;/p&gt;
&lt;p&gt;Q: How generalizable are the findings beyond the teacher sample?
A: The short-term RCT findings replicated among a sample of pregnant women in the general Swiss population. The authors also document that potential net gains from increasing labor supply — net of additional childcare costs — are large for the broader population of part-time working Swiss mothers, supporting feasibility of similar-magnitude adjustments outside teaching. The teaching context likely represents a lower bound for lifetime earnings and pension losses in other professions due to the absence of a part-time promotion penalty in teaching.&lt;/p&gt;
&lt;p&gt;Q: What are the policy implications?
A: The findings suggest that default exposure to individualized financial information about the long-term costs of part-time work — delivered by employers, pension funds, or the state — could improve decision quality and labor supply. More broadly, the results imply that policies designed to increase female labor supply (parental leave reforms, childcare subsidies) may underperform if mothers do not fully internalize the financial benefits of additional hours; ensuring that families solve the correct optimization problem is a precondition for unlocking the full potential of such policies.&lt;/p&gt;
&lt;p&gt;Child Penalty: The large and persistent reduction in women&amp;rsquo;s labor force participation and income following the birth of a first child, identified in the paper as the key driver of remaining gender inequality in the labor market in industrialized countries and a source of profound life-cycle financial consequences including reduced lifetime earnings and pension savings.&lt;/p&gt;
&lt;p&gt;Cost-Unaware: The authors&amp;rsquo; term for women who hold overly optimistic expectations about the financial consequences of part-time work — specifically, who over-estimate pension receipt under low part-time employment by more than 10% and who believe wage growth under low part-time is at least as high as under higher employment levels. In the descriptive survey 42% of mothers qualify on both dimensions.&lt;/p&gt;
&lt;p&gt;Future Calculator: An online individualized projection tool developed by the authors in cooperation with a Swiss bank, calibrated to teachers&amp;rsquo; deterministic salary and pension schedules, allowing users to estimate the long-term financial implications of different employment levels. Used both in the descriptive survey vignette and as part of the RCT treatment.&lt;/p&gt;
&lt;p&gt;Second Pillar (Occupational Pension Scheme, PP): Switzerland&amp;rsquo;s occupational pension scheme, the pillar most heavily affected by part-time work because contributions are directly proportional to earnings above a minimum annual earnings threshold. The paper documents an average gender pension gap of 47.5% in this pillar in 2024 and a 43% lower monthly pension receipt for the average female teacher&amp;rsquo;s part-time trajectory relative to full-time employment.&lt;/p&gt;
&lt;p&gt;Two-Stage Randomization: The experimental design used to separate direct treatment effects from spillover effects within schools. One-third of schools are assigned to a pure control group; in the remaining two-thirds, teachers are individually randomized into treatment or spillover control (untreated teachers in treated schools), enabling identification of both causal treatment impacts and social learning channels.&lt;/p&gt;
&lt;p&gt;Information Constraint: The paper&amp;rsquo;s central mechanism — mothers&amp;rsquo; failure to spontaneously account for the full long-term financial implications of reduced labor supply when making employment decisions, distinct from deliberate forward-looking tradeoffs. The authors document this both through the absence of long-term financial factors in open-ended decision narratives (only 11% of mothers mention them) and through systematic misperceptions of pension and wage outcomes.&lt;/p&gt;
&lt;p&gt;Cognitive Dissonance (as used in the paper): The authors use this term to describe the initial negative emotional response (−0.41 SD overall, −0.68 SD for cost-unaware women) when treated women learn that the true financial costs of part-time work are higher than they expected — information that conflicts with prior beliefs and prior choices, producing unpleasant emotions that subsequently reverse into lower stress levels two months later.&lt;/p&gt;</description></item><item><title>A choice-based approach to the measurement of inflation expectations</title><link>https://macropaperwarehouse.com/papers/a-choice-based-approach-to-the-measurement-of-inflation-expectations/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/a-choice-based-approach-to-the-measurement-of-inflation-expectations/</guid><description>&lt;p&gt;Standard survey-based measurement of inflation expectations relies on density forecasts in which respondents assign probabilities to pre-specified inflation bins; this method has been found to induce biases through its bin structure (suggesting that values near zero are more likely), to impose cognitive demands that raise dropout rates, and to become uninformative during high-inflation episodes when responses cluster in open-ended extreme bins—making cross-time and cross-country comparisons unreliable. This paper proposes a new choice-based elicitation method rooted in decision theory that uses a bisection process: respondents first state a minimum and maximum inflation level for which they see almost no chance of actual inflation falling outside the range, avoiding external anchors, and then answer a series of binary choices from which the relevant percentiles of their subjective distribution can be inferred. Two large surveys (UK and US) and a laboratory experiment demonstrate that the method leads to well-defined expectations that fulfil both subjective and objective quality criteria, that it is neither perceived as more difficult nor more time-consuming than the density forecast standard, and that—unlike density forecasts—it is robust to differences in the state of the economy, enabling comparisons across time and countries. The method is portable and can be applied to elicit distributions over other macroeconomic variables beyond inflation.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-specific-failures-of-density-forecasts-motivate-the-new-method"&gt;Q1. What specific failures of density forecasts motivate the new method?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper identifies four problems with the standard density forecast format: (i) bin-structure bias—narrower bandwidths around zero may lead respondents to infer that near-zero inflation is more likely by design, biasing responses toward zero; (ii) cognitive demands that raise dropout rates and may introduce selection bias; (iii) sensitivity to question wording and response-scale changes; and (iv) loss of informativeness during high-inflation episodes when responses bunch in extreme open-ended bins, compounded by the incompatibility of adjusted bin structures across survey waves.&lt;/strong&gt; During the recent surge in inflation these problems became especially visible, motivating a method more robust to the state of the inflation environment.&lt;/p&gt;
&lt;h3 id="q2-how-does-the-choice-based-bisection-method-work"&gt;Q2. How does the choice-based bisection method work?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The method, building on Baillon (2008), elicits respondents&amp;rsquo; subjective inflation distribution via a series of binary choices structured as a bisection algorithm that partitions the state space into equally likely subevents, allowing the relevant percentiles of the distribution to be recovered without imposing external anchors.&lt;/strong&gt; The procedure begins by asking respondents for a minimum and maximum inflation level for which they believe there is &amp;ldquo;almost no chance&amp;rdquo; actual inflation falls outside the interval—avoiding the bin-structure bias of the density forecast by letting respondents define their own relevant range. Subsequent binary choices then narrow down the median, quartiles, and further quantiles according to a strict algorithm.&lt;/p&gt;
&lt;h3 id="q3-what-do-the-field-surveys-and-laboratory-experiment-show"&gt;Q3. What do the field surveys and laboratory experiment show?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Two large surveys—one in the UK testing feasibility across multiple protocol variants, one in the US—and a laboratory experiment demonstrate that the choice-based method produces well-defined expectations fulfilling both subjective and objective quality criteria, and is neither perceived as harder nor more time-consuming than the standard density forecast.&lt;/strong&gt; The UK survey compared two variants of the proposed &amp;ldquo;Midpoint method&amp;rdquo; against existing density forecast formats. The convergence on quality criteria across different samples and settings supports the method&amp;rsquo;s potential for adoption in large-scale central bank surveys.&lt;/p&gt;
&lt;h3 id="q4-what-makes-the-method-robust-to-the-state-of-the-economy-and-why-does-that-matter-for-monetary-policy"&gt;Q4. What makes the method robust to the state of the economy, and why does that matter for monetary policy?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;In contrast to density forecasts, the choice-based method is robust to differences in the level and volatility of inflation because respondents define their own relevant range rather than choosing among fixed pre-specified bins, so the method does not become uninformative when actual inflation is far from the bins&amp;rsquo; central mass.&lt;/strong&gt; This robustness allows comparisons of inflation expectations distributions across time (including across high- and low-inflation regimes) and across countries—a feature density forecasts cannot deliver without adjusting bin structures in ways that compromise comparability. For monetary authorities that use survey expectations as both an indicator and a policy tool, this portability is a key advantage.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;density forecast&lt;/strong&gt; : the standard survey format in which respondents assign subjective probabilities to pre-specified inflation intervals (bins); the format used by the Federal Reserve Bank of New York&amp;rsquo;s Survey of Consumer Expectations and widely adopted by central banks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;choice-based elicitation (Midpoint method)&lt;/strong&gt; : the paper&amp;rsquo;s proposed alternative; a bisection procedure in which respondents first report a subjective min/max range and then answer binary choices, yielding quantiles of the subjective inflation distribution without imposing an external bin structure or anchors.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;bisection process&lt;/strong&gt; : the algorithmic structure in which each binary choice partitions the remaining probability mass so that successive responses identify the median, quartiles, and further quantiles of the respondent&amp;rsquo;s subjective distribution.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;bin-structure bias&lt;/strong&gt; : the distortion introduced by the density forecast&amp;rsquo;s pre-specified bins when narrower intervals near zero suggest to respondents that near-zero inflation is considered more likely by the survey designers, biasing their reported probabilities toward zero.&lt;/p&gt;</description></item><item><title>A Goldilocks Theory of Fiscal Deficits</title><link>https://macropaperwarehouse.com/papers/a-goldilocks-theory-of-fiscal-deficits/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/a-goldilocks-theory-of-fiscal-deficits/</guid><description>&lt;p&gt;This paper develops a tractable continuous-time model to study the fiscal sustainability of government deficits and the joint dynamics of public debt, with two main ingredients: an endogenous interest rate R that rises with the debt level through a convenience yield mechanism (savers value holding government bonds), and a potentially binding zero lower bound (ZLB) on the nominal interest rate. The paper&amp;rsquo;s central theoretical contribution is deriving the correct free-lunch condition: not the commonly cited $R &amp;lt; G$, but the stricter condition $R &amp;lt; G - \varphi$, where $\varphi$ captures the sensitivity of $R - G$ to debt. Even when $R &amp;lt; G$, accumulating more debt raises R through reduced convenience yields, and this endogenous feedback tightens fiscal sustainability. The paper maps the full deficit-debt space with a hump-shaped locus, analyzes ZLB dynamics where the deficit-debt relationship can invert, and studies the role of income inequality and tax policy. Calibrating to U.S. and Japan as of December 2019, the paper finds little room for free-lunch policies in the U.S. — a maximum permanent deficit of just over 2% of GDP at a stable debt-to-GDP ratio of 110% — while Japan is in the &amp;ldquo;inverted&amp;rdquo; ZLB regime where deficit increases can reduce debt through higher nominal growth.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a published paper based on the NBER working paper full text (w29707), AI-assisted, pending human review. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Mian, Straub, and Sufi construct a tractable deterministic continuous-time model with savers who derive convenience utility from holding government bonds, hand-to-mouth spenders, and a monetary authority that targets inflation (except at the ZLB), to systematically analyze when deficits can be &amp;ldquo;free lunches.&amp;rdquo; The core insight is that the standard r &amp;lt; g analysis treats interest rates as exogenous to the debt level, but if R rises as debt accumulates — through the declining marginal convenience yield of bonds — then the condition for a free-lunch policy is not R &amp;lt; G but R &amp;lt; G − φ. This matters empirically: the paper estimates φ (the debt-to-interest-rate sensitivity) from empirical estimates of the convenience yield elasticity, and calibrates the model to U.S. and Japan December 2019 conditions. The U.S. calibration finds a maximum free-lunch deficit of just over 2% of GDP at a stable debt ratio of 110%, implying the U.S. was barely inside the free-lunch region pre-Covid. By contrast, the paper finds ample free-lunch space for Japan and an &amp;ldquo;inverted&amp;rdquo; ZLB regime in which higher deficits can reduce the debt-to-GDP ratio by stimulating nominal growth. The analysis is extended to incorporate aggregate risk, capital, debt maturity structure, and inequality — each with distinct implications for the size and location of fiscal space.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-deficit-debt-diagram-and-what-is-the-free-lunch-condition"&gt;Q1. What is the deficit-debt diagram, and what is the free-lunch condition?&lt;/h3&gt;
&lt;p&gt;&lt;em&gt;&lt;em&gt;The deficit-debt diagram is the locus of steady-state combinations of the primary deficit z and the debt-to-GDP ratio b, derived from the government budget constraint $\dot{b} = -(G^&lt;/em&gt; - R^&lt;em&gt;(b))b + z$ at steady state; this locus is hump-shaped, with the maximum sustainable permanent deficit z&lt;/em&gt; occurring at the debt level b&lt;/em&gt; where $R^&lt;em&gt;(b^&lt;/em&gt;) = G^* - \varphi(b^&lt;em&gt;)$.** The hump shape arises because at low debt levels the convenience yield is high (R is low relative to G, allowing large deficits), while at high debt levels the convenience yield is saturated (R rises toward G, leaving little deficit room). The left branch of the locus — where debt levels are below b&lt;/em&gt; — is the free-lunch region: any permanent increase in the deficit to a value below z* raises the steady-state debt level but requires no future tax increases. The right branch — debt above b* — is the conventional region: any deficit increase must eventually be accompanied by higher taxes. The key departure from the standard r &amp;lt; g analysis is that R is endogenous; Proposition 1 and Corollary 1 formally establish that the correct free-lunch threshold is $R^&lt;em&gt;(b_0) &amp;lt; G^&lt;/em&gt; - \varphi(b_0)$, not simply R &amp;lt; G.&lt;/p&gt;
&lt;h3 id="q2-why-is-r--g-insufficient-as-a-free-lunch-condition-and-what-does-φ-capture"&gt;Q2. Why is R &amp;lt; G insufficient as a free-lunch condition, and what does φ capture?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The condition R &amp;lt; G fails as a free-lunch criterion because, when the government borrows an additional dollar and rolls it over forever, it faces two opposing budget effects: a positive cash flow of G − R from rolling over the existing debt, and a tightening of the budget constraint from the endogenous rise in R on all infra-marginal outstanding debt; the parameter φ measures the magnitude of this second effect as the semi-elasticity of R − G with respect to the log of debt.&lt;/strong&gt; When φ is positive — as it is empirically because convenience yields are declining in debt supply — the net fiscal benefit of rolling over additional debt is G − R − φ, not G − R. An economy can exhibit R &amp;lt; G yet be in the conventional debt region if φ is sufficiently large that R &amp;gt; G − φ at the current debt level. The U.S. calibration illustrates this: the traditional R &amp;lt; G condition holds up to a debt ratio of 220% of GDP, but the stricter R &amp;lt; G − φ condition breaks down already at 110%, which is the actual boundary of the free-lunch region for the U.S.&lt;/p&gt;
&lt;h3 id="q3-how-does-the-analysis-change-at-the-zero-lower-bound-and-what-is-the-inverted-fiscal-regime"&gt;Q3. How does the analysis change at the zero lower bound, and what is the &amp;ldquo;inverted&amp;rdquo; fiscal regime?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;At the ZLB, the direction of causality reverses: instead of the debt level determining the interest rate, the debt level determines the nominal growth rate G (via aggregate demand and the Phillips curve), creating an &amp;ldquo;inverted&amp;rdquo; regime in which higher deficits can reduce rather than increase debt by stimulating nominal growth and inflating away the debt.&lt;/strong&gt; The mechanism is: when the nominal rate is constrained at zero, fiscal expansion raises aggregate demand, which via the Phillips curve (slope κ) raises inflation, which raises nominal growth G, which accelerates the inflation of the debt ratio. If the fiscal multiplier times κ times the debt level exceeds one — a sufficient statistic condition — then higher deficits reduce the debt ratio. The paper finds this condition plausible for Japan (debt ratio ~225%, estimated κ = 0.1-0.3, multipliers of 1.5-2) but not for the U.S. in 2019. The deficit-debt locus in this regime is &amp;ldquo;backward-bending&amp;rdquo;: as the ZLB binds more tightly (lower debt), the locus can curve back and eventually allow the inverted relationship between deficits and debt.&lt;/p&gt;
&lt;h3 id="q4-how-does-income-inequality-affect-fiscal-space-and-why-does-the-zlb-reverse-the-sign"&gt;Q4. How does income inequality affect fiscal space, and why does the ZLB reverse the sign?&lt;/h3&gt;
&lt;p&gt;&lt;em&gt;&lt;em&gt;Outside the ZLB, greater income inequality (a larger income share of savers relative to hand-to-mouth spenders) expands fiscal space, because savers have a higher propensity to save, which reduces the natural interest rate R&lt;/em&gt; and thus raises G − R and allows larger sustainable deficits; at the ZLB, greater inequality shrinks fiscal space, because it reduces aggregate demand and hence nominal growth G rather than R.&lt;/em&gt;* Formally, outside the ZLB: $z(b) = (v&amp;rsquo;(b)(1-x-\mu) - \rho)b$, which increases as the spender share μ falls (Corollary 3). At the ZLB, nominal growth G becomes demand-determined via equation (20), and lower μ reduces demand, lowering G and hence z(b). The policy implication is a potential conflict between redistributive policies and deficit finance: redistribution (raising μ) reduces fiscal space outside the ZLB but expands it at the ZLB. The paper notes that roughly 69% of U.S. government debt held by households is directly or indirectly held by the top 10% of the wealth distribution, making savers&amp;rsquo; saving propensity the primary driver of the convenience yield.&lt;/p&gt;
&lt;h3 id="q5-what-are-the-us-and-japan-calibration-results-for-fiscal-space-as-of-december-2019"&gt;Q5. What are the U.S. and Japan calibration results for fiscal space as of December 2019?&lt;/h3&gt;
&lt;p&gt;&lt;em&gt;&lt;em&gt;For the U.S. in December 2019, the model calibrates a maximum permanent primary deficit z&lt;/em&gt; of just over 2% of GDP at a stable debt-to-GDP ratio of b&lt;/em&gt; ≈ 110%, implying the U.S. was just inside the free-lunch region; for Japan, the model finds the economy in the inverted ZLB regime where higher deficits reduce debt by raising nominal growth.** The calibration uses empirical estimates of φ from the literature on convenience yield demand elasticities (Krishnamurthy and Vissing-Jorgensen 2012, Laubach 2009, Presbitero and Wiriadinata 2020). For the U.S., the standard r &amp;lt; g condition holds up to a debt ratio of 220% (the upper bound), but the binding free-lunch condition R &amp;lt; G − φ limits fiscal space to 110%. Deficits beyond the 2%-of-GDP limit must be financed by future tax increases or spending cuts, even though R &amp;lt; G throughout the range. The Japan calibration illustrates the ZLB regime: with a debt ratio already above 200%, the fiscal multiplier effect on inflation is large enough that the backward-bending locus applies, and Japan&amp;rsquo;s economy lies in the inverted region.&lt;/p&gt;
&lt;h3 id="q6-how-does-the-analysis-extend-to-aggregate-risk-capital-crowding-out-and-debt-maturity"&gt;Q6. How does the analysis extend to aggregate risk, capital crowding-out, and debt maturity?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;With aggregate risk, the free-lunch condition R &amp;lt; G − φ remains informative: when the condition holds on average, free-lunch policies can be designed with probability approaching one; when it fails on average, no free lunch is possible.&lt;/strong&gt; The risk extension follows Mehrotra and Sergeyev (2020) and confirms numerically that the deterministic condition provides a valid signal for the stochastic case. Adding capital and crowding-out (Section 7.2) yields a counterintuitive finding: greater crowding-out of capital actually increases fiscal space by reducing the sensitivity of interest rates to debt (lower φ), because each additional unit of government debt displaces private capital rather than reducing convenience yields as sharply. Regarding debt maturity: issuing long-term debt reduces fiscal space at low debt levels (locking in higher interest costs), but increases it at high debt levels; this suggests that QE-style maturity shortening may constrain fiscal space as debt rises. These extensions confirm that the φ parameter — and the R &amp;lt; G − φ condition — is robust to a range of model ingredients, making it a practically useful criterion beyond the baseline model.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;free-lunch fiscal policy&lt;/strong&gt; : a permanent increase in the primary deficit that raises steady-state debt to a new higher level without requiring any future tax increases or spending cuts; feasible only when $R^&lt;em&gt;(b_0) &amp;lt; G^&lt;/em&gt; - \varphi(b_0)$, which is strictly tighter than the standard r &amp;lt; g condition when φ &amp;gt; 0.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;debt-rate sensitivity (φ)&lt;/strong&gt; : the semi-elasticity of R − G with respect to the log of debt, capturing how much the endogenous convenience yield on government bonds falls (and hence interest rates rise) as the debt supply increases; the paper&amp;rsquo;s addition to the standard r &amp;lt; g framework that tightens the sustainability condition from R &amp;lt; G to R &amp;lt; G − φ; estimated empirically from convenience yield demand curves.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;deficit-debt diagram&lt;/strong&gt; : the hump-shaped locus of sustainable steady-state combinations of the primary deficit and the debt-to-GDP ratio; the left (increasing) branch is the free-lunch region where fiscal expansion is self-sustaining, and the right (decreasing) branch is the conventional region where fiscal expansion requires future tax increases.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;inverted ZLB fiscal regime&lt;/strong&gt; : the case where the nominal interest rate is zero and the deficit-debt locus bends backward, so that higher deficits reduce rather than increase the debt ratio; occurs when the fiscal multiplier is large enough that deficit-induced nominal growth more than offsets the direct debt accumulation effect; found to apply to Japan as of December 2019 but not the U.S.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;convenience yield&lt;/strong&gt; : the non-pecuniary benefit savers derive from holding government bonds (capturing liquidity, safety, and regulatory premia), modeled as the utility function v(b) for savers; the mechanism making R endogenous to debt: as debt supply rises, the marginal convenience yield v&amp;rsquo;(b) falls, pushing R toward G and shrinking fiscal space.&lt;/p&gt;</description></item><item><title>A model of expenditure shocks</title><link>https://macropaperwarehouse.com/papers/a-model-of-expenditure-shocks/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/a-model-of-expenditure-shocks/</guid><description>&lt;p&gt;A common observation from account-level bank data is that low-income, low-liquidity households often use additional income to repay debt rather than consume, and that household-level consumption is extremely volatile even though aggregate consumption is smooth. This paper formalizes these patterns using four new facts from the PSID: household consumption is as volatile as income (contradicting PIH); the correlation between household consumption and income growth is only about 0.2 (low); consumption growth is negatively autocorrelated (contradicting both PIH and habit models); and—a finding new to the literature—the cross-sectional correlation between consumption and income growth is far smaller among households experiencing high consumption episodes than in the full sample. The paper proposes an explanation based on stochastic consumption thresholds: unanticipated shocks such as medical expenses or vehicle repairs create time-varying minimum-consumption floors whose violation incurs large utility costs, inducing households to prioritize expenditures on these needs over income-responsive consumption and to rebuild savings after the shock. This mechanism increases the welfare cost of income fluctuations by an order of magnitude relative to standard models.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-are-the-four-empirical-facts-and-why-do-they-challenge-standard-models"&gt;Q1. What are the four empirical facts and why do they challenge standard models?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Fact 1: for the average PSID household, consumption is as volatile as income; Fact 2: the correlation between consumption growth and income growth is about 0.2; Fact 3: household consumption growth is negatively autocorrelated; Fact 4 (new): the cross-sectional correlation between consumption and income growth is far smaller among households with high consumption than in the full sample.&lt;/strong&gt; Fact 1 contradicts the permanent income hypothesis (PIH), under which consumption should be smoother than income. Facts 1 and 2 together cannot both be explained by liquidity constraints (which would tie consumption to current income, producing a high correlation) or by very persistent income shocks (same problem). Fact 3 contradicts habit models (which generate positive autocorrelation) and is inconsistent with PIH (which implies zero autocorrelation). Fact 4 is novel: in standard models the level of consumption barely affects the income-consumption growth relationship, so this fact requires a new explanation.&lt;/p&gt;
&lt;h3 id="q2-what-is-the-expenditure-shock-mechanism-and-how-does-it-rationalize-the-four-facts"&gt;Q2. What is the expenditure shock mechanism, and how does it rationalize the four facts?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The model introduces stochastic, time-varying consumption thresholds—representing unavoidable expenditures such as medical emergencies, vehicle breakdowns, or appliance repairs—that, if violated, incur large utility costs; this forces households to prioritize meeting these minimum needs over income-proportional consumption.&lt;/strong&gt; When a threshold shock hits, consumption jumps to meet it regardless of current income (explaining volatile, income-disconnected consumption). After the shock the household rebuilds savings, reducing consumption below its long-run level (generating negative autocorrelation). During high-consumption episodes (threshold shocks), income and consumption growth are decoupled (explaining Fact 4). Meanwhile, without a threshold shock, households are saving to self-insure against future shocks (explaining why low-income households save rather than consume when income rises).&lt;/p&gt;
&lt;h3 id="q3-what-does-the-model-imply-for-the-welfare-cost-of-income-fluctuations"&gt;Q3. What does the model imply for the welfare cost of income fluctuations?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The stochastic thresholds increase the welfare cost of income fluctuations by an order of magnitude relative to standard consumption models, because households must maintain precautionary buffers against the risk of hitting a threshold and being unable to meet it.&lt;/strong&gt; The large welfare cost arises from two sources: the direct cost of violating a threshold (large utility penalty), and the precautionary motive it creates, which forces households to save at the expense of current consumption utility even when no threshold shock is present.&lt;/p&gt;
&lt;h3 id="q4-what-empirical-evidence-does-the-paper-use-and-what-is-the-scope-of-the-findings"&gt;Q4. What empirical evidence does the paper use and what is the scope of the findings?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The PSID (post-1999 comprehensive consumption module) provides panel data on total household consumption and income; the authors use this to document all four facts, including the novel Fact 4.&lt;/strong&gt; The negative autocorrelation of consumption growth (Fact 3) is documented in the prior literature (Blundell et al. 2008) as indicative of preference shocks or measurement error, but the paper&amp;rsquo;s model gives it a structural interpretation as evidence of expenditure shocks. The finding that consumption is volatile yet disconnected from income (Facts 1 and 2) is robust to restricting attention to nondurable consumption, ruling out durable goods as the driver. The results hold at the household level; aggregate consumption is smooth because household threshold shocks are largely idiosyncratic and average out.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;stochastic consumption threshold&lt;/strong&gt; : a time-varying, unanticipated minimum consumption level (representing unavoidable expenditures like medical emergencies or vehicle repairs) whose violation incurs large utility costs; the paper&amp;rsquo;s key modeling innovation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;expenditure shock&lt;/strong&gt; : an unanticipated increase in the required minimum consumption level, representing events that force households to spend on necessities regardless of current income or savings; the proposed explanation for the four empirical facts about household consumption dynamics.&lt;/p&gt;</description></item><item><title>A Monetary-Fiscal Theory of Sudden Inflations</title><link>https://macropaperwarehouse.com/papers/a-monetary-fiscal-theory-of-sudden-inflations/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/a-monetary-fiscal-theory-of-sudden-inflations/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; Why do sudden inflations and currency crises occur, while symmetric sudden deflations never do? The paper asks whether treating nominal government bonds as analogous to ordinary corporate bonds — with an asymmetric payoff structure capped at face value on the upside but exposed to real losses when fiscal surpluses are insufficient — can generate a unified theory of these crises endogenously from a single model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Intellectual Lineage and Approach.&lt;/strong&gt; The paper sits at the intersection of two literatures. The first is the Fiscal Theory of the Price Level (FTPL), originating with Leeper (1991), Sims (1994), and Sargent and Wallace (1985), which links the real value of nominal government debt to expected future surpluses. The second is the safe-asset literature, where Holmstrom (2015) and Gorton (2017) explain that assets can circulate as safe stores of value precisely because their backing is costly to investigate and consumers rationally remain uninformed. The paper applies this information-economics logic to nominal government bonds, so that consumers normally hold bonds without investigating the government&amp;rsquo;s true fiscal capacity, and only pay the cost to investigate when real repayment doubts become sufficiently severe.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model Structure.&lt;/strong&gt; The model is a two-period reduced-form general equilibrium. In period 1, a representative consumer buys nominal government bonds at an interest rate set by the monetary authority. In period 2, the government must repay those bonds. The fiscal authority attempts to hit a price-level target P* by raising tax revenue, but faces a hard ceiling τ_max on the surplus it can collect — arising from Laffer limits on taxation, political constraints on austerity, or the need to fund financial-sector bailouts. The consumer has prior beliefs that τ_max is low (L) with probability π and high (H) with probability 1−π, and can pay a fixed utility cost γ to learn τ_max before deciding how many bonds to purchase.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bond Payoff Structure and Asymmetry.&lt;/strong&gt; The key mechanism is the asymmetric, bond-like real payoff of nominal government debt. If τ_max ≥ B1/P*, the government raises enough surplus to repay bonds fully in real terms at the price-level target; the real payoff is flat at face value (the &amp;ldquo;in-the-money&amp;rdquo; region). If τ_max &amp;lt; B1/P*, the government sets taxes to the ceiling τ_max and the price level rises above P* to balance the budget constraint, reducing the real payoff proportionally (the &amp;ldquo;default&amp;rdquo; region). Critically, because the nominal payoff is capped at face value, there is no upside region: governments will not run surpluses large enough to deliver a windfall to bondholders, so sudden deflations — analogous to a corporate bond being worth more than face value — cannot occur. This asymmetry is the direct source of the one-sided nature of crises.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Two Illustrative Mechanisms for Sudden Inflations.&lt;/strong&gt; The paper numerically and analytically characterizes two triggering scenarios:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Lower surplus expectations (fiscal stress narrative, corresponding to Burnside et al. 2001 on the 1997 Asian crisis)&lt;/em&gt;: As the probability π of a low future surplus (e.g., from a prospective banking-sector bailout) rises, the value of information about τ_max increases. In the numerical example (i = 0.05, γ = 0.13, L = 0.1), the value of information equals the cost γ at π = 0.15. For π above 0.15, consumers pay to investigate, learn τ_max = L, and refuse to purchase bonds beyond what will be repaid in real terms (B1 = τ_max = L = 0.1). The price level in period 1 rises discontinuously as a function of π at this threshold.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Interest rate increases (speculative attack narrative)&lt;/em&gt;: As the monetary authority raises the interest rate to defend a currency, consumers demand more bonds. Larger bond quantities increase the risk that surpluses will be insufficient, raising the value of fiscal information. In the numerical example (π = 0.5, γ = 0.24, 1+i ∈ [1, 1.2]), the value of information equals γ at 1+i = 1.1 (i.e., i = 10%). For interest rates above this threshold, consumers learn τ_max = L, restrict bond purchases to what will be repaid, and the price level in period 1 jumps discontinuously. Further interest rate increases above the threshold produce only upward drift in the price level, not additional monetary tightening effects — illustrating the limits of monetary policy in fiscally stressed environments.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;Theoretical Results.&lt;/strong&gt; Two formal theorems establish generality. Theorem 1 shows that, given bond demand B1(π) such that L &amp;lt; B1 for all π ∈ (0,1), there exist thresholds k and γ &amp;gt; 0 such that the period-1 price level P1 is discontinuous as a function of π on (0, k]. Theorem 2 establishes an analogous discontinuity in P1 as a function of the interest rate i, given that B1(i) &amp;gt; L for all i in the relevant range.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions.&lt;/strong&gt; The model is a two-period reduced form that abstracts from dynamics, multiple maturities, and secondary market trading. The informational friction is a fixed binary cost γ, not a richer signal structure. The results depend on the existence of a binding surplus ceiling τ_max; when the government is far from this ceiling (i.e., consumers&amp;rsquo; beliefs are far from the &amp;ldquo;default boundary&amp;rdquo;), shocks produce only small, smooth price-level changes. Large discontinuous price-level jumps require the economy to be near the kink point of the bond payoff curve.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-fundamental-analogy-that-drives-the-papers-theory-and-what-economic-literature-does-it-build-on"&gt;Q1. What is the fundamental analogy that drives the paper&amp;rsquo;s theory, and what economic literature does it build on?&lt;/h3&gt;
&lt;p&gt;The paper analogizes nominal government bonds to corporate bonds (following Sargent 1982&amp;rsquo;s advice that &amp;ldquo;government debt is valued according to the same economic considerations that give private debt value&amp;rdquo;). Like a corporate bond, the nominal government bond pays its face value if the underlying project (government fiscal capacity) delivers a surplus at least equal to the face value, but pays only a share of the realized surplus if the surplus falls short. This bond-like payoff — flat on the upside, proportional to outcomes on the downside — is the direct source of asymmetric crisis dynamics. The paper combines this with Holmstrom (2015) and Gorton (2017)&amp;rsquo;s framework in which safe assets function because their backing is costly to investigate, so consumers rationally remain uninformed in normal times.&lt;/p&gt;
&lt;h3 id="q2-what-is-the-key-information-friction-and-how-does-it-generate-the-switch-between-normal-times-and-crisis"&gt;Q2. What is the key information friction, and how does it generate the switch between &amp;ldquo;normal times&amp;rdquo; and crisis?&lt;/h3&gt;
&lt;p&gt;In normal times, consumers are confident that the government&amp;rsquo;s future maximum surplus τ_max is sufficient to repay bonds in real terms. The fixed utility cost γ of investigating the true surplus exceeds the benefit, so consumers remain uninformed and bonds trade at a price reflecting only uninformed prior beliefs. A crisis arises when the value of information V(.) rises above γ — either because the probability of a low surplus state rises (fiscal stress) or because the interest rate rises and consumers demand more bonds, bringing them closer to the repayment boundary. Once V &amp;gt; γ, consumers investigate and, upon learning τ_max = L (low surplus), refuse to hold bonds that will not be repaid in real terms, triggering a discrete upward jump in the price level.&lt;/p&gt;
&lt;h3 id="q3-how-does-the-bond-payoff-structure-explain-the-absence-of-sudden-deflations"&gt;Q3. How does the bond payoff structure explain the absence of sudden deflations?&lt;/h3&gt;
&lt;p&gt;The real payoff of a nominal government bond cannot exceed its face value: the bond is capped at face value on the upside because the government will not voluntarily raise tax surpluses to deliver a windfall to bondholders. In the event that surpluses turn out to be higher than needed (τ_max ≥ B1/P*), the government simply sets taxes to exactly repay the bonds at P* and returns no additional real value to bondholders. This is the flat portion of the payoff curve. Because there is no upside kink — no region where learning that τ_max is unexpectedly large causes the price level to fall sharply — there is no mechanism for sudden deflations symmetric to sudden inflations. The 1933 U.S. episode (Jacobson et al. 2019) is cited: when deﬂation from leaving gold would have required fiscal austerity for full real repayment, Roosevelt chose to exit the gold standard rather than allow deflation.&lt;/p&gt;
&lt;h3 id="q4-how-does-the-first-numerical-example-lower-surplus-expectations-work-quantitatively"&gt;Q4. How does the first numerical example (lower surplus expectations) work quantitatively?&lt;/h3&gt;
&lt;p&gt;The baseline parameters are: i = 0.05, γ = 0.13, L = 0.1, H ≈ ∞, P* = 1, e1 = e2 = 1, B0 = 1, τ1 = 0.8, β = 1. The analysis is restricted to π ∈ (0, 0.3]. As π (probability that τ_max = L) rises, the value of information V(.) rises. At π = 0.15, V equals the cost γ = 0.13. For π &amp;gt; 0.15, consumers pay to investigate and, upon learning τ_max = L, purchase only B1 = L = 0.1 in bonds — the amount that will be repaid — causing the period-1 price level P1 to jump discontinuously from approximately 0.95 to approximately 1.13. For π ≤ 0.15, consumers remain uninformed and P1 rises only smoothly from below 1 as π increases (fewer bonds demanded as repayment risk rises, even without investigation).&lt;/p&gt;
&lt;h3 id="q5-how-does-the-second-numerical-example-interest-rate-increase-work-quantitatively-and-what-does-it-imply-for-monetary-policy"&gt;Q5. How does the second numerical example (interest rate increase) work quantitatively, and what does it imply for monetary policy?&lt;/h3&gt;
&lt;p&gt;With π = 0.5, γ = 0.24, and 1+i ∈ [1, 1.2], as the monetary authority raises the interest rate, consumers demand more bonds, increasing real repayment risk and the value of information. At 1+i = 1.1 (i.e., i = 10%), V equals γ. For 1+i &amp;gt; 1.1, consumers investigate and learn τ_max = L; they then only purchase bonds up to the repayment limit, causing P1 to jump discontinuously to approximately 1.15. For interest rates above the threshold, further increases yield only a smooth upward slope in P1 (bond purchases are fixed in real amount but nominal revenue falls). This illustrates that the monetary authority&amp;rsquo;s ability to use higher interest rates to lower the price level is limited by the surplus constraint: once the interest rate is high enough to trigger consumer investigation and a fiscal crisis, raising rates further is inflationary rather than deflationary.&lt;/p&gt;
&lt;h3 id="q6-what-are-the-two-regions-of-the-deterministic-model-and-how-do-they-differ-in-fiscal-and-price-level-dynamics"&gt;Q6. What are the two regions of the deterministic model and how do they differ in fiscal and price-level dynamics?&lt;/h3&gt;
&lt;p&gt;In the deterministic version (1-π = 0, so τ_max = L with certainty, and there is no uncertainty), the model produces two distinct regions. In the &amp;ldquo;insufficient surplus&amp;rdquo; region where τ_max &amp;lt; B1/P*, the fiscal authority sets taxes to their maximum τ_max, the real payoff of bonds is τ_max/B1 &amp;lt; 1, the period-1 price level P1 = B0/(βτ_max), and real bond revenue Π = βτ_max (constant in τ_max). Selling additional bonds does not raise additional real revenue because any extra bonds lead to a proportional rise in P2 and a fall in Q. In the &amp;ldquo;sufficient surplus&amp;rdquo; region where τ_max ≥ B1/P*, the government meets its fiscal target (τ2 = B1/P*), P2 = P* is hit, P1 = βB1/(B0P*), and Π = βB1/P* (increasing in B1). In this region, selling additional bonds does raise real revenue and lowers P1 as the government absorbs more money.&lt;/p&gt;
&lt;h3 id="q7-what-are-the-two-interest-rate-regions-in-the-deterministic-model-and-what-is-their-implication-for-monetary-policy-effectiveness"&gt;Q7. What are the two interest rate regions in the deterministic model, and what is their implication for monetary policy effectiveness?&lt;/h3&gt;
&lt;p&gt;Using B1 = B0(1+i) (debt rolled over at the chosen rate), the monetary authority has two interest-rate regions. In the &amp;ldquo;constrained&amp;rdquo; region where 1+i &amp;gt; τ_max P*/B0 (the surplus ceiling binds), raising i does not change the period-2 surplus (τ2 = τ_max), does not change real revenue (Π = βτ_max), and does not affect P1 — but raises P2 above the target P*. In the &amp;ldquo;unconstrained&amp;rdquo; region where 1+i ≤ τ_max P*/B0, raising i increases bond demand, increases real surplus backing, raises real revenue, and lowers P1 while P2 = P* is maintained. The boundary between these regions determines the limit of monetary policy: the monetary authority can reduce P1 by raising i only up to the point where the surplus ceiling would be hit.&lt;/p&gt;
&lt;h3 id="q8-how-does-the-paper-relate-to-and-extend-prior-ftpl-literature"&gt;Q8. How does the paper relate to and extend prior FTPL literature?&lt;/h3&gt;
&lt;p&gt;The paper is grounded in the FTPL of Leeper (1991), Sims (1994), and Cochrane (2005, 2020), in which the price level is determined by the requirement that real government liabilities equal the present value of future surpluses. The paper&amp;rsquo;s contribution is to make the information structure endogenous: consumers&amp;rsquo; beliefs and their decision to acquire fiscal information determine whether or not the FTPL logic is operative. In normal times (consumers uninformed), the price level does not respond to changes in the maximum surplus — a result that resembles the &amp;ldquo;Ricardian&amp;rdquo; or non-FTPL regime. When consumers investigate and learn the surplus is insufficient, the connection between the surplus and the price level is restored, reproducing FTPL-type dynamics. This provides an endogenous, single-model rationale for the regime-switching behavior between FTPL and non-FTPL environments documented empirically in Bianchi and Melosi (2013, 2017) and Davig and Leeper (2006).&lt;/p&gt;
&lt;h3 id="q9-what-is-the-welfare-role-of-consumer-ignorance-in-this-framework"&gt;Q9. What is the welfare role of consumer ignorance in this framework?&lt;/h3&gt;
&lt;p&gt;Consumer ignorance of the government&amp;rsquo;s true surplus plays a dual role. On one hand, ignorance is individually rational in normal times because the cost γ of investigating exceeds the benefit V (.) when beliefs are comfortably away from the default boundary. On the other hand, following Dang et al. (2017), informed knowledge of the safe asset&amp;rsquo;s backing destroys the symmetric ignorance that supports the asset&amp;rsquo;s role as a safe store of value, reducing welfare. In this model the concern is repayment risk rather than adverse selection: the consumer fears not being repaid in real terms and chooses to investigate when that risk is sufficiently high, potentially triggering the very crisis they feared.&lt;/p&gt;
&lt;h3 id="q10-what-are-the-scope-conditions-and-limitations-of-the-model"&gt;Q10. What are the scope conditions and limitations of the model?&lt;/h3&gt;
&lt;p&gt;The model is explicitly a two-period reduced form designed to illustrate the bond-payoff mechanism in the simplest possible setting. It abstracts from: multi-period bond maturities and secondary market trading; rich heterogeneity among consumers; endogenous monetary and fiscal policy responses beyond the simple rules specified; and the general equilibrium interactions between inflation, output, and labor markets. The information cost γ is modeled as a fixed binary cost rather than a continuous or richer signal structure. The results on discontinuous price-level jumps hold when bond demand is sufficiently large relative to L (i.e., L &amp;lt; B1), ensuring genuine repayment risk; when surpluses are very large relative to bond liabilities, no crisis dynamics arise.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Maximum Surplus (τ_max).&lt;/strong&gt; The paper&amp;rsquo;s name for the hard ceiling on the net tax revenue (taxes minus money transfers) the government can collect in the second period. This ceiling can arise from a Laffer limit on taxable income, political-economy constraints on austerity, or from a banking crisis requiring government transfers to bail out the financial sector. It is the paper&amp;rsquo;s analogue of a project&amp;rsquo;s liquidation value: the maximum the &amp;ldquo;project&amp;rdquo; (the government) can deliver to bondholders.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bond-Like Payoff of Nominal Government Debt.&lt;/strong&gt; The paper&amp;rsquo;s central structural claim: the real payoff to holding a nominal government bond is capped at face value on the upside (the government will not raise surpluses beyond what is needed to repay bonds at the price-level target) but falls proportionally below face value when τ_max is insufficient for full real repayment. This is precisely the payoff structure of a standard corporate bond — flat on the upside, proportional to recovery on the downside — and it is the source of the asymmetry between sudden inflations and the absence of sudden deflations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Value of Information (V(.)).&lt;/strong&gt; Defined as the difference in expected utility between a consumer who learns the true τ_max before making bond-purchase decisions and one who remains uninformed and acts only on prior beliefs π, 1−π. The consumer investigates if and only if V(.) &amp;gt; γ. V is zero when beliefs are certain (limπ→0 and limπ→1), can be hump-shaped in π, and is increasing in the interest rate i (through its effect on bond demand). The threshold condition V = γ defines the boundary between &amp;ldquo;normal times&amp;rdquo; (no investigation) and crisis (investigation and possible sudden inflation).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Endogenous Information Structure.&lt;/strong&gt; The paper&amp;rsquo;s term for the property that whether consumers choose to learn the government&amp;rsquo;s fiscal capacity is itself determined within the model by the parameters of the economy (the interest rate, prior beliefs, the cost of investigation). This contrasts with models that exogenously specify whether agents are informed or not. The endogenous information structure is the mechanism by which the paper generates the two apparent regimes (FTPL-active vs. FTPL-dormant) from a single unified model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Default Boundary.&lt;/strong&gt; The kink point in the bond payoff curve at τ_max = B1/P*: the level of the maximum surplus at which the government exactly repays bonds in real terms at the price-level target. When beliefs or bond quantities place the economy near the default boundary, small changes in π or i can push the economy across it, triggering large price-level responses. When the economy is far from the boundary (τ_max comfortably above B1/P*), small shocks have only small smooth effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sudden Inflation / Currency Crisis (as defined in this paper).&lt;/strong&gt; A discrete, discontinuous jump in the period-1 price level P1 that occurs when consumers pass the threshold V(.) = γ and investigate the government&amp;rsquo;s fiscal capacity, finding surpluses to be insufficient. The mechanism is: informed consumers refuse to hold bonds they know will not be repaid in real terms at P*, forcing the price level to jump to clear the government&amp;rsquo;s budget constraint with fewer bonds outstanding. The paper treats sudden inflations and currency crises as the same mechanism in different institutional contexts.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Repayment Risk Premium.&lt;/strong&gt; The markup above the risk-free rate that consumers require on government bonds to compensate for the probability that the government&amp;rsquo;s surplus will be insufficient for full real repayment (i.e., the probability that the economy is in the τ_max &amp;lt; B1/P* region). This premium is present even when consumers are uninformed (i.e., do not know which state of τ_max will occur), and is reflected in the consumer&amp;rsquo;s first-order condition for bond demand.&lt;/p&gt;</description></item><item><title>A Preferred-Habitat Model of Term Premia, Exchange Rates, and Monetary Policy Spillovers</title><link>https://macropaperwarehouse.com/papers/a-preferred-habitat-model-of-term-premia-exchange-rates-and-monetary-policy-spillovers/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/a-preferred-habitat-model-of-term-premia-exchange-rates-and-monetary-policy-spillovers/</guid><description>&lt;h2 id="layer-1--core-argument"&gt;Layer 1 — Core Argument&lt;/h2&gt;
&lt;p&gt;The paper develops a two-country preferred-habitat model in which currency and bond markets are populated by different investor clienteles — currency traders with price-elastic demand for foreign assets, and bond investors whose preferences are habitat-specific by country and maturity — with segmentation partly overcome by global arbitrageurs who have limited capital and bear mean-variance risk. Risk premia in the model are time-varying, connected across markets, and consistent with the empirical violations of Uncovered Interest Parity (UIP) and the Expectations Hypothesis (EH): in particular, currency carry trade (CCT) and bond carry trade (BCT) strategies earn abnormally high expected returns in ways that co-vary across the two markets in a manner the standard frictionless model cannot generate. Through these time-varying, connected risk premia, large-scale bond purchases (QE) lower domestic bond yields, lower foreign bond yields, and depreciate the purchasing country&amp;rsquo;s currency; short-rate cuts also lower foreign yields, but with smaller effects than bond purchases. A key structural finding, quantified in the estimated model calibrated to US and Eurozone data, is that currency returns are nearly uncorrelated with long-maturity bond returns — an exchange-rate disconnect — yet the currency market is instrumental in transmitting bond demand shocks across countries, because arbitrageurs hedge their cross-currency positions in bond markets and vice versa. Sterilized foreign-exchange interventions have strong effects on the exchange rate but weak effects on bond yields, while QE/QT has weak effects on the exchange rate but sizeable effects on foreign bond yields — a sharp asymmetry that follows directly from the disconnect.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-why-do-uip-and-eh-fail-in-the-standard-model-and-what-changes-in-this-model"&gt;Q1. Why do UIP and EH fail in the standard model, and what changes in this model?&lt;/h3&gt;
&lt;p&gt;In the standard model with perfect capital mobility, risk premia are constant, so the yield curve depends only on expectations of the domestic short rate and the exchange rate absorbs short-rate differentials exactly. In this model, arbitrageurs bear the residual risk when currency traders and bond clienteles are unwilling to absorb excess supply or demand at prevailing prices. Because arbitrageurs have limited capital (captured by a risk-aversion parameter &lt;em&gt;a&lt;/em&gt; ≥ 0 that can also represent capital or Value-at-Risk constraints in reduced form), they demand compensation — time-varying risk premia — for holding currency and maturity risk. When &lt;em&gt;a&lt;/em&gt; = 0, arbitrageurs are risk-neutral, UIP and EH both hold, and the model collapses to the standard frictionless benchmark.&lt;/p&gt;
&lt;h3 id="q2-what-are-the-three-types-of-agents-and-what-does-each-do"&gt;Q2. What are the three types of agents and what does each do?&lt;/h3&gt;
&lt;p&gt;&lt;em&gt;Currency traders&lt;/em&gt; hold foreign assets and have a demand that is downward-sloping (price-elastic, with slope coefficient αe ≥ 0) in the log exchange rate; their demand also shifts with a stochastic currency demand factor γt. They can be interpreted as households engaged in expenditure switching or central banks managing reserve levels. &lt;em&gt;Bond investors&lt;/em&gt; form clienteles, each with a preferred-habitat demand for bonds of a specific country and maturity that is downward-sloping in the log bond price (slope αj(τ)) and shifts with a country-specific bond demand factor βjt; examples are pension funds and insurance companies whose liabilities are long-dated and denominated in their home currency. &lt;em&gt;Global arbitrageurs&lt;/em&gt; trade the currency and all bonds of both countries, maximizing mean-variance utility over instantaneous wealth changes; they bridge the segmented markets and their positions pin down equilibrium risk premia.&lt;/p&gt;
&lt;h3 id="q3-what-is-the-equilibrium-structure-and-which-factors-drive-prices"&gt;Q3. What is the equilibrium structure and which factors drive prices?&lt;/h3&gt;
&lt;p&gt;The equilibrium exchange rate and bond prices are log-affine functions of five stochastic factors: the home short rate iHt, the foreign short rate iFt, the currency demand factor γt, and the two bond demand factors βHt and βFt. These factors follow a mean-reverting (Ornstein-Uhlenbeck) system. The equilibrium is characterized by a scalar nonlinear system (25 equations in the general case) whose solution pins down the loadings of prices on each factor. This affine structure means each asset&amp;rsquo;s risk premium is the product of the arbitrageur&amp;rsquo;s risk-aversion coefficient, the factor covariance matrix, and arbitrageur net positions, which are themselves determined by market-clearing.&lt;/p&gt;
&lt;h3 id="q4-how-does-a-conventional-short-rate-cut-transmit-domestically-and-internationally-in-the-model"&gt;Q4. How does a conventional short-rate cut transmit domestically and internationally in the model?&lt;/h3&gt;
&lt;p&gt;Following a home short-rate cut, arbitrageurs find it attractive to enter the CCT — borrow home currency, invest in foreign currency. If currency traders&amp;rsquo; demand is price-elastic (αe &amp;gt; 0), arbitrageurs&amp;rsquo; equilibrium foreign-currency holdings rise, and the expected return on the CCT rises too (arbitrageurs must be compensated for the increased risk). This &lt;em&gt;attenuation effect&lt;/em&gt; means the foreign currency appreciates less than implied by UIP: the exchange rate response is dampened. Simultaneously, arbitrageurs enter the home BCT (borrow at the home short rate, invest in long home bonds); if home bond investors&amp;rsquo; demand is price-elastic (αH(τ) &amp;gt; 0), arbitrageurs&amp;rsquo; long-bond holdings rise and the BCT&amp;rsquo;s expected return rises, attenuating the transmission to domestic long-maturity yields (which fall less than EH would imply). A &lt;em&gt;propagation effect&lt;/em&gt; to foreign bond yields arises through arbitrageur hedging: by taking long positions in foreign currency (CCT), arbitrageurs become exposed to the risk that the foreign short rate drops and the foreign currency depreciates; long-maturity foreign bonds provide a natural hedge (their price rises when the foreign short rate drops), so arbitrageurs increase foreign bond demand, depressing foreign yields. This international transmission of conventional policy is absent from the standard model.&lt;/p&gt;
&lt;h3 id="q5-how-does-unconventional-policy-qeqt-transmit-domestically-and-to-the-exchange-rate-and-foreign-yields"&gt;Q5. How does unconventional policy (QE/QT) transmit domestically and to the exchange rate and foreign yields?&lt;/h3&gt;
&lt;p&gt;Following QE purchases of home bonds, their prices rise; arbitrageurs accommodate by holding fewer home bonds, which reduces their exposure to home short-rate risk. With less home-rate risk, arbitrageurs become more willing to hold foreign currency (which depreciates when the home short rate rises, offering a natural hedge against the home rate risk they have shed). The increased foreign-currency position in turn makes arbitrageurs more willing to hold foreign bonds (which hedge the foreign-currency position against foreign rate changes). The net result in the model is: QE lowers domestic bond yields, lowers foreign bond yields, and depreciates the home currency. The quantitative finding from the estimated model is that QE/QT effects on foreign bond yields are sizeable and stronger than those of conventional short-rate policy.&lt;/p&gt;
&lt;h3 id="q6-what-explains-the-exchange-rate-disconnect-and-how-can-the-currency-market-still-transmit-bond-demand-shocks"&gt;Q6. What explains the exchange-rate disconnect, and how can the currency market still transmit bond demand shocks?&lt;/h3&gt;
&lt;p&gt;In the estimated model, variance decompositions reveal that long-maturity bond yields in each country are driven primarily by bond demand factors (βHt and βFt), while the exchange rate is driven primarily by the currency demand factor (γt); short rates account for a small fraction of movements in both, and each factor type accounts for negligible variation in the other asset class&amp;rsquo;s price. The disconnect between bond yields and the exchange rate arises because bond demand shocks in the two countries move the exchange rate in &lt;em&gt;opposite&lt;/em&gt; directions — a home bond demand shock that lowers home yields also raises the exchange rate via arbitrageur hedging, while a foreign bond demand shock moves the exchange rate in the opposite direction. These offsetting effects make the exchange rate nearly uncorrelated with long-maturity bond yields. However, bond demand shocks in one country are transmitted to bond yields in the &lt;em&gt;other&lt;/em&gt; country through the currency market: arbitrageurs hedge their bond positions using the currency, so a shock to home bond demand moves arbitrageurs&amp;rsquo; currency positions, which in turn affects their willingness to hold foreign bonds. Cross-country bond yield comovement is therefore positive and sizeable, despite the exchange-rate disconnect.&lt;/p&gt;
&lt;h3 id="q7-what-are-the-models-implications-for-foreign-exchange-intervention"&gt;Q7. What are the model&amp;rsquo;s implications for foreign exchange intervention?&lt;/h3&gt;
&lt;p&gt;A sterilized purchase of foreign currency by the home or foreign central bank — which shifts the currency demand factor — has strong effects on the exchange rate but weak effects on bond yields. This follows directly from the variance decomposition: the exchange rate loads heavily on the currency demand factor and bond yields load lightly on it. The asymmetry mirrors the QE result in reverse: QE shifts bond demand factors, which load heavily onto bond yields and lightly onto the exchange rate; FX intervention shifts the currency demand factor, which loads heavily onto the exchange rate and lightly onto bond yields. The model thus delivers a sharp policy instrument separation between QE/QT (primarily a bond yield tool) and FX intervention (primarily an exchange-rate tool), with each having spillovers in the other dimension that are quantitatively weaker.&lt;/p&gt;
&lt;h3 id="q8-how-is-the-relationship-between-currency-risk-premia-and-bond-risk-premia-captured-and-what-empirical-regularities-does-the-model-match"&gt;Q8. How is the relationship between currency risk premia and bond risk premia captured, and what empirical regularities does the model match?&lt;/h3&gt;
&lt;p&gt;The model&amp;rsquo;s risk premia are linked through the shared arbitrageur portfolio: the price of each risk factor is proportional to the covariance between that factor and the arbitrageur&amp;rsquo;s overall portfolio return, so a shock that changes arbitrageurs&amp;rsquo; currency positions also changes the compensation required for bond positions, and vice versa. The estimated model is reported to match closely the violations of UIP (CCT profitability) and EH (BCT profitability) documented in the literature, and the ways in which these violations are connected — including findings that yield-curve slope differentials predict CCT profitability, and that CCT profitability declines when carried out with long-maturity rather than short-maturity bonds. These matches are described as consistent with the empirical regularities, not structural identification of the underlying causes.&lt;/p&gt;
&lt;h3 id="q9-what-is-the-role-of-segmented-versus-global-arbitrage-and-why-does-the-distinction-matter"&gt;Q9. What is the role of segmented versus global arbitrage, and why does the distinction matter?&lt;/h3&gt;
&lt;p&gt;The paper considers both cases. Under &lt;em&gt;segmented arbitrage&lt;/em&gt;, separate arbitrageur pools operate in the currency market (risk aversion ae), home bond market (aH), and foreign bond market (aF); first-order conditions for each pool reflect only their own portfolio risk, so the prices of risk factors differ across markets. Under &lt;em&gt;global arbitrage&lt;/em&gt;, a single pool of arbitrageurs trades all assets, and their shared portfolio means the price of each risk factor is the same across currency and bond markets — this is the mechanism through which bond demand shocks in one country propagate through the currency market to bond yields in the other. Global arbitrage is the primary specification; segmented arbitrage serves as a benchmark to isolate the hedging-based transmission channel that requires global positions.&lt;/p&gt;
&lt;h3 id="q10-how-does-the-model-relate-to-and-extend-predecessor-frameworks"&gt;Q10. How does the model relate to and extend predecessor frameworks?&lt;/h3&gt;
&lt;p&gt;The model extends Vayanos and Vila (2021) — a closed-economy preferred-habitat yield curve model — to two countries by adding a currency market and a second country&amp;rsquo;s bond market, with arbitrageurs who are global rather than country-specific. In the currency dimension, the attenuation of UIP deviations parallels Gabaix and Maggiori (2015), which models exchange-rate dynamics with financially constrained intermediaries but without a yield curve. The two-country structure allows the paper to simultaneously study term premia (EH violations), exchange rate dynamics (UIP violations), and their connection, and to quantify the effects of QE, conventional monetary policy, and FX intervention within a single internally consistent framework estimated on US-Eurozone data.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Preferred-habitat demand:&lt;/strong&gt; A bond investor&amp;rsquo;s demand for bonds of a specific country and maturity that does not arise from portfolio optimization over the full menu of available assets, but rather from institutional constraints or liability-matching motives (e.g., pension funds matching long-dated domestic liabilities). In the model, preferred-habitat demand is price-elastic with slope αj(τ) and shifts with a country-specific bond demand factor βjt; the elastic component means that as bond prices rise, clientele demand falls, so arbitrageurs must absorb the residual supply and require a risk premium to do so.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Global arbitrageur:&lt;/strong&gt; An investor who trades the currency and bonds of both countries simultaneously, bridging the segmented currency and bond markets. In the model, global arbitrageurs maximize mean-variance utility over instantaneous wealth changes; their shared portfolio across all asset classes is the mechanism through which shocks in one market create hedging-driven demand in other markets, generating the cross-market linkages in risk premia and monetary policy transmission.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Currency carry trade (CCT):&lt;/strong&gt; A strategy that borrows at the home short rate and invests at the foreign short rate, profiting when the foreign currency does not depreciate enough to offset the interest rate differential. Under UIP, the CCT earns zero expected return; the model generates a positive expected CCT return — a currency risk premium — when arbitrageurs are risk-averse and currency traders&amp;rsquo; demand is price-elastic. In the paper&amp;rsquo;s notation, the CCT return is det/et + (iFt − iHt)dt.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bond carry trade (BCT):&lt;/strong&gt; A strategy that borrows at the short rate and invests in long-maturity bonds of the same country, profiting when long yields fall or when expected short rates are below current long yields. Under EH, the BCT earns zero expected return; the model generates a positive expected BCT return — a term premium — when arbitrageurs are risk-averse and bond clientele demand is price-elastic.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Exchange-rate disconnect:&lt;/strong&gt; The empirical and model finding that movements in the exchange rate are nearly uncorrelated with movements in long-maturity bond yields, even though both are endogenously determined in the same model. The disconnect arises in the estimated model because long bond yields are driven primarily by bond demand factors, while the exchange rate is driven primarily by the currency demand factor, and the two sets of factors move the exchange rate in offsetting directions so that their net effect on bond yield-exchange rate covariance is approximately zero.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Attenuation effect:&lt;/strong&gt; The dampening of monetary policy transmission to asset prices caused by the need to compensate risk-averse arbitrageurs for the increased risk they bear when accommodating the policy-induced excess demand. In the currency market, a home short-rate cut causes the CCT&amp;rsquo;s expected return to rise (arbitrageurs must be paid more to hold foreign currency), which means the foreign currency appreciates less than UIP predicts. In the bond market, a short-rate cut causes the BCT&amp;rsquo;s expected return to rise (term premia increase), so long yields fall less than EH predicts.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Propagation effect:&lt;/strong&gt; The international transmission of a domestic monetary policy shock to foreign asset prices through arbitrageur hedging. A home short-rate cut causes arbitrageurs to increase their foreign-currency position (CCT); this exposes them to the risk of foreign short-rate declines (which depreciate the foreign currency), and long-maturity foreign bonds hedge this risk; so arbitrageurs increase foreign bond demand, depressing foreign yields. This channel is absent from the standard model where risk premia are constant.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Log-affine equilibrium:&lt;/strong&gt; The conjectured and verified form of the equilibrium in which the log exchange rate and log bond prices are affine (linear plus constant) functions of the five state factors (iHt, iFt, γt, βHt, βFt). This structure allows the model to be solved as a system of ordinary differential equations and scalar equations, and enables closed-form or numerically tractable characterization of risk premia, variance decompositions, and policy effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bond demand factor (βjt):&lt;/strong&gt; A stochastic variable that shifts the intercept of bond clientele demand in country j, independent of maturity τ. A positive shock to βjt increases desired bond holdings of country-j clienteles at any given price, forcing arbitrageurs to shed country-j bonds, which lowers bond yields. The factor follows a mean-reverting process and in the estimated model is found to be the primary driver of long-maturity yields in both countries.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Currency demand factor (γt):&lt;/strong&gt; A stochastic variable that shifts the intercept of currency traders&amp;rsquo; demand for foreign assets, independent of the exchange rate level. A positive shock to γt increases desired foreign asset holdings of currency traders, so arbitrageurs reduce their foreign-currency position, which affects their bond positions through hedging. In the estimated model, γt is the primary driver of exchange-rate movements.&lt;/p&gt;
&lt;hr&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary based on LSE Research Online accepted version (accepted manuscript). AI-assisted, human review pending.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;</description></item><item><title>A Theory of How Workers Keep up with Inflation</title><link>https://macropaperwarehouse.com/papers/a-theory-of-how-workers-keep-up-with-inflation/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/a-theory-of-how-workers-keep-up-with-inflation/</guid><description/></item><item><title>A Theory of Supply Function Choice and Aggregate Supply</title><link>https://macropaperwarehouse.com/papers/a-theory-of-supply-function-choice-and-aggregate-supply/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/a-theory-of-supply-function-choice-and-aggregate-supply/</guid><description>&lt;h2 id="research-question"&gt;Research Question&lt;/h2&gt;
&lt;p&gt;Modern macroeconomic models of aggregate supply universally restrict firms to price-setting — committing to a price and supplying whatever quantity the market demands. Flynn, Nikolakoudis, and Sastry ask: what happens if instead firms choose any supply function, a mapping that describes the price charged at each quantity of production? The paper develops the first general-equilibrium, macroeconomic theory of supply function choice and characterizes its implications for the slope of aggregate supply, monetary non-neutrality, and time-varying inflation-output tradeoffs.&lt;/p&gt;
&lt;h2 id="methodology"&gt;Methodology&lt;/h2&gt;
&lt;p&gt;The paper proceeds in two stages. In partial equilibrium, a single monopolistic firm with constant-returns-to-scale technology and constant-elasticity demand faces log-normal uncertainty about demand shifters, the aggregate price level, real marginal costs, and the stochastic discount factor. The firm chooses a non-parametric supply function — any implicit mapping f(p,q) = 0 — to maximize expected real profits. The paper shows that supply function choice is equivalent to conditioning price-quantity decisions on the realized nominal demand state z = ΨP^η. The authors prove (Theorem 1) that the optimal supply function is endogenously log-linear: log p = α₀ + α₁ log q, where the inverse supply elasticity α₁ is characterized in closed form.&lt;/p&gt;
&lt;p&gt;In general equilibrium, the authors embed supply function choice in an otherwise standard monetary business cycle model (in the tradition of Woodford 2003a and Hellwig and Venkateswaran 2009), featuring a representative household demanding differentiated goods, a money supply following a random walk with time-varying volatility, and idiosyncratic shocks to productivity, wages, and demand. They guess and verify a log-linear equilibrium and derive a scalar fixed-point equation for the equilibrium supply elasticity (Theorem 3).&lt;/p&gt;
&lt;p&gt;For quantification, the authors calibrate structural parameters (η = 8 from Hottman et al. 2016 scanner data; γ = 0.11 from Gagliardone et al. 2023 Belgian firm data; κ^M = 0.29 calibrated to match an average aggregate supply slope of 0.11 from Hazell et al. 2022) and estimate time-varying uncertainty via a GARCH model of quarterly US data on GDP growth, inflation, and real marginal cost growth from 1960 Q1 to 2024 Q4. Idiosyncratic demand uncertainty is set proportional to aggregate TFP uncertainty using the proportionality factor R = 6.5 from Bloom et al. (2018).&lt;/p&gt;
&lt;h2 id="main-findings"&gt;Main Findings&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Optimal supply function.&lt;/strong&gt; The optimal firm-level supply function is log-linear with inverse supply elasticity α₁ determined by the relative variances and covariances of demand, the price level, and real marginal costs. Three comparative statics drive the macroeconomic results: (1) higher idiosyncratic demand uncertainty (σ²_Ψ) flattens the supply function toward price-setting, because a fixed price insulates profit markups against demand variation; (2) higher price-level uncertainty (σ²_P) steepens the supply function toward quantity-setting, because setting a fixed quantity allows relative prices to adjust; (3) lower price elasticity of demand (less elastic demand, more market power) flattens the supply function, conditional on a sufficient condition that holds in US data whenever η &amp;gt; 2.5.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;From micro supply to aggregate supply.&lt;/strong&gt; With fixed log-linear supply functions, the economy has a unique log-linear equilibrium with an AD/AS representation (Theorem 2). The slope of aggregate supply ε^S_t depends on ω₁ (the transformed inverse supply elasticity), κ^M (firms&amp;rsquo; signal precision about the money supply), γ (income effects), and η (demand elasticity). Aggregate supply is maximally elastic — money is as non-neutral as possible — if and only if firms are pure price-setters (ω₁ = 0). Aggregate supply is perfectly inelastic — money is neutral — if and only if firms are quantity-setters (ω₁ = 1/η). A lower elasticity of demand flattens aggregate supply through general equilibrium strategic complementarities, a prediction opposite to the New Keynesian model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Equilibrium supply slope and its determinants.&lt;/strong&gt; The equilibrium ω₁ solves a fixed-point equation (Theorem 3) in which macroeconomic uncertainty shapes firms&amp;rsquo; optimal supply functions, which in turn shape macroeconomic dynamics. Under the special case of balanced strategic interactions (ηγ = 1), the slope of aggregate supply has a clean closed form depending only on the ratio ρ_t = σ_{ϑ,t}/σ^M_{t|s} (idiosyncratic demand uncertainty relative to posterior monetary uncertainty). Critically, the equilibrium supply slope is invariant to the overall level of uncertainty — only the composition of uncertainty matters (Proposition 3). Even vanishingly small uncertainty can generate any level of monetary non-neutrality depending on uncertainty composition.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Quantitative results — United States over time.&lt;/strong&gt; The model&amp;rsquo;s estimated slope of aggregate supply shows sharp variation since 1960. The slope is relatively flat and stable during the 1960s, the Great Moderation (1991–2007), the Great Recession (2008–2019), and the recovery from the Great Recession. It spikes dramatically during the 1970s oil crisis and the post-Covid inflation of the 2020s. Compared to Ball and Mazumder (2011), the model qualitatively matches the steepening during 1973–1984 (+58% in the model) vs. the data&amp;rsquo;s +175%, and a subsequent flattening of −25% vs. −32% in the data during 1985–2007. Compared to Cerrato and Gitti (2022), the model accounts for approximately 4/5 of the steepening between the pre-Covid and post-Covid periods (+112% model vs. +145% data). For the Hazell et al. (2022) comparison, the model accounts for approximately 1/2 of the estimated flattening from 1978–1990 to 1991–2018.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Quantitative results — Cross-country.&lt;/strong&gt; Using OECD annual data from 1960–2019, the model&amp;rsquo;s predicted slope of aggregate supply is not positively correlated with the average level of inflation across countries. For countries with the highest inflation rates, the model predicts a negative slope of aggregate supply, driven by very high correlation between price-level uncertainty and real marginal cost uncertainty. The model-predicted slope correlates positively with the reduced-form regression coefficient of inflation on real output growth across countries, even after instrumenting for demand. This predictive power is over and above what can be explained by the level or volatility of inflation alone.&lt;/p&gt;
&lt;h2 id="scope-conditions"&gt;Scope Conditions&lt;/h2&gt;
&lt;p&gt;All results are derived under log-normality of uncertainty, which ensures the log-linear structure of optimal supply functions. The quantification relies on GARCH-estimated uncertainty and treats idiosyncratic demand uncertainty as proportional to aggregate TFP uncertainty. The model abstracts from microeconomic nominal price stickiness (though the authors show in Appendix B that Calvo-style sticky prices can be incorporated). The baseline model requires the equilibrium condition on firm beliefs to be consistent (rational expectations). Multiple equilibria of the scalar fixed-point are possible in principle, bounded by at most five log-linear equilibria (Proposition 2).&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-wrong-with-assuming-price-setting-or-quantity-setting-as-a-primitive-restriction-on-firm-behavior"&gt;Q1. What is wrong with assuming price-setting or quantity-setting as a primitive restriction on firm behavior?&lt;/h3&gt;
&lt;p&gt;A: Price-setting and quantity-setting are two isolated, generically non-optimal points in the larger space of supply functions. Corollary 2 establishes that price-setting is optimal only in the limit as idiosyncratic demand uncertainty becomes unboundedly large (σ²_Ψ → ∞), while quantity-setting is optimal only in the limit as price-level uncertainty becomes unboundedly large (σ²_P → ∞). In a macroeconomic environment where both sources of uncertainty are present in comparable magnitudes, both extreme policies perform poorly and the analyst who imposes either inadvertently restricts firms&amp;rsquo; strategies in ways that have large macroeconomic consequences — for example, making money neutral under quantity-setting even when information frictions are present, or making the slope of aggregate supply invariant to demand elasticity under price-setting.&lt;/p&gt;
&lt;h3 id="q2-what-is-the-formal-equivalence-between-supply-function-choice-and-conditioning-on-realized-demand"&gt;Q2. What is the formal equivalence between supply function choice and conditioning on realized demand?&lt;/h3&gt;
&lt;p&gt;A: The firm&amp;rsquo;s problem of choosing a supply function f(p,q) = 0 ex ante is mathematically equivalent to choosing a price-quantity plan (p(z), q(z)) indexed by the nominal demand state z = ΨP^η (Equation 4 in the paper). After the supply function is set, the firm produces where the supply function intersects the demand curve, which pins down the market-clearing outcome as a function of z. Choosing the supply function ex ante is therefore the same as choosing z-contingent prices and quantities without any parametric constraint. This links the model to rational expectations equilibrium in the spirit of Lucas (1972): firms use the demand for their product as a noisy signal to update beliefs and set their optimal price and quantity in response to realized demand conditions.&lt;/p&gt;
&lt;h3 id="q3-how-is-the-optimal-inverse-supply-elasticity-α-derived-and-what-is-the-2sls-interpretation"&gt;Q3. How is the optimal inverse supply elasticity α₁ derived, and what is the 2SLS interpretation?&lt;/h3&gt;
&lt;p&gt;A: Because the optimal supply function allows the firm to set a z-contingent price, the first-order condition at each realized demand state z = t equates expected marginal revenue and expected marginal cost (Equation 7). Under log-normality, this yields a log-linear relationship log p = α₀ + α₁ log q. The elasticity α₁ equals the ratio (d log p / d log z) / (d log q / d log z) = Cov[log z, log p**] / Cov[log z, log q**], where p** and q** are the full-information optimal price and quantity (Equation 9). This is formally equivalent to a 2SLS regression: the firm estimates how its optimal price should change with its optimal quantity, using the nominal demand state z as an instrument for the optimal quantity. The supply function is steep if nominal demand strongly predicts movements in the full-information optimal price (large reduced-form coefficient); it is flat if nominal demand primarily predicts movements in the full-information optimal quantity (large first-stage coefficient).&lt;/p&gt;
&lt;h3 id="q4-how-do-uncertainty-and-demand-elasticity-shape-the-firms-optimal-supply-function-in-partial-equilibrium"&gt;Q4. How do uncertainty and demand elasticity shape the firm&amp;rsquo;s optimal supply function in partial equilibrium?&lt;/h3&gt;
&lt;p&gt;A: Three key comparative statics apply when the supply function is upward-sloping. (1) Greater price-level uncertainty (σ²_P increases) steepens α₁ toward quantity-setting: not knowing competitors&amp;rsquo; prices makes aggressive dynamic pricing attractive because it allows the firm&amp;rsquo;s relative price to adjust ex post. (2) Greater idiosyncratic demand uncertainty (σ²_Ψ increases) flattens α₁ toward price-setting: demand uncertainty favors a fixed price to keep the markup over real marginal costs constant, accommodating demand with quantity variation. (3) A lower price elasticity of demand (more market power, lower η) flattens α₁: more market power reduces the cost of setting the &amp;ldquo;wrong&amp;rdquo; price, reducing the benefit of dynamic pricing. Corollary 1 provides a sufficient condition — σ_{M,P} ≥ 0, 2ησ_{M,P} + σ_{M,Ψ} ≥ σ_{P,Ψ}, and α₁ ≥ 0 — under which ∂α₁/∂η &amp;gt; 0, implying greater market power flattens supply; the paper verifies this condition holds in US data whenever η &amp;gt; 2.5.&lt;/p&gt;
&lt;h3 id="q5-how-does-the-model-generate-an-aggregate-supply-and-demand-representation-from-supply-function-choices"&gt;Q5. How does the model generate an aggregate supply and demand representation from supply function choices?&lt;/h3&gt;
&lt;p&gt;A: Theorem 2 establishes that, given any fixed log-linear supply functions with slope ω₁,t, there is a unique log-linear equilibrium. In this equilibrium, the price level and real output are jointly determined by an aggregate demand curve — shifting with the money supply but not productivity — and an aggregate supply curve — shifting with productivity but not the money supply. The inverse elasticity of aggregate supply is ε^S_t = γ(κ^M_t + ω₁,t(η − 1/γ)(1 − κ^M_t)) / ((1 − ω₁,t η)(1 − κ^M_t)), derived from aggregating firm-level pricing decisions. The slope depends on ω₁,t (micro supply), κ^M_t (signal precision about money), γ (income effects), and η (demand elasticity). An aggregate demand shock of ∆ log M raises the price level by ε^S_t ∆ log M / (ε^D_t + ε^S_t) and raises real output by ∆ log M / (ε^D_t + ε^S_t), where ε^D_t = γ is the inverse elasticity of aggregate demand.&lt;/p&gt;
&lt;h3 id="q6-what-is-the-equilibrium-fixed-point-equation-and-why-can-there-be-multiple-equilibria"&gt;Q6. What is the equilibrium fixed-point equation and why can there be multiple equilibria?&lt;/h3&gt;
&lt;p&gt;A: Theorem 3 shows that the equilibrium transformed inverse supply elasticity ω₁,t solves a quintic polynomial fixed-point equation (Equation 29) that depends on the variances of idiosyncratic demand shocks (σ²_ϑ,t), posterior uncertainty about productivity (σ^A_{t|s}), and posterior uncertainty about money (σ^M_{t|s}). Multiple equilibria can arise because of a self-reinforcing feedback: if firms set steep supply functions, prices respond more to demand, which raises price-level volatility, which in turn makes quantity-setting more attractive, further steepening supply functions. Proposition 2 establishes existence of at least one log-linear equilibrium and at most five. Idiosyncratic productivity and factor price uncertainty do not enter the fixed-point equation because the variance of real marginal costs per se does not affect optimal supply function choice — only the covariance of marginal costs with demand and the price level matters.&lt;/p&gt;
&lt;h3 id="q7-what-determines-the-slope-of-aggregate-supply-in-the-special-case-of-balanced-strategic-interactions-ηγ--1"&gt;Q7. What determines the slope of aggregate supply in the special case of balanced strategic interactions (ηγ = 1)?&lt;/h3&gt;
&lt;p&gt;A: Under ηγ = 1 — where strategic complementarities from relative price effects exactly offset strategic substitutabilities from aggregate consumption effects — the slope of aggregate supply has the closed-form expression ε^S_t = γ(κ^M_t / (1 − κ^M_t))(1 + 1/(γ²ρ²_t κ^M_t)) where ρ_t = σ_{ϑ,t}/σ^M_{t|s} is the ratio of idiosyncratic demand uncertainty to posterior monetary uncertainty (Corollary 5). Aggregate productivity uncertainty drops out entirely because firms do not use the demand state to infer aggregate productivity when strategic interactions are balanced. As ρ_t → ∞ (idiosyncratic demand dominates), the slope converges to the price-setting value γκ^M_t/(1 − κ^M_t). As ρ_t → 0 (monetary uncertainty dominates), the slope goes to infinity, corresponding to quantity-setting and monetary neutrality.&lt;/p&gt;
&lt;h3 id="q8-what-is-the-role-of-total-uncertainty-versus-the-composition-of-uncertainty"&gt;Q8. What is the role of total uncertainty versus the composition of uncertainty?&lt;/h3&gt;
&lt;p&gt;A: Proposition 3 establishes a striking invariance result: if all standard deviations in the economy are scaled by a common factor λ &amp;gt; 0, the equilibrium supply elasticity and slope of aggregate supply are unchanged. The equilibrium outcomes depend only on the ratios of different sources of uncertainty, not their absolute magnitudes. This sharply distinguishes the model from menu-cost models, in which any increase in uncertainty unambiguously raises the benefit of price adjustment and steepens aggregate supply. A corollary is that idiosyncratic productivity uncertainty has no effect on the slope of aggregate supply in the supply function model, whereas it would steepen aggregate supply in Golosov-Lucas menu-cost models. Moreover, even a vanishingly small level of uncertainty can generate any level of monetary non-neutrality, because the equilibrium supply elasticity is discontinuous at zero uncertainty (ε^S_t (0) = {∞} while ε^S_t (λ) is bounded for any λ &amp;gt; 0).&lt;/p&gt;
&lt;h3 id="q9-how-does-market-power-demand-elasticity-affect-the-slope-of-aggregate-supply-and-why-does-this-differ-from-the-new-keynesian-prediction"&gt;Q9. How does market power (demand elasticity) affect the slope of aggregate supply, and why does this differ from the New Keynesian prediction?&lt;/h3&gt;
&lt;p&gt;A: In the supply function model, a lower elasticity of demand (more market power, lower η) flattens aggregate supply by reducing general-equilibrium strategic complementarities. When other firms raise their prices following a demand shock, a given firm faces higher relative demand; the strength of this effect is parameterized by η. With supply functions (ω₁,t ≠ 0), this relative demand increase generates an additional price response, so higher η steepens aggregate supply. Crucially, this effect is exactly zero if and only if firms are pure price-setters (ω₁,t = 0) — meaning the prediction that market power affects aggregate supply is absent from price-setting models. This is the opposite of the New Keynesian prediction: in Woodford (2003b) with decreasing returns to scale, a higher elasticity of demand (less market power) steepens the Phillips curve, because more elastic demand amplifies the quantity response to price changes and thereby the marginal cost response to nominal cost shocks.&lt;/p&gt;
&lt;h3 id="q10-how-does-the-model-rationalize-the-steepening-of-aggregate-supply-in-the-1970s-and-2020s"&gt;Q10. How does the model rationalize the steepening of aggregate supply in the 1970s and 2020s?&lt;/h3&gt;
&lt;p&gt;A: The GARCH estimates of macroeconomic uncertainty show abrupt increases in inflation uncertainty during the 1970s oil crisis period and after the Covid-19 shock in the 2020s. In the model, a spike in aggregate price-level uncertainty (σ²_P increases) causes firms to choose steeper supply functions — closer to quantity-setting — endogenously. This steepens the aggregate supply curve so that demand shocks have larger nominal effects and smaller real effects. Quantitatively, relative to the base period, the model predicts a steepening of +58% during 1973–1984 and +112% during 2021–2023. The empirical comparisons are +175% (Ball and Mazumder 2011, 1973–1984) and +145% (Cerrato and Gitti 2022, 2021–2023). The model thus accounts for the direction and rough order of magnitude of both episodes but not their full extent. The quarterly time series of model-implied ε^S_t has a correlation of 0.93 with one-quarter-ahead inflation uncertainty and 0.62 with the quarterly level of inflation.&lt;/p&gt;
&lt;h3 id="q11-how-does-the-cross-country-evidence-help-distinguish-the-model-from-alternatives-based-on-the-level-of-inflation"&gt;Q11. How does the cross-country evidence help distinguish the model from alternatives based on the level of inflation?&lt;/h3&gt;
&lt;p&gt;A: The cross-country analysis uses OECD data from 1960–2019 to construct country-level model-implied slopes of aggregate supply using the same structural parameters (η = 8, γ = 0.11, κ^M = 0.29) and country-specific GARCH uncertainty estimates from a one-lag VAR. The key finding is that the model-implied slope is not positively predicted by average inflation across countries (Panel A of Figure 5) — in fact, for the highest-inflation countries such as Chile, Israel, and Mexico, the model predicts a negative slope of aggregate supply, reflecting high correlation between price-level uncertainty and real marginal cost uncertainty. By contrast, the model-implied slope correlates positively with the reduced-form regression coefficient of inflation on real output growth (Panel B), and this positive correlation is also found using a model-derived instrument isolating exogenous monetary variation. This implies that relative uncertainties, not the mean or volatility of inflation per se, help account for cross-country heterogeneity in inflation-output tradeoffs beyond the predictions of Ball et al. (1988).&lt;/p&gt;
&lt;h3 id="q12-how-can-supply-functions-be-integrated-into-larger-linearized-macroeconomic-models"&gt;Q12. How can supply functions be integrated into larger linearized macroeconomic models?&lt;/h3&gt;
&lt;p&gt;A: Section 4.5 provides a general framework. For any model in which firms face a demand function q_it = d(p_it, z^D_it) and a value function V(p_it, q_it, z^V_it), log-linearization around a deterministic steady state yields an optimal pricing rule ˆp_it = ω₁,it ˆz^D_it (Equation 35) for some scalar ω₁,it determined by the covariance structure of the linearized model. The coefficients ω₁,it enter the standard representation of aggregate dynamics (McKay and Wolf 2023) through the ideal price index ˆP_t = ∫₀¹ ˆp_it di. The additional &amp;ldquo;rational expectations&amp;rdquo; restriction is that ω₁,it must be consistent with the equilibrium law of motion for prices. The paper argues that supply functions can thereby be embedded in the broad class of linearized DSGE models used for quantitative work, including models with decreasing returns, monopsony, endogenous markups, sticky prices, investment, and quality choice.&lt;/p&gt;
&lt;h3 id="q13-what-are-the-implications-of-supply-function-choice-for-monetary-policy-discretion"&gt;Q13. What are the implications of supply function choice for monetary policy discretion?&lt;/h3&gt;
&lt;p&gt;A: The model implies a thorny tradeoff for monetary policymakers. If a central bank wishes to maintain discretion — the ability to surprise private agents — this increases firms&amp;rsquo; uncertainty about the money supply (higher σ²_M). Under balanced strategic interactions (ηγ = 1), greater posterior monetary uncertainty (σ^M_{t|s}) lowers the ratio ρ_t = σ_{ϑ,t}/σ^M_{t|s}, which flattens the aggregate supply curve (reduces ε^S_t) and thereby increases the real effect of monetary surprises. However, this also endogenously induces firms to set steeper supply functions — closer to quantity-setting — so that the aggregate supply curve steepens in response to the greater price-level uncertainty generated by such an environment. The paper therefore concludes that maintaining monetary policy discretion may be, at least partially, self-defeating.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Inverse supply elasticity (α₁):&lt;/strong&gt; The percentage by which a firm increases its price in response to a one percent increase in production, characterizing the slope of the firm&amp;rsquo;s optimal supply function. It is endogenously log-linear and determined by the ratio of covariances relating the nominal demand state to the firm&amp;rsquo;s optimal price vs. optimal quantity under full information — formally equivalent to a 2SLS coefficient using nominal demand as an instrument.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Supply function:&lt;/strong&gt; A mapping f(p, q) = 0 describing the locus of prices and quantities a firm commits to, as an implicit function over price-quantity pairs. Unlike price-setting (f depends only on p) or quantity-setting (f depends only on q), the general supply function allows prices to vary with realized demand, nesting both polar cases as limits of extreme uncertainty.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Nominal demand state (z):&lt;/strong&gt; The composite variable z = ΨP^η that indexes the demand curve. Firms observing their own output market clearing can use z as a noisy signal for inference about the aggregate price level, real marginal costs, and monetary conditions. The supply function is formally equivalent to conditioning price-quantity choices on z.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Slope of aggregate supply (ε^S):&lt;/strong&gt; The inverse elasticity of the aggregate supply curve in the AD/AS representation, measuring the relative within-period response of the price level versus real output to an aggregate demand shock. It depends on the slope of firm-level supply functions (ω₁) interacted with the information precision about the money supply (κ^M) and income effects (γ).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Transformed inverse supply elasticity (ω₁):&lt;/strong&gt; The reparameterization ω₁ = α₁/(1 + ηα₁), where α₁ is the firm-level inverse supply elasticity and η is the price elasticity of demand. ω₁ = 0 corresponds to price-setting; ω₁ = 1/η corresponds to quantity-setting. The equilibrium value of ω₁ solves a fixed-point equation that maps macroeconomic uncertainty back into firms&amp;rsquo; optimal supply function choices.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Balanced strategic interactions (ηγ = 1):&lt;/strong&gt; A parametric special case in which strategic complementarities from aggregate demand externalities (parameterized by η) exactly offset strategic substitutabilities from wage pressure (parameterized by 1/γ). Under this condition, the slope of aggregate supply has a closed-form solution that depends only on the relative uncertainty about idiosyncratic demand vs. the money supply.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Relative uncertainty sufficient statistic (ρ_t):&lt;/strong&gt; The ratio σ_{ϑ,t} / σ^M_{t|s}, measuring firms&amp;rsquo; uncertainty about idiosyncratic demand shocks relative to posterior uncertainty about the money supply. Under balanced strategic interactions (ηγ = 1), ρ_t is the single sufficient statistic determining the equilibrium slope of aggregate supply. As ρ_t → ∞ (idiosyncratic demand uncertainty dominates), firms converge to price-setting and aggregate supply flattens; as ρ_t → 0 (monetary uncertainty dominates), firms converge to quantity-setting and aggregate supply becomes vertical.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Invariance to total uncertainty:&lt;/strong&gt; A key property of the model: the equilibrium slope of aggregate supply is invariant to the overall scale of uncertainty (Proposition 3). Only the composition of uncertainty across idiosyncratic vs. aggregate sources and demand vs. productivity shocks matters. This distinguishes the model from menu-cost models, in which any increase in uncertainty raises the benefit of price flexibility and steepens aggregate supply regardless of uncertainty composition.&lt;/p&gt;</description></item><item><title>A traffic-jam theory of growth</title><link>https://macropaperwarehouse.com/papers/a-traffic-jam-theory-of-growth/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/a-traffic-jam-theory-of-growth/</guid><description>&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; Finocchiaro and Weil ask whether financial development necessarily promotes long-run economic growth, or whether congestion externalities in R&amp;amp;D markets can offset — and even reverse — the growth benefits of easier credit access. The paper proposes that the empirical coexistence of expanding financial sectors and roughly constant per-capita GDP growth rates (approximately 2% annually in the United States over the last century) can be explained by the interplay of search frictions in two sequential markets: credit and innovation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Methodology.&lt;/strong&gt; The authors build a continuous-time endogenous growth model in which all growth is innovation-led. Firms must pass through four sequential stages — creation, fund-raising (Stage 0–1), R&amp;amp;D search (Stage 1–2), and high-productivity production (Stage 2–3) — before being exogenously destroyed. Both the credit market (firms searching for banks/venture capitalists) and the innovation market (firms searching for innovators after securing finance) are characterized by constant-returns-to-scale matching functions with endogenous market tightness. Nash bargaining determines the loan repayment, and free entry drives profits to zero in both markets. The model is then calibrated to annual U.S. data, with the risk-free rate r = 3.5%, separation rate s = 4%, symmetric bargaining power ω = 0.5, a productivity jump γ = 0.023 targeting a baseline growth rate of 2%, credit market duration for creditors just below one month and for firms slightly above one year (consistent with Wasmer and Weil, 2004), a two-year average patent approval time (USPTO 2020), 6% employment in finance (BLS 2020), and 0.5% employment in scientific R&amp;amp;D (BLS 2020).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Core Mechanism.&lt;/strong&gt; The paper derives a &amp;ldquo;spillover function&amp;rdquo; Q(p,g) that links the equilibrium probability of finding an innovator (q) to the probability of finding a bank (p) and the growth rate (g). Because free entry holds profits at zero, easier credit — a higher p — forces q downward: if a firm spends less time raising funds, the innovation market becomes more congested (Qp &amp;lt; 0). This negative spillover between the two markets is the paper&amp;rsquo;s central traffic-jam analogy: relieving one bottleneck shifts congestion downstream.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings.&lt;/strong&gt; The GG curve — the locus of (p, g) pairs consistent with equilibrium — is hump-shaped under the symmetric cost condition c = ωn (flow search cost for firms in credit markets equals the firm&amp;rsquo;s share of search costs in innovation markets). Growth is maximized when expected credit search time equals expected innovation search time (1/p = 1/q). Beyond that interior optimum, further financial deepening lowers the growth rate. The calibrated economy sits to the right of the hump in a flat region (p &amp;gt; q), so that reducing credit frictions alone has a marginally negative effect on growth: eliminating credit frictions lowers g from 2.000% to 1.997%, a reduction of 0.003 percentage points. Reducing innovation frictions alone raises g modestly to 2.071% (+0.071 pp). Only a simultaneous reduction of frictions in both markets raises g meaningfully, to 2.122% (+0.122 pp). The quantitative effects are deliberately small, consistent with the near-constancy of long-run growth despite financial deepening.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions.&lt;/strong&gt; The non-monotonicity requires both markets to carry search frictions; when only one friction is present, financial development is unambiguously good for growth (Section 4.3). The hump-shape is established analytically in the symmetric case c = ωn; more generally, the paper shows (via back-of-envelope approximation) that the sign of the finance–growth link depends on whether c/ω is less than or greater than n. The quantitative insensitivity of growth to finance is amplified when the real interest rate is close to the growth rate and when potential growth γ is close to actual growth g: the elasticity of growth with respect to finance is proportional to (γ − g)/γ. Extensions to fixed bank entry costs (introducing a growth-to-finance feedback), endogenous innovator wages (Section 4.2), and frictionless innovation (Section 4.3) all confirm the benchmark conclusions under stated parameter conditions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the paper&amp;rsquo;s central theoretical claim about the finance–growth nexus?&lt;/strong&gt;
The paper claims that the finance–growth relationship is non-monotonic: financial development raises growth when credit is scarce (left of the hump on the GG curve) but lowers it when credit is readily available (right of the hump), because easier financing draws more firms into the innovation market, tightening it and reducing the probability of finding an innovator. This congestion spillover from the credit market to the innovation market is the &amp;ldquo;traffic-jam&amp;rdquo; mechanism. The non-monotonicity vanishes if either market lacks search frictions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What is the &amp;ldquo;spillover function&amp;rdquo; and why is it central to the model?&lt;/strong&gt;
The spillover function Q(p, g) is derived from the free-entry zero-profit condition for firms and expresses the innovation-matching probability q consistent with equilibrium for given credit-matching probability p and growth rate g. It has Qp &amp;lt; 0 (easier credit reduces q) and Qg &amp;lt; 0 (faster growth reduces q), capturing the two-way negative interaction between the markets. It is central because all equilibrium and comparative-statics results flow through it: the GG curve is defined by substituting Q into the growth equation g = γ/(1 + s/p + s/Q(p,g)).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: Under what condition is the GG curve hump-shaped, and what is the intuition?&lt;/strong&gt;
The GG curve is hump-shaped when the flow search cost for firms in the credit market c equals the firm&amp;rsquo;s share of innovation search costs ωn (Proposition 4). The intuition mirrors equalizing travel times across two congested roads: growth is maximized when expected credit search time (1/p) equals expected innovation search time (1/q). When credit is very tight (p small), a marginal increase in p raises the share of innovating firms faster than it tightens the innovation market, so growth rises. Once credit is abundant (p large), the congestion effect on innovation dominates and growth falls.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What does the benchmark calibration predict about the quantitative effect of financial development on growth?&lt;/strong&gt;
The benchmark calibration, targeting 2% annual U.S. growth, places the economy to the right of the hump in a flat region of the GG curve (p &amp;gt; q). Eliminating credit market frictions alone reduces the annual growth rate by 0.003 percentage points (from 2.000% to 1.997%) while lengthening expected innovation search time from 2 years to 3.4 years. This marginally negative effect arises because the economy is already well to the right of the optimum. The results are deliberately small and consistent with the empirical near-constancy of growth alongside financial deepening.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What combination of policies does the model recommend for raising growth?&lt;/strong&gt;
Only a simultaneous reduction of frictions in both the credit and the innovation market raises the growth rate meaningfully, to 2.122% in the calibration (+0.122 pp relative to the 2.000% benchmark). Isolated improvements in credit markets have a marginally negative effect; isolated improvements in innovation markets have a marginally positive effect (+0.071 pp). The authors interpret this as supporting the OECD view that growth-stimulating policies should be designed as a system rather than as isolated pro-growth measures.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How does the elasticity of growth to finance depend on the gap between potential and actual growth?&lt;/strong&gt;
The authors show (referenced as available on request) that the elasticity of the growth rate with respect to financial factors is proportional to (γ − g)/γ, where γ is the potential growth rate (the productivity jump per innovation) and g is the actual equilibrium growth rate. When actual growth is close to potential — as in the benchmark calibration with γ = 0.023 and g = 2.000% — this factor is near zero, making growth nearly insensitive to changes in financial conditions. This provides a structural rationale for why empirically measured finance–growth effects are often small or nil in advanced economies.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: How does introducing fixed bank entry costs (Section 4.1) change the results?&lt;/strong&gt;
When banks bear a fixed licensing cost K (paid each time they enter the credit market), credit market tightness φ becomes an increasing function of (r − g)K: the annuity value of the fixed cost falls as growth rises, inducing more bank entry and reducing credit tightness. This introduces an upward-sloping PP curve (rather than a vertical one) and creates a direct positive feedback from growth to financial deepening. The qualitative conclusions on non-monotonicity are preserved: lower licensing costs shift the PP curve right and steepen it, with the equilibrium effect on growth remaining ambiguous due to the congestion spillover into the innovation market.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What happens to the spillover function when innovators are paid (Section 4.2)?&lt;/strong&gt;
When innovators receive a Nash-bargained wage, the equilibrium wage (Equation 30) is increasing in innovator productivity (πγ), innovation market tightness (θn), and the growth rate, and decreasing in total credit market search costs K(φ). Easier credit raises both expected revenues and innovator wages for the firm. For innovator bargaining power α sufficiently small (and always for α &amp;lt; 1, as shown in the Appendix), the revenue effect dominates so that Qp &amp;lt; 0 is preserved: finance still creates bottlenecks in the innovation market, and the core non-monotonicity result carries through.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What does the model predict when only one market has search frictions?&lt;/strong&gt;
When only the credit market is frictional and innovators are found instantly after financing is secured, improving credit market efficiency unambiguously raises growth (Section 4.3, Figure 4). The GG curve becomes g = γ/(s/p + 1), which is strictly increasing in p, and the PP curve shifts in a way that unambiguously raises equilibrium growth. The paper uses this case to isolate the source of non-monotonicity: the negative spillover from credit ease to innovation congestion requires frictions in both markets to operate.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: How does the paper relate to the empirical &amp;ldquo;too much finance&amp;rdquo; literature?&lt;/strong&gt;
The paper offers a distinct theoretical mechanism for the inverted-U relationship between credit and productivity growth documented by Arcand et al. (2015), Aghion et al. (2019), and Popov (2018), among others. While Aghion et al. (2019) explain the inverted-U through less-efficient incumbents surviving longer with better credit access, and Malamud and Zucchi (2019) emphasize how financing frictions differentially affect entrant and incumbent composition, Finocchiaro and Weil&amp;rsquo;s mechanism operates through congestion externalities in sequential search markets — a channel not previously formalized in the innovation-led growth literature.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Search frictions in credit markets:&lt;/strong&gt; Firms searching for financiers (banks or venture capitalists) and banks searching for firms face a matching technology with constant returns to scale; credit market tightness φ is the ratio of firms searching for banks to banks searching for firms, and the matching probability p(φ) is strictly decreasing in φ. Free entry drives bank profits to zero, pinning equilibrium tightness.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Search frictions in innovation markets:&lt;/strong&gt; After securing financing, firms search for innovators who can upgrade their productivity by factor γ; innovation market tightness θ is the ratio of firms searching for innovators to innovators, and the matching probability q(θ) is strictly decreasing in θ. The number of innovators is held fixed (analogously to fixed labor supply in Mortensen-Pissarides).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Spillover function Q(p, g):&lt;/strong&gt; Derived from the free-entry zero-profit condition for firms, Q expresses the equilibrium innovation-matching probability q as a function of the credit-matching probability p and the growth rate g. It has Qp &amp;lt; 0 and Qg &amp;lt; 0, meaning easier credit and faster growth both reduce q by tightening the innovation market. It is the formal embodiment of the traffic-jam mechanism.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;GG curve:&lt;/strong&gt; The locus of (p, g) pairs consistent with the equilibrium growth equation g = γ/(1 + s/p + s/Q(p,g)). Under the symmetric cost condition c = ωn, the GG curve is hump-shaped: it rises from the origin, reaches a maximum interior growth rate, then declines toward an asymptote g∞ &amp;lt; γ. Its shape encodes the non-monotonic relationship between finance and growth.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;PP curve:&lt;/strong&gt; The locus of equilibrium credit-matching probabilities consistent with free entry in the credit market. In the benchmark model it is a vertical line at p* = p(ω/(1−ω) · k/c), independent of q and g. When banks bear a fixed entry cost K, the PP curve becomes upward-sloping, introducing a direct positive feedback from growth to financial deepening.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Potential growth rate γ:&lt;/strong&gt; The productivity jump per successful innovation; in a frictionless world (p = q = ∞) the economy grows at γ. Actual growth g falls below γ to the extent that search frictions delay the delivery of credit and innovation. The elasticity of g to financial factors is proportional to (γ − g)/γ, so when actual and potential growth are close, financial factors matter little for growth.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Congestion externality in R&amp;amp;D:&lt;/strong&gt; The mechanism by which financial deepening — raising p — drives more firms to seek innovators, tightening the innovation market and reducing q. This negative spillover (Qp &amp;lt; 0) is the paper&amp;rsquo;s central departure from models with only a single friction, where finance is always growth-enhancing.&lt;/p&gt;</description></item><item><title>Abundance from Abroad: Migrant Income and Long-Run Economic Development</title><link>https://macropaperwarehouse.com/papers/abundance-from-abroad-migrant-income-and-long-run-economic-development/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/abundance-from-abroad-migrant-income-and-long-run-economic-development/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This paper asks how persistent increases in international migrant income prospects affect long-run economic development in migrant-origin areas. The central question is whether Philippine provinces with persistent access to higher-income migration opportunities develop faster than provinces with less attractive migration opportunities, and through which channels.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Natural Experiment and Identification Strategy&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors exploit the 1997 Asian Financial Crisis as a large-scale natural experiment. The crisis triggered sharp, heterogeneous, and persistent exchange rate changes across Philippine migrants&amp;rsquo; destination countries — ranging from a 4% depreciation against the Philippine peso (Korea) to a 57% appreciation (Libya), with Japan and Saudi Arabia in between (appreciations of 32% and 52%, respectively). Because Philippine provinces differed in the pre-crisis distribution of migrant income across destinations (measured using unusual POEA/OWWA administrative contract data covering all overseas worker contracts, including migrant incomes, origins, and destinations), these exchange rate shocks generated exogenous, province-level variation in a shift-share instrument: the predicted change in province migrant income per capita due to the 1997 shocks. Identification follows the &amp;ldquo;exogenous shares&amp;rdquo; framework of Goldsmith-Pinkham et al. (2020). Pre-trend tests across up to 12 years of pre-shock panel data find no evidence of differential trends across provinces. The five destinations with the highest Rotemberg weights — Saudi Arabia, Japan, United States, Taiwan, and Hong Kong — collectively account for 75% of the identifying variation. The exchange rate shocks and the exposure weights both exhibit strong persistence over two decades post-1997.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Philippine government administrative data (POEA/OWWA) on all overseas worker contracts, 1992–2015, matched at 95% rate, providing province-of-origin and destination-specific migrant income.&lt;/li&gt;
&lt;li&gt;Philippine Family Income and Expenditure Survey (FIES), up to twelve triennial rounds from 1985–2018 (74 provinces, ~40,000 households per round), for domestic income and expenditure.&lt;/li&gt;
&lt;li&gt;Six rounds of the Philippine Census of Population (1990–2015) for education, migration rates, and sectoral employment shares.&lt;/li&gt;
&lt;li&gt;Province-level consumer price index data (1994–2017) and firm-level export survey data for robustness checks.&lt;/li&gt;
&lt;li&gt;Unit of analysis: 74 Philippine provinces (consistent 1990 borders).&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Main Findings with Quantitative Magnitudes&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Six-fold magnification of migrant income&lt;/strong&gt;: Each unit of initial short-run shock (1997–1998) to migrant income per capita is magnified more than six-fold by 2009–2015. A one-standard-deviation shock (0.093) raises long-run migrant income per capita by 14.7% of the baseline mean (PhP 601 per capita, 0.2 standard deviations).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Domestic income gains predominate&lt;/strong&gt;: A one-standard-deviation shock raises domestic income per capita (excluding migrant income and remittances) by 6.4% of the baseline mean (PhP 1,676, 0.18 standard deviations). Remarkably, 73.6% of the long-run global income increase comes from domestic income and only 26.4% from migrant income.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Global income and expenditure&lt;/strong&gt;: A one-standard-deviation shock raises global income per capita by PhP 2,277 (0.2 standard deviations, or 7.5% of the baseline mean) in 2009–2015. Expenditure per capita rises by PhP 1,159 (0.13 standard deviations). Effects emerge gradually over two decades.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Education&lt;/strong&gt;: A one-standard-deviation shock increases the college-educated share of the population by 0.46–0.51 percentage points (0.11–0.12 standard deviations) and secondary completion by 0.63 percentage points. There is no significant effect on primary completion.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Migration rates and skill composition&lt;/strong&gt;: A one-standard-deviation shock increases the migration rate by 0.19 percentage points (0.22 standard deviations), raises the share of skilled migrants by 1.84 percentage points (0.19 standard deviations), and increases average migrant annual salary by PhP 23,703 (0.16 standard deviations). New migration concentrates in higher-education-quartile occupations.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Structural change&lt;/strong&gt;: The shock reduces primary sector employment shares by 1.2 percentage points per standard deviation (0.06 standard deviations), with over 70% of that shift absorbed by non-tradable goods and services sectors. Domestic income gains are driven almost entirely by non-agricultural income, and roughly 55% of the increase in entrepreneurial income is from service sectors.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Education&amp;rsquo;s contribution to income&lt;/strong&gt;: Model-based calculations assign 19.6% of the global income gain, 17.8% of the migrant income gain, and 20.2% of the domestic income gain to educational investments. Exchange rate persistence plus altered migration flows explain an additional 64.6% of the migrant income increase, so together these mechanisms account for 82.3% of the six-fold magnification. A demand multiplier (assuming 64% of migrant income returns to origin economies and a multiplier of 2.9, consistent with estimates from the literature) accounts for approximately 83.3% of the non-education-related portion of the domestic income increase.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;Threats to Identification Ruled Out&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Import and export shift-share controls (constructed analogously using bilateral trade data and province-level industry employment shares) are uncorrelated with the migrant income shock and leave coefficient estimates unchanged. Province-level manufactured exports, agricultural income, the CPI, and national-level FDI inflows show no statistically significant response to the shock. Internal migration rates are unaffected. Geographic spillover controls and tourism controls do not alter results. Placebo regressions in the pre-period yield small, statistically insignificant coefficients.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper studies formal, government-regulated temporary labor migration from the Philippines, where migrants sign contracts through POEA-licensed agencies and typically expect to return after one or more contracts. The findings apply specifically to settings where persistent (not transitory) migrant income shocks occur. Approximately 60% of contract migrants are female. The study period spans 1985–2018, with main long-run outcome analyses comparing 1994 (pre-shock) with 2009–2015 (post-shock).&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-makes-the-1997-asian-financial-crisis-useful-as-a-natural-experiment-for-this-papers-purposes"&gt;Q1. What makes the 1997 Asian Financial Crisis useful as a natural experiment for this paper&amp;rsquo;s purposes?&lt;/h3&gt;
&lt;p&gt;A1: The crisis was largely unanticipated by policymakers, international organizations, and financial markets, making it implausible that pre-1997 migration destination choices reflected anticipation of the shocks. Exchange rate changes were heterogeneous across destinations (ranging from a 4% depreciation to a 57% appreciation), and crucially, these changes proved highly persistent over two decades — regression coefficients of long-run exchange rate changes on the initial 1997–1998 shock are close to and statistically indistinguishable from 1 in nearly all post-shock periods. Combined with the province-specific variation in migrant destination exposure, this generates persistent, exogenous, and heterogeneous shocks to migrant income prospects across provinces.&lt;/p&gt;
&lt;h3 id="q2-what-is-the-shift-share-variable-and-how-does-it-combine-shifts-and-shares"&gt;Q2. What is the shift-share variable, and how does it combine &amp;ldquo;shifts&amp;rdquo; and &amp;ldquo;shares&amp;rdquo;?&lt;/h3&gt;
&lt;p&gt;A2: The shift-share variable Shiftshareo equals the sum over destinations d of (ωdo0 × ΔRd), where ωdo0 is province o&amp;rsquo;s pre-shock migrant income per capita from destination d (the &amp;ldquo;exposure weight&amp;rdquo; or &amp;ldquo;share&amp;rdquo;), and ΔRd is the fractional change in destination d&amp;rsquo;s exchange rate from before to after the crisis (the &amp;ldquo;shift&amp;rdquo;). It captures the predicted change in province-level migrant income per capita due to the 1997 exchange rate shocks, and is derived directly from a theoretical model of migration. Identification relies on the &amp;ldquo;exogenous shares&amp;rdquo; approach of Goldsmith-Pinkham et al. (2020): the pre-1997 exposure weights are treated as as-good-as-randomly assigned conditional on controls, because they reflect historical migration networks formed well before the crisis.&lt;/p&gt;
&lt;h3 id="q3-why-is-the-six-fold-magnification-of-the-initial-migrant-income-shock-so-striking-and-what-does-the-structural-model-say-about-its-sources"&gt;Q3. Why is the six-fold magnification of the initial migrant income shock so striking, and what does the structural model say about its sources?&lt;/h3&gt;
&lt;p&gt;A3: The coefficient on migrant income per capita (6.463 in Panel D of Table 1) implies that for each unit of initial short-run migrant income shock, migrant income per capita is more than six units higher in 2009–2015 — a far larger response than a one-for-one pass-through would predict. The structural model, which augments a Fréchet-based gravity model of migration with endogenous education investments, accounts for 82.3% of this magnification. Education investments explain 17.8% of the migrant income increase; persistent favorable exchange rates and resulting shifts in migration flows across destinations explain an additional 64.6%. The Fréchet elasticity of migration flows with respect to destination wages is estimated at θ = 3.42 via PPML, implying that even partial reorientation of migrants toward now-higher-wage destinations substantially raises aggregate migrant income.&lt;/p&gt;
&lt;h3 id="q4-what-evidence-supports-the-parallel-trends-assumption-in-the-pre-shock-period"&gt;Q4. What evidence supports the parallel trends assumption in the pre-shock period?&lt;/h3&gt;
&lt;p&gt;A4: The authors present event study diagrams (Figure 2) showing no differential positive pre-trends in either expenditure per capita or domestic income per capita prior to 1997 — for domestic income, there is a statistically insignificant negative trend from 1985–1991 and no trend in 1991–1994. Placebo regressions estimated on the pre-period only (1985, 1988, 1991 as &amp;ldquo;pre,&amp;rdquo; 1994 and 1997 as &amp;ldquo;post&amp;rdquo;) yield small, statistically insignificant coefficients on both domestic income and expenditure. Balance tests focusing on the five high-Rotemberg-weight destination shares (Saudi Arabia, Japan, US, Taiwan, Hong Kong) — which collectively account for 75% of the identifying variation — also show no significant pre-trends in key outcomes across provinces with varying levels of exposure.&lt;/p&gt;
&lt;h3 id="q5-how-do-the-authors-rule-out-trade-flows-as-an-alternative-mechanism-for-the-estimated-income-effects"&gt;Q5. How do the authors rule out trade flows as an alternative mechanism for the estimated income effects?&lt;/h3&gt;
&lt;p&gt;A5: They construct separate import and export shift-share variables, analogous to the &amp;ldquo;China shock&amp;rdquo; of Autor et al. (2013), using baseline bilateral trade values (from COMTRADE, disaggregated to 36 ISIC industries), province-level employment shares in import and export industries (from the 1990 Census), and the same destination exchange rate shocks. These trade shift-share variables are uncorrelated with the migrant income shock after conditioning on baseline controls (Appendix Table A5). Including them as additional controls in Panel D of all main regression tables leaves the migrant income coefficient stable. Further, province-level manufactured exports per capita show no large or statistically significant response to the migrant income shock, agricultural income similarly shows no significant response, and consumer price indices are unresponsive — ruling out import price changes as a confound. FDI inflows at the national level also show no significant relationship with destination-country exchange rate shocks.&lt;/p&gt;
&lt;h3 id="q6-what-is-the-composition-of-the-domestic-income-gains--where-do-they-come-from"&gt;Q6. What is the composition of the domestic income gains — where do they come from?&lt;/h3&gt;
&lt;p&gt;A6: Both wage income and entrepreneurial/rental income rise significantly and in similar magnitude, while &amp;ldquo;other income&amp;rdquo; (pensions, interest, dividends) shows no robust increase (Table 4). Non-agricultural income drives virtually the entire domestic income gain; agricultural income per capita is statistically insignificant (Table 5, columns 1–2). Within entrepreneurial income, approximately 55% of the increase is from service sectors, with manufacturing and primary sector entrepreneurial income showing insignificant effects at the 10% level (Table 5, columns 3–5). These patterns are consistent with the structural change finding: the shock shifts labor from primary sectors toward non-tradable goods and services rather than toward tradable manufacturing.&lt;/p&gt;
&lt;h3 id="q7-what-is-the-global-income-concept-and-what-share-does-each-component-contribute"&gt;Q7. What is the &amp;ldquo;global income&amp;rdquo; concept and what share does each component contribute?&lt;/h3&gt;
&lt;p&gt;A7: Global income per capita is defined as the sum of domestic income per capita (earned within the Philippine economy, excluding all international transfers) and migrant income per capita (the full income earned abroad by a province&amp;rsquo;s international migrants, calculated from contract data). Of the long-run global income increase, 73.6% comes from domestic income and 26.4% from migrant income. A one-standard-deviation shock raises global income by PhP 2,277 per capita in 2009–2015 (0.2 standard deviations, or 7.5% of the baseline mean).&lt;/p&gt;
&lt;h3 id="q8-how-do-education-effects-translate-into-more-and-higher-skilled-migration"&gt;Q8. How do education effects translate into more and higher-skilled migration?&lt;/h3&gt;
&lt;p&gt;A8: A one-standard-deviation migrant income shock increases college completion by 0.46 percentage points and secondary completion by 0.63 percentage points (with no significant effect on primary completion), consistent with the shock raising the return to higher education in the broader population. These better-educated workers then migrate at higher rates: the share of migrants who are skilled (college-educated) rises by 1.84 percentage points per standard deviation. Migration increases are concentrated in the two highest-education quartiles of occupations (engineers, medical professionals, teachers in the 4th quartile; caregivers, restaurant workers, performing artists in the 3rd quartile), with no significant effect in the two lowest quartiles. Average annual migrant salary rises by PhP 23,703 per standard deviation (0.16 standard deviations).&lt;/p&gt;
&lt;h3 id="q9-what-mechanisms-does-the-structural-model-invoke-to-explain-the-domestic-income-gains"&gt;Q9. What mechanisms does the structural model invoke to explain the domestic income gains?&lt;/h3&gt;
&lt;p&gt;A9: The model treats domestic income changes as arising through at least two channels: (1) the education channel, which the model assigns 20.2% of the domestic income increase (using the estimated college completion response of 0.046 per unit shock, baseline skill-migration probabilities, and baseline skill premia for domestic income); and (2) a demand multiplier operating on the portion of migrant income remitted to origin provinces, combined with capital accumulation from sustained migrant income flows. Assuming 64% of migrant income returns to origin economies (estimated indirectly from KNOMAD/ILO and Survey on Overseas Filipinos data) and a multiplier of 2.9 (consistent with estimates from Kenya and India), this demand-plus-investment channel can explain approximately 83.3% of the remaining (non-education-related) domestic income increase of PhP 14.4 per unit shock. Under baseline assumptions (α = 0.64), the stylized dynamic model generates PhP 18.88 of domestic income by 2015 from a PhP 1 initial shock — close to the empirical estimate of PhP 18.02.&lt;/p&gt;
&lt;h3 id="q10-how-do-the-authors-assess-sutva-and-internal-migration"&gt;Q10. How do the authors assess SUTVA and internal migration?&lt;/h3&gt;
&lt;p&gt;A10: They test whether the migrant income shock affects net internal migration rates at the provincial level (Appendix Table A6) and find no large or statistically significant impact. There is a small negative effect on outmigration of young adults (aged 16–24) that the authors judge cannot account for the documented income impacts. The Philippines&amp;rsquo; archipelago geography (over 7,000 islands) is noted as likely limiting inter-provincial economic spillovers; to the extent spillovers occur, they would be positive (demand spillovers from provinces experiencing income gains to neighboring provinces), making estimates conservative lower bounds. Direct tests controlling for the inverse-distance-weighted migrant income shock in neighboring provinces leave main estimates unchanged.&lt;/p&gt;
&lt;h3 id="q11-are-the-exposure-weights-migration-shares-persistent-and-does-this-support-interpreting-the-shock-as-persistent"&gt;Q11. Are the exposure weights (migration shares) persistent, and does this support interpreting the shock as persistent?&lt;/h3&gt;
&lt;p&gt;A11: Yes. Regressions of dyadic migrant income per capita in post-shock years (2009, 2012, 2015) on dyadic migrant income per capita in 1995 yield coefficients ranging from 0.4 to 0.6, each statistically significantly different from zero (and from 1, indicating partial but substantial persistence). The exchange rate shocks ΔRd are even more persistent: regression coefficients on the initial 1997–1998 shock are close to 1 and statistically indistinguishable from 1 in nearly all post-shock periods (with the only exceptions in 2009–2012 during the Great Recession). Both components of the shift-share variable thus show persistence over two decades, supporting interpretation of the long-run effects as responses to a persistent (not transitory) income shock.&lt;/p&gt;
&lt;h3 id="q12-what-are-the-policy-implications-and-how-do-the-authors-connect-findings-to-migration-policy"&gt;Q12. What are the policy implications and how do the authors connect findings to migration policy?&lt;/h3&gt;
&lt;p&gt;A12: The findings suggest migration policy should be an important part of the development policy toolkit. The results are directly relevant to origin-country policies facilitating formal, contract-based labor migration (e.g., regulation of recruitment agencies, educational investments to raise worker skills and competitiveness for overseas employment) and destination-country policies governing legal immigration opportunities. The authors also note implications for overseas development assistance: development agencies could consider supplementing traditional foreign aid with programs that facilitate international labor migration. The paper&amp;rsquo;s context — formal, government-regulated migration through POEA and OWWA — is described as highly policy-relevant, with 94% of developing countries with populations exceeding 1 million having a dedicated government migration agency and 78% having policies promoting migrant remittances.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Shift-share variable (Shiftshareo):&lt;/strong&gt; The paper&amp;rsquo;s primary independent variable, equal to the sum over all overseas destinations d of (ωdo0 × ΔRd) — the province&amp;rsquo;s pre-shock migrant income per capita from each destination (the exposure weight or &amp;ldquo;share&amp;rdquo;) multiplied by that destination&amp;rsquo;s exchange rate shock (the &amp;ldquo;shift&amp;rdquo;). It is the predicted change in province migrant income per capita due to the 1997 Asian Financial Crisis exchange rate shocks, and is derived directly from the theoretical model of migration (Equation A9). Identification treats the exposure weights as exogenous following the &amp;ldquo;exogenous shares&amp;rdquo; approach of Goldsmith-Pinkham et al. (2020).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Exposure weights (ωdo0):&lt;/strong&gt; Province o&amp;rsquo;s pre-shock aggregate migrant income per capita earned in destination d, calculated from administrative POEA/OWWA contract data for 1995. These serve as the &amp;ldquo;shares&amp;rdquo; in the shift-share and capture the extent to which a province&amp;rsquo;s residents are exposed to a given destination&amp;rsquo;s exchange rate shock. They reflect historically-formed migration networks rather than anticipation of future shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Global income per capita:&lt;/strong&gt; The sum of domestic income per capita and migrant income per capita. Domestic income is household income earned within the Philippine economy (wages, entrepreneurial, and other sources), explicitly excluding all income from international sources including remittances. Migrant income is the full income earned abroad by all international migrants from the province, calculated from contract data (not remittances sent home). Global income thus captures the full resource gain available to a province from the combination of domestic production and international migration.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Magnification (of migrant income shock):&lt;/strong&gt; The empirical finding that the long-run coefficient on migrant income per capita (6.463 in Panel D, Table 1) far exceeds 1 — meaning each unit of initial short-run shock becomes more than six units of migrant income per capita in 2009–2015. The paper decomposes this magnification into contributions from persistent exchange rates, educational investments raising skill levels and migration, and shifts in migration flows toward now-higher-wage destinations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Brain gain:&lt;/strong&gt; The paper&amp;rsquo;s term for the process by which improved migrant income prospects raise educational investments among the broader population (not just among migrants), leading to higher skill levels among non-migrants as well. The paper distinguishes this from &amp;ldquo;brain drain&amp;rdquo; (where migration of skilled workers reduces origin-area human capital) and provides evidence of a &amp;ldquo;virtuous cycle&amp;rdquo;: education raises migration rates and migrant skill levels, which in turn raises migrant and domestic incomes, potentially funding further education.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Rotemberg weights:&lt;/strong&gt; Province-destination-level weights (following Goldsmith-Pinkham et al. 2020) characterizing which destination-specific exchange rate shocks drive the estimates most. Saudi Arabia (0.20), Japan (0.19), United States (0.18), Taiwan (0.10), and Hong Kong (0.08) together account for 75% of the total Rotemberg weight. These weights guide which destination-specific exposure shares receive the most scrutiny in pre-trend and balance tests.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Fréchet elasticity (θ):&lt;/strong&gt; The elasticity of migration flows from an origin province to a destination with respect to destination wages (in Philippine pesos), estimated at 3.42 via PPML using the exchange rate shocks. This parameter governs how much migration flows — and thereby migrant income — respond to the persistent exchange rate changes, and is central to the model&amp;rsquo;s decomposition of the six-fold magnification of migrant income effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Domestic income multiplier:&lt;/strong&gt; The ratio of long-run domestic income increase to the portion of the migrant income shock that returns to origin provinces. Assuming 64% of migrant income returns to origin economies (estimated from multiple administrative data sources), the implicit demand multiplier in the paper&amp;rsquo;s context ranges from about 2.9 to 3.4, consistent with multipliers found in related literature on cash transfers and credit supply shocks in low-income settings.&lt;/p&gt;</description></item><item><title>Across-Country Wage Compression in Multinationals</title><link>https://macropaperwarehouse.com/papers/across-country-wage-compression-in-multinationals/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/across-country-wage-compression-in-multinationals/</guid><description>&lt;h2 id="layer-1--summary"&gt;Layer 1 — Summary&lt;/h2&gt;
&lt;p&gt;Many multinationals do not fully adjust wages to the local context of their foreign establishments; instead, they partially link the wages of foreign workers in a given position to the wages paid in the same position at headquarters — a practice the authors call &amp;ldquo;wage anchoring.&amp;rdquo; Using yearly establishment-level compensation data on roughly 1,200 multinationals operating across 174 cities worldwide (2000–2015) and matched employer-employee administrative data (RAIS) from Brazil, Hjort, Li, and Sarsons document that a 10 percent higher headquarters wage is associated with 1.63–2.8 percent higher wages for workers in the same occupation at foreign establishments, with the within-firm across-country correlation substantially exceeding the correlation between a given establishment&amp;rsquo;s wages and the local average paid by other multinationals for the same position. To establish a causal link between externally imposed headquarters wage changes and subsequent foreign establishment wage responses, the paper exploits two identification strategies: minimum wage shocks in the headquarters country or U.S. state and exchange rate fluctuations, both of which generate plausibly exogenous variation in headquarters wages that is then partially transmitted to foreign workers in the same position. Wage change transmission appears to be direct and to operate through firm-wide wage-setting procedures rather than through associated changes in technology or employment at foreign establishments, a conclusion the Brazil RAIS data support because total employment at multinationals&amp;rsquo; Brazilian establishments shows little change following positive external shocks to headquarters wages. Wage anchoring is strongest for low-skill occupations (cleaners, drivers, security guards), where a 10 percent higher headquarters wage is associated with a 2.8 percent higher foreign establishment wage, versus roughly 1.2 percent for middle- and high-skill occupations; the resulting spatial compression of wages is in line with how many multinationals themselves report setting pay across locations.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-central-phenomenon-documented-in-this-paper-and-what-are-the-two-broad-empirical-components-of-the-analysis"&gt;Q1. What is the central phenomenon documented in this paper, and what are the two broad empirical components of the analysis?&lt;/h3&gt;
&lt;p&gt;The central phenomenon is &amp;ldquo;wage anchoring&amp;rdquo;: multinationals link wages at their foreign establishments to the wage level at headquarters for the same narrowly-defined occupation, so that the within-firm across-country wage distribution is more compressed than what local labor-market conditions alone would imply. The first empirical component is descriptive — documenting the high cross-sectional correlation between headquarters and foreign establishment wages within a firm×occupation cell, controlling for city×year effects and local wage benchmarks. The second component is causal — using minimum wage shocks in the headquarters country or U.S. state and exchange rate shocks to generate externally imposed changes in headquarters wages, and tracing whether and how quickly those changes are partially transmitted to foreign establishments.&lt;/p&gt;
&lt;h3 id="q2-what-is-the-primary-dataset-what-does-it-cover-and-what-are-its-key-limitations"&gt;Q2. What is the primary dataset, what does it cover, and what are its key limitations?&lt;/h3&gt;
&lt;p&gt;The primary dataset was compiled by an unidentified consulting company that gathers compensation information from client employers and harmonizes positions globally into 309 occupations across 16 skill levels and 26 occupational categories. It covers roughly 1,200 multinationals (private-sector firms and multinational public-sector employers such as NGOs and multilateral organizations), operating in more than 170 cities, with yearly observations spanning 2000–2015. The data report average nominal gross total monthly wages for domestic (non-expat) workers in each establishment-occupation-year cell. Key limitations: the panel is unbalanced because multinationals choose which establishments report each year and often rotate establishments in and out; matching between the headquarters and any given foreign establishment requires observing the same occupation in the same year at both, which reduces the headquarters-matched sample to 80 employers and 611 foreign establishments (Sample 3, the most comparable subsample). The publicly listed U.S. firms in the data account for about one-third of total revenue of all publicly listed U.S. firms, so the sample is skewed toward unusually large employers.&lt;/p&gt;
&lt;h3 id="q3-how-do-the-authors-define-and-measure-wage-anchoring-in-the-descriptive-section"&gt;Q3. How do the authors define and measure &amp;ldquo;wage anchoring&amp;rdquo; in the descriptive section?&lt;/h3&gt;
&lt;p&gt;The authors regress log average wages of workers in occupation j at a firm f&amp;rsquo;s foreign establishment in city c in year t (wjfct) on log average wages for the same occupation at the firm&amp;rsquo;s headquarters (HQwjft), controlling for firm×occupation fixed effects, city×year fixed effects, and a local market wage benchmark measured either as the average paid by other multinationals in the same city-occupation-year cell or as a city×occupation×year fixed effect. The estimated coefficient on the headquarters wage — around 0.163 using the benchmark-wage control and about 0.09 using the more restrictive city×occupation×year fixed effect — measures how much of a headquarters wage difference is &amp;ldquo;passed through&amp;rdquo; to foreign establishment wages within the same firm and occupation. They further document that the within-firm wage slope (the difference between wages in consecutive skill levels within an occupational category) at foreign establishments is similarly anchored to the corresponding slope at headquarters, with a 10 percent greater consecutive-skill wage gap at headquarters associated with about a 1.4 percent greater gap at the foreign establishment.&lt;/p&gt;
&lt;h3 id="q4-what-exactly-do-the-minimum-wage-and-exchange-rate-identification-strategies-exploit-and-what-do-they-identify"&gt;Q4. What exactly do the minimum wage and exchange rate identification strategies exploit, and what do they identify?&lt;/h3&gt;
&lt;p&gt;The minimum wage strategy compares multinationals whose headquarters are located in a country or U.S. state that experiences a minimum wage increase (&amp;ldquo;treated&amp;rdquo;) against multinationals whose headquarters are not exposed (&amp;ldquo;control&amp;rdquo;), conditioning on establishments being in the same foreign city. Within the treated group, it also exploits cross-occupation variation: within a given foreign establishment, workers in positions whose headquarters counterparts are more exposed to the minimum wage increase (because their wages are closer to the new minimum) experience larger foreign wage gains. The exchange rate strategy exploits appreciation of a non-U.S. headquarter country&amp;rsquo;s currency against the dollar: when the USD-measured headquarters wage of such a multinational increases following an appreciation, this tests whether foreign establishment wages in USD also rise. Because exchange rates increase and decrease, are less stable than minimum wages, and have different underlying drivers, the exchange rate design provides an independent corroboration of the minimum wage findings. Both strategies identify the effect of externally imposed headquarters wage changes on wages at the same firm&amp;rsquo;s foreign establishments in the same narrowly defined occupation.&lt;/p&gt;
&lt;h3 id="q5-what-evidence-is-marshaled-against-indirect-pathways-technology-changes-employment-changes-offshoring-as-the-driver-of-foreign-wage-transmission"&gt;Q5. What evidence is marshaled against indirect pathways (technology changes, employment changes, offshoring) as the driver of foreign wage transmission?&lt;/h3&gt;
&lt;p&gt;The paper presents three types of evidence against indirect pathways. First, including headquarters country×year fixed effects in the descriptive wage regressions — which absorbs any technology shocks originating in the headquarters country that affect all occupations uniformly — leaves the estimated wage anchoring coefficient essentially unchanged. Second, event study and panel regressions using the Brazil RAIS data show little change in total employment at multinationals&amp;rsquo; Brazilian establishments following positive external shocks to headquarters wages, which is hard to reconcile with employment-driven or offshoring-driven wage adjustment. Third, a causal forest analysis of the conditional average treatment effect of minimum wage shocks on foreign wages — estimated allowing responses to vary with a wide range of job, employer, sector, and location characteristics — finds that occupation characteristics and sector have little explanatory power for which establishments transmit more, while differences in transmission are more closely related to characteristics of the headquarter-establishment country pair (proximity, similarity, shared language), which are more naturally associated with administrative coordination than with technology or production-style linkages.&lt;/p&gt;
&lt;h3 id="q6-how-does-occupation-skill-level-moderate-wage-anchoring-and-what-does-this-heterogeneity-imply"&gt;Q6. How does occupation skill level moderate wage anchoring, and what does this heterogeneity imply?&lt;/h3&gt;
&lt;p&gt;Wage anchoring is strongest for low-skill occupations. In the descriptive correlations, a 10 percent higher headquarters wage is associated with 2.8 percent higher foreign wages in low-skill jobs (cleaners, drivers, data entry clerks, security guards) but only about 1.2 percent higher foreign wages in both middle-skill and high-skill jobs. The occupation heterogeneity is visible graphically (Figure 1 Panel C) and holds in regressions interacting the headquarters wage with skill-level indicators. A natural interpretation, consistent with the firm-wide wage-setting procedure explanation, is that firms are most likely to apply standardized pay rules to lower-level positions where local market customization may be seen as less important; higher-skill workers may be more likely to have individually negotiated contracts responsive to local conditions. The heterogeneity also implies that the spatial compression effect — wages in foreign establishments being pulled toward headquarters levels — is particularly pronounced at the lower end of the within-firm wage distribution, affecting positions like cleaners and guards in ways that can result in wages that are, relative to GDP per capita, an order of magnitude higher than what headquarters workers in the same position receive.&lt;/p&gt;
&lt;h3 id="q7-what-is-the-spatial-compression-implication-and-how-does-it-relate-to-within-firm-wage-inequality"&gt;Q7. What is the &amp;ldquo;spatial compression&amp;rdquo; implication and how does it relate to within-firm wage inequality?&lt;/h3&gt;
&lt;p&gt;Wage anchoring implies that workers in the same occupation at foreign establishments located in lower-income countries receive wages that are compressed toward headquarters levels rather than fully adjusted to local wages. The paper shows that nominal wages at foreign establishments average about 89 percent of headquarters wages in the same occupation and year — and about 78 percent for establishments in countries poorer than the headquarter country — a ratio that is roughly stable across the within-firm headquarters wage distribution. This partial equalization is what the authors call &amp;ldquo;across-country wage compression&amp;rdquo;: it reduces the within-multinational cross-country wage dispersion relative to what would arise from purely market-based, locally responsive wage-setting. The spatial compression is consistent with how many firms self-report setting wages: a survey of primarily North American employers (Culpepper &amp;amp; Associates, 2011) found 29 percent report paying the same nominal wages across locations, and several large employers (Amazon, IKEA, Walmart) have self-imposed country-wide wage floors.&lt;/p&gt;
&lt;h3 id="q8-what-role-do-headquarter-establishment-country-pair-characteristics-play-in-predicting-which-establishments-exhibit-stronger-wage-transmission"&gt;Q8. What role do headquarter-establishment country-pair characteristics play in predicting which establishments exhibit stronger wage transmission?&lt;/h3&gt;
&lt;p&gt;Using a causal forest algorithm to estimate the conditional average treatment effect of a minimum wage shock at headquarters and then constructing above- versus below-median predicted treatment groups, the paper finds that differences in transmission are &amp;ldquo;generally not large&amp;rdquo; but that higher transmission is somewhat associated with characteristics of the headquarter-establishment country pair: pairs that are more closely connected and share more similarities (e.g., common language, closer geographic distance) transmit more. Some foreign-establishment-country characteristics such as inequality and urbanization also appear related. In contrast, occupation characteristics (such as offshorability), the sector the multinational operates in, and characteristics of the headquarter country alone have little explanatory power. The paper notes these findings do not conclusively rule out alternative explanations but are more consistent with administrative coordination channels than with technology- or employment-based ones.&lt;/p&gt;
&lt;h3 id="q9-what-role-do-potential-fairness-preferences-and-firm-wide-wage-norms-play-in-the-papers-interpretation"&gt;Q9. What role do potential fairness preferences and firm-wide wage norms play in the paper&amp;rsquo;s interpretation?&lt;/h3&gt;
&lt;p&gt;The authors suggest several possible mechanisms through which firm-wide wage-setting procedures could operate. Firms may adopt uniform wage-setting to reduce the menu and information costs of localized wage-setting (Lemieux et al., 2012); to increase foreign worker morale, particularly if workers are averse to pay inequality relative to headquarters peers (Card et al., 2012; Dube et al., 2019); or to respond to fairness preferences from headquarters workers or consumers (Harrison &amp;amp; Scorse, 2010). Survey evidence from Alfaro-Urena et al. (2019) explicitly records that multinationals pay high wages abroad in part to &amp;ldquo;ensure cross-country pay fairness within the MNC.&amp;rdquo; Alternatively, the authors note that firm-wide wage-setting may represent a form of firm inertia or mistakes — an inability or unwillingness to fully adapt pricing and compensation to local contexts — consistent with DellaVigna &amp;amp; Gentzkow (2019). The paper presents this as an open question for future research rather than definitively adjudicating among the explanations.&lt;/p&gt;
&lt;h3 id="q10-how-does-the-brazil-rais-data-corroborate-and-extend-the-global-multinationals-findings"&gt;Q10. How does the Brazil RAIS data corroborate and extend the global multinationals findings?&lt;/h3&gt;
&lt;p&gt;The RAIS matched employer-employee administrative data cover all employees at each Brazilian establishment of the 44 multinationals in the global dataset that operate in Brazil, with individual-level information on wages, education, race, gender, age, and tenure. Because RAIS is an administrative census of formal-sector employment rather than a consulting firm&amp;rsquo;s client dataset, it provides independent corroboration of the main findings. The paper confirms using RAIS that wages of individual workers at multinationals&amp;rsquo; Brazilian establishments rise abruptly when their foreign headquarters experience positive external shocks. The RAIS data then enable the additional step of examining employment responses, where event study and panel regressions find little change in total employment at multinationals&amp;rsquo; Brazilian establishments following such shocks — evidence against employment- or technology-driven indirect pathways as the primary explanation for wage transmission.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Wage anchoring:&lt;/strong&gt; The practice by which a multinational ties wages at its foreign establishments, for workers in a given occupation, to the wage level at its headquarters for the same occupation. In this paper&amp;rsquo;s usage, anchoring does not mean wages are set identically across locations but that they are partially linked — externally imposed changes in headquarters wages are partially transmitted to foreign establishment wages — rather than being independently set based on local labor-market conditions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Across-country wage compression:&lt;/strong&gt; The reduction in the cross-country dispersion of wages within a multinational that results from wage anchoring. Because foreign establishment wages are partially pulled toward headquarters levels rather than fully adjusting to local wages, the multinational&amp;rsquo;s within-firm wage distribution is more compressed across countries than it would be under purely localized wage-setting. In the paper&amp;rsquo;s data, this compression is particularly pronounced for low-skill occupations in lower-income host countries.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Firm-wide wage-setting procedures:&lt;/strong&gt; Administrative practices, such as applying a single pay scale or a fixed wage ratio across all of a firm&amp;rsquo;s establishments regardless of location, that mechanically link foreign establishment wages to headquarters wages. The paper argues these procedures — rather than correlated technology shocks or employment adjustments — are the proximate driver of wage anchoring, on the basis of the employment non-response in Brazil, the persistence of anchoring after controlling for headquarters-country technology shocks, and the pattern of heterogeneity across country pairs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Partial transmission:&lt;/strong&gt; A load-bearing qualifier in this paper describing the magnitude of wage anchoring: headquarters wage changes arising from external shocks are not fully extended to foreign workers, but a fraction of the change is passed through. The estimated pass-through in descriptive regressions ranges from about 0.09 to 0.31 depending on specification and sample, and is highest (around 0.28) for low-skill occupations. The partial nature of transmission means that the spatial compression is real but incomplete.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Wage slope:&lt;/strong&gt; The difference between log average wages paid by an employer to workers in jobs of consecutive skill levels within an occupational category, at a given establishment. The paper documents that the wage slope at foreign establishments is correlated with the wage slope at headquarters — a 10 percent greater consecutive-skill wage gap at headquarters is associated with a roughly 1.4 percent greater gap at the foreign establishment — suggesting that the anchoring extends beyond the level of wages to the internal wage structure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;External shocks to headquarter wages:&lt;/strong&gt; Minimum wage increases in the headquarters country or U.S. state, and exchange rate fluctuations that change the USD value of wages set in local currency. These shocks serve as instruments or quasi-experimental sources of variation in headquarters wages that are plausibly exogenous to conditions at foreign establishments, enabling causal identification of the effect of headquarter wage changes on foreign establishment wages.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Causal forest (heterogeneous treatment effect estimation):&lt;/strong&gt; A machine learning algorithm used in the paper to estimate the conditional average treatment effect of a minimum wage shock at headquarters, allowing the size of the foreign wage response to vary flexibly with a large set of characteristics (job, employer, sector, headquarter country, establishment country, headquarter-establishment country pair). The resulting predicted treatment effect scores are used to construct above- and below-median transmission groups, which are then compared across observable characteristics to identify what predicts stronger wage anchoring.&lt;/p&gt;
&lt;hr&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary based on NBER Working Paper 26788 (February 2020, Revised April 2025). Source text was truncated after the beginning of Section 4.1 (minimum wage event study analysis); all causal evidence descriptions draw on the introduction and Section 3–4 framing rather than the full Section 4 tables and Section 5 heterogeneity analysis. AI-assisted, human review pending.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;</description></item><item><title>Aggregate demand externality and self-fulfilling default cycles</title><link>https://macropaperwarehouse.com/papers/aggregate-demand-externality-and-self-fulfilling-default-cycles/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/aggregate-demand-externality-and-self-fulfilling-default-cycles/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; Why do corporate defaults cluster in recurring episodes rather than occurring smoothly? The paper asks whether observable fundamental factors — firm characteristics and macroeconomic variables — are sufficient to account for the clustered default patterns documented in the data, and, if not, what theoretical mechanism can explain them.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Empirical Motivation.&lt;/strong&gt; Using Moody&amp;rsquo;s historical default rate data, the authors document that the long-run average corporate bond default rate during 1866–2008 was approximately 1.50%, yet defaults were highly episodic: the worst three-year period during the Great Depression totaled 12.88%, and the three-year period 1873–1875 after the railroad boom reached 35.80%. A Markov switching regression on post-war default rate data (1951–2017) strongly rejects a linear no-switch model in favor of a two-regime model across all information criteria (AIC, HQ, SC, and log-likelihood). The estimated high-default regime has a mean default rate of 1.93% (unconditional mean µ/(1−ρ)) — roughly eight times the 0.23% mean of the low-default regime — and a standard deviation nearly six times larger. The high-default regime persists on average 5.81 years (transition probability of staying ≈ 0.83), while the low-default regime lasts approximately 7.52 years (staying probability ≈ 0.87).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model.&lt;/strong&gt; The authors build a continuous-time general equilibrium model with Dixit-Stiglitz monopolistic competition (CES aggregation with elasticity σ) and an endogenous entry/exit/default mechanism. Households are risk-neutral and also act as entrepreneurs. At each instant, δµ new project blueprints are invented; entrepreneurs borrow to invest, then face an idiosyncratic liquidity shock z drawn from a Pareto distribution G(z). Entrepreneurs continue if z ≤ Z*, a cutoff determined by the continuation value of the firm, and default otherwise. Continuing firms become monopolists for a new variety until that variety becomes obsolete at a Poisson rate δ. Each operating firm must borrow working capital constrained by its firm value Vt (collateral constraint wtnjt ≤ θVjt). The entire equilibrium reduces to a two-dimensional dynamical system in (Mt, Vt), where Mt is the number of operating firms (state variable) and Vt is the firm value (control variable).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Key Mechanism — Demand Externality and Positive Feedback.&lt;/strong&gt; Under CES aggregation, each firm&amp;rsquo;s gross revenue is y_jt^(1–1/σ) · Y_t^(1/σ), making individual firm revenue increasing in aggregate output Yt. A decline in Yt lowers firm profits and firm value Vt, which raises the default threshold Z* and increases the fraction of projects that are abandoned. Fewer operating firms further depress Yt, closing a positive feedback loop. This static strategic complementarity (through CES) is combined with dynamic strategic complementarity through the borrowing constraint: higher expected future firm value relaxes current working capital constraints, raising current production.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Multiple Equilibria and Global Dynamics.&lt;/strong&gt; The two-locus phase diagram (˙Mt = 0 and ˙Vt = 0) yields multiple intersections — and hence multiple steady states — when productivity A lies in an intermediate range (A &amp;lt; A &amp;lt; Ā). When A &amp;gt; Ā, a single good saddle-point equilibrium exists. When A &amp;lt; A, no equilibrium can be sustained. In the intermediate range, a good steady state (low default rate, high firm value) coexists with a bad steady state (high default rate, low firm value). The good steady state is always a saddle; the bad steady state is a sink (locally indeterminate, κ &amp;lt; κ_Hopf) or a source (locally determinate but globally indeterminate, κ &amp;gt; κ_Hopf), depending on parameter κ = 1 + (θ + ρ)/δ.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bogdanov-Takens Bifurcation.&lt;/strong&gt; Using global dynamical methods, the paper demonstrates richer indeterminacy than local analysis permits. Near the Bogdanov-Takens point (κ, Ā), the system can exhibit: (a) infinite equilibrium trajectories converging to the bad steady state; (b) saddle-loop bifurcation at κ = κ_SL ≈ 14.25 (under the baseline calibration); (c) stable or unstable periodic orbits for κ ∈ (κ_Hopf, κ_SL) — endogenous business cycles in a perfect-foresight equilibrium; and (d) multiple trajectories from near the source that converge to the good saddle equilibrium.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Simulation of Clustered Defaults.&lt;/strong&gt; With a two-state Markov process for productivity (Ah = 10, Al = 9.34) and pessimistic sentiment shifts (the &amp;ldquo;ugly&amp;rdquo; state), the model replicates the cluster pattern: in the good/high-productivity state, the default rate is near zero; when productivity falls to low and sentiment turns pessimistic, the default rate can spike to approximately 12%, consistent with the Great Depression observation. Critically, the paper shows that the cluster pattern is generated only under global dynamics — restricting to local dynamics produces substantially smaller fluctuations in the default rate, confirming that the ugly (sink) equilibrium is essential.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Policy.&lt;/strong&gt; A countercyclical subsidy to non-defaulting entrants — financed by a lump-sum tax, calibrated as tr(Vt) = τ(VG − Vt) — shifts the ˙Mt = 0 locus downward and can eliminate the bad steady state entirely, leaving only the good saddle-path equilibrium. The paper provides a closed-form sufficiency condition for τ (Proposition 7).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions.&lt;/strong&gt; Multiple equilibria require: (i) productivity in the intermediate range A &amp;lt; A &amp;lt; Ā; (ii) the elasticity of substitution σ not too large (below a threshold σ̄ that itself depends on µ); (iii) the borrowing constraint binding (δ &amp;gt; θσ/((σ–1)κ), which can always be ensured by choosing δ sufficiently large). Clustered defaults in the simulation require the joint occurrence of a negative fundamental shock (productivity falling from high to low) and a shift to pessimistic sentiment; either factor alone generates only limited default amplification.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-core-empirical-motivation-for-the-model-and-what-does-the-regime-switching-analysis-establish"&gt;Q1. What is the core empirical motivation for the model, and what does the regime-switching analysis establish?&lt;/h3&gt;
&lt;p&gt;The paper documents that the corporate bond default rate, drawn from Moody&amp;rsquo;s data covering 1866–2008, clusters sharply in episodes: the long-run average is 1.50%, yet the worst three-year period of the Great Depression totaled 12.88% and 1873–1875 reached 35.80%. A Markov switching regression on 1951–2017 data strongly rejects a linear no-regime-switch model across all four criteria (log-likelihood, AIC, HQ, SC). The two-regime model identifies a high-default regime with unconditional mean 1.93% and standard deviation roughly six times the low-default regime&amp;rsquo;s, a persistence probability of approximately 0.83 (duration ≈ 5.81 years), and a low-default regime with unconditional mean 0.23% and persistence approximately 0.87 (duration ≈ 7.52 years). The regime-switching result supports the prior literature&amp;rsquo;s claim (Das et al. 2007; Duffie et al. 2009; Azizpour et al. 2018) that observable fundamentals alone cannot account for clustered defaults.&lt;/p&gt;
&lt;h3 id="q2-how-does-the-dixit-stiglitz-ces-structure-generate-a-demand-externality-that-links-aggregate-output-to-individual-firm-default-decisions"&gt;Q2. How does the Dixit-Stiglitz CES structure generate a demand externality that links aggregate output to individual firm default decisions?&lt;/h3&gt;
&lt;p&gt;Under CES aggregation with elasticity σ, each firm&amp;rsquo;s gross revenue equals y_jt^(1–1/σ) · Y_t^(1/σ) (equation 7), so aggregate output Yt directly enters individual firm revenue. Each firm takes Yt as given, yet the aggregation of all firms&amp;rsquo; output determines Yt. When aggregate output falls — because more firms have defaulted and exited production — each remaining firm&amp;rsquo;s revenue and profit fall, reducing the firm&amp;rsquo;s continuation value Vt. A lower Vt tightens the borrowing constraint (wtnjt ≤ θVjt), reduces working capital, and raises the probability that the firm&amp;rsquo;s idiosyncratic liquidity shock will exceed the default threshold Z*, producing further defaults. This positive feedback constitutes the demand externality: individual firms&amp;rsquo; decisions are strategic complements, both statically (through CES demand) and dynamically (through the borrowing constraint on working capital).&lt;/p&gt;
&lt;h3 id="q3-what-is-the-two-dimensional-dynamical-system-that-summarizes-the-equilibrium-and-what-do-the-two-loci-look-like-in-the-phase-diagram"&gt;Q3. What is the two-dimensional dynamical system that summarizes the equilibrium, and what do the two loci look like in the phase diagram?&lt;/h3&gt;
&lt;p&gt;The entire equilibrium reduces to two differential equations in (Mt, Vt): ˙Mt = –δ[Mt – µG(Z(Vt))] and ˙Vt = κδVt[1 – F(Vt, Mt)], where F captures the ratio of monopoly profit to firm value including the borrowing constraint. The ˙Mt = 0 locus slopes strictly upward because a higher firm value Vt raises the default cutoff Z* and lowers the fraction of entrants who default, so more firms survive and Mt rises until absorption equals entry. This locus has a minimum at Mm = µG(zm) because firm value must exceed the threshold that sustains the credit market. The ˙Vt = 0 locus is non-monotonic: it first slopes upward (more firms raise aggregate demand and profit through the scale/externality channel) and then slopes downward (more firms tighten the labor market, raising wages and lowering profits). The two opposing channels make the ˙Vt = 0 locus hump-shaped, creating the possibility of two intersections and hence two steady states.&lt;/p&gt;
&lt;h3 id="q4-under-what-conditions-do-multiple-steady-states-exist-and-what-does-each-look-like"&gt;Q4. Under what conditions do multiple steady states exist, and what does each look like?&lt;/h3&gt;
&lt;p&gt;Multiple steady states exist when productivity A satisfies A &amp;lt; A &amp;lt; Ā, where A and Ā are closed-form thresholds given by Equations (A.3) and (A.4), and the elasticity of substitution σ is below a threshold σ̄ (Equation A.5). When A &amp;lt; A, neither locus intersects and no equilibrium is sustainable. When A &amp;gt; Ā, a single good saddle-point equilibrium exists. In the multiple-equilibria range, the good steady state has a higher firm value and a smaller fraction of firms defaulting; the bad steady state has a lower firm value and a higher default rate. Under the paper&amp;rsquo;s numerical calibration (A = 10, η = 6.5, Zmin = 0.88), the low default rate at the good steady state is approximately 1.5% and the high default rate at the bad steady state is between 12% and 13%.&lt;/p&gt;
&lt;h3 id="q5-what-are-the-local-dynamics-around-each-steady-state-and-how-does-parameter-κ-determine-whether-the-bad-steady-state-is-a-sink-or-a-source"&gt;Q5. What are the local dynamics around each steady state, and how does parameter κ determine whether the bad steady state is a sink or a source?&lt;/h3&gt;
&lt;p&gt;Proposition 5 shows that the good steady state is always a saddle point, ensuring a unique convergent path for initial Mt near Mg_0. The bad steady state&amp;rsquo;s local nature depends on κ = 1 + (θ + ρ)/δ and the critical value κ_Hopf = 1 + ψ/(θMb_0Vb_0). When κ is between 1 and κ_Hopf, the Jacobian trace is negative and the bad steady state is a sink with one order of indeterminacy: given Mt close to Mb_0, infinitely many initial values of the control variable Vt satisfy all equilibrium conditions. When κ &amp;gt; κ_Hopf, the bad steady state is a source point; the economy diverges from it. Because κ does not affect the steady-state locations (Proposition 3), one can vary κ to change the dynamic character without moving the equilibria in the phase diagram.&lt;/p&gt;
&lt;h3 id="q6-what-does-the-global-dynamics-analysis-reveal-that-local-analysis-misses"&gt;Q6. What does the global dynamics analysis reveal that local analysis misses?&lt;/h3&gt;
&lt;p&gt;Global analysis via Bogdanov-Takens bifurcation (Proposition 6) reveals three classes of dynamics absent from local analysis. First, even in the saddle-source case (locally determinate), there exist multiple equilibrium trajectories diverging from near the bad (source) steady state and converging to the good (saddle) steady state; these paths satisfy all equilibrium conditions including transversality but are incorrectly ruled out by local methods. Second, at the critical value κ_SL ≈ 14.25 (under the baseline calibration), a homoclinic saddle-loop orbit connects the saddle point to itself — all trajectories interior to the loop converge to the bad steady state. Third, for κ between κ_Hopf and κ_SL, periodic orbits arise in a perfect-foresight equilibrium with no external shocks. For example, at κ = 14.9, the phase diagram displays a unique periodic orbit around the bad steady state, with two distinct initial values of Vt for any given Mt near the orbit — endogenous, perpetual oscillations without any exogenous driving force. Numerical experiments confirm that Mt = 0.23 admits two rational-expectations values of Vt (2.09 and 3.55) on the saddle path alone, illustrating abundant indeterminacy even at the endpoint.&lt;/p&gt;
&lt;h3 id="q7-how-does-the-paper-simulate-the-clustered-default-pattern-and-what-is-the-role-of-the-ugly-equilibrium"&gt;Q7. How does the paper simulate the clustered default pattern and what is the role of the &amp;ldquo;ugly&amp;rdquo; equilibrium?&lt;/h3&gt;
&lt;p&gt;The paper constructs a three-state Markov economy: &amp;ldquo;good&amp;rdquo; (high productivity Ah = 10, single saddle equilibrium, near-zero default rate), &amp;ldquo;bad&amp;rdquo; (low productivity Al = 9.34, saddle-path equilibrium, modestly elevated defaults), and &amp;ldquo;ugly&amp;rdquo; (low productivity, sink-path equilibrium, sharply elevated defaults). The ugly state is reached when, upon a productivity decline, firms adopt pessimistic expectations and the economy slides to the high-default sink instead of remaining on the low-default saddle path. Transition probabilities are set so that the average ugly-state duration is approximately 6 years and roughly 45% of periods are ugly, consistent with the regime-switching estimates. With Zmin = 0.2 and η = 15, the ugly-state default rate can reach approximately 12%, matching the Great Depression observation. The counterfactual experiment deletes the ugly state (pGU = 0) and resets pGB = 0.45: the resulting default rate stays close to zero with no cluster pattern, demonstrating that global dynamics (the ugly sink) rather than the fundamental shock alone generate the clustering.&lt;/p&gt;
&lt;h3 id="q8-can-purely-sentiment-driven-cycles-generate-the-clustered-default-pattern"&gt;Q8. Can purely sentiment-driven cycles generate the clustered default pattern?&lt;/h3&gt;
&lt;p&gt;Section 6.2 fixes productivity at a low level (A = 9.53) and drives switches between the bad (saddle path) and ugly (sink path) states by pure sentiment shocks alone (πBU and πUB). The simulated default rate does spike upward when sentiment turns pessimistic, but the rises are generally more modest than in the combined fundamental-plus-sentiment exercise, and the default rate can no longer be characterized as countercyclical. The authors conclude that the realistic observed default cluster is the result of a combination of negative fundamental shocks and pessimistic sentiment shifts; either ingredient alone is insufficient to replicate all features of the data.&lt;/p&gt;
&lt;h3 id="q9-how-does-the-collateral-constraint-on-working-capital-create-dynamic-strategic-complementarity"&gt;Q9. How does the collateral constraint on working capital create dynamic strategic complementarity?&lt;/h3&gt;
&lt;p&gt;Following Jermann and Quadrini (2012), Liu and Wang (2014), and Lian and Ma (2021), each operating firm must borrow to pay wages each period, subject to the constraint wtnjt ≤ θVjt. Since Vt is forward-looking (the discounted present value of the firm&amp;rsquo;s monopoly profit stream), optimistic expectations about future output raise Vt, relax the borrowing constraint, allow firms to hire more labor and produce more output today, and thereby validate optimism. This intertemporal complementarity means that the equilibrium is sensitive not only to current fundamentals but also to beliefs about the future, opening the channel for sentiment-driven multiple equilibria and self-fulfilling cycles.&lt;/p&gt;
&lt;h3 id="q10-what-is-the-policy-remedy-for-the-bad-equilibrium-and-how-does-it-work"&gt;Q10. What is the policy remedy for the bad equilibrium, and how does it work?&lt;/h3&gt;
&lt;p&gt;Proposition 7 establishes that a countercyclical lump-sum-tax-financed subsidy to non-defaulting entrants, tr(Vt) = τ(VG − Vt), with τ exceeding a computable threshold, eliminates the bad steady state. The subsidy works by effectively raising the value of continuing for a firm at any given Vt and Mt, shifting the ˙Mt = 0 locus downward until it lies below the ˙Vt = 0 locus everywhere in the relevant range, eliminating the second intersection and leaving only the good saddle-path equilibrium. The numerical illustration uses parameters from Section 6 with A = 9.67 and τ = 1/3 to demonstrate that the bad steady state vanishes and the phase diagram has a single equilibrium. The subsidy is self-limiting: in normal conditions when firm value is already high (Vt ≈ VG), the transfer is near zero.&lt;/p&gt;
&lt;h3 id="q11-how-does-this-paper-differ-from-cui-and-kaas-2021-the-most-closely-related-predecessor"&gt;Q11. How does this paper differ from Cui and Kaas (2021), the most closely related predecessor?&lt;/h3&gt;
&lt;p&gt;Cui and Kaas (2021) show default cycles from self-fulfilling beliefs in a fully competitive firm environment, focusing on intertemporal default coordination. The present paper differs in three respects. First, firms engage in monopolistic competition under CES preferences, and the main novel mechanism is cross-firm default contagion through the demand externality — which can produce multiple equilibria even in a static setting, without any intertemporal coordination. Second, the paper examines the joint role of fundamental shocks and aggregate-demand externalities together, showing that multiple equilibria arise only in the presence of sufficiently low productivity (A &amp;lt; A &amp;lt; Ā), making indeterminacy contingent on external fundamentals rather than structural parameters alone. Third, the continuous-time framework with full global analysis via Bogdanov-Takens bifurcation allows characterization of periodic orbits and the interaction of the ugly sink path with Markov productivity regimes — dynamics not covered in Cui and Kaas (2021).&lt;/p&gt;
&lt;h3 id="q12-what-is-the-markup-prediction-of-the-model-and-is-it-consistent-with-empirical-evidence"&gt;Q12. What is the markup prediction of the model, and is it consistent with empirical evidence?&lt;/h3&gt;
&lt;p&gt;Under Dixit-Stiglitz CES with elasticity σ, the equilibrium markup of each intermediate good equals σ/(σ–1) at the firm level. However, the measured gross markup — which includes the effective collateral constraint — is predicted to comove positively with the default rate in the model, and hence the markup is countercyclical. The paper notes this is consistent with the well-documented empirical regularity in Bils (1987) and Rotemberg and Woodford (1999). Additionally, the model replicates the finding in Gilchrist and Zakrajšek (2012) that a low default rate is associated with a high firm entry rate.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Demand Externality (Dixit-Stiglitz type).&lt;/strong&gt; In the paper&amp;rsquo;s sense, this is the mechanism by which individual firms&amp;rsquo; revenues depend on aggregate output Yt through the CES aggregator: each firm&amp;rsquo;s gross revenue is y_jt^(1–1/σ) · Y_t^(1/σ). Each firm takes Yt as given, but the aggregation of all firms&amp;rsquo; output determines Yt. This creates a positive spillover: more operating firms raise aggregate output, which raises each firm&amp;rsquo;s revenue, and vice versa. The paper uses this as the central transmission channel for self-fulfilling defaults, in contrast to prior literature that emphasized debt networks or asymmetric information contagion.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Self-Fulfilling Default Cycle.&lt;/strong&gt; A dynamic equilibrium path in which pessimistic expectations about aggregate output are validated: if firms anticipate that more other firms will default (lowering Yt), their own continuation value Vt falls, raising the probability that their idiosyncratic liquidity shock will exceed the default threshold, increasing actual defaults, further lowering Yt, and so on. The paper distinguishes this from shock-amplifier stories by constructing a model with multiple rational-expectations equilibria in which the aggregate default rate is determined in part by initial beliefs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bogdanov-Takens Bifurcation.&lt;/strong&gt; A mathematical tool for global dynamics analysis applied to two-dimensional continuous-time systems. In the paper, it is used to characterize system behavior when the parameters (κ, A) are near the point (κ̄, Ā) at which the Jacobian has two zero eigenvalues. Near this point, the system can exhibit saddle-loop bifurcations, Hopf bifurcations, homoclinic orbits, and stable or unstable periodic orbits — all of which are invisible to local linearization analysis. The paper uses this to establish that indeterminacy is more pervasive than local analysis suggests.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Good / Bad / Ugly Steady States.&lt;/strong&gt; In the paper&amp;rsquo;s three-regime framework: the &amp;ldquo;good&amp;rdquo; state is the unique saddle-point equilibrium under high productivity Ah, with near-zero default rates; the &amp;ldquo;bad&amp;rdquo; state is the saddle-path equilibrium under low productivity Al, with modestly elevated defaults; the &amp;ldquo;ugly&amp;rdquo; state is the sink-path equilibrium under low productivity, characterized by self-fulfilling high default rates (up to ~12%). The ugly state is reached only when pessimistic sentiment coincides with the low-productivity regime, and it is the ugly state that generates the cluster pattern in simulation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Collateral Constraint on Working Capital.&lt;/strong&gt; The firm-level borrowing constraint wtnjt ≤ θVjt, where θ is the collateral ratio and Vjt is the firm&amp;rsquo;s continuation value. This constraint means that higher expected future profits — by raising Vt — relax the current borrowing limit, increase current labor demand and output, and create dynamic strategic complementarity between current and future production. It is this constraint, combined with the CES demand externality, that makes the dynamical system two-dimensional and generates the non-monotonic ˙Vt = 0 locus.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Global Indeterminacy.&lt;/strong&gt; The existence, given an initial state variable Mt, of multiple equilibrium trajectories — each satisfying all equilibrium conditions including transversality — that converge to different steady states or follow periodic paths. In the paper, global indeterminacy arises even when the system is locally determinate (e.g., in the saddle-source case): trajectories diverging from near the source steady state can converge to the saddle steady state along multiple paths, none of which is detectable by local linearization.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Periodic Orbit (Endogenous Cycle).&lt;/strong&gt; In the paper, a closed trajectory in the (Mt, Vt) phase plane that the economy follows indefinitely in perfect-foresight equilibrium without any exogenous shocks. Such orbits exist for κ ∈ (κ_Hopf, κ_SL), are stable if S &amp;lt; 0 and unstable if S &amp;gt; 0 (where S is a computable quantity defined in Equation A.13). Their existence demonstrates that business cycles can arise purely from internal forces — the demand externality and borrowing constraint — consistent with the view in Beaudry, Galizia, and Portier (2020).&lt;/p&gt;</description></item><item><title>Aggregation and the Estimation of Quality Change</title><link>https://macropaperwarehouse.com/papers/aggregation-and-the-estimation-of-quality-change/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/aggregation-and-the-estimation-of-quality-change/</guid><description>&lt;p&gt;Errico and Lashkari address two intertwined problems in the measurement of aggregate price indices: how to account for quality change and variety entry/exit when the demand system is not CES, and how to identify flexible demand systems from prices and market shares alone when supply and demand shocks are correlated. The paper makes a theoretical contribution and a methodological one, then applies both to the measurement of US import price inflation over 1989–2016.&lt;/p&gt;
&lt;p&gt;The theoretical contribution generalizes the unified CES price index of Redding and Weinstein (2020a) and the Feenstra (1994) variety correction to the full class of smooth, invertible demand systems. The key insight is that the contribution of quality change to the aggregate price index depends on heterogeneous cross-product elasticities of substitution, not a single scalar as in the CES case. For practical implementation, the paper specializes to the Homothetic with Aggregator (HA) family of demand systems — which includes Kimball (1995), CRESH (Hanoch, 1971), and HSA (Matsuyama and Ushchev, 2017) — showing that within this family cross-product elasticities collapse to product-level elasticities, dramatically reducing dimensionality. The resulting approximate price index (Proposition 2) weights each product by its love-of-variety index 1/(epsilon_it − 1), departing from the uniform CES weighting.&lt;/p&gt;
&lt;p&gt;The methodological contribution is a dynamic panel (DP) identification strategy that exploits the Markov structure of quality shocks. The paper assumes that innovations to product quality are mean-zero conditional on lagged prices. Under flexible pricing, firms maximize current-period profits without regard to future demand shocks, so lagged prices are valid instruments for current prices. This permits identification of rich demand systems without external cost instruments and without the conventional assumption of uncorrelated supply and demand shocks. The conventional Feenstra–Broda–Weinstein (FBW) approach imposes zero correlation between quality shocks and prices; the paper shows that when quality and marginal cost are positively correlated, FBW produces downward-biased elasticity estimates (endogeneity bias).&lt;/p&gt;
&lt;p&gt;The empirical application constructs a dataset covering 155 time-consistent 5-digit NAICS industries over 1989–2018, matching US customs import data with domestic production data and treating country-of-origin varieties as the unit of observation. The paper estimates both CES and Kimball demand systems using the DP approach and compares them to FBW estimates.&lt;/p&gt;
&lt;p&gt;Key quantitative findings: First, DP-estimated CES elasticities are larger on average than FBW estimates (weighted mean 5.99 vs. 4.62), confirming a downward endogeneity bias in conventional methods. Second, Kimball mean elasticities exceed CES estimates (weighted mean 3.11 for Kimball vs. 5.99 for CES at the industry level, but the Kimball distribution has a mean of 17.0 and median 4.70), reflecting a heterogeneity bias — CES understates the dispersion of elasticities and thereby understates the elasticity relevant for the base (domestic) product whose market share is declining. Third, quality improvements in imported goods reduced the US import price index by approximately 20.2 percentage points cumulatively (0.67 p.p. annually) under Kimball demand, and 15.9 percentage points cumulatively (0.53 p.p. annually) under CES demand, over 1989–2018. The headline figure cited in the abstract is approximately 0.7 p.p. annually. The aggregate import price index (price plus quality components combined) fell by 8.25 p.p. cumulatively under Kimball and 4.01 p.p. under CES, compared to a BEA PCE index increase of 57.8 p.p. over the same period. Sectorally, machinery and electrical equipment account for roughly 60% of total quality gains (~200 p.p. cumulative). By country, China accounts for approximately 35% of cumulative quality gains, with non-OECD countries collectively contributing ~59%, and China&amp;rsquo;s quality upgrading accelerating after WTO accession.&lt;/p&gt;
&lt;p&gt;Validation using US automobile market data (1980–2018) confirms the DP identification assumption: controlling for current product characteristics, future characteristics are uncorrelated with current prices. The DP approach produces elasticity estimates and quality change measures similar to those obtained using real exchange rate cost-shock instruments, and the Kimball demand closely matches mixed logit (BLP) estimates of both price elasticities and price indices. CES estimates exhibit a measurable downward heterogeneity bias in this validation setting, which the paper traces theoretically and empirically to a positive covariance between demand elasticities and price volatility across products.&lt;/p&gt;
&lt;p&gt;Scope conditions: results apply to homothetic (income-invariant) demand; nonhomothetic extensions are provided as a generalization (Proposition 4) but not the primary focus. The import price index measures the cost of imports conditional on given domestic consumption; it does not capture full consumption-side welfare effects including substitution away from domestic varieties.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the core theoretical result on price index measurement beyond CES?&lt;/strong&gt;
Proposition 1 shows that for any smooth, invertible demand system satisfying the connected substitute property, the change in the log aggregate price index can be approximated as a weighted sum of log price changes and log expenditure share changes, with the expenditure share changes premultiplied by the inverse of the matrix Psi_t capturing cross-product elasticities of substitution. In the CES special case this reduces to the scalar (1/(sigma−1)) weight of the Redding-Weinstein (2020a) CUPI. The key departure in general demand is that the weight applied to each product&amp;rsquo;s expenditure share change is heterogeneous and depends on the full matrix of cross-product substitutabilities, not a single constant.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: How does the HA (Homothetic with Aggregator) family simplify the theoretical results?&lt;/strong&gt;
For HA demand — which nests Kimball, CRESH, and HSA — Lemma 1 establishes that cross-product elasticities sigma_ij depend only on product-level elasticities epsilon_i through simple analytic formulas (e.g., epsilon_i * epsilon_j / epsilon-bar for HDIA), reducing the estimation problem from an N×N matrix to a vector of N scalars. Proposition 2 then gives an approximate price index in which each product&amp;rsquo;s expenditure share change is weighted by its love-of-variety index 1/(epsilon_it − 1), rather than a common CES scalar. This is the operative formula for the Kimball application.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What is the endogeneity bias in conventional elasticity estimation and how large is it?&lt;/strong&gt;
Conventional FBW methods assume supply and demand shocks are uncorrelated; when quality improvements are positively correlated with product prices (e.g., higher-quality goods command higher prices and also have higher marginal costs), FBW estimates are biased downward. The paper documents this: for CES demand, the DP-estimated weighted mean elasticity is 5.99 versus 4.62 under FBW, and for median estimates the DP value is 4.27 versus 2.58 under FBW, across 155 industries. The bias matters because underestimated elasticities imply underestimated quality changes and a smaller quality correction to the price index.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What is the heterogeneity bias and how does it differ from the endogeneity bias?&lt;/strong&gt;
Even after correcting for endogeneity, CES demand imposes a single elasticity per industry, ignoring the cross-product distribution. The paper shows that the CES estimate is an average that does not correctly capture the behavior of the base product (the domestic US variety) whose market share is declining. Because the domestic variety tends to have a lower elasticity than the import average, CES understates this product&amp;rsquo;s love-of-variety index and thereby understates the quality correction attributable to rising import shares. Theoretically and empirically (Appendix E.4), this bias is larger when demand elasticities covary positively with price volatility across products.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What is the dynamic panel identification assumption and why does it hold under flexible pricing?&lt;/strong&gt;
The paper assumes that quality shock innovations u_it are mean-zero conditional on lagged log prices: E[u_it | log p_it−1] = 0. Under flexible pricing, firms maximize current-period profits using current variables only; current prices are determined by current quality but are not chosen in anticipation of future quality shocks. Therefore lagged prices are uncorrelated with future quality innovations, making them valid instruments for current prices. This assumption is validated empirically in the automobile market: controlling for current product characteristics (horsepower, weight, fuel economy), future characteristics are not correlated with current prices.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: What are the headline findings on quality change in US import prices?&lt;/strong&gt;
Under Kimball demand, quality improvements in imported goods reduced the US import price index by 20.2 percentage points cumulatively over 1989–2018, equivalent to 0.67 p.p. annually (the abstract rounds this to approximately 0.7 p.p. annually). Under CES demand, the quality contribution is 15.9 p.p. cumulatively (0.53 p.p. annually). The aggregate import price index combining price and quality changes fell by 8.25 p.p. under Kimball and 4.01 p.p. under CES over the same period. These figures imply that official import price statistics substantially overstate import price inflation by failing to account for quality improvements.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: Which sectors and countries drive the quality gains?&lt;/strong&gt;
Machinery and electrical equipment account for approximately 60% of total cumulative quality gains, with roughly 200 p.p. cumulative quality improvement in that sector. Computer and peripheral equipment (NAICS 3341) is a notable contributor — the official import-to-producer price ratio shows a nearly five-fold increase between 1989 and 2018, but after quality adjustment this ratio reverses direction. By country of origin, China accounts for approximately 35% of cumulative quality gains; other non-OECD countries collectively contribute approximately 59%; OECD countries contribute approximately 7%. China&amp;rsquo;s quality upgrading is documented to accelerate following its WTO accession.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: Why does CES understate the quality correction relative to Kimball?&lt;/strong&gt;
The primary mechanism is that the US domestic variety — which serves as the numeraire for quality measurement — has a declining market share over the sample period. In Kimball demand, products with declining market shares are assigned lower elasticities (higher love-of-variety indices), amplifying the quality correction associated with import share gains. CES imposes a uniform elasticity, failing to capture this asymmetry. The paper shows that the key driver of the CES-Kimball gap in the import price index is CES underestimating the love-of-variety index of the base domestic product.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: How is the identification approach validated in the automobile market?&lt;/strong&gt;
Using the Berry-Levinsohn-Pakes dataset extended by Grieco et al. (2024) for 1980–2018, the paper first verifies empirically that future product characteristics (horsepower, weight, fuel efficiency) are uncorrelated with current prices after controlling for current characteristics. It then compares DP estimates for both CES and Kimball demand against estimates obtained using real exchange rate (RER) variation as a cost-shock instrument, finding similar results in both cases. Finally, it compares Kimball and CES estimates against mixed logit (BLP) demand: Kimball closely matches BLP price elasticities and implied quality changes, while CES shows a downward heterogeneity bias.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What does the automobile market validation imply for the import price index methodology?&lt;/strong&gt;
Since Kimball demand matches the richer mixed logit demand in the auto setting — where product characteristics are observed — the validation provides evidence that Kimball demand serves as a good approximation to rich heterogeneous-elasticity models when characteristics are unavailable. The paper constructs price indices for the US auto industry based on mixed logit, mixed CES, Kimball, and standard CES, and shows that the Kimball index is closer to the mixed logit and mixed CES indices than is the standard CES index.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: How does the paper handle product entry and exit?&lt;/strong&gt;
Proposition 3 generalizes Proposition 1 to accommodate product entry and exit. The expression includes a variety correction analogous to Feenstra (1994) but generalized to non-CES settings via the mean love-of-variety index of entering and exiting products. In the CES special case this reduces exactly to the Feenstra (1994) correction. In the empirical application to US imports, entry and exit of country-of-origin varieties within industries is a relevant margin given the expansion of trading partners over the sample.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: How does the paper relate to Redding and Weinstein (2020a)?&lt;/strong&gt;
Redding and Weinstein (2020a) derive a price index formula under CES demand that accounts for taste shocks, applied to US retail scanner data where quality is constant at the barcode level. The present paper generalizes their CUPI formula beyond CES to general and HA demand systems, and extends their identification strategy to settings where demand changes partly reflect quality changes rather than pure taste shocks. The paper also shows that the CES assumption used in Redding-Weinstein may overstate the contribution of taste shocks to cost-of-living indices, since part of the expenditure share variation attributed to taste shocks under CES would be reassigned under heterogeneous-elasticity demand.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q13: Does the paper address welfare implications beyond the import price index?&lt;/strong&gt;
The paper explicitly notes that the import price index does not capture the full consumption-side welfare effects of rising imports, since gains from lower import prices may be partly offset by substitution away from domestic varieties. The paper also notes that it abstracts from nonhomotheticity (income effects), pointing to Jaravel and Lashkari (2021) for that extension. The primary welfare-relevant quantity reported is the quality-adjusted change in the cost of the imported goods basket, which is the import price index in the conventional sense.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Love-of-variety index&lt;/strong&gt;: For a product i, defined as 1/(epsilon_it − 1) where epsilon_it is the product-level demand elasticity in an HA demand system. It measures the welfare value of having access to that variety and serves as the weight applied to expenditure share changes in the generalized price index formula (Proposition 2). In the CES special case all products share the same love-of-variety index 1/(sigma−1).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Homothetic with Aggregator (HA) demand&lt;/strong&gt;: A family of income-invariant (homothetic) demand systems — including Kimball (1995), CRESH (Hanoch, 1971), and HSA (Matsuyama and Ushchev, 2017) — in which preferences are represented by a utility function with a specific aggregator structure. The key property exploited in the paper is that cross-product elasticities of substitution sigma_ij depend only on product-level elasticities epsilon_i through simple analytic formulas, reducing the dimensionality of the estimation problem from an N×N matrix to N scalars.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Endogeneity bias (in elasticity estimation)&lt;/strong&gt;: Downward bias in estimated elasticities of substitution arising from a positive correlation between product quality shocks and prices. When higher-quality products command higher prices and also have higher marginal costs, conventional methods (FBW) that assume zero correlation between supply and demand shocks will attribute part of the price variation to supply, underestimating how much demand responds to price. The paper documents this bias as the gap between DP and FBW estimates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Heterogeneity bias (in elasticity estimation)&lt;/strong&gt;: Additional downward bias in CES elasticity estimates relative to the mean of Kimball elasticities, arising from CES imposing a single elasticity per industry when the true elasticities are heterogeneous across products. The bias is stronger for differentiated products and is theoretically traced to a positive covariance between demand elasticities and price volatility across products.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Dynamic panel (DP) identification&lt;/strong&gt;: The paper&amp;rsquo;s proposed identification strategy, which exploits the Markov structure of quality shocks. The key moment condition is that quality shock innovations are mean-zero conditional on lagged prices, which holds under flexible pricing. Lagged prices (and higher-order lags and nonlinear transformations) serve as instruments for current prices, permitting identification of demand parameters without external cost instruments.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Quality shock (phi_it)&lt;/strong&gt;: An unobserved product characteristic that shifts demand for product i at time t, defined through the utility function as a scalar multiplying the quantity consumed. Quality is identified from residual demand — the component of demand not explained by price — following the approach of Khandelwal (2010) and Hallak and Schott (2011). The paper models quality shocks as following a stationary AR(1) process with product-specific means.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Unified CES price index (CUPI)&lt;/strong&gt;: The price index formula of Redding and Weinstein (2020a) for CES demand, which decomposes the aggregate price change into a price component (expenditure-share-weighted price changes) and a quality/taste component proportional to (1/(sigma−1)) times expenditure share changes. The present paper&amp;rsquo;s Proposition 2 generalizes CUPI to HA demand by replacing the scalar 1/(sigma−1) with product-specific love-of-variety indices.&lt;/p&gt;</description></item><item><title>AI and task efficiency</title><link>https://macropaperwarehouse.com/papers/ai-and-task-efficiency/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/ai-and-task-efficiency/</guid><description>&lt;p&gt;AI can improve decisions, raise firm productivity, and accelerate human capital growth through its effect on signal quality in problem-solving tasks, but the consequences are heterogeneous across the skill distribution and depend on how AI changes the hierarchy within firms. This paper proposes a framework in which AI improves the accuracy of the signals that guide human decisions—individually and in groups—and derives implications for firm organization, wages, and productivity. It also examines preliminary evidence: a cross-sectional regression of changes in TFP growth (2024 versus 2022) on sectoral AI exposure (Eisfeldt et al. 2024) for Compustat firms yields a positive relationship, statistically significant at the 10% level at the 3-digit NAICS sector level and at the 5% level at the firm level, with a slope coefficient of 0.206 for the firm-level regression. The paper compares AI to earlier general purpose technologies (GPTs)—electricity and information technology—finding that if there is a productivity delay for AI it appears shorter than the five- and eight-year delays documented for electrification and IT.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-theoretical-framework-linking-ai-to-decisions-and-productivity"&gt;Q1. What is the theoretical framework linking AI to decisions and productivity?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper models several mechanisms through which AI may improve outcomes by raising the accuracy of signals that guide problem-solving: when signal accuracy rises, individuals and groups make better decisions, potentially enabling lower-level workers to handle more complex tasks and reducing the need for expensive higher-level solutions.&lt;/strong&gt; For example, if AI allows managers to understand problems faster, they can handle more problems at a given time, potentially reducing demand for specialized expert judgment at lower hierarchy levels. Alternatively, if AI allows lower-level workers (clerks, nurses) to handle tasks previously requiring specialists (partners, doctors), the demand for specialists may fall and the wage premium for top-tier workers may narrow. The direction of the effect depends on whether AI is a better complement to high-skill or to low-skill tasks.&lt;/p&gt;
&lt;h3 id="q2-what-does-the-empirical-evidence-show-about-ais-current-productivity-effects"&gt;Q2. What does the empirical evidence show about AI&amp;rsquo;s current productivity effects?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;A cross-sectional regression using 5,009 Compustat firm-level observations for 66 three-digit NAICS sectors finds a positive and statistically significant relationship between sectoral AI exposure in 2022 (from Eisfeldt et al. 2024) and the change in annual TFP growth between 2024 and 2022, with a sector-level slope coefficient that is statistically significant at the 10% level.&lt;/strong&gt; The firm-level regression (including 3-digit NAICS fixed effects) yields a slope of 0.206 on AI exposure, significant at the 5% level (t-statistic 2.08), with R² = 0.20 and 1,996 observations. The relationship is absent when examining TFP growth levels in any individual year between 2019 and 2022, consistent with AI&amp;rsquo;s macroeconomic effects only becoming measurable after the release of GPT-4 in March 2023.&lt;/p&gt;
&lt;h3 id="q3-how-does-ai-compare-with-prior-general-purpose-technologies"&gt;Q3. How does AI compare with prior general purpose technologies?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper relates AI to the earlier GPT literature, noting that productivity growth tended to be lower at the start of both the electrification and IT eras—with delays of approximately five and eight years respectively before productivity gains became measurable—and that if there is a similar delay for AI it appears shorter based on the preliminary 2024 data.&lt;/strong&gt; This comparison suggests that AI may be a GPT with unusually rapid diffusion or a shorter learning curve, though the authors caution that the evidence is still preliminary and depends on the dating of AI&amp;rsquo;s &amp;ldquo;arrival.&amp;rdquo;&lt;/p&gt;
&lt;h3 id="q4-why-might-ai-effects-differ-across-the-hierarchy-within-firms"&gt;Q4. Why might AI effects differ across the hierarchy within firms?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;AI&amp;rsquo;s effect on a firm hierarchy depends on whether it complements or substitutes for skills at each level: if AI primarily helps managers (by speeding problem diagnosis), it may reduce demand for specialized lower-level workers; if it primarily helps clerks (by enabling them to handle more complex documents), it may reduce demand for partners while raising demand for lower-level staff.&lt;/strong&gt; The paper argues that the distributional consequences—whether AI raises or lowers wage dispersion—depend on this complementarity/substitutability pattern, which likely varies by industry, as illustrated by the contrasting cases of automotive assembly (AI may help managers but not line workers) and law firms (AI may help clerks handle more complex work).&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;AI as signal accuracy improvement&lt;/strong&gt; : the paper&amp;rsquo;s framework for thinking about AI&amp;rsquo;s effect on decision quality: AI raises the precision of the signals that guide problem-solving, which leads to better individual and group decisions regardless of the specific mechanism.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;general purpose technology (GPT) delay&lt;/strong&gt; : the empirical phenomenon documented by Jovanovic and Rousseau (2005) in which productivity growth is lower at the start of a major GPT era before eventually accelerating; the paper examines whether AI exhibits the same pattern, finding that any delay appears shorter than for electrification (five years) or IT (eight years).&lt;/p&gt;</description></item><item><title>An endogenous gridpoint method for distributional dynamics</title><link>https://macropaperwarehouse.com/papers/an-endogenous-gridpoint-method-for-distributional-dynamics/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/an-endogenous-gridpoint-method-for-distributional-dynamics/</guid><description>&lt;p&gt;This paper introduces the Distributional Endogenous Gridpoint Method (DEGM), a novel numerical technique for solving the distributional dynamics that arise in heterogeneous agent macroeconomic models. The core problem is how to efficiently update the distribution of agents over the state space as the economy evolves. The dominant existing approach — the &amp;ldquo;lottery method&amp;rdquo; of Young (2010) — discretizes the state space and represents policy functions as lotteries over nearby gridpoints, producing a transition matrix that is linear in optimal policies. This linearity renders the lottery method incapable of capturing nonlinear effects in distributional dynamics, a limitation that becomes quantitatively significant for higher-order perturbation solutions.&lt;/p&gt;
&lt;p&gt;DEGM extends Carroll&amp;rsquo;s (2006) endogenous gridpoint method from individual optimization to the distributional level. Rather than discretizing the density and integrating forward, DEGM works directly on the cumulative distribution function (CDF). The key insight is that when the policy function is monotone — as savings functions typically are — the endogenous gridpoints generated by the policy function trace out exact points on the post-policy CDF without requiring integration. Specifically, if A*_{i,j} = a*(A_i, Y_j) are optimal asset choices from grid point A_i at income Y_j, then the CDF values at those endogenous points are known analytically as F_t(A_i | Y_j). An interpolant using shape-preserving splines constructed through these points allows evaluation of the updated CDF at any point without integration. The income transition step is handled separately via standard quadrature over the discretized income process.&lt;/p&gt;
&lt;p&gt;The paper demonstrates DEGM&amp;rsquo;s performance with two applications. First, in the Aiyagari (1994) economy, DEGM converges to the stationary equilibrium an order of magnitude faster than the lottery method in terms of gridpoints. At nk=40 gridpoints, the lottery method deviates from the benchmark capital stock by 1.72% and the wealth Gini by 2.24% (for nh=5), while DEGM deviates by only 0.09% and 0.12% respectively. Both methods converge to the same solution as the number of gridpoints increases, but DEGM reaches this limit far faster.&lt;/p&gt;
&lt;p&gt;Second, the authors introduce a Krusell-Smith style model with aggregate investment risk (capital depreciation shocks calibrated following Barro, 2006, as a 0.4% quarterly probability of 7.5% capital destruction causing a 10% annual GDP drop) as a new baseline for studying aggregate nonlinearities with household heterogeneity. This model overcomes the near-linearity of aggregate capital dynamics in the original Krusell-Smith specification. Using a third-order perturbation solution with DEGM, aggregate investment risk lowers the capital stock by 5 to 11 basis points and increases wealth inequality by up to 11 basis points relative to the non-stochastic steady state, depending on idiosyncratic income risk calibration. The lottery method systematically mispredicts these effects: it always predicts a decrease in wealth inequality in the presence of investment risk, while DEGM predicts an increase. At third order, the lottery method predicts wealth Gini changes of +2.0 bp (persistent calibration) and -149.7 bp (transitory calibration), while DEGM predicts +10.7 bp and +2.1 bp respectively.&lt;/p&gt;
&lt;p&gt;The mechanism for increased inequality under investment risk is heterogeneous: for less wealthy households the substitution effect dominates (they reduce saving more in response to risky returns), while for wealthy households the income effect is stronger and precautionary saving motives dominate. The lottery method, by making the distributional transition matrix linear in policies, zeros out the second derivative of the transition matrix with respect to the policy function, missing the term capturing how the density at the pre-image of each asset level is affected nonlinearly. DEGM&amp;rsquo;s cubic spline interpolant captures all nonlinearities up to third order, enabling economically meaningful results that qualitatively differ from lottery-method predictions on wealth inequality.&lt;/p&gt;
&lt;p&gt;Q: What is the fundamental numerical problem that DEGM solves?
A: Evolving the distribution of agents forward over time in heterogeneous agent models requires evaluating a Kolmogorov forward equation, which naively demands numerical integration. The lottery method avoids integration by discretizing the state space and expressing transitions as a linear matrix operation, but this forces the distributional dynamics to be linear in optimal policies. DEGM avoids integration by exploiting policy function monotonicity: the endogenous policy gridpoints are the interpolation nodes, so the CDF update requires only interpolation, not integration. This preserves nonlinear effects up to the order of the splines used.&lt;/p&gt;
&lt;p&gt;Q: How does DEGM handle the borrowing constraint and the resulting mass point?
A: Savings policy functions are typically weakly monotone: constant at the borrowing constraint for sufficiently poor households, then strictly monotone above a threshold. DEGM accommodates this by starting the endogenous grid at the EGM solution corresponding to the borrowing constraint (the threshold a_j above which the policy is strictly monotone), restoring strict monotonicity on the relevant domain. The mass point at the borrowing constraint is captured by evaluating F_t(a_j, Y_j). Echoes of the borrowing constraint diminish as the number of income states increases, and in practice 10 income gridpoints are sufficient to smooth them.&lt;/p&gt;
&lt;p&gt;Q: How much faster does DEGM converge relative to the lottery method for the stationary equilibrium?
A: In the Aiyagari economy with nk=40 asset gridpoints, the lottery method&amp;rsquo;s capital stock deviates from the benchmark by 1.72% and the wealth Gini by 2.24% (nh=5), while DEGM deviates by only 0.09% and 0.12% respectively — roughly a 20-fold improvement in accuracy for the same gridpoints. At nk=80, the lottery method still shows 0.56%/0.78% deviations while DEGM shows 0.03%/0.00%. Although for a fixed number of gridpoints the lottery method is faster in wall-clock time (0.35s vs 0.82s at nk=40, nh=20), DEGM is faster for a given level of accuracy because it requires far fewer gridpoints.&lt;/p&gt;
&lt;p&gt;Q: Why does the lottery method fail at higher-order perturbations?
A: The lottery method constructs its transition matrix as a piecewise linear function of the optimal policy a*, so its second derivative with respect to a* is zero. As a result, it misses the second term in the second-order derivative of the end-of-period CDF: the term involving the derivative of the density at the pre-image of each asset level times the squared linear policy effect. This missing nonlinearity becomes quantitatively important at second and third order. DEGM&amp;rsquo;s cubic hermitian spline interpolant captures all nonlinearities up to third order, allowing it to correctly represent how the distribution responds nonlinearly to aggregate shocks.&lt;/p&gt;
&lt;p&gt;Q: What does the paper find about the effect of aggregate investment risk on the capital stock and wealth inequality?
A: Using a third-order perturbation solution with DEGM, aggregate investment risk lowers the capital stock by 5 to 11 basis points from the non-stochastic steady state, depending on whether income risk is persistent or transitory (DEGM third-order: -4.7 bp persistent, -11.4 bp transitory). Wealth inequality increases by up to 11 basis points (DEGM third-order: +10.7 bp persistent, +2.1 bp transitory). The lottery method diverges dramatically at third order, predicting Gini changes of +2.0 bp and -149.7 bp for the persistent and transitory calibrations respectively, compared to DEGM&amp;rsquo;s +10.7 bp and +2.1 bp.&lt;/p&gt;
&lt;p&gt;Q: What is the mechanism through which aggregate investment risk increases wealth inequality?
A: The mechanism operates through heterogeneous saving responses across the wealth distribution. For less wealthy households, capital income is a small share of total income, so the substitution effect of risky returns dominates: higher investment risk reduces their incentive to save. For wealthy households, capital income is central, so the income effect is stronger and precautionary saving motives intensify. A capital depreciation shock upon realization compresses the wealth distribution, but the risk of such a shock increases inequality on average because it disproportionately reduces saving among poorer households.&lt;/p&gt;
&lt;p&gt;Q: How do the authors extend DEGM to handle aggregate risk and higher-order perturbations?
A: The authors follow Reiter (2009) in including the distribution and value functions in the state space, defining a nonlinear difference equation over these objects. Higher-order perturbation of this system proceeds using the algorithms of Andreasen et al. (2018) and Levintal (2017), with second-order terms solved via a generalized Sylvester equation using Kim et al.&amp;rsquo;s (2008) doubling algorithm. The implementation handles up to 3,200 variables at second order and 220 variables at third order. For the second-order solution, the Bayer-Luetticke (2020) state-space reduction and its refinement in Bayer et al. (2024) yield results identical to the full unreduced system.&lt;/p&gt;
&lt;p&gt;Q: What is the state-space reduction procedure and how much does it compress the system?
A: The full system uses 402 states and 412 controls (persistent calibration). A copula representation of the distribution reduces this to 213 states and 412 controls; adding DCT compression of the value function gives 213 states and 98 controls; further adding a factor representation from the first-order solution yields 111 states and 98 controls — a 75% reduction. The R-squared-like IRF statistic remains 1.00 across all reductions, and ergodic moments are identical (capital: 25.54, Gini: 0.61 for the persistent calibration).&lt;/p&gt;
&lt;p&gt;Q: Does DEGM produce different first-order impulse responses than the lottery method?
A: For first-order perturbations, DEGM and the lottery method converge to the same solution as the number of gridpoints increases, but DEGM converges faster. For the first-order dynamics of the wealth distribution (wealth Gini IRFs), DEGM reaches convergence with nk=40 gridpoints while the lottery method requires nk=160. For aggregate capital stock IRFs, both methods converge quickly at first order. Quantitative differences become significant only at second and higher orders.&lt;/p&gt;
&lt;p&gt;Q: What calibration is used for the investment risk model?
A: Capital depreciation deviates from its steady-state value by a shock with second moment sigma_delta = 0.005 and third moment tau_delta = 0.012. This corresponds to a 0.4% quarterly probability that a disaster destroys 7.5% of the capital stock and causes a 10% drop in annual GDP, consistent with the evidence in Barro (2006). The model is solved under both a persistent income calibration (beta=0.98, rho=0.98, sigma_epsilon=0.14, implied Gini=0.66) and a transitory income calibration (beta=0.99, rho=0.88, sigma_epsilon=0.18, implied Gini=0.42).&lt;/p&gt;
&lt;p&gt;Distributional Endogenous Gridpoint Method (DEGM): A numerical method for evolving the joint CDF of agents over the state space by constructing an interpolant at endogenous gridpoints A*_{i,j} = a*(A_i, Y_j) — the optimal policy values — at which CDF values are known analytically as F_t(A_i | Y_j), thus updating the distribution through interpolation rather than integration and preserving nonlinearities up to the order of the spline.&lt;/p&gt;
&lt;p&gt;Lottery Method (LM): Young&amp;rsquo;s (2010) standard technique that replaces the continuous distribution with a discrete counterpart and represents optimal policy functions as probability weights over nearby gridpoints, yielding a single transition matrix A* such that f_{t+1} = f_t * A*. The transition matrix is linear in optimal policies, which zeroes out the second derivative of the distributional dynamics with respect to policies and causes systematic misprediction of distributional dynamics under higher-order perturbation.&lt;/p&gt;
&lt;p&gt;Kolmogorov Forward Equation (Distributional Dynamics): The law of motion for the joint CDF F_t(a, y) describing how the distribution of households over assets and income evolves given optimal policies and the income transition process. In DEGM, this equation is split into a sub-period for asset choices (where endogenous gridpoints allow integration-free updating) and a sub-period for income transitions (handled by quadrature over the discretized income process).&lt;/p&gt;
&lt;p&gt;Higher-Order Perturbation Solution: A Taylor expansion of the model&amp;rsquo;s nonlinear equilibrium conditions around the non-stochastic steady state beyond first order. Second-order solutions capture precautionary motives and mean deviations from the steady state; third-order solutions additionally capture asymmetric effects of shocks, requiring DEGM&amp;rsquo;s nonlinear distributional representation to produce accurate results.&lt;/p&gt;
&lt;p&gt;Aggregate Investment Risk (Capital Depreciation Shocks): Shocks to the aggregate capital depreciation rate calibrated following Barro (2006) as a 0.4% quarterly probability of a disaster that destroys 7.5% of the capital stock and causes a 10% annual GDP drop. Proposed as a replacement for near-linear Krusell-Smith aggregate productivity shocks to generate genuine nonlinearities in aggregate capital dynamics while remaining equally parsimonious.&lt;/p&gt;
&lt;p&gt;State-Space Reduction: A sequence of compression techniques — copula representation of the wealth distribution, discrete cosine transform (DCT) compression of the value function, and factor representation from the first-order solution — that reduce the Reiter (2009) system from 402 states and 412 controls to 111 states and 98 controls (a 75% reduction) with no measurable loss of accuracy in impulse responses or ergodic moments.&lt;/p&gt;
&lt;p&gt;Shape-Preserving Interpolation: Interpolation methods (linear spline or piecewise cubic hermitian splines) that maintain the monotonicity of the CDF when constructing the interpolant from endogenous gridpoints. Cubic hermitian splines additionally preserve differentiability, making the distributional dynamics smooth enough for third-order perturbation and capturing all nonlinear effects that the lottery method misses.&lt;/p&gt;</description></item><item><title>An Equilibrium Analysis of the Effects of Neighborhood-Based Interventions on Children</title><link>https://macropaperwarehouse.com/papers/an-equilibrium-analysis-of-the-effects-of-neighborhood-based-interventions-on-children/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/an-equilibrium-analysis-of-the-effects-of-neighborhood-based-interventions-on-children/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research question.&lt;/strong&gt; How should governments design neighborhood-based policies to improve long-run outcomes for children, once one accounts for general equilibrium (GE) forces—endogenous rents, neighborhood quality, wages, and distortionary taxation—that small-scale experimental studies cannot identify?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model.&lt;/strong&gt; The paper embeds neighborhood effects into a quantitative, heterogeneous-agent overlapping-generations (OLG) model with endogenous location choice and child skill development. The economy has three building blocks: (1) a dynastic life-cycle structure in which parents choose a neighborhood (from two options: a disadvantaged n=1 and an advantaged n=2) and allocate time to child development, with child skills produced by a nested CES aggregator combining parental time and neighborhood quality (proxied by per-capita income in the tract); (2) a GE Aiyagari incomplete-markets framework with endogenous labor supply, wage uncertainty, and progressive labor taxation; and (3) a government that finances housing vouchers or place-based wage subsidies by adjusting the labor income tax parameter, with all additional net expenses fully offset by tax revenue. Housing supply is upward-sloping (elasticity 1.75, from Saiz 2010), so rents are endogenous.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and calibration.&lt;/strong&gt; The model is estimated by simulated method of moments to match U.S. data from the 2000s, drawing on the PSID, NLSY, ATUS, the 2012–2016 ACS, and the Opportunity Atlas (Chetty et al. 2018). Neighborhoods are mapped to Census tracts divided into bottom-10-percent and top-90-percent median household income groups within each commuting zone. Key targeted moments include the income gap between neighborhoods (108 percent higher mean individual income in n=2), the 30 percent higher incomes for children from low-income families raised in the better neighborhood, and a 32 percent gap in weekly parental time with children across neighborhoods.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Validation.&lt;/strong&gt; Before policy counterfactuals, the calibrated model is validated against two bodies of reduced-form evidence. First, a simulated small-scale, single-generation, partial-equilibrium voucher experiment generates 23 percent higher income for children—close to the 31 percent MTO experimental estimate from Chetty et al. (2016), with the difference largely explained by a smaller poverty-rate contrast (18 vs. 22 percentage points) in the simulation. Second, a simulated 20 percent place-based wage subsidy generates 17–21 percent earnings gains for adult residents of n=1, consistent with Busso et al.&amp;rsquo;s (2013) quasi-experimental EZ estimates of 17–24 percent.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main findings — housing vouchers.&lt;/strong&gt; The welfare-maximizing voucher program features a 100 percent subsidy rate, targets households with children and wages below the 80th percentile (fourth quintile), and is financed by progressive labor taxes. In the long-run steady state this policy raises 12.5 percent more children in the advantaged neighborhood, increases labor productivity by 1.1 percent, reduces income inequality (variance of log after-tax lifetime earnings) by 6.3 percent—comparable in magnitude to the Sweden–U.S. after-tax inequality gap—and raises upward mobility by 27.7 percent (roughly half its standard deviation across U.S. Census tracts). The average marginal tax rate must increase by 15.7 percent to fund the program. Despite this, long-run welfare rises by 3.4 percent in consumption equivalence units. A decomposition shows that intergenerational dynamics add 11.5 percentage points to welfare (relative to a short-run, single-generation scenario), while taxation subtracts 10.2 percentage points, and rent plus neighborhood-quality effects together subtract only 1.4 percentage points—leaving the net long-run GE gain similar to the short-run partial-equilibrium gain of 3.5 percent. Crucially, non-targeting children generates welfare losses of 5.0 percent, confirming that restriction to households with children is essential.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main findings — place-based wage subsidies.&lt;/strong&gt; A 12 percent wage subsidy to workers in the disadvantaged neighborhood yields the highest steady-state welfare gain of 0.7 percent. This is approximately one-fifth of the gain achievable with the optimal voucher. The subsidy induces substantial resorting toward n=1, reducing the share of children in n=2 by 6.7 percent while raising neighborhood quality in n=1 by 19.7 percent. Income inequality falls by 8.7 percent and upward mobility rises by 20.4 percent. However, in a short-run partial-equilibrium setup, the wage subsidy has a negative welfare effect of −1.0 percent because it draws parents (and their children) into the disadvantaged area; the positive net effect only emerges through long-run intergenerational channels (+2.5 percentage points) and equilibrium neighborhood-quality adjustments.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Political economy.&lt;/strong&gt; Because voucher gains are concentrated among young cohorts (those aged 16–43 at introduction), only 33 percent of incumbent adults would rationally vote for the housing voucher program. In contrast, the place-based wage subsidy provides positive average welfare gains for all age cohorts alive at introduction, yielding estimated majority support from over 63 percent of adults. This creates a fundamental political economy tradeoff: the policy with the larger long-run social gains lacks majority democratic support, while the policy with broader support delivers smaller long-run gains.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-are-the-two-market-frictions-that-justify-government-intervention-in-the-model"&gt;Q1. What are the two market frictions that justify government intervention in the model?&lt;/h3&gt;
&lt;p&gt;A1: The first friction is the absence of intergenerational borrowing markets: parents cannot borrow against their child&amp;rsquo;s future income, which limits the parent&amp;rsquo;s willingness to pay the higher rent in n=2 to give their child a developmental advantage. Housing vouchers act as a tax-financed substitute for this missing contract by paying the rent premium and recovering the cost through taxes on the high-earning adults the children become. The second friction is a neighborhood externality: individuals do not internalize the effect of their own income on the neighborhood quality experienced by neighbors&amp;rsquo; children. Place-based wage subsidies partially correct this externality by subsidizing work in the disadvantaged area, raising local income per capita and thereby improving the neighborhood quality index for all children resident there.&lt;/p&gt;
&lt;h3 id="q2-how-is-neighborhood-quality-defined-and-modeled-and-why-is-this-specification-chosen"&gt;Q2. How is neighborhood quality defined and modeled, and why is this specification chosen?&lt;/h3&gt;
&lt;p&gt;A2: Neighborhood quality sn is defined as total income per capita (the sum of labor and capital income) for all residents of neighborhood n, including non-workers. This specification is intended to capture multiple mechanisms: school quality (which depends on local tax bases), role-model effects from productive adults, and social organization effects through adult supervision of children. The formulation includes retired and non-working residents, which means the arrival of children mechanically reduces neighborhood quality per capita in the model, partially capturing a crowding channel. Formally, the neighborhood spillover function takes the power form f(sn) = A * sn^ζ, where ζ governs the elasticity of child development to neighborhood quality.&lt;/p&gt;
&lt;h3 id="q3-how-does-the-paper-validate-the-models-key-mechanism--the-neighborhood-effect-on-children"&gt;Q3. How does the paper validate the model&amp;rsquo;s key mechanism — the neighborhood effect on children?&lt;/h3&gt;
&lt;p&gt;A3: The validation mimics the MTO RCT within the calibrated model: the government provides a 100 percent rent voucher usable only in n=2 to households in n=1 with incomes below the 10th percentile, holding prices and neighborhood qualities fixed (as in a small-scale experiment). The model generates 25 percent voucher take-up and a 23 percent increase in children&amp;rsquo;s income in their late 20s. This compares to the experimental MTO estimate of approximately 31 percent. The paper attributes most of the gap to the smaller poverty-rate contrast in the simulation (18 percentage points) relative to MTO (22 percentage points), and shows that plotting the simulated result against the site-specific MTO estimates in a scatterplot of child income gains against neighborhood poverty reductions places the model prediction on the fitted line through the experimental data.&lt;/p&gt;
&lt;h3 id="q4-what-is-the-quantitative-role-of-long-run-intergenerational-dynamics-in-the-voucher-program-relative-to-other-ge-channels"&gt;Q4. What is the quantitative role of long-run intergenerational dynamics in the voucher program, relative to other GE channels?&lt;/h3&gt;
&lt;p&gt;A4: The decomposition in Table 5 isolates four GE channels. Starting from a short-run partial-equilibrium welfare gain of 3.5 percent (for the children of a single treated generation), allowing the economy to operate for the long run while holding prices and taxes fixed raises welfare to 15.0 percent — an increase of 11.5 percentage points — because improved skills in one generation create higher-skilled, higher-income parents who invest more in the next generation. Introducing housing market price adjustments (rents rise by 3.9 percent in n=2) reduces welfare by only 0.6 percentage points. Allowing neighborhood quality to adjust (quality in n=2 falls by 4 percent as lower-income families move in) reduces welfare by an additional 0.8 percentage points. Adding full taxation to balance the government budget reduces welfare by 10.2 percentage points, from 13.6 to 3.4 percent. The four channels nearly cancel, leaving the long-run GE steady-state gain close to the short-run single-generation gain.&lt;/p&gt;
&lt;h3 id="q5-why-does-the-optimal-voucher-program-require-targeting-to-families-with-children-and-what-happens-without-this-restriction"&gt;Q5. Why does the optimal voucher program require targeting to families with children, and what happens without this restriction?&lt;/h3&gt;
&lt;p&gt;A5: When the voucher is extended to all households regardless of children (Column 6 of Table 4), nearly 82.6 percent of the population receives a subsidy, pushing almost everyone to n=2. Rents in n=2 rise by 5.3 percent. To finance this much broader program, the average marginal tax rate must increase by 44 percent, far exceeding the 15.7 percent required for the children-targeted program. The large tax increase suppresses labor supply and income, which reduces neighborhood quality in n=2 by 11.6 percent. The net effect is a welfare loss of 5.0 percent. The intuition is that the benefit of the voucher program flows primarily through child skill development, so subsidizing adults without children is fiscally expensive without producing the intergenerational gains that justify the cost.&lt;/p&gt;
&lt;h3 id="q6-what-drives-the-difference-in-long-run-welfare-gains-between-vouchers-34-percent-and-place-based-wage-subsidies-07-percent"&gt;Q6. What drives the difference in long-run welfare gains between vouchers (3.4 percent) and place-based wage subsidies (0.7 percent)?&lt;/h3&gt;
&lt;p&gt;A6: The primary channel is labor productivity. The optimal voucher program raises labor productivity by 1.1 percent by increasing the average neighborhood quality to which children are exposed by 1.2 percent. The wage subsidy raises productivity by only 0.2 percent because it induces resorting toward the disadvantaged neighborhood, meaning children&amp;rsquo;s average neighborhood quality actually decreases by 0.2 percent despite large improvements in n=1&amp;rsquo;s quality (up 19.7 percent), since fewer children reside in n=1 after the subsidy draws their parents there. Inequality reduction is not the source of the gap: the wage subsidy actually reduces inequality more (8.7–8.9 percent) than the voucher (6.3 percent), but this inequality effect does not translate into larger aggregate welfare because productivity effects dominate.&lt;/p&gt;
&lt;h3 id="q7-how-does-the-wage-subsidy-produce-positive-long-run-welfare-when-it-generates-negative-welfare-in-the-short-run"&gt;Q7. How does the wage subsidy produce positive long-run welfare when it generates negative welfare in the short run?&lt;/h3&gt;
&lt;p&gt;A7: In the short run, the wage subsidy draws parents into the disadvantaged neighborhood to exploit higher wages, which reduces the share of children in the advantaged neighborhood n=2 and lowers children&amp;rsquo;s late-life productivity (welfare of −1.0 percent for treated children in the single-generation scenario). Two long-run channels flip the sign. First, the subsidy is permanent, so children themselves receive it as adults, providing a direct wage income benefit. Second, the sustained presence of higher-income workers in n=1 raises neighborhood quality there durably (by 19.7 percent at the steady state), which benefits the children who reside in n=1. Together these intergenerational effects add 2.5 percentage points to welfare, while taxation costs reduce it by only 1.4 percentage points, yielding a net gain of 0.7 percent.&lt;/p&gt;
&lt;h3 id="q8-what-determines-the-political-economy-divide-between-the-two-policies"&gt;Q8. What determines the political economy divide between the two policies?&lt;/h3&gt;
&lt;p&gt;A8: For the housing voucher, welfare gains are concentrated among younger incumbent adults (ages 16–43), particularly those who are about to have or already have children, while older adults tend to lose because they face higher taxes without benefiting from improved neighborhood quality for their (now independent) children. This concentration implies only 33 percent of incumbent adults would support the voucher under the model&amp;rsquo;s welfare metric. For the place-based wage subsidy, average welfare gains are positive for every age cohort alive at introduction (though larger for younger cohorts), because the wage subsidy raises incomes for workers in n=1 immediately and benefits from equilibrium rent declines in n=1 that allow all residents to benefit. Over 63 percent of adults would support the wage subsidy. The paper notes that if the government could borrow to initially finance the voucher program and pay for it later (as in Daruich 2020 for early childhood programs), majority support for the voucher could potentially be achieved.&lt;/p&gt;
&lt;h3 id="q9-how-sensitive-are-the-welfare-results-to-the-key-calibrated-parameters"&gt;Q9. How sensitive are the welfare results to the key calibrated parameters?&lt;/h3&gt;
&lt;p&gt;A9: The sensitivity analysis (Table 9, following Andrews et al. 2017) shows that individual parameters would need to change substantially to overturn the conclusion that vouchers generate larger steady-state welfare gains than wage subsidies. For example, the altruism parameter β̃ would need to increase by 22 percent to eliminate the voucher welfare gain, which would require average parental transfers to rise to 198 percent of income — far from the empirical target of 125.4 percent. Using the more conservative tract-level housing supply elasticity from Baum-Snow and Han (2021) of 0.3–0.4 (about 80 percent below the baseline Saiz 2010 estimate of 1.75) would reduce the voucher welfare gain from 3.37 to approximately 2.57 percent, not reversing the qualitative conclusion. The parameters with the largest influence on welfare gains are the labor disutility parameter µ and the altruism parameter β̃; the housing supply elasticity matters more for the voucher than the wage subsidy because easier housing supply accommodates growth in n=2 without displacement under the voucher.&lt;/p&gt;
&lt;h3 id="q10-what-does-the-transition-path-of-the-voucher-program-look-like-and-why-do-welfare-gains-initially-dip-before-recovering"&gt;Q10. What does the transition path of the voucher program look like, and why do welfare gains initially dip before recovering?&lt;/h3&gt;
&lt;p&gt;A10: When the voucher is unexpectedly introduced, the first newborn cohort gains approximately 4 percent welfare, but gains for subsequent cohorts initially dip to around 3 percent before stabilizing at 3.4 percent by the 20th post-introduction cohort. The dip occurs because moving costs slow resorting: immediately after introduction, rents in n=2 begin rising and neighborhood quality there begins falling as low-income families move in, but the capital stock adjustment (which would counteract these effects by raising GDP) lags the resorting. The rebound comes as capital accumulates in n=2 over time and as intergenerational productivity gains build through successive cohorts of better-skilled parents. Labor productivity jumps noticeably for the first cohort born to parents who received the voucher (approximately 28 years after introduction) and again for the first cohort born to grandparents who received it, visibly demonstrating the intergenerational mechanism. In contrast, the wage subsidy&amp;rsquo;s welfare gains are approximately constant at 0.7 percent across all cohorts because the key channels (neighborhood quality improvement in n=1 and wage gains) materialize rapidly and remain stable throughout the transition.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Neighborhood quality (sn):&lt;/strong&gt; In this paper, neighborhood quality is not school quality or amenities in a generic sense but is explicitly defined as total income per capita — the sum of labor income and capital income — for all residents of neighborhood n, including non-workers. This endogenous measure rises when higher-income or more productive residents move in and falls when lower-income residents or additional children arrive.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Intergenerational borrowing constraint:&lt;/strong&gt; The inability of parents to borrow against their child&amp;rsquo;s future income, modeled as a non-negativity constraint on the monetary transfer from parent to child (transfer ≥ 0). This is the paper&amp;rsquo;s first key market friction: without it, a poor parent who moved to a better neighborhood would smooth consumption across generations by having the high-earning child compensate the parent. The constraint prevents this, reducing parental investment below the socially efficient level.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Consumption equivalence (veil of ignorance):&lt;/strong&gt; The welfare metric used throughout the policy analysis. It is defined as the percentage change in consumption that would make a newborn individual indifferent between the pre-policy and post-policy steady states, computed before knowing their position in the skill or income distribution. This is the paper&amp;rsquo;s measure of long-run steady-state welfare.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Parental investment aggregator (CES):&lt;/strong&gt; A nested constant-elasticity-of-substitution function that determines how parental time τ and neighborhood quality sn combine to form the effective investment input I into child skill development: I = Ā[αI f(sn)^γ + (1 − αI)τ^γ]^(1/γ). The elasticity parameter 1/(1 − γ), estimated at 0.41, governs the degree of complementarity between time and neighborhood quality; a lower elasticity (γ = −1.43) implies the two inputs are complements, so parents with children in better neighborhoods also spend more time with them.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Place-based wage subsidy:&lt;/strong&gt; A neighborhood-specific wage premium (denoted w̃s) paid to all workers who both live and work in the disadvantaged neighborhood n=1, raising their effective wage to w1 = (1 + w̃s)w2. This policy targets the neighborhood externality by increasing the income of residents in n=1, which raises neighborhood quality and provides an incentive for higher-skilled workers to relocate to (or remain in) the disadvantaged area.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Upward mobility:&lt;/strong&gt; Measured in this paper as the probability that a child born to parents in the bottom 20 percent of the income distribution reaches the top 20 percent of the income distribution during the working stage of their own life. This is distinct from mean income rank measures; it specifically tracks cross-quintile transitions in the model&amp;rsquo;s stationary distribution.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Equilibrium decomposition:&lt;/strong&gt; A simulation-based method in which GE channels are progressively activated. Starting from a short-run, partial-equilibrium, single-generation baseline (analogous to an RCT), the authors sequentially allow: (i) long-run intergenerational dynamics while holding prices fixed; (ii) housing market price adjustments; (iii) neighborhood quality adjustments; (iv) tax and production-price adjustments. Each step&amp;rsquo;s change in outcomes identifies the quantitative contribution of that specific channel.&lt;/p&gt;</description></item><item><title>Anatomy of the Phillips Curve: Micro Evidence and Macro Implications</title><link>https://macropaperwarehouse.com/papers/anatomy-of-the-phillips-curve-micro-evidence-and-macro-implications/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/anatomy-of-the-phillips-curve-micro-evidence-and-macro-implications/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This paper addresses a fundamental puzzle in macroeconomics: why do estimates of the New Keynesian Phillips curve (NKPC) slope differ sharply depending on whether real marginal cost or the output gap is used as the real activity variable? The conventional, output gap-based NKPC yields very flat slope estimates (e.g., 0.006 to 0.024 in Hazell et al. 2022 and Rotemberg and Woodford 1997), which has led to the widespread view that the Phillips curve is &amp;ldquo;flat,&amp;rdquo; at least during the pre-pandemic period. The authors argue that this view conflates two distinct structural relationships: the elasticity of inflation with respect to real marginal cost, and the elasticity of marginal cost with respect to the output gap.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Methodology&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors assemble a unique quarterly micro-level dataset covering 4,598 manufacturing firms in Belgium over 84 quarters (1999:Q1–2019:Q4), totaling 132,915 observations. The dataset combines product-level domestic prices and quantities from the PRODCOM administrative database, customs data on foreign competitors&amp;rsquo; prices, and firms&amp;rsquo; variable production costs (labor costs from social security declarations plus intermediate input costs from VAT declarations). Intermediate inputs account for approximately 75 percent of total variable costs on average and are the most volatile cost component (within-firm coefficient of variation 1.77, versus 0.77 for labor costs).&lt;/p&gt;
&lt;p&gt;Their estimation strategy follows a &amp;ldquo;bottom-up&amp;rdquo; approach. Starting from a theoretical framework with heterogeneous firms subject to Calvo (1983) nominal rigidities and strategic complementarities in price setting (imperfect competition including dynamic oligopoly and Kimball demand), they derive a forward-looking dynamic pass-through regression linking a firm&amp;rsquo;s current price to discounted present values of its own marginal costs and competitors&amp;rsquo; prices, plus a lagged price level that serves as an error-correction term. This is Model A; robustness variants include Model B (absorbing competitor prices via industry-by-time fixed effects), Model C (imposing an AR(1) process for marginal cost), and Model A-U (unrestricted lagged-price coefficient).&lt;/p&gt;
&lt;p&gt;The structural parameters governing the NKPC slope — the degree of nominal rigidity (θ) and the strength of strategic complementarities (Ω) — are estimated jointly via GMM. Instruments for marginal cost are four-quarter-lagged firm-level total factor productivity (TFPQ), and instruments for competitors&amp;rsquo; prices exploit variation in EU-area export prices to third-country destinations and bilateral exchange rates between non-EU competitor currencies and the Euro. Sector-by-time fixed effects and firm fixed effects absorb confounding trends, shifting trend inflation, and permanent markups.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings with Quantitative Magnitudes&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The baseline estimate (Model A) yields θ = 0.711 (SE 0.014), implying that prices remain fixed for approximately three to four quarters on average, consistent with Nakamura and Steinsson (2008) Belgian PPI data (0.72). The strategic complementarity parameter is Ω = 0.570 (SE 0.059), indicating that competitor price dynamics reduce the pass-through of own marginal cost shocks by approximately half relative to the no-complementarities benchmark.&lt;/p&gt;
&lt;p&gt;These structural estimates imply a slope of the marginal cost-based NKPC of λ = 0.052 (SE 0.007), tightly estimated and robust across specifications: λ = 0.077 in Model B, λ = 0.069 in Model C, and λ = 0.056 in the unrestricted Model A-U. This slope is two to ten times larger than existing estimates of the conventional output gap-based NKPC slope (κ ≈ 0.024, Rotemberg and Woodford 1997; κ ≈ 0.006, Hazell et al. 2022).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Reconciling the High Cost-Based Slope with the Flat Output-Based Slope&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper shows that the output-based slope κ equals the product of the cost-based slope λ and the output elasticity of marginal cost σ_y: κ = λ · σ_y. Using Bartik-style instruments based on high-frequency ECB monetary policy surprises interacted with industry-level sensitivities, the authors estimate σ_y using two models. Model D yields σ_y = 0.406 and κ = 0.021; Model E (directly regressing changes in marginal cost on changes in output) yields σ_y = 0.112 and κ = 0.006. These estimates are consistent with, and overlap with, Rotemberg and Woodford (1997) and Hazell et al. (2022) during the pre-pandemic sample period. The low elasticity of marginal cost to output is attributed to near-constant short-run returns to scale at the firm level and wage rigidity that mutes general equilibrium effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Aggregate Inflation Dynamics&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Feeding an aggregate marginal cost index (constructed as a Törnqvist-weighted average of firm-level marginal costs) into the model-implied inflation expression produces a series that tracks Belgian manufacturing PPI inflation well: marginal cost fluctuations alone account for approximately 70 percent of inflation variation (R² = 0.68, correlation 0.8), without appealing to unobservable cost-push shocks or inflation lags.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model Validation via Supply Shocks&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A validation exercise using identified oil shocks (Känzig 2021 — measured as unexpected OPEC-day movements in oil futures prices) confirms the model. A one-standard-deviation shock to oil prices (a 15.7 percent increase in Brent crude) raises firms&amp;rsquo; real marginal costs by approximately 1.5 to 3 percent within the first three quarters, before reverting. The price response peaks at approximately 3 percent after six quarters, consistent with nominal rigidities generating a delayed but persistent response. Impulse-response matching yields λ_IRF = 0.042 (SE 0.005), within the confidence bands of the micro-level estimate λ = 0.052, validating the bottom-up approach.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;All estimates are drawn from Belgian manufacturing firms over 1999–2019, a period of moderate inflation during which Calvo pricing provides a good approximation of firm behavior. The authors note that the elasticity of marginal cost to output may be time-varying and nonlinear, and that during large aggregate shocks (such as the post-pandemic inflation surge), both the frequency of price adjustment and the sensitivity of marginal cost to output can rise substantially, requiring state-dependent pricing models (addressed in a companion paper, Gagliardone et al. 2025).&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-primitive-formulation-of-the-nkpc-and-how-does-it-differ-from-the-conventional-formulation"&gt;Q1. What is the primitive formulation of the NKPC, and how does it differ from the conventional formulation?&lt;/h3&gt;
&lt;p&gt;A1: The primitive NKPC features real marginal cost (in log-deviation from its steady state) as the real activity variable: π_t = λ·mc_t + β·E_t{π_{t+1}} + u_t, where λ is the slope depending on nominal rigidities and strategic complementarities. The conventional formulation uses the output gap (or unemployment gap) as a proxy for marginal cost, which is valid only under specific conditions including perfectly flexible wages. When those conditions fail, the output gap is a poor proxy for marginal cost, typically leading to downward bias in slope estimates. Even when a proportionality holds, the output-based slope κ equals λ multiplied by σ_y (the output elasticity of marginal cost), so the two slopes carry different economic content.&lt;/p&gt;
&lt;h3 id="q2-what-structural-parameters-govern-the-slope-of-the-cost-based-nkpc-and-what-is-the-formula"&gt;Q2. What structural parameters govern the slope of the cost-based NKPC, and what is the formula?&lt;/h3&gt;
&lt;p&gt;A2: The slope is λ = &lt;a href="1%e2%88%92%ce%a9"&gt;(1−θ)(1−βθ)/θ&lt;/a&gt;, where θ is the Calvo probability of price non-adjustment (capturing nominal rigidity) and Ω = Γ/(1+Γ) is the strategic complementarities parameter derived from the markup elasticity Γ with respect to relative prices. High nominal rigidity (high θ) flattens the slope by making individual price adjustments less frequent; strong strategic complementarities (high Ω) flatten it further because firms mute their price response to marginal cost in order to avoid deviating from competitors. The discount factor β is calibrated at 0.99 for quarterly data.&lt;/p&gt;
&lt;h3 id="q3-how-does-the-dynamic-pass-through-regression-differ-from-the-static-long-run-pass-through-regressions-used-in-prior-literature"&gt;Q3. How does the dynamic pass-through regression differ from the static (long-run) pass-through regressions used in prior literature?&lt;/h3&gt;
&lt;p&gt;A3: The dynamic pass-through regression (Model A) includes the firm&amp;rsquo;s lagged price as a regressor, which functions as an error-correction term controlling for persistent deviations between the price and the optimal reset price. Failing to include this term with quarterly data leads to omitted variable bias of magnitude −θ·Var(Δp_ft), since the cointegration error is autocorrelated with coefficient θ. Static pass-through regressions (as in Amiti, Itskhoki and Konings 2019 using annual data) are appropriate only when nominal rigidities can be ignored (θ ≈ 0); with quarterly data and θ ≈ 0.711, the orthogonality condition of the static model fails and the dynamic framework is necessary.&lt;/p&gt;
&lt;h3 id="q4-what-are-the-baseline-estimates-of-the-structural-parameters-and-how-robust-are-they"&gt;Q4. What are the baseline estimates of the structural parameters, and how robust are they?&lt;/h3&gt;
&lt;p&gt;A4: The baseline Model A yields θ = 0.711 (SE 0.014) and Ω = 0.570 (SE 0.059), implying prices fixed for approximately three to four quarters and competitor-price influence roughly equal to own marginal cost influence. The implied NKPC slope is λ = 0.052 (SE 0.007). Robustness checks across six specifications (Models B, C, A-U, variable SR-RTS controls, Translog TFPQ, eight-quarter-lagged instrument) yield λ in the range 0.044 to 0.077, with all estimates statistically significant and within each other&amp;rsquo;s confidence bands. The unrestricted model (A-U) cannot reject the restriction Ϛ = θ on the lagged-price coefficient (p-value 0.90).&lt;/p&gt;
&lt;h3 id="q5-what-is-the-short-run-elasticity-of-a-firms-own-price-to-a-permanent-marginal-cost-shock-and-how-do-nominal-rigidities-and-strategic-complementarities-each-contribute"&gt;Q5. What is the short-run elasticity of a firm&amp;rsquo;s own price to a permanent marginal cost shock, and how do nominal rigidities and strategic complementarities each contribute?&lt;/h3&gt;
&lt;p&gt;A5: The short-run pass-through elasticity is (1−Ω)(1−θ) ≈ (1−0.570)(1−0.711) ≈ 0.125. This is substantially below one because both forces dampen price adjustment: nominal rigidity (1−θ ≈ 0.289) means most firms cannot adjust in any given quarter, and strategic complementarities (1−Ω ≈ 0.430) mean that adjusting firms reduce their pass-through to avoid deviating from competitors&amp;rsquo; prices. Without strategic complementarities (Ω = 0), the elasticity would be roughly 0.289; without nominal rigidities (θ = 0), it would be roughly 0.430; both together produce the observed 0.125.&lt;/p&gt;
&lt;h3 id="q6-how-is-marginal-cost-measured-in-the-data-and-why-is-the-inclusion-of-intermediate-input-costs-important"&gt;Q6. How is marginal cost measured in the data, and why is the inclusion of intermediate input costs important?&lt;/h3&gt;
&lt;p&gt;A6: Marginal cost is proxied by average variable cost per unit of output: the log-nominal marginal cost equals ln(TVC_ft/Y_ft) + ln(1+ν_ft), where TVC is the sum of intermediate input costs (from VAT declarations) and labor costs (wage bill from social security declarations), and Y_ft is a quantity index. Intermediate inputs account for approximately 75 percent of total variable costs on average and are the most volatile component (within-firm coefficient of variation 1.77 vs 0.77 for labor). The authors note that DSGE models typically feature only labor as a variable input, but accounting for intermediates is pivotal because intermediate goods price shocks were among the most important drivers of the post-pandemic inflation surge.&lt;/p&gt;
&lt;h3 id="q7-what-instruments-are-used-for-marginal-cost-and-competitors-prices-and-what-are-the-identifying-assumptions"&gt;Q7. What instruments are used for marginal cost and competitors&amp;rsquo; prices, and what are the identifying assumptions?&lt;/h3&gt;
&lt;p&gt;A7: The instrument for marginal cost is the four-quarter lagged firm-level TFPQ (physical total factor productivity), estimated as the residual from a gross-output production function. Its relevance depends on TFP persistence (confirmed); the exclusion restriction requires that persistent TFP variation is orthogonal to current and future demand shocks after removing permanent demand components (via firm fixed effects) and industry trends (via sector-by-time fixed effects). Two instruments for competitors&amp;rsquo; prices exploit international trade variation: (i) sales-weighted average export prices of EU-area competitors to non-Belgium, non-EU destinations (orthogonal to Belgian demand shocks by construction), and (ii) bilateral exchange rate movements between non-EU competitor currencies and the Euro. All instruments pass the Cragg-Donald and Kleibergen-Paap F-statistics (strongly rejecting weak instruments) and Hansen-Sargan over-identification tests (failing to reject validity).&lt;/p&gt;
&lt;h3 id="q8-what-evidence-supports-the-validity-of-the-tfpq-instrument-against-capacity-utilization-concerns"&gt;Q8. What evidence supports the validity of the TFPQ instrument against capacity utilization concerns?&lt;/h3&gt;
&lt;p&gt;A8: The authors run two empirical tests. First, regressing marginal cost on four-quarter-lagged capacity utilization yields a small, statistically insignificant elasticity (0.011, SE 0.052), suggesting the TFPQ instrument&amp;rsquo;s predictive power does not reflect capacity utilization variation. Second, re-estimating with &amp;ldquo;purified&amp;rdquo; TFPQ instruments adjusted for capital utilization (Column 4) and for both capital and labor utilization (Column 5) produces parameter estimates and NKPC slopes essentially unchanged from baseline. Additionally, regression residuals show only weak and short-lived autocorrelation (−0.09 at one-quarter lag, p=0.09; −0.01 at two-quarter lag, p=0.69), indicating demand shocks are highly transitory after conditioning on fixed effects.&lt;/p&gt;
&lt;h3 id="q9-how-does-the-model-track-aggregate-belgian-manufacturing-ppi-inflation-and-what-does-this-imply-for-cost-push-shocks"&gt;Q9. How does the model track aggregate Belgian manufacturing PPI inflation, and what does this imply for cost-push shocks?&lt;/h3&gt;
&lt;p&gt;A9: Using the reduced-form expression π_t = λ̃(mc_t^n − p_{t-1}) + α + θu_t, where the reduced-form slope λ̃ = 0.22 is evaluated at baseline structural estimates, the model produces a model-implied inflation series that accounts for approximately 70 percent of variation in manufacturing PPI inflation (R² = 0.68, correlation 0.8), without including inflation lags or cost-push shocks. The model captures the inflation drop during the 2008 financial crisis, the run-up in 2016, and the subsequent decline. This contrasts with the quantitative DSGE literature in which cost-push shocks (variation in desired price and wage markups) account for approximately 70 percent of inflation volatility (e.g., Primiceri, Schaumburg and Tambalotti 2006).&lt;/p&gt;
&lt;h3 id="q10-how-do-the-authors-estimate-the-output-elasticity-of-marginal-cost-σ_y-and-what-do-they-find"&gt;Q10. How do the authors estimate the output elasticity of marginal cost σ_y, and what do they find?&lt;/h3&gt;
&lt;p&gt;A10: They use two approaches. Model D is a pricing equation directly relating firm-level prices and nominal output (value added), estimated via GMM, instrumented with Bartik-style shifters based on high-frequency ECB monetary policy surprises (Altavilla et al. 2019) interacted with industry-level sensitivities. Model E directly regresses changes in nominal marginal cost on changes in nominal output, also instrumented. Model D yields σ_y = 0.406 (SE 0.099) and implied κ = 0.021 (SE 0.005); Model E yields σ_y = 0.112 (SE 0.026) and κ = 0.006 (SE 0.001). The low σ_y is consistent with near-constant short-run returns to scale at the firm level and wage rigidity muting general equilibrium labor-market feedback, at least during the moderate-inflation pre-pandemic period.&lt;/p&gt;
&lt;h3 id="q11-how-does-the-oil-shock-validation-exercise-confirm-the-cost-based-nkpc-slope-estimate"&gt;Q11. How does the oil shock validation exercise confirm the cost-based NKPC slope estimate?&lt;/h3&gt;
&lt;p&gt;A11: Following Känzig (2021), the authors identify oil shocks as unexpected movements in Brent crude oil futures around OPEC meeting days, normalizing to a one-standard-deviation shock (15.7 percent Brent increase). Local linear projection IRFs show that firms&amp;rsquo; real marginal costs rise 1.5 to 3 percent within three quarters and then revert, while prices peak at approximately 3 percent increase after six quarters (consistent with nominal rigidity delaying the price response). Impulse-response matching — minimizing the weighted distance between empirical and model-implied price IRFs — yields λ_IRF = 0.042 (SE 0.005), which is close to and within the confidence bands of the micro-level estimate λ = 0.052, validating the bottom-up estimation approach.&lt;/p&gt;
&lt;h3 id="q12-what-do-the-estimates-imply-about-why-the-conventional-nkpc-appears-flat-in-normal-times"&gt;Q12. What do the estimates imply about why the conventional NKPC appears flat in normal times?&lt;/h3&gt;
&lt;p&gt;A12: The flat conventional NKPC slope (κ ≈ 0.006–0.024) does not reflect limited transmission of marginal cost fluctuations to inflation — that transmission is high (λ ≈ 0.052–0.077). Rather, flatness reflects a weak link between the output gap and marginal cost during the pre-pandemic period (σ_y ≈ 0.112–0.406), attributable to near-constant short-run returns to scale in production and wage rigidity. This decomposition matters for policy: supply shocks that directly raise marginal cost will pass through strongly to inflation even when output does not move much, whereas demand shocks that operate through the output-cost channel face attenuated transmission.&lt;/p&gt;
&lt;h3 id="q13-under-what-conditions-does-the-cost-based-phillips-curve-decompose-cleanly-into-a-product-of-the-two-elasticities"&gt;Q13. Under what conditions does the cost-based Phillips curve decompose cleanly into a product of the two elasticities?&lt;/h3&gt;
&lt;p&gt;A13: The decomposition κ = λ · σ_y requires assuming that real wages are flexible and determined in general equilibrium at the industry level, with real wages increasing in industry output with elasticity σ_w; that the natural level of output is defined as the equilibrium under flexible prices and constant desired markups; and that the firm&amp;rsquo;s marginal product of labor depends on productivity and output with a common short-run returns-to-scale parameter ν (homogeneous across firms and time-invariant). Under these assumptions (which parallel those used to derive the conventional NKPC in the standard NK model), the output elasticity of marginal cost is σ_y = σ_w + ν, and the theoretical restriction κ = λ · σ_y holds exactly.&lt;/p&gt;
&lt;h3 id="q14-how-do-macroeconomic-complementarities-from-aggregate-decreasing-returns-to-scale-affect-the-nkpc-slope"&gt;Q14. How do macroeconomic complementarities from aggregate decreasing returns to scale affect the NKPC slope?&lt;/h3&gt;
&lt;p&gt;A14: If aggregate SR-RTS fall below unity, the NKPC slope formula gains an additional term Θ = 1/(1+γν(1−Ω)) &amp;lt; 1, where ν is inversely related to average SR-RTS and γ is the within-industry elasticity of substitution. However, empirical estimates of sectoral SR-RTS range from 0.93 to 0.98, with an aggregate estimate of approximately 0.965 (implying ν ≈ 0.036). Given this and calibrating γ = 4, Θ ≈ 0.941, so macroeconomic complementarities would reduce the NKPC slope by only about 6 percent — well within the confidence bounds of the baseline estimates. The authors conclude that the constant-returns assumption in their main framework is a good approximation.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Primitive (cost-based) NKPC slope (λ):&lt;/strong&gt; The coefficient linking inflation to real marginal cost in the underlying New Keynesian pricing equation, defined as λ = &lt;a href="1%e2%88%92%ce%a9"&gt;(1−θ)(1−βθ)/θ&lt;/a&gt;. It captures how strongly firms&amp;rsquo; aggregate price setting responds to movements in real marginal cost per unit of output, holding the discount factor, nominal rigidity, and strategic complementarities fixed. Estimated at 0.052 (tightly, range 0.044–0.077 across specifications) for Belgian manufacturing.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Calvo probability of price non-adjustment (θ):&lt;/strong&gt; The parameter from Calvo (1983) staggered price setting capturing the share of firms that cannot change their price in a given period, equal to one minus the per-period probability of price adjustment. In this paper, θ is estimated directly from the dynamic pass-through regression coefficient on lagged prices, yielding θ ≈ 0.711, implying prices fixed approximately three to four quarters on average.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Strategic complementarities parameter (Ω):&lt;/strong&gt; Defined as Ω = Γ/(1+Γ), where Γ is the elasticity of a firm&amp;rsquo;s desired markup with respect to its own relative price. Captures the extent to which a firm weights competitors&amp;rsquo; prices (rather than its own marginal cost) when resetting its price. High Ω means firms strongly mute price responses to own cost changes to avoid relative price deviations from competitors. Estimated at Ω ≈ 0.570, implying competitor prices and own marginal cost enter the reset price with roughly equal weight.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Dynamic pass-through regression:&lt;/strong&gt; A forward-looking pricing equation (Model A) relating observed firm prices to the discounted present values of own marginal costs and competitors&amp;rsquo; prices, plus lagged own price as an error-correction term. The structural parameters θ and Ω are identified jointly from the regression coefficients, using GMM with instruments for the present values. The dynamic specification is necessary at quarterly frequency because the error-correction term (omitted in static pass-through models) is non-negligible when θ &amp;gt; 0.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Output elasticity of marginal cost (σ_y):&lt;/strong&gt; The elasticity of firm-level real marginal cost with respect to the firm-level output gap, defined under the assumptions that real wages are flexible and industry-level, equal to σ_y = σ_w + ν (wage elasticity with respect to industry output plus the short-run returns-to-scale parameter). This parameter bridges the cost-based and output-based Phillips curve slopes via κ = λ · σ_y. Estimated from micro data using monetary policy shock instruments at σ_y ≈ 0.112–0.406 in the pre-pandemic period.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Short-run returns to scale (SR-RTS):&lt;/strong&gt; The extent to which a firm&amp;rsquo;s marginal cost rises with output scale in the short run, parameterized by ν in the cost function MC^n_ft = C_{it} · A_{ft} · Y_ft^ν. If ν = 0, marginal cost is independent of output scale (constant returns), which the authors assume in their baseline. Firm- and sector-level estimates from Translog production functions yield SR-RTS ≈ 0.93–0.98 across sectors (aggregate ≈ 0.965), broadly consistent with the constant-returns assumption and implying modest macroeconomic complementarities.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Reduced-form aggregate pass-through slope (λ̃):&lt;/strong&gt; A composite parameter capturing the contemporaneous pass-through of aggregate real marginal cost (defined as nominal marginal cost relative to the lagged price level) into quarterly inflation under the assumption that nominal marginal cost follows a random walk. Evaluated at θ ≈ 0.70 and Ω ≈ 0.52 (median across models), λ̃ = 0.22. This is distinct from the structural NKPC slope λ because it also captures the persistence of cost shocks.&lt;/p&gt;</description></item><item><title>Are Inflationary Shocks Regressive? A Feasible Set Approach</title><link>https://macropaperwarehouse.com/papers/are-inflationary-shocks-regressive-a-feasible-set-approach/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/are-inflationary-shocks-regressive-a-feasible-set-approach/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; The paper asks whether inflationary shocks are regressive, and demonstrates that the answer depends critically on the &lt;em&gt;source&lt;/em&gt; of the shock. A single aggregate inflation statistic conceals radically different distributional consequences depending on whether inflation is driven by an oil supply contraction or by expansionary monetary policy.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Framework.&lt;/strong&gt; The authors develop a &amp;ldquo;feasible set approach&amp;rdquo; grounded in the envelope theorem. They show that the first-order money-metric welfare effect of any macroeconomic shock on a household is summarized by the present discounted value of changes to five components of the household&amp;rsquo;s budget constraint: (1) consumption prices, (2) wage income, (3) asset dividends, (4) asset prices, and (5) government transfers. Because the envelope theorem implies that endogenous substitution responses are not welfare-relevant to a first order, no assumption about the utility function&amp;rsquo;s form or the economy&amp;rsquo;s general equilibrium structure is required. The framework is valid for generic stationary shocks that do not directly shift household preferences.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Empirical Strategy.&lt;/strong&gt; The welfare formula requires two inputs: (i) impulse response functions (IRFs) for all prices, dividends, wages, and unemployment, estimated using internal-instrument SVAR methods applied to two identified shocks — the Kanzig (2021) oil supply news shock (instrumented by oil futures surprises around OPEC announcements) and the Gertler-Karadi (2015) monetary policy shock (instrumented by fed funds futures surprises in 30-minute windows around FOMC announcements) — and (ii) cross-sectional data on consumption bundles, labor income, and asset portfolios from the CEX, CPS, SCF, and SIPP for three education groups (high school or less, some college, college-educated) across the full lifecycle. The baseline cross-section uses 2019 data. Shocks are normalized to produce comparable aggregate inflation responses: a 10% WTI oil price increase and a 25 basis point decline in the one-year Treasury yield each generate roughly 15–16 basis points of CPI-U inflation on impact, rising to approximately 34–35 basis points after two quarters.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings.&lt;/strong&gt; Oil supply contractions are regressive and monetary expansions are progressive, and this divergence is primarily driven by the asset price channel, not the consumption price or labor income channels.&lt;/p&gt;
&lt;p&gt;For the 10% oil supply shock: middle-aged households with high school education or less must be paid approximately $870 (around 2% of annual consumption) to be made whole relative to their pre-shock utility; college-educated middle-aged households, by contrast, gain the equivalent of approximately $833 (1.1% of annual consumption). Younger college-educated households (still net equity accumulators) gain around $572.&lt;/p&gt;
&lt;p&gt;For the 25 basis point monetary rate cut: low-education households approximately break even (net welfare effect near $23), while middle-aged college-educated households must be paid approximately $4,051 (around 5.5% of annual consumption) to restore their pre-shock utility. Older college-educated households must be paid approximately $851.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why asset prices dominate.&lt;/strong&gt; Oil supply contractions reduce equity prices (S&amp;amp;P500 falls approximately 2% one year post-shock) and depress dividends (approximately 82 basis points), while leaving house prices and bond prices largely unaffected. Because middle-aged college-educated households are the primary accumulators of equities, they benefit from the price decline (cheaper future accumulation), making oil shocks progressive through this channel — but regressive overall once the consumption and labor income channels (both mildly regressive) are included. Monetary expansions do the opposite: equity prices rise approximately 3 percentage points on impact, house prices rise approximately 1.5% after three years, and dividends increase. These asset price increases hurt those in the accumulation phase — disproportionately middle-aged college-educated households — creating a progressive distributional pattern.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Consumption and labor income channels.&lt;/strong&gt; Both shocks generate disproportionate inflation in motor fuel and fuel and utilities, and low-education households spend a larger share of their budget on these goods, making the consumption channel mildly regressive for both shocks. The labor income channel differs sharply: oil shocks raise unemployment (approximately 0.15 log points for low-education households two years post-shock) and reduce weekly earnings by 0.2–0.6 log points, mildly harming low-education workers; monetary expansions reduce unemployment (approximately 0.83 log points for low-education workers one year post-shock) and similarly benefit low-education households through the labor market, pushing toward progressivity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope conditions.&lt;/strong&gt; Results apply to short-run first-order welfare effects of identified stationary macroeconomic shocks (four-year horizon). The framework does not incorporate uncertainty shocks, preference shocks, or the role of hedging motives in portfolio choice. Results concern policy shocks rather than policy rules.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Robustness.&lt;/strong&gt; Qualitative conclusions hold across six alternative specifications: incorporating borrowing constraints (with or without empirical death rates), adjusting for unemployment insurance replacement rates (approximately 6% true average replacement rate), allowing for log-linear trends in no-shock choices, and dropping aggregate CPI controls from IRF estimation.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-feasible-set-approach-and-how-does-it-differ-from-prior-work-on-inflation-incidence"&gt;Q1. What is the &amp;ldquo;feasible set approach&amp;rdquo; and how does it differ from prior work on inflation incidence?&lt;/h3&gt;
&lt;p&gt;A: The feasible set approach measures welfare effects through changes in the household&amp;rsquo;s entire budget constraint — consumption prices, wage income, asset dividends, asset prices, and government transfers — rather than focusing on any single channel. Prior work either examined the Fisher channel (net nominal positions), or consumption price heterogeneity, or labor income responses in isolation. The key insight is that the envelope theorem implies substitution responses are not welfare-relevant to a first order, so the money-metric welfare change is simply the discounted sum of changes in the five budget constraint components evaluated at pre-shock choices, without requiring knowledge of the utility function&amp;rsquo;s form or the economy&amp;rsquo;s general equilibrium structure.&lt;/p&gt;
&lt;h3 id="q2-why-is-the-asset-price-channel--rather-than-consumption-prices--the-dominant-channel-in-both-shocks"&gt;Q2. Why is the asset price channel — rather than consumption prices — the dominant channel in both shocks?&lt;/h3&gt;
&lt;p&gt;A: Asset holdings are large relative to annual consumption (net worth averages $1.5 million for college-educated and $260,000 for high-school-educated households in 2019), so even modest percentage movements in asset prices generate large dollar welfare effects. By contrast, the budget shares on the goods most responsive to both shocks (motor fuel, fuel and utilities) are relatively modest, so the consumption channel, while mildly regressive, is quantitatively small relative to the portfolio channel. The portfolio channel accounts for roughly 0.5% of consumption gains for middle-aged college-educated households under the oil shock, while the consumption channel produces losses of only about 0.1% for college-educated and 0.25% for low-education households.&lt;/p&gt;
&lt;h3 id="q3-how-does-the-direction-of-the-equity-price-response-differ-between-oil-and-monetary-shocks-and-why-does-this-create-opposite-distributional-effects"&gt;Q3. How does the direction of the equity price response differ between oil and monetary shocks, and why does this create opposite distributional effects?&lt;/h3&gt;
&lt;p&gt;A: An oil supply contraction reduces equity prices (approximately 2% decline one year post-shock) and dividends (approximately 82 basis points decline), while a monetary expansion raises equity prices (approximately 3 percentage points on impact, approximately 4% higher after four quarters) and increases dividends. The welfare effect of asset price changes falls on those who &lt;em&gt;trade&lt;/em&gt; the asset, not those who merely hold it at a constant level: middle-aged college-educated households are the primary net &lt;em&gt;accumulators&lt;/em&gt; of equity, so falling prices benefit them (they can buy more cheaply) while rising prices hurt them. This is the principal reason oil shocks appear progressive through the portfolio channel — but regressive overall — while monetary expansions are regressive through the portfolio channel and progressive overall.&lt;/p&gt;
&lt;h3 id="q4-what-are-the-precise-welfare-numbers-for-oil-supply-shocks-by-education-group-baseline-ages-2265"&gt;Q4. What are the precise welfare numbers for oil supply shocks by education group (baseline, ages 22–65)?&lt;/h3&gt;
&lt;p&gt;A: From Table 3 (baseline row, lifecycle-weighted averages for ages 25–65): households with high school or less experience a welfare loss of approximately $798; those with some college experience a loss of approximately $816; and college-educated households experience a welfare &lt;em&gt;gain&lt;/em&gt; of approximately $494. These numbers reflect the sum of the consumption, labor income, portfolio, and transfer channels over a 16-quarter horizon, discounted at the one-year Treasury yield.&lt;/p&gt;
&lt;h3 id="q5-what-are-the-precise-welfare-numbers-for-monetary-policy-shocks-by-education-group-baseline-ages-2565"&gt;Q5. What are the precise welfare numbers for monetary policy shocks by education group (baseline, ages 25–65)?&lt;/h3&gt;
&lt;p&gt;A: From Table 3 (baseline row): households with high school or less experience a small welfare &lt;em&gt;gain&lt;/em&gt; of approximately $23; those with some college experience a welfare loss of approximately $1,278; and college-educated households experience a welfare loss of approximately $3,055. These losses for college-educated households are driven overwhelmingly by rising equity and house prices that raise the cost of planned asset accumulation.&lt;/p&gt;
&lt;h3 id="q6-how-does-the-life-cycle-interact-with-the-distributional-incidence-of-both-shocks"&gt;Q6. How does the life cycle interact with the distributional incidence of both shocks?&lt;/h3&gt;
&lt;p&gt;A: There is substantial heterogeneity within education groups across the life cycle because asset accumulation and decumulation patterns are age-dependent. Under oil shocks, younger college-educated households (who are net equity accumulators) gain approximately $572, middle-aged college-educated households gain approximately $833, while older college-educated households lose approximately $69 (because they hold large equity positions and lose dividend income). Under monetary shocks, middle-aged college-educated households lose the most (approximately $4,051) because they are simultaneously accumulating equities and housing, both of which become more expensive. Older college-educated households lose less (approximately $851) because rising dividends on existing holdings partially offset the asset price cost. Low-education households are approximately flat across the life cycle under monetary shocks.&lt;/p&gt;
&lt;h3 id="q7-how-does-the-consumption-channel-compare-across-education-groups-and-across-the-two-shocks"&gt;Q7. How does the consumption channel compare across education groups and across the two shocks?&lt;/h3&gt;
&lt;p&gt;A: The consumption channel is mildly regressive for both shocks, but of similar absolute magnitude across the two shocks because both generate similar inflation in motor fuel and fuel and utilities — the goods with the largest price response. Low-education households spend a larger share on motor fuel and fuel and utilities; as a result, they lose approximately 0.25% of consumption from the consumption channel under the oil shock, compared with less than 0.1% for college-educated households. For monetary shocks, the consumption channel affects all household types roughly equally in proportional terms.&lt;/p&gt;
&lt;h3 id="q8-how-does-the-labor-income-channel-differ-between-oil-and-monetary-shocks-across-education-groups"&gt;Q8. How does the labor income channel differ between oil and monetary shocks across education groups?&lt;/h3&gt;
&lt;p&gt;A: Oil shocks raise unemployment disproportionately for low-education workers (approximately 0.15 log point increase after two years, roughly 0.68 standard deviations, compared with near-zero response for college-educated workers) and reduce weekly earnings by 0.2–0.6 log points across groups. Monetary expansions reverse this: a 25 basis point rate cut reduces log unemployment by approximately 0.83 log points for low-education workers and approximately 1.96 log points for college-educated workers after one year, with limited response in conditional wages. Thus the labor income channel pushes toward regressive incidence for oil shocks and toward progressive incidence for monetary expansions, though in both cases it is quantitatively smaller than the portfolio channel.&lt;/p&gt;
&lt;h3 id="q9-what-is-the-role-of-housing-in-the-portfolio-channel"&gt;Q9. What is the role of housing in the portfolio channel?&lt;/h3&gt;
&lt;p&gt;A: Housing behaves simultaneously as a durable consumption good and a financial asset. A house price increase raises welfare for households planning to &lt;em&gt;decumulate&lt;/em&gt; (sell) housing (primarily older households) through the portfolio channel, but also raises the implicit rental cost for those who &lt;em&gt;use&lt;/em&gt; housing — a negative consumption-side effect. Monetary expansions raise house prices by approximately 1.5% after three years. College-educated households accumulate housing at a faster rate and earlier in the life cycle than low-education households, making them more exposed to the cost of rising house prices during the accumulation phase. This amplifies the progressive pattern of monetary shocks through the portfolio channel.&lt;/p&gt;
&lt;h3 id="q10-how-does-the-paper-handle-the-dual-role-of-durable-goods-vehicles-and-housing"&gt;Q10. How does the paper handle the dual role of durable goods (vehicles and housing)?&lt;/h3&gt;
&lt;p&gt;A: Durable goods are treated as both a consumption good and a financial asset. The utility-relevant consumption price of a durable is proportional to the price times the depreciation rate per unit of use, capturing the &amp;ldquo;implicit rent&amp;rdquo; of ownership. On the asset side, the durable enters the portfolio channel like a zero-dividend financial asset. This allows the framework to correctly attribute, for example, that a rise in house prices hurts net accumulators (through the portfolio channel) while also raising the implicit cost of housing services (through the consumption channel), rather than treating house price appreciation as an unambiguous welfare gain for homeowners.&lt;/p&gt;
&lt;h3 id="q11-what-happens-to-the-main-conclusions-when-borrowing-constraints-are-introduced"&gt;Q11. What happens to the main conclusions when borrowing constraints are introduced?&lt;/h3&gt;
&lt;p&gt;A: Incorporating net worth constraints (with either constant or empirical death rates) dampens the portfolio channel for young and middle-aged college-educated households, because rising asset prices relax borrowing constraints for these households, partially offsetting the welfare cost of more expensive accumulation. Under constant death rates with borrowing constraints, college-educated households&amp;rsquo; oil shock welfare gain falls from +$494 to +$76; under empirical death rates, it becomes a loss of -$394. For monetary shocks, the college-educated loss falls from -$3,055 to -$1,718 (constant death rate) or -$1,036 (empirical death rates). Despite these quantitative changes, the qualitative conclusion — oil shocks are regressive, monetary expansions are progressive — holds across all specifications.&lt;/p&gt;
&lt;h3 id="q12-what-is-the-implication-of-these-findings-for-the-policy-interaction-between-oil-shocks-and-monetary-tightening"&gt;Q12. What is the implication of these findings for the policy interaction between oil shocks and monetary tightening?&lt;/h3&gt;
&lt;p&gt;A: If the monetary authority responds to oil-price-induced inflation with unexpected interest rate increases, it may exacerbate the distributional consequences of the initial oil shock. An oil supply contraction is already regressive (harming low-education households through consumption prices and labor market effects); a disinflationary monetary tightening would additionally harm low-education households through the labor income channel (higher unemployment, lower wages) while partially benefiting college-educated households through lower asset prices. The paper notes this policy interaction as noteworthy, while cautioning that the results concern identified policy &lt;em&gt;shocks&lt;/em&gt; rather than policy &lt;em&gt;rules&lt;/em&gt;.&lt;/p&gt;
&lt;h3 id="q13-how-are-the-two-shocks-calibrated-to-be-comparable"&gt;Q13. How are the two shocks calibrated to be comparable?&lt;/h3&gt;
&lt;p&gt;A: The oil shock is normalized to a 10% increase in WTI crude oil prices (approximately one standard deviation of monthly oil price growth). The monetary shock is normalized to a 25 basis point decline in the one-year Treasury yield — chosen because it generates approximately the same aggregate CPI-U inflation response as the oil shock (approximately 15–16 basis points on impact, rising to approximately 34–35 basis points after two quarters). This normalization allows the paper to attribute the different distributional outcomes to the &lt;em&gt;source&lt;/em&gt; of inflation rather than to differences in the aggregate inflation magnitude.&lt;/p&gt;
&lt;h3 id="q14-what-role-does-the-transfer-channel-play-and-for-whom"&gt;Q14. What role does the transfer channel play, and for whom?&lt;/h3&gt;
&lt;p&gt;A: The transfer channel is small relative to the other three channels for the vast majority of working-age households, because transfer income is less than $100 per month for most households under age 65. Social Security payments — the bulk of transfer income — are explicitly indexed to the CPI; the paper models them as moving with CPI with a one-year lag. The transfer channel exclusively benefits older households (those receiving Social Security), and its quantitative effect is modest even there. Transfer income is more than 20 times smaller than labor and asset income for prime-age households of all education groups.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Feasible set approach.&lt;/strong&gt; The paper&amp;rsquo;s organizing framework, in which the first-order welfare impact of a macroeconomic shock is measured by how the shock changes the household&amp;rsquo;s budget constraint (consumption prices, wage income, asset dividends, asset prices, and government transfers) evaluated at the household&amp;rsquo;s pre-shock choices. Substitution responses are not welfare-relevant to a first order by the envelope theorem.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Money-metric welfare gain.&lt;/strong&gt; The willingness-to-pay measure used throughout: the welfare change from a shock divided by the household&amp;rsquo;s marginal utility of consumption at time zero, expressed in time-zero dollars. Interpreted as an equivalent variation — the amount the household must be paid or would give up to be indifferent to receiving the shock. Used because it places households with very different utility functions on a common dollar scale.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Portfolio channel.&lt;/strong&gt; The component of the welfare formula capturing the effect of asset price and dividend changes on household welfare. Asset price changes are welfare-relevant only for households that &lt;em&gt;trade&lt;/em&gt; (accumulate or decumulate) the asset: rising prices benefit sellers and harm buyers; falling prices benefit buyers and harm sellers. This is distinct from the &amp;ldquo;Fisher channel&amp;rdquo; in prior literature, which focuses on net nominal positions rather than on which households are in the accumulation versus decumulation phase.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Internal instrument SVAR.&lt;/strong&gt; The time-series estimation procedure used throughout: the pre-estimated identified shock series (oil supply news or monetary policy surprise) is included as a variable ordered first in a recursive structural VAR for each outcome variable. This separates shock identification (using the published instruments and controls from Kanzig 2021 and Gertler-Karadi 2015) from IRF estimation for each outcome variable, allowing the use of the full available sample for each outcome series.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Oil supply news shock (Kanzig 2021).&lt;/strong&gt; An identified supply shock to oil markets, constructed from changes in oil price futures in tight windows around OPEC production announcements. Used to capture exogenous cost-push inflation driven by supply constraints rather than demand.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Monetary policy shock (Gertler-Karadi 2015).&lt;/strong&gt; An identified demand-side shock, constructed from federal funds rate futures surprises in 30-minute windows around FOMC announcements, instrumented into a monetary SVAR. Captures exogenous interest rate cuts that generate aggregate demand expansion and inflation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Borrowing constraint wedge.&lt;/strong&gt; An additional term that appears in the welfare formula when households face net worth constraints. Proportional to the Lagrange multiplier on the net worth constraint, it discounts future periods more heavily when constraints bind, and adds a term for the welfare value of relaxed constraints when asset prices rise. Identified from deviations from perfect consumption smoothing using CEX lifecycle consumption data.&lt;/p&gt;</description></item><item><title>Artificial intelligence and cognitive inequality</title><link>https://macropaperwarehouse.com/papers/artificial-intelligence-and-cognitive-inequality/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/artificial-intelligence-and-cognitive-inequality/</guid><description>&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;Summary blocked:&lt;/strong&gt; No freely available working paper or full text was found. Metadata record only; paper suppressed from site until a working paper is located.&lt;/p&gt;
&lt;/blockquote&gt;</description></item><item><title>Artificial intelligence and technological unemployment</title><link>https://macropaperwarehouse.com/papers/artificial-intelligence-and-technological-unemployment/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/artificial-intelligence-and-technological-unemployment/</guid><description>&lt;p&gt;Wang and Wong develop a continuous-time labor-search model to assess the dynamic effects of generative AI (GenAI) on labor productivity and unemployment. The paper is motivated by conflicting empirical evidence: micro studies find productivity gains of 14% (Brynjolfsson, Li, and Raymond 2025) and 55.8% faster coding (Peng et al. 2023), while macro estimates suggest modest TFP gains of at most 0.064% annually (Acemoglu 2024), and occupation-level evidence shows a 13% relative employment decline in AI-exposed jobs (Brynjolfsson, Chandar, and Chen 2025).&lt;/p&gt;
&lt;p&gt;The model distinguishes GenAI from earlier automation technologies by its learning-by-using mechanism: AI capability grows at rate µ per employed worker (law of motion dAt/At = µHt − δ), raises employed workers&amp;rsquo; productivity, and creates a displacement threat through renegotiation. When renegotiation fails, AI replaces the worker, generating technological unemployment. Firms renegotiate wages at a rate ρµAt proportional to AI&amp;rsquo;s learning rate and the job&amp;rsquo;s exposure ρ. The joint surplus condition governs whether replacement occurs: AI replaces a worker if and only if πA (AI&amp;rsquo;s net present value per output) exceeds the post-renegotiation joint surplus St.&lt;/p&gt;
&lt;p&gt;The model admits three steady states: (i) a some-AI steady state with finite AI capability, persistent AI adoption (It = 1), expanded job creation but declining employment at H∞ = δ/µ; (ii) an unbounded-AI equilibrium with sustained endogenous growth, no displacement (It = 0), and employment at H∞ = α/(α+σ); and (iii) a no-AI equilibrium reverting to the Mortensen-Pissarides benchmark. In the benchmark model (exogenous job-finding rate, AI-augmented productivity), multiple steady states can coexist—global indeterminacy—when condition (28) holds. In the full model (endogenous job creation via free entry), both global and local indeterminacy are possible, and a continuum of oscillatory transition paths converge to the some-AI steady state.&lt;/p&gt;
&lt;p&gt;Calibrated to U.S. data, targeting a pre-AI unemployment rate of 5%, AI elasticity of productivity εy = 1.069 (from Czarnitzki et al. 2023), initial AI productivity boost of 14% (Brynjolfsson et al. 2025), worker exposure ρ = 0.618 (Brynjolfsson et al. 2018&amp;rsquo;s machine learning suitability index), AI replacement cost ϕ = 0.0043 (from U.S. business GenAI spending), AI learning rate µ = 0.632, and AI error rate δ = 0.462 (Moore&amp;rsquo;s law half-life of 1.5 years), the model converges to a some-AI steady state. The long-run results are: a 23% employment loss (H∞ = 0.732 vs. H0 = 0.95), AI capability improvement of 321%, and labor productivity gain of 366%. Approximately half of the employment loss—11.5 percentage points—occurs within the first five years, alongside a 49.3% output gain and 45.5% AI capability improvement over that period.&lt;/p&gt;
&lt;p&gt;Untargeted moments are validated: the model implies 7.08% labor productivity growth over the first 10 years (consistent with Briggs and Kodnani 2023) and an AI elasticity of vacancies averaging 0.16 over the first five years (consistent with Acemoglu et al. 2022).&lt;/p&gt;
&lt;p&gt;On welfare, equilibria are inefficient even when the Hosios condition holds. AI introduces four externalities beyond standard matching frictions: job destruction via displacement, productivity enhancement for employed workers, feedback from AI learning depending on employment, and direct effects on matching surpluses. A constrained-optimal subsidy to jobs at risk of AI displacement is 26.6% in the short run and exceeds 50% in the long run. In the full model, the Hosios condition requires fixing firm bargaining power θ to the vacancy elasticity of matching ξ, but an additional per-output transfer T = µApωA to firm-worker matches is necessary to correct AI adoption inefficiency.&lt;/p&gt;
&lt;p&gt;Q: What is the core mechanism by which AI generates unemployment in this model?
A: AI capability grows through a learning-by-using process (dAt/At = µHt − δ), improving as it observes employed workers. As capability rises, firms gain a displacement option that arrives at rate ρµAt per matched pair. When renegotiation over wages fails—i.e., when the AI&amp;rsquo;s NPV πA exceeds the joint surplus—firms replace workers with AI, causing unemployment. This creates a feedback loop: higher employment accelerates AI learning, which increases displacement pressure and reduces employment.&lt;/p&gt;
&lt;p&gt;Q: What are the three steady states and what distinguishes them?
A: The some-AI steady state features finite AI capability, persistent displacement (It = 1), and long-run employment H∞ = δ/µ; it involves technological unemployment. The unbounded-AI steady state features infinite AI capability, no displacement (It = 0), endogenous productivity growth, and employment H∞ = α/(α+σ) as in the standard Mortensen-Pissarides model. The no-AI steady state has A∞ = 0 with the same H∞ = α/(α+σ) but no AI contribution. Employment is higher in the unbounded-AI equilibrium than in the some-AI equilibrium.&lt;/p&gt;
&lt;p&gt;Q: What does the calibration imply for long-run employment and productivity?
A: The calibrated full model converges to a some-AI steady state with a 23% employment loss (H∞ = 0.732), a 321% improvement in AI capability, and a 366% gain in labor productivity. The parameters yield a unique equilibrium under the baseline calibration (πA = 1.949 &amp;gt; sAI = 0.8735 confirms some-AI existence). These results reflect a large worker replacement effect under the calibrated AI learning and error rates, while the job creation effect is relatively modest.&lt;/p&gt;
&lt;p&gt;Q: How fast does technological unemployment materialize?
A: Approximately half of the total 23% employment loss occurs within the first five years; specifically, employment falls by 11.5 percentage points over that period. Over the same five years, AI capability improves by 45.5% and output rises by 49.3%. Over the first 10 years, AI capability improvement accumulates to 94.0% and output gain to 103% (approximately double the five-year output gain).&lt;/p&gt;
&lt;p&gt;Q: How does the full model differ from the benchmark model in transition dynamics?
A: In the full model, job-finding rates are endogenous: firms post vacancies until a free-entry condition (κyt = ftΠt) is satisfied, tying job-finding rate αt to the surplus ratio st via αt = α(st). This endogeneity implies that as AI raises labor productivity, firms create more vacancies, slowing the employment decline relative to the benchmark model with a fixed job-finding rate. At the same time, AI capability grows faster in the full model because higher employment accelerates AI learning.&lt;/p&gt;
&lt;p&gt;Q: What is global indeterminacy and when does it arise?
A: Global indeterminacy occurs when both the some-AI and unbounded-AI steady states coexist, so the long-run outcome depends on initial conditions or expectations. In the benchmark model this requires condition (28): 0 &amp;lt; r + σ + α(1−θ) − (1−b)/πA ≤ εy(µα/(α+σ) − δ). In the full model, global indeterminacy is plausible when firm bargaining power rises to θ = 0.95 given the baseline AI replacement cost ϕ = 0.0043. The region of global indeterminacy is larger when firm bargaining power is higher.&lt;/p&gt;
&lt;p&gt;Q: What is local indeterminacy and what does it imply for transition paths?
A: Local indeterminacy means there is a continuum of equilibrium paths converging to the some-AI steady state in the neighborhood of that steady state, rather than a unique saddle path. In the full model, under alternative parameters (θ = 1, ξ = 0.765, εy = 6), the eigenvalues feature a negative real root and two complex roots with negative real parts, yielding oscillatory local dynamics in employment and AI capability. This implies short-run cycles in productivity and unemployment, consistent with the wide range of empirical findings on AI&amp;rsquo;s labor-market effects.&lt;/p&gt;
&lt;p&gt;Q: Why does the Hosios condition fail to deliver efficiency in this model?
A: The Hosios condition eliminates the standard matching externality by setting firm bargaining power to the vacancy elasticity of matching. But AI introduces four additional externalities: (i) job destruction through displacement, (ii) productivity enhancement for employed workers, (iii) feedback from AI learning that depends on aggregate employment, and (iv) direct effects on matching surpluses and job-finding rates. These externalities mean the standard Hosios rule alone is insufficient; additional instruments are required.&lt;/p&gt;
&lt;p&gt;Q: What is the constrained-optimal policy response?
A: In the simple model, the constrained optimal AI adoption threshold differs from the equilibrium threshold because firm bargaining power θ distorts adoption decisions: AI is over-adopted when πA &amp;gt; (1−b)/(r+σ+α(1−θ)) and under-adopted when (1−b)/(r+σ+α) &amp;lt; πA ≤ (1−b)/(r+σ+α(1−θ)). In the full model, constrained optimality requires setting θ = ξ (Hosios) plus a per-output subsidy T = µApωA to firm-worker matches exposed to AI displacement. This targeted subsidy is 26.6% in the short run and exceeds 50% in the long run.&lt;/p&gt;
&lt;p&gt;Q: How does AI compare to computers in this model&amp;rsquo;s counterfactual?
A: The paper reports that exogenous productivity growth from computers reduced unemployment only modestly—by 0.16 percentage points. By contrast, AI&amp;rsquo;s learning-by-using and displacement features imply a nearly 20% long-run employment loss in a comparable counterfactual. The key distinction is that computers lack the self-learning improvement and associated renegotiation-triggered displacement that characterize GenAI in this model.&lt;/p&gt;
&lt;p&gt;Q: How is AI exposure parameterized and what does it capture?
A: The exposure parameter ρ captures the degree to which a job is subject to AI-driven replacement risk. It is calibrated using Brynjolfsson et al. (2018)&amp;rsquo;s suitability for machine learning (SML) index: on a 1–5 scale, SML averages 3.47 across 964 O*NET occupations, translating to (3.47−1)/(5−1) = 61.8%, so ρ = 0.618. The effective exposure measure is ρµ, which is higher when facing a faster-learning AI.&lt;/p&gt;
&lt;p&gt;Q: What is the predator-prey analogy in the model&amp;rsquo;s dynamics?
A: The dynamical system for AI capability (At) and employment (Ht) in the simple model resembles the Lotka-Volterra predator-prey system. Employment (prey) feeds AI learning; as AI capability (predator) grows, it displaces workers faster, reducing employment; lower employment then slows AI learning, causing capability to decay; and the cycle repeats with diminishing magnitude until the steady state is reached. This mechanism operates only when the AI learning rate µ is neither too high nor too low, with the convergence path being a spiral when µα &amp;lt; 4δ²(1 − δ(α+σ)/(µα)).&lt;/p&gt;
&lt;p&gt;Q: What is the labor-share implication of the unbounded-AI equilibrium?
A: In the unbounded-AI steady state, employment is higher than in the some-AI steady state (H^AJJ &amp;gt; H^AI) and labor productivity grows without bound. However, the labor share is lower in the unbounded-AI equilibrium if the firm&amp;rsquo;s bargaining power θ is sufficiently low. This implies that while workers are not fully displaced and rising AI-augmented productivity sustains employment, workers&amp;rsquo; income share may still decline even in the more favorable unbounded scenario.&lt;/p&gt;
&lt;p&gt;Technological unemployment: A phenomenon in which AI adoption raises labor productivity and expands job creation, yet still causes sizable employment losses because the worker displacement effect (driven by renegotiation failure when AI&amp;rsquo;s NPV πA exceeds the joint surplus) dominates the job-creation effect. In the calibrated model this amounts to a 23% employment loss despite a 366% productivity gain.&lt;/p&gt;
&lt;p&gt;Learning-by-using AI: The model&amp;rsquo;s representation of GenAI as a technology whose capability At grows through reinforced learning from employed workers at rate µ per worker, so aggregate AI growth is µHt, offset by deterioration at rate δ. This distinguishes GenAI from earlier automation technologies (computers, robotics) that do not self-improve through usage.&lt;/p&gt;
&lt;p&gt;Some-AI steady state: A long-run equilibrium with finite AI capability (gA∞ = 0), persistent AI adoption (It = 1), and employment pinned at H∞ = δ/µ—the ratio of AI&amp;rsquo;s error rate to its learning rate. Characterized by expanded job creation but lower employment than the no-AI benchmark, constituting the model&amp;rsquo;s primary calibrated outcome.&lt;/p&gt;
&lt;p&gt;Unbounded-AI steady state: A long-run equilibrium with infinite AI capability (A∞ = ∞), no displacement (It = 0), and endogenous growth at rate gA = µH^AJJ − δ. Employment equals the Mortensen-Pissarides level H∞ = α/(α+σ), and labor productivity grows without bound, complementing Aghion, Jones, and Jones (2019)&amp;rsquo;s idea production framework.&lt;/p&gt;
&lt;p&gt;Global indeterminacy: Coexistence of multiple steady states (some-AI and unbounded-AI) such that the long-run equilibrium depends on initial conditions or expectations rather than being uniquely determined. Arises in the benchmark model when condition (28) holds and becomes more likely with higher firm bargaining power θ.&lt;/p&gt;
&lt;p&gt;Local indeterminacy: A continuum of equilibrium transition paths converging to a single steady state from nearby initial conditions, rather than a unique saddle path. Arises in the full model under certain parameter configurations (e.g., θ = 1, ξ = 0.765, εy = 6), implying oscillatory short-run dynamics in employment and AI capability.&lt;/p&gt;
&lt;p&gt;AI exposure (ρ): A firm-level parameter capturing the degree to which a job-match is subject to AI-driven displacement risk. The displacement option arrives at rate ρµAt per matched pair; ρ is calibrated at 0.618 using the average suitability-for-machine-learning score across O*NET occupations. The effective exposure measure is the product ρµ.&lt;/p&gt;
&lt;p&gt;Renegotiation-proof displacement: Proposition 1&amp;rsquo;s result that the joint surplus Snt is independent of the renegotiation round n, so the AI adoption decision It is also round-invariant. This simplifies the model to a single indicator function: AI replaces the worker if and only if πA exceeds the joint surplus St, regardless of how many renegotiation rounds have occurred.&lt;/p&gt;</description></item><item><title>Auctions with Frictions: Recruitment, Entry, and Limited Commitment</title><link>https://macropaperwarehouse.com/papers/auctions-with-frictions-recruitment-entry-and-limited-commitment/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/auctions-with-frictions-recruitment-entry-and-limited-commitment/</guid><description>&lt;p&gt;This paper develops an auction model that jointly incorporates three frictions pervading informal price-formation processes: (1) costly recruitment by the seller, (2) costly participation by bidders, and (3) the seller&amp;rsquo;s inability to commit to a recruitment level or reserve price. The authors argue these frictions are especially prevalent in markets for idiosyncratic assets such as mergers and acquisitions, real estate, and home repair contracting, where auction houses like Christie&amp;rsquo;s and Sotheby&amp;rsquo;s command fees of 20–30% of revenues precisely because they reduce the underlying inefficiencies.&lt;/p&gt;
&lt;p&gt;The model features a single seller who exerts recruitment effort gamma at cost gamma*s, generating a Poisson-distributed number of contacted bidders with mean gamma. Each contacted bidder independently decides whether to pay entry cost c &amp;gt; 0 to learn their private value and participate in a first-price auction (FPA). The seller cannot commit to gamma (which is unobservable to bidders) or to a reserve price. Two scenarios are analyzed: PO (participation-observable, where bidders observe the number of entrants before bidding) and PU (participation-unobservable).&lt;/p&gt;
&lt;p&gt;The central tension is between the seller&amp;rsquo;s incentive to recruit more bidders to intensify competition and raise revenue, and bidders&amp;rsquo; rational concern that excessive recruitment makes entry unprofitable. Because the seller cannot commit, this tension generates several novel inefficiency results.&lt;/p&gt;
&lt;p&gt;In the PO scenario, the seller&amp;rsquo;s marginal revenue from recruitment Ro&amp;rsquo;(lambda) is single-peaked, meaning there is a minimum profitable participation scale lambda_o below which the seller will never recruit. Combined with a maximum participation level lambda-bar_c above which bidders will not enter (defined by U(lambda-bar_c) = c, where U is the bidder&amp;rsquo;s expected payoff), no-trade equilibrium is the unique outcome whenever lambda-bar_c &amp;lt; lambda_o — even for arbitrarily small recruitment cost s. This result holds because with unobservable effort, bidders correctly anticipate the seller will target participation above lambda_o, making entry unprofitable. When lambda-bar_c &amp;gt; lambda_o, three regimes arise: (i) no trade if s exceeds a threshold s-bar_o; (ii) an interior equilibrium with full entry (q* = 1) and lambda* = lambda_o(s) for intermediate s; and (iii) for small s, an equilibrium with lambda* = lambda-bar_c and partial entry q* = Ro&amp;rsquo;(lambda-bar_c)/s &amp;lt; 1. In regime (iii), total recruitment cost lambda*(s/q*) equals the constant lambda-bar_c * Ro&amp;rsquo;(lambda-bar_c) regardless of s — so even as s approaches zero, wasteful recruitment costs do not vanish, because they are determined by incentive constraints rather than by technology.&lt;/p&gt;
&lt;p&gt;In the PU scenario, a no-trade equilibrium always exists for all parameter values, because the seller cannot credibly disclose participation, creating self-reinforcing expectations of zero competition. The seller&amp;rsquo;s recruitment incentive xi(lambda) is strictly weaker than Ro&amp;rsquo;(lambda) in the PO scenario (proven via revenue equivalence: Ro&amp;rsquo;(lambda) = xi(lambda) + a positive term reflecting how greater participation induces more aggressive bidding). This yields ranking reversals: for intermediate s and small c, the PO scenario dominates PU; but for small s or large c, the PU scenario&amp;rsquo;s weaker recruitment incentive reduces wasteful over-recruitment, making PU preferable. These comparisons translate directly to a comparison of FPA and SPA with unobservable participation: the two formats are not equivalent in the presence of recruitment and entry frictions because they generate different recruitment incentives.&lt;/p&gt;
&lt;p&gt;A sampling-curse mechanism drives near-complete market unraveling when sellers have privately known recruitment costs drawn from a continuous uniform distribution on [0, s_o]. Because low-cost sellers recruit more, a contacted bidder believes the seller is more likely to have low costs — and hence to have recruited many other bidders — making entry unprofitable. Proposition 3 establishes a threshold c-hat such that for c in (c-hat, c-bar), as the lower bound of the cost distribution approaches zero, the fraction of seller types that remain inactive approaches one — near-complete unraveling — even though each type would be active if its cost were commonly known.&lt;/p&gt;
&lt;p&gt;Q: What is the paper&amp;rsquo;s main modeling innovation relative to the existing literature?
A: The paper&amp;rsquo;s central novelty is combining all three frictions — costly recruitment by the seller, costly participation by bidders, and limited seller commitment — in one model. The existing literature had studied entry and recruitment separately; Szech (2011) examined costly recruitment with costless entry; McAfee and McMillan (1987) and Levin and Smith (1994) studied costly entry with an exogenously given number of potential bidders; Milgrom (1987) and McAfee and Vincent (1997) studied limited commitment to a reserve price with a fixed bidder set. None combine all three.&lt;/p&gt;
&lt;p&gt;Q: What is the &amp;ldquo;minimum profitable scale&amp;rdquo; result and why does it arise?
A: Because the seller cannot commit to a reserve price, the first few bidders are complementary — they stimulate competitive bidding, causing the seller&amp;rsquo;s marginal revenue Ro&amp;rsquo;(lambda) to be initially increasing, then decreasing (single-peaked). This means the seller&amp;rsquo;s profit Pi_o(lambda, q) is maximized either at zero or at a participation level above a minimum scale lambda_o, defined by Ro&amp;rsquo;(lambda_o) = s-bar_o. The seller will never choose a participation level between 0 and lambda_o.&lt;/p&gt;
&lt;p&gt;Q: Under what conditions does the market completely shut down in the PO scenario?
A: No-trade is the unique equilibrium outcome whenever lambda-bar_c &amp;lt; lambda_o, where lambda-bar_c is defined by U(lambda-bar_c) = c (the participation break-even level) and lambda_o is the seller&amp;rsquo;s minimum profitable scale. This condition arises when entry costs c are large enough relative to the competitive dynamics. Importantly, no trade occurs for every recruitment cost s &amp;gt; 0, including arbitrarily small s — commitment failure alone can cause complete market breakdown even when recruiting bidders is nearly costless.&lt;/p&gt;
&lt;p&gt;Q: What is the inefficiency in regime (iii) of Proposition 2 (small s, PO scenario)?
A: When s &amp;lt; Ro&amp;rsquo;(lambda-bar_c), equilibrium has lambda* = lambda-bar_c and q* = Ro&amp;rsquo;(lambda-bar_c)/s &amp;lt; 1. The total recruitment cost is lambda* * (s/q*) = lambda-bar_c * Ro&amp;rsquo;(lambda-bar_c), a strictly positive constant independent of s. As s approaches zero, total recruitment effort and its cost do not vanish — they are pinned by incentive constraints. This waste could be avoided if the seller could commit to an effort level below lambda-bar_c, illustrating that commitment failure creates persistent inefficiency even when the technology of recruitment is inexpensive.&lt;/p&gt;
&lt;p&gt;Q: Why does a no-trade equilibrium always exist in the PU scenario but not always in the PO scenario?
A: In the PU scenario, if bidders expect zero participation, they bid zero conditional on being contacted; the seller then has no incentive to recruit, validating the expectation. This equilibrium is self-sustaining for all parameter values (Claim 2). In the PO scenario, the equilibrium refinement (requiring that off-path beliefs not support negative seller payoff at lambda = 0 when trade equilibria exist) rules out no-trade equilibria when lambda-bar_c &amp;gt; lambda_o and s is not too large; specifically, Proposition 2 shows that no-trade equilibrium is unique only when s &amp;gt; s-bar_o or lambda-bar_c &amp;lt; lambda_o.&lt;/p&gt;
&lt;p&gt;Q: What drives the ranking reversal between PO and PU scenarios?
A: The core result is Claim 3: Ro&amp;rsquo;(lambda) &amp;gt; xi(lambda) for all lambda &amp;gt; 0, meaning the marginal incentive to recruit is strictly stronger under PO than PU. This follows from revenue equivalence: Ro&amp;rsquo;(lambda) = xi(lambda) + (d/d lambda-hat) Ru(lambda, beta_{lambda-hat})|_{lambda-hat=lambda}, and the second term is strictly positive because greater expected participation induces more aggressive bidding. For intermediate s and small c, stronger PO recruitment incentives support higher participation and revenue. For small s or large c, those same stronger incentives generate wasteful over-recruitment in PO, making PU preferable.&lt;/p&gt;
&lt;p&gt;Q: How does the paper connect its PO/PU comparison to a comparison of first- and second-price auctions?
A: In any standard auction where the highest-value bidder wins, payoff and revenue equivalence imply that the bidder payoff function U(lambda) and seller revenue Ro(lambda) are identical. In particular, the dominant-strategy equilibrium of the SPA (where bidders bid their true values regardless of participation) generates the same outcomes as the PO equilibrium, because with truthful bidding the observability of participation is irrelevant. Therefore, comparing PO and PU with an FPA is equivalent to comparing the SPA and FPA with unobservable participation. The two formats are not revenue-equivalent when recruitment and entry frictions are present: their ranking depends on s and c in exactly the way described for PO vs. PU.&lt;/p&gt;
&lt;p&gt;Q: What is the &amp;ldquo;sampling curse&amp;rdquo; and how does it cause market unraveling?
A: The sampling curse arises when sellers have privately known recruitment costs. Because a lower-cost seller optimally recruits more bidders, the probability of any given bidder being contacted is higher when the seller has a lower cost. Conditional on being contacted, a bidder therefore believes the seller more likely has a low cost and thus has recruited many competitors, reducing the value of entry. In the binary-type case (Claim 8), if sL is sufficiently small relative to sH, the low-cost seller must recruit so many bidders that entry becomes unattractive; the resulting low q* makes the marginal recruitment cost sH/q* prohibitively high for the high-cost type, driving it out (lambda*_H = 0).&lt;/p&gt;
&lt;p&gt;Q: What does Proposition 3 establish about near-complete unraveling with a continuum of seller types?
A: With seller costs uniformly distributed on [s-bar, s_o], Proposition 3 establishes a threshold c-hat strictly between 0 and c-bar such that: (i) for c in (c-hat, c-bar), as s-bar approaches zero, the fraction of seller types with zero recruitment approaches one — near-complete market unraveling; (ii) for c &amp;lt; c-hat, all seller types remain active regardless of how small s-bar is. This is striking because for any commonly known s in (0, s_o), the PO scenario supports trade for all c &amp;lt; c-bar; unraveling arises purely from the interaction of private cost information and the sampling curse, not from any type&amp;rsquo;s cost being intrinsically too high.&lt;/p&gt;
&lt;p&gt;Q: What does the welfare analysis say about equilibrium efficiency?
A: The welfare-maximizing participation level lambda_w satisfies U(lambda_w) = c + s (equating the marginal bidder&amp;rsquo;s surplus to the full social cost of one more participant), with full entry q_w = 1. In equilibrium under PO, q* &amp;lt; 1 in some cases (wasted recruitment) and lambda* differs from lambda_w for almost all (s, c) pairs — both excessive participation (lambda* &amp;gt; lambda_w) and deficient participation (lambda* &amp;lt; lambda_w) can arise. Full efficiency requires Ro&amp;rsquo;(lambda*) = s and U(lambda*) = s + c simultaneously, but since both U and Ro&amp;rsquo; are independent of s and c as parameters, these equalities generically fail.&lt;/p&gt;
&lt;p&gt;Q: Does the seller benefit from being able to commit to recruitment effort?
A: Claim 10 shows that with observable effort in the PO scenario, the seller commits to gamma-hat = min{lambda-bar_c, lambda_o(s)} when lambda-bar_c &amp;gt;= lambda_o, and to lambda-bar_c (if profitable) when lambda-bar_c &amp;lt; lambda_o. Commitment strictly improves the seller&amp;rsquo;s profit whenever gamma-hat = lambda-bar_c: it enables positive trade when lambda-bar_c &amp;lt; lambda_o and Ro(lambda-bar_c) &amp;gt; lambda-bar_c * s (otherwise impossible without commitment), and it saves recruitment costs when lambda-bar_c &amp;gt; lambda_o and Ro&amp;rsquo;(lambda-bar_c) &amp;gt; s. However, the commitment outcome is always welfare-inefficient: lambda-bar_c &amp;gt; lambda_w whenever s &amp;gt; 0.&lt;/p&gt;
&lt;p&gt;Q: What anecdotal evidence do the authors cite for the model&amp;rsquo;s relevance?
A: Subramanian (2010) and Boone and Mulherin (2004, 2009) show that the majority of merger and acquisition auctions are &amp;ldquo;informal&amp;rdquo; — mixtures of auctions and negotiations rather than structured processes with rules laid out in advance — and that sellers are typically unable to credibly commit to participation levels. Milgrom (2003) states from consulting experience that marketing an auction is often more critical than clever mechanism design. Fees of 20–30% of revenues paid to intermediaries like Christie&amp;rsquo;s and Sotheby&amp;rsquo;s are offered as quantitative evidence of the magnitude of the inefficiencies that such intermediaries reduce. Home repair contracting is cited as a familiar informal-auction setting where both recruitment and entry costs are material.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Recruitment effort (gamma):&lt;/strong&gt; The seller&amp;rsquo;s costly action of contacting potential bidders, modeled as a Poisson process with mean gamma at cost gamma*s; unobservable to bidders in the baseline model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Participation-observable (PO) vs. participation-unobservable (PU) scenarios:&lt;/strong&gt; The two variants of the model; in PO, bidders observe the total number of entrants n before bidding; in PU, they do not observe n and the seller cannot credibly disclose it.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Minimum profitable scale (lambda_o):&lt;/strong&gt; The smallest positive participation level the seller will ever choose in equilibrium, defined as the value where Ro&amp;rsquo;(lambda_o) equals the peak of the average revenue curve s-bar_o. The seller always recruits either zero bidders or at least lambda_o, due to the initial complementarity of bidders (they stimulate each other&amp;rsquo;s bids) under no-commitment-to-reserve-price.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Break-even participation level (lambda-bar_c):&lt;/strong&gt; The maximum participation level at which a bidder&amp;rsquo;s expected gross payoff U(lambda) equals the entry cost c; bidders will not enter if they expect participation above lambda-bar_c.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sampling curse:&lt;/strong&gt; The adverse-selection mechanism arising when sellers have privately known recruitment costs: because low-cost sellers recruit more, a contacted bidder infers the seller is more likely to have a low cost and thus to have recruited many competitors, making entry less attractive and potentially driving higher-cost seller types out of the market.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;xi(lambda):&lt;/strong&gt; The seller&amp;rsquo;s marginal revenue with respect to recruitment in the PU scenario, defined as the total derivative of Ru(lambda, beta_{lambda-hat}) evaluated where actual and expected participation coincide (lambda-hat = lambda). Strictly less than Ro&amp;rsquo;(lambda) for all lambda &amp;gt; 0, reflecting that in PU the seller loses the ability to leverage bidder aggression via observable competition.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Wasteful recruitment:&lt;/strong&gt; The equilibrium phenomenon in which total recruitment cost lambda*(s/q*) remains at the positive constant lambda-bar_c * Ro&amp;rsquo;(lambda-bar_c) even as s approaches zero, because incentive constraints — not technology — pin the equilibrium effort level.&lt;/p&gt;</description></item><item><title>Automated credit limit increases and consumer welfare</title><link>https://macropaperwarehouse.com/papers/automated-credit-limit-increases-and-consumer-welfare/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/automated-credit-limit-increases-and-consumer-welfare/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; Should regulators restrict banks from proactively raising credit card limits using machine-learning algorithms, and if so, how? The paper asks: to what extent are bank-initiated credit limit increases directed toward revolving borrowers (those who carry interest-accruing balances month-to-month), and what are the welfare consequences of policies that constrain such increases?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data.&lt;/strong&gt; The empirical analysis uses the Federal Reserve&amp;rsquo;s Capital Assessments and Stress Testing (Y-14M) regulatory data, January 2014 to December 2024, covering monthly account-level records for all credit cards issued by large stress-tested banks (assets &amp;gt; $100B). The 26 banks in the sample collectively represent more than 70% of U.S. credit card balances. A 0.5% sample yields more than 150 million observations across more than 3.6 million unique active credit cards. A key advantage of Y-14 over credit bureau data is that it identifies whether each limit change was bank-initiated or consumer-initiated — a distinction not available in other datasets.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Stylized Facts.&lt;/strong&gt; Credit limit increases are an important and understudied source of consumer credit. During the post-pandemic period, limit increases generate more than $40 billion of additional available credit per quarter, roughly 60% of the approximately $70 billion coming from new card originations; prior to the pandemic the figure was about $30 billion, or roughly half of new issuance. The number of accounts undergoing a limit increase each quarter is on average 30% higher than the number of new cards issued. Consistent with &amp;ldquo;low-and-grow&amp;rdquo; lending strategies, limit increases are disproportionately important for lower credit-score borrowers: average subprime credit limits rise from $700 at origination to $2,700 by five years after origination (a 285% increase) and to nearly $5,000 by eight years, while average superprime limits rise only from approximately $12,000 to $15,000 (a 25% increase). About 30% of total revolving balances are made possible by limit increases, with the share reaching 60% for subprime borrowers but only 12% for superprime borrowers. Approximately 75–80% of all limit increases — both by dollar amount and by number of cards — are bank-initiated rather than consumer-initiated. Banks that more frequently reference &amp;ldquo;artificial intelligence&amp;rdquo; or &amp;ldquo;machine learning&amp;rdquo; in their 10-K filings support a larger share of revolving balances through limit increases. Bank-initiated increases are roughly 1.5–2 times more prevalent among accounts that have revolved in the prior three months, whereas consumer-initiated increases show essentially no differential by revolving status.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Empirical Analysis.&lt;/strong&gt; Using a linear probability model with card-portfolio-group fixed effects, month fixed effects, and controls for credit score, income, prior limit changes, and other account characteristics, the authors show that the probability of a bank-initiated limit increase follows an inverse-U shape in revolving utilization: accounts with revolving utilization in the moderate range (roughly 0.2–0.7) are most likely to receive an increase, while those near zero or near 1.0 are not. An account with revolving utilization in the (0.2, 0.3] bin is approximately as likely to receive a limit increase as an account whose credit score just rose by 66 points. Transacting utilization, by contrast, follows a logistic growth pattern: the probability rises monotonically until about a utilization of 0.3 and is flat above that. An event study shows that after a bank-initiated limit increase, revolving utilization rebounds to its pre-increase level within approximately 8 months; on average, revolving balances increase by about 40% of the limit increase, with approximately 30% of the limit increase going toward revolving balances. This rebound occurs even for accounts with revolving utilization below the pre-increase mean of 0.28, indicating that the effect is not confined to liquidity-constrained borrowers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model.&lt;/strong&gt; The authors develop a life-cycle consumption–saving model with credit card borrowing, uninsurable income and employment risk, potential default (Chapter 7 style), and heterogeneous preferences following Nakajima (2017) and Gul–Pesendorfer (2001, 2004). Two household types coexist: 60% with standard exponential-discounting preferences (calibrated β = 0.92) and 40% with temptation preferences (calibrated β = 0.96, temptation parameter λ = 0.28 from Kovacs et al., 2021). The credit limit increase function is calibrated using Y-14M data via a latent-variable formulation, replicating the empirical inverted-U relationship between revolving utilization and limit increase probability. The four internally calibrated targets are: share of households with revolving credit card debt (data: 45%, model: 41.8%); utilization rate conditional on debt (data: 35%, model: 28.9%); default probability (data: 0.94%, model: 0.94%); debt-to-income ratio (data: 8.6%, model: 6.8%).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings — Baseline.&lt;/strong&gt; Through the model, tempted agents are disproportionately likely to receive credit limit increases because they are more likely to revolve. For customers with utilization above 50%, the majority of credit limit increases are detrimental from the borrower&amp;rsquo;s own perspective. Standard agents almost always benefit from higher credit limits.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Counterfactual 1 — UK-style (prohibit limit increases for revolving borrowers).&lt;/strong&gt; This policy reduces the annual probability of limit increases from roughly 5.5% to approximately 1.0%. The default probability falls from about 0.9% to near zero. The debt-to-income ratio declines by roughly 2 percentage points. Aggregate welfare improves by 1.12% in consumption equivalent variation (CEV) when the social planner internalizes the psychological cost of temptation (0.98% without). Standard households incur a modest welfare loss of 0.21% from reduced consumption-smoothing flexibility, while tempted households gain approximately 3.12% in CEV.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Counterfactual 2 — Canada/EU-style (require consumer consent).&lt;/strong&gt; This policy reduces the annual limit-increase probability from 5.5% to approximately 1.9%. Aggregate welfare improves by 1.16% in CEV (1.04% without psychological costs). Standard households lose 0.19%, while tempted households gain approximately 3.19%. Under the baseline assumption of sophisticated tempted households, results are nearly identical to the UK-style policy. However, when the fraction of naïve tempted households is large, the consent-based policy becomes ineffective (naïve consumers accept limit increases they will regret), whereas the UK-style revolving-borrower ban remains welfare-improving regardless of the naïve share.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Robustness.&lt;/strong&gt; When the firm is allowed to re-optimize its credit limit increase policy, it endogenously reallocates more limit increases toward standard consumers. Welfare gains remain positive but are attenuated: the UK-style policy yields 0.21% CEV (vs. 1.12% in the baseline calibration) and the consent-based policy yields 0.27% CEV.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Policy Implications.&lt;/strong&gt; The U.S. lacks regulation of bank-initiated proactive credit limit increases (existing rules under ECOA and ability-to-pay provisions are largely non-binding for this purpose). The authors conclude that banks&amp;rsquo; revealed preference for targeting revolvers constitutes an implicit targeting of consumers with self-control issues, and that if a meaningful share of households have self-control issues, there are strong consumer protection grounds for regulating algorithmic credit limit increases.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-why-do-the-authors-use-y-14m-data-rather-than-credit-bureau-data-and-what-does-this-data-uniquely-enable"&gt;Q1. Why do the authors use Y-14M data rather than credit bureau data, and what does this data uniquely enable?&lt;/h3&gt;
&lt;p&gt;A: The Y-14M dataset allows the authors to distinguish between bank-initiated and consumer-initiated credit limit changes — a distinction not observable in credit bureau data. It also contains actual payment information enabling identification of revolvers (those carrying interest-accruing balances) rather than just total balances. The sample covers more than 70% of U.S. credit card balances and more than 150 million monthly observations over the January 2014 to December 2024 period.&lt;/p&gt;
&lt;h3 id="q2-how-large-are-credit-limit-increases-relative-to-new-card-originations-in-the-us-credit-card-market"&gt;Q2. How large are credit limit increases relative to new card originations in the U.S. credit card market?&lt;/h3&gt;
&lt;p&gt;A: During the post-pandemic period, limit increases produce more than $40 billion of additional available credit per quarter, roughly 60% of the approximately $70 billion created by new card originations. Prior to the pandemic the figure was approximately $30 billion, or about half of new issuance. On a count basis, the number of cards undergoing a limit increase each quarter is on average 30% higher than the number of new cards issued.&lt;/p&gt;
&lt;h3 id="q3-what-is-the-low-and-grow-strategy-and-how-large-is-the-subsequent-credit-expansion"&gt;Q3. What is the &amp;ldquo;low-and-grow&amp;rdquo; strategy, and how large is the subsequent credit expansion?&lt;/h3&gt;
&lt;p&gt;A: The low-and-grow strategy involves originating higher-risk borrowers at low initial credit limits and then expanding limits based on observed borrowing behavior. For the average subprime credit card, the initial limit of $700 grows to $2,700 by five years after origination (a 285% increase) and to nearly $5,000 by eight years. For superprime borrowers, the initial limit of approximately $12,000 grows only to $15,000 (a 25% increase) by five years and then is approximately unchanged.&lt;/p&gt;
&lt;h3 id="q4-how-does-a-borrowers-revolving-status-affect-the-probability-of-receiving-a-bank-initiated-limit-increase"&gt;Q4. How does a borrower&amp;rsquo;s revolving status affect the probability of receiving a bank-initiated limit increase?&lt;/h3&gt;
&lt;p&gt;A: Bank-initiated increases are approximately 1.5–2 times more prevalent among accounts that have revolved at least once in the prior three months, compared to non-revolving accounts. By contrast, consumer-initiated increases show essentially no differential between revolvers and non-revolvers. This reveals a bank-side revealed preference for targeting revolvers.&lt;/p&gt;
&lt;h3 id="q5-what-is-the-shape-of-the-relationship-between-revolving-utilization-and-the-probability-of-a-bank-initiated-limit-increase-and-how-large-is-its-economic-magnitude"&gt;Q5. What is the shape of the relationship between revolving utilization and the probability of a bank-initiated limit increase, and how large is its economic magnitude?&lt;/h3&gt;
&lt;p&gt;A: The relationship follows an inverted-U shape. Accounts with revolving utilization in bins between approximately 0.2 and 0.7 have the highest probability of receiving an increase; accounts near zero or near full utilization are as unlikely to receive an increase as zero-utilization accounts. The effect of being in the (0.2, 0.3] revolving utilization bin has approximately the same positive effect on the probability of receiving a limit increase as a 66-point increase in credit score, making it economically large relative to standard risk signals.&lt;/p&gt;
&lt;h3 id="q6-how-does-transacting-utilization-relate-to-bank-initiated-limit-increases-and-how-does-this-differ-from-revolving-utilization"&gt;Q6. How does transacting utilization relate to bank-initiated limit increases, and how does this differ from revolving utilization?&lt;/h3&gt;
&lt;p&gt;A: Transacting utilization follows a logistic growth pattern rather than an inverted-U. The probability of receiving a limit increase rises monotonically with transacting utilization until about a utilization of 0.3, above which the probability does not vary with utilization. This contrasts with revolving utilization, where very high utilization (above 0.9) is actually no more predictive than zero utilization.&lt;/p&gt;
&lt;h3 id="q7-what-does-the-event-study-show-about-borrowing-behavior-following-credit-limit-increases"&gt;Q7. What does the event study show about borrowing behavior following credit limit increases?&lt;/h3&gt;
&lt;p&gt;A: After a bank-initiated limit increase, revolving utilization (as a share of the credit limit) drops mechanically but then rebounds to pre-increase levels within approximately 8 months. On average, revolving balances increase by about 40% of the amount of the limit increase, with approximately 30% of each dollar of new credit limit going toward revolving balances. These magnitudes are somewhat larger than the 13% (Gross and Souleles, 2002) and 18% (Aydin, 2022) found in prior work, which the authors attribute to the non-causal nature of their event study, higher average utilization in their sample, and their focus on revolving rather than total utilization.&lt;/p&gt;
&lt;h3 id="q8-is-the-post-increase-borrowing-rebound-driven-by-liquidity-constrained-borrowers"&gt;Q8. Is the post-increase borrowing rebound driven by liquidity-constrained borrowers?&lt;/h3&gt;
&lt;p&gt;A: No. The authors show that limiting the sample to accounts with revolving utilization below the pre-increase mean of 0.28 — accounts that are unlikely to be liquidity constrained — yields very similar results. This finding is consistent with the presence of self-control issues rather than binding credit constraints.&lt;/p&gt;
&lt;h3 id="q9-what-are-the-key-modeling-assumptions-about-household-types-and-how-were-the-share-parameters-calibrated"&gt;Q9. What are the key modeling assumptions about household types, and how were the share parameters calibrated?&lt;/h3&gt;
&lt;p&gt;A: The model features two types: 60% with standard exponential-discounting preferences (estimated discount factor β = 0.92) and 40% with temptation preferences (β = 0.96, temptation parameter λ = 0.28 set from Kovacs et al., 2021). The 40% tempted share is internally estimated via the Method of Simulated Moments targeting four aggregate moments: share with revolving credit card debt (45% in data, 41.8% in model), utilization rate conditional on debt (35% vs. 28.9%), default probability (0.94% vs. 0.94%), and debt-to-income ratio (8.6% vs. 6.8%).&lt;/p&gt;
&lt;h3 id="q10-how-do-tempted-and-standard-households-differ-in-their-credit-card-usage-within-the-model"&gt;Q10. How do tempted and standard households differ in their credit card usage within the model?&lt;/h3&gt;
&lt;p&gt;A: In the model, 76% of tempted agents carry revolving credit card debt, with an average utilization rate of 73.6%, a debt-to-income ratio of 15.4%, and a default probability of 2.22%. Standard agents carry debt only 18.9% of the time, with average utilization of 4.1%, a debt-to-income ratio of 1.1%, and a default probability of 0.08%. Tempted agents also pay a substantially higher share of income on credit card interest.&lt;/p&gt;
&lt;h3 id="q11-how-does-the-model-capture-the-mechanism-by-which-credit-limit-increases-harm-tempted-households"&gt;Q11. How does the model capture the mechanism by which credit limit increases harm tempted households?&lt;/h3&gt;
&lt;p&gt;A: The Gul–Pesendorfer temptation utility function makes household welfare depend on both actual consumption and the most tempting consumption alternative available (the budget-set maximum). When credit limits rise, the most tempting alternative ˜c_t increases, which raises the utility cost of self-restraint even for households that do not succumb to temptation. This mechanism is distinct from hyperbolic discounting: temptation imposes a psychic cost even on those who ultimately choose not to over-borrow.&lt;/p&gt;
&lt;h3 id="q12-what-are-the-quantitative-welfare-effects-of-the-uk-style-policy-prohibiting-limit-increases-for-revolving-borrowers"&gt;Q12. What are the quantitative welfare effects of the UK-style policy prohibiting limit increases for revolving borrowers?&lt;/h3&gt;
&lt;p&gt;A: The policy yields an overall welfare gain of 1.12% in consumption equivalent variation (CEV) when the social planner internalizes the psychological cost of temptation (0.98% without). Standard households suffer a modest welfare loss of 0.21% from reduced consumption-smoothing flexibility. Tempted households gain approximately 3.12% in CEV, because the benefit from reduced temptation and lower interest expenditure outweighs the cost of reduced credit access.&lt;/p&gt;
&lt;h3 id="q13-what-are-the-quantitative-welfare-effects-of-the-canadaeu-style-consent-required-policy"&gt;Q13. What are the quantitative welfare effects of the Canada/EU-style consent-required policy?&lt;/h3&gt;
&lt;p&gt;A: The consent-based policy yields an overall welfare gain of 1.16% in CEV (1.04% without psychological costs). Standard households lose 0.19%, and tempted households gain approximately 3.19%. Under the baseline assumption of fully sophisticated tempted households, results are nearly identical to the UK-style ban.&lt;/p&gt;
&lt;h3 id="q14-how-sensitive-are-the-two-policy-counterfactuals-to-the-share-of-naïve-unaware-of-their-self-control-issues-tempted-households"&gt;Q14. How sensitive are the two policy counterfactuals to the share of naïve (unaware of their self-control issues) tempted households?&lt;/h3&gt;
&lt;p&gt;A: The UK-style ban on limit increases for revolving borrowers remains welfare-improving regardless of whether tempted households are sophisticated or naïve — the welfare impact is approximately flat as the naïve fraction rises from zero to one. The consent-based policy, by contrast, exhibits a negative linear relationship between the naïve fraction and welfare impact, with welfare gains disappearing as the naïve fraction approaches one. Naïve consumers accept limit increases they would regret, so the policy&amp;rsquo;s effectiveness depends on households accurately recognizing their own self-control issues.&lt;/p&gt;
&lt;h3 id="q15-what-happens-when-the-firm-is-allowed-to-re-optimize-its-credit-limit-increase-policy-in-response-to-regulation"&gt;Q15. What happens when the firm is allowed to re-optimize its credit limit increase policy in response to regulation?&lt;/h3&gt;
&lt;p&gt;A: With firm re-optimization, both counterfactual policies continue to improve welfare but the magnitudes are attenuated. The UK-style policy yields 0.21% CEV overall (tempted: 0.89%) and the consent-based policy yields 0.27% overall (tempted: 0.98%), compared to 1.12% and 1.16% without re-optimization. The re-optimizing firm reallocates more limit increases toward standard consumers, which reduces the number directed at tempted households but also limits the welfare gains from regulation.&lt;/p&gt;
&lt;h3 id="q16-what-do-lenders-10-k-filings-reveal-about-the-role-of-aiml-in-targeting-revolvers-for-limit-increases"&gt;Q16. What do lenders&amp;rsquo; 10-K filings reveal about the role of AI/ML in targeting revolvers for limit increases?&lt;/h3&gt;
&lt;p&gt;A: Banks that mention &amp;ldquo;artificial intelligence&amp;rdquo; or &amp;ldquo;machine learning&amp;rdquo; above the median number of times in their 2024 10-K filings support a higher share of revolving balances through credit limit increases, for all credit score groups. This difference is not driven by differences in credit limits at origination between higher-AI and lower-AI lenders, suggesting that AI/ML adoption affects the targeting of limit increases toward revolvers rather than the initial credit allocation.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Revolving utilization.&lt;/strong&gt; In this paper, revolving utilization is defined as the portion of overall credit card utilization attributable to balances that the borrower carries from one month to the next without full repayment, thereby accruing interest. It is measured as revolving balances divided by credit limit, averaged over the prior three months. This is distinct from transacting utilization (new purchases as a share of limit) and is the primary signal banks use — implicitly, via their algorithms — to select accounts for proactive limit increases.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bank-initiated vs. consumer-initiated credit limit increase.&lt;/strong&gt; A bank-initiated limit increase is one in which the lender proactively raises a borrower&amp;rsquo;s credit limit without a request from the borrower. A consumer-initiated increase is one explicitly requested by the borrower. The Y-14M data uniquely identify the source of each change. The paper documents that approximately 75–80% of all limit increases are bank-initiated, and that bank-initiated increases are strongly correlated with revolving utilization whereas consumer-initiated increases are not.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Low-and-grow strategy.&lt;/strong&gt; The practice of originating higher-risk borrowers at low initial credit limits and then expanding those limits over time based on observed borrowing behavior. In the paper this is a documented empirical pattern, not an assumption: subprime accounts start at an average $700 limit at origination and reach nearly $5,000 by eight years, a 285% increase versus only 25% for superprime accounts over the same horizon.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Temptation preferences (Gul–Pesendorfer).&lt;/strong&gt; A utility framework in which household welfare depends not only on actual consumption but also on the most tempting consumption alternative within the budget set. The disutility from temptation arises even when the household does not succumb — it reflects the psychological cost of self-restraint. In the paper, λ (set to 0.28) parameterizes the weight of this temptation cost relative to standard utility. Temptation preferences are time-consistent, which facilitates welfare analysis, and are preferred to hyperbolic discounting in this setting because they predict that individuals may pay to have tempting options removed even without acting on them.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Revealed preference for targeting revolvers.&lt;/strong&gt; The paper&amp;rsquo;s characterization of banks&amp;rsquo; credit limit increase behavior as reflecting a systematic preference for giving increases to revolving borrowers, inferred from the empirical pattern in the Y-14M data (the inverted-U shape between revolving utilization and limit increase probability). Because banks&amp;rsquo; algorithms are proprietary and unobserved, the paper interprets the observed allocation of limit increases as a revealed preference, consistent with banks&amp;rsquo; profit motive since revolvers generate the majority of credit card interest income.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Consumption equivalent variation (CEV).&lt;/strong&gt; The welfare metric used throughout the paper&amp;rsquo;s counterfactual analysis. CEV is defined as the percentage change in consumption in every period and state that would make households indifferent between the baseline policy regime and the counterfactual policy. A positive CEV indicates that the counterfactual policy improves welfare; a negative CEV indicates harm. The paper considers two versions: one in which the social planner internalizes the psychological cost of temptation (consistent with tempted households&amp;rsquo; actual preferences), and one in which the planner ignores that cost (λ = 0 for the planner) but households still face temptation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Persistent revolving debt (UK regulatory definition).&lt;/strong&gt; In the UK Financial Conduct Authority&amp;rsquo;s framework, a borrower is considered in &amp;ldquo;persistent revolving debt&amp;rdquo; when the cumulative amount paid toward interest and fees exceeds the cumulative amount of principal repaid over a 12-month period. The UK rule prohibits lenders from increasing credit limits for borrowers meeting this definition. The paper models a stylized version: any account currently carrying a revolving balance is ineligible for a bank-initiated limit increase in the UK-style counterfactual.&lt;/p&gt;</description></item><item><title>Automation and Rent Dissipation</title><link>https://macropaperwarehouse.com/papers/automation-and-rent-dissipation/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/automation-and-rent-dissipation/</guid><description>&lt;p&gt;Acemoglu and Restrepo examine the effects of automation in economies where labor market distortions cause some workers to earn rents—wages above their opportunity cost or outside option. The central question is how the interplay between automation and these distortions shapes wages, inequality, and productivity. The paper makes three contributions: a theoretical framework identifying a rent dissipation mechanism, reduced-form empirical evidence using US data from 1980 to 2016, and a general equilibrium quantification of automation&amp;rsquo;s aggregate effects.&lt;/p&gt;
&lt;p&gt;The theoretical framework extends the task model of Acemoglu and Restrepo (2022) to incorporate task-specific wage wedges. In this setup, a firm employing labor of type g in task x pays a wage equal to the base wage multiplied by an exogenous wedge capturing rents from efficiency wages, bargaining, licensing, regulations, or norms. Because these wedges artificially inflate labor costs in high-rent tasks, firms have a stronger incentive to automate precisely those tasks—automation saves more in labor costs where rents are highest. Proposition 3 establishes that endogenous adoption decisions are tilted toward high-rent tasks: the rent distribution in automated tasks first-order stochastically dominates the rent distribution across all tasks. This targeting generates the rent dissipation mechanism. The equilibrium is inefficient on both the intensive margin (too little employment in high-rent tasks) and the extensive margin (excessive automation of high-rent tasks that a social planner would prefer to keep labor-intensive).&lt;/p&gt;
&lt;p&gt;The rent dissipation mechanism has three consequences identified theoretically. First, it amplifies average wage losses for exposed groups beyond what displacement alone would produce, pushing displaced workers toward lower-paying jobs. Second, it compresses within-group wage dispersion by concentrating losses at higher percentiles of the within-group distribution, generating a U-shaped pattern of wage changes: workers at low percentiles earn no rents and experience only base-wage adjustments, while workers between the 70th and 95th percentiles face the steepest declines due to loss of high-rent jobs. Third, it is inefficient: because the tasks targeted by automation are not those where wages reflect scarcity or skill but rather distortionary rents, a planner would have preferred more labor allocated to these tasks, and rent dissipation offsets part or all of the cost-saving productivity gains from automation.&lt;/p&gt;
&lt;p&gt;The empirical analysis covers 500 detailed demographic groups defined by education (five levels), gender, five age groups, five race/ethnicity groups, and nativity. Task displacement is measured as a weighted sum of industry-level automation exposure using three proxies: adjusted industrial robot penetration, specialized software services, and dedicated machinery in value added. Workers in the middle and lower-middle of the wage distribution lost 15–20% of their tasks to automation between 1980 and 2016, while post-college workers saw few tasks automated.&lt;/p&gt;
&lt;p&gt;A 10 percentage point increase in task displacement is associated with a 24% decline in group-level relative wages (β = −2.36, s.e. = 0.13), falling to 19% after controlling for gender, education, sectoral demand, and rent shifters (β = −1.90, s.e. = 0.29). The U-shaped pattern in within-group wage changes is clearly visible: wages decline by 25–30% per 10 percentage point task displacement at the 70th–90th percentiles, compared to only 16% at the 5th–40th percentiles. Decomposing the average wage effect, the base-wage component is β = −1.53 (s.e. = 0.33) and the rent-dissipation component is β = −0.37 (s.e. = 0.11), implying a rent dissipation rate of approximately 37%. Across multiple proxies for rents—inter-industry/occupation wage differentials, wage losses after job displacement, and quit rates—the average estimated rent dissipation rate is approximately 35%. Rent dissipation accounts for one-fifth of the overall relative wage decline experienced by groups exposed to automation.&lt;/p&gt;
&lt;p&gt;In the general equilibrium quantification (with elasticity of substitution λ = 0.5, average cost savings π = 30%, and average rent in automated tasks of 35%), automation accounts for 52% of the rise in between-group wage inequality since 1980: 42 percentage points via baseline displacement effects on labor demand, and 10 percentage points via rent dissipation. Cost savings from automation increased TFP by approximately 3% between 1980 and 2016, but inefficient rent dissipation offsets 60–90% of these gains, leaving net TFP gains of only 0.3–1.3% and net aggregate consumption gains of only 0.45–1.95% over the 36-year period.&lt;/p&gt;
&lt;p&gt;Q: What is the rent dissipation mechanism, and why does it arise?
A: Rent dissipation arises because labor market wedges make high-rent tasks artificially costly to staff with workers, giving firms a stronger incentive to automate precisely those tasks. When automation displaces workers from high-rent jobs, workers lose the premium above their opportunity cost that those jobs paid, amplifying wage losses beyond what displacement alone would cause. The mechanism is endogenous: firms do not randomly automate tasks but disproportionately target tasks where rents are highest, since doing so saves the most in labor costs. Proposition 3 formalizes this as first-order stochastic dominance of the rent distribution in automated tasks over the rent distribution in all tasks.&lt;/p&gt;
&lt;p&gt;Q: Why is rent dissipation inefficient?
A: In a distorted economy, high-rent tasks already feature too little employment at the equilibrium—firms under-hire in these tasks because the wage wedge makes labor artificially expensive. A social planner would want to allocate more labor to these tasks, not less. When automation further removes labor from high-rent tasks, it moves the economy further from the efficient allocation, dissipating rents that reflect distortions rather than true scarcity. The TFP formula shows that this inefficient targeting offsets part or all of the cost-saving gains from automation, and can even reduce aggregate productivity if the cost savings are small relative to the rent losses.&lt;/p&gt;
&lt;p&gt;Q: What is the U-shaped pattern of within-group wage changes, and what does it indicate?
A: The U-shaped pattern means that wage declines due to automation are smallest at the bottom percentiles of a group&amp;rsquo;s within-group wage distribution, largest in the 70th–95th percentile range, and then smaller again at the very top. Workers at low percentiles earn no rents, so they experience only the base-wage adjustment from reduced labor demand. Workers in the middle-upper range of the distribution hold the high-rent jobs that are disproportionately automated, so they lose both the base-wage component and the rent component of their wages. This pattern is directly visible in US data 1980–2016, with declines of 25–30% per 10 percentage point task displacement at the 70th–90th percentiles versus 16% at the 5th–40th percentiles.&lt;/p&gt;
&lt;p&gt;Q: How is task displacement measured, and which groups are most exposed?
A: Task displacement is measured as a weighted sum of industry-level automation exposure, accounting for each demographic group&amp;rsquo;s specialization in routine tasks within industries. Three proxies are used: the adjusted penetration of industrial robots, the increase in specialized software services, and the increase in dedicated machinery in value added. Workers in the middle and lower-middle of the wage distribution—broadly corresponding to non-college workers—lost 15–20% of their tasks to automation between 1980 and 2016. Post-college degree workers saw few tasks automated.&lt;/p&gt;
&lt;p&gt;Q: How large is the rent dissipation rate, and how robust is this estimate?
A: The baseline estimate from the U-shaped within-group wage change decomposition implies a rent dissipation rate (μ_Ag/μ_g − 1) of approximately 37% (β = −0.37, s.e. = 0.11). Using inter-industry and occupation wage differentials as a proxy for rents, the estimate is 39% (β = −0.39, s.e. = 0.11). Using wage losses after job displacement, the estimate is 20% (β = −0.20, s.e. = 0.04). After purging compensating differentials from the wage differential proxy the estimate remains 37%; after purging from the displacement-loss proxy it falls to 19%. Quit-rate evidence is consistent with rent dissipation: automation shifts workers toward higher-quit-rate jobs, which are lower-rent jobs. The average across proxies is approximately 35%.&lt;/p&gt;
&lt;p&gt;Q: How much of between-group wage inequality since 1980 does automation explain, and what share is due to rent dissipation specifically?
A: Automation accounts for 52% of the rise in between-group wage inequality in the US since 1980. Of this 52 percentage points, 42 percentage points are attributable to the baseline displacement effect working through reduced labor demand for exposed groups. The remaining 10 percentage points are attributable to rent dissipation—automation pushing exposed groups away from high-rent tasks into lower-paying employment. Rent dissipation thus accounts for roughly one-fifth (10/52) of automation&amp;rsquo;s total contribution to between-group inequality.&lt;/p&gt;
&lt;p&gt;Q: How large are the productivity gains from automation, and how much does rent dissipation offset them?
A: Cost savings from automation increased TFP by approximately 3% between 1980 and 2016. However, inefficient rent dissipation offsets 60–90% of these gains, because automation disproportionately targets high-rent tasks rather than tasks where the efficiency case is strongest. The net TFP increase attributable to automation is only 0.3–1.3% over the 36-year period, and the corresponding net increase in aggregate consumption is only 0.45–1.95%.&lt;/p&gt;
&lt;p&gt;Q: How does automation affect within-group versus between-group inequality, and why is this notable?
A: Automation increases between-group inequality by reducing relative wages of exposed groups (largely non-college workers) relative to unexposed groups, accounting for 52% of the rise in between-group inequality since 1980. At the same time, automation reduces within-group wage dispersion for exposed groups by compressing wages at higher percentiles. This contrasts with the standard view that inequality is fractal—rising at all levels of aggregation due to skill-biased demand—and helps explain why within-group inequality has risen steadily for college workers since the 1980s while remaining flat and then declining for non-college workers since the 1990s.&lt;/p&gt;
&lt;p&gt;Q: What do the propagation matrix and rent-impact matrix represent in the general equilibrium analysis?
A: The propagation matrix encodes how task reallocation due to automation in one demographic group creates competition for marginal tasks across other groups, transmitting the wage effects of automation to groups not directly displaced. The rent-impact matrix encodes how this task reallocation changes the rent composition of employment across groups. Both matrices are estimated from US data on task shares and group-level wage elasticities and are used to translate partial-equilibrium estimates of task displacement and rent dissipation into general equilibrium effects on wages and productivity for all demographic groups simultaneously.&lt;/p&gt;
&lt;p&gt;Q: What are the policy implications of inefficient rent dissipation?
A: Because rent dissipation is inefficient, the social value of automation is lower than what firms and consumers are willing to pay—firms capture all the labor cost savings but do not internalize the welfare cost of destroying high-rent jobs that the distorted equilibrium already under-supplies. Second-best interventions should address the underlying distortions generating rents rather than trying to slow automation directly. The paper suggests that strengthening labor market institutions supporting worker rents in non-automatable tasks could partially counteract the adverse distributional consequences of automation.&lt;/p&gt;
&lt;p&gt;Q: How does this paper relate to Bound and Johnson (1992) and Borjas and Ramey (1995)?
A: Bound and Johnson (1992) decompose changes in the US wage structure between 1979 and 1988 into technology, supply, and rent components (modeled as exogenous industry wedges), finding that 10–20% of between-group wage changes reflect rent losses. Borjas and Ramey (1995) estimate that trade increased the college premium by 1.3–2.6 log points between 1976 and 1990, with 15–33% due to loss of rents from trade-exposed jobs. Both are comparable to this paper&amp;rsquo;s finding that rent dissipation accounts for one-fifth of the wage effect of automation, though Bound and Johnson&amp;rsquo;s estimates include all factors affecting rents while this paper isolates automation specifically.&lt;/p&gt;
&lt;p&gt;Worker rents: Wages above a worker&amp;rsquo;s opportunity cost or outside option, arising from efficiency wages, bargaining, licensing, regulations, or norms. Modeled as task-specific multiplicative wedges (μ_gx ≥ 1) that force firms to pay more than the base wage for labor in particular tasks. Explicitly excludes compensating differentials and skill premia.&lt;/p&gt;
&lt;p&gt;Rent dissipation: The loss of above-opportunity-cost wages experienced by workers displaced from high-rent tasks into lower-paying employment. Occurs because automation endogenously targets high-rent tasks where labor is most expensive, and pushes workers into tasks where rents are lower. Quantified as the ratio of average rents in automated tasks to average rents across all tasks, minus one (approximately 35% in US data 1980–2016).&lt;/p&gt;
&lt;p&gt;Task displacement: The share of tasks performed by a demographic group that are automated away, measured as a weighted sum of industry-level automation exposure accounting for the group&amp;rsquo;s specialization in routine tasks. Distinct from employment loss because it captures reallocation of tasks from labor to capital within the production function.&lt;/p&gt;
&lt;p&gt;U-shaped within-group wage change profile: The pattern whereby automation generates the largest wage declines at intermediate-to-upper percentiles (70th–95th) of an exposed group&amp;rsquo;s within-group wage distribution, with smaller declines at the bottom, because high-percentile workers disproportionately hold high-rent jobs targeted by automation. Predicted theoretically and confirmed empirically in US data 1980–2016.&lt;/p&gt;
&lt;p&gt;Propagation matrix: A matrix estimated from US data on task shares and group-level wage elasticities that encodes how automation of tasks performed by one demographic group creates competition for marginal tasks with other groups, transmitting wage effects across the demographic distribution in general equilibrium.&lt;/p&gt;
&lt;p&gt;Inefficient automation targeting: The mechanism by which labor market distortions cause firms to automate high-rent tasks that a social planner would prefer to keep labor-intensive, since the distorted equilibrium already features too little employment in those tasks. Results in rent dissipation offsetting 60–90% of automation&amp;rsquo;s direct TFP gains from cost savings.&lt;/p&gt;
&lt;p&gt;Rent-impact matrix: A matrix that encodes how task reallocation due to automation changes the rent composition of employment across demographic groups, used alongside the propagation matrix to compute general equilibrium effects of automation on wages and productivity accounting for distortions.&lt;/p&gt;</description></item><item><title>Bargaining and Inequality in the Labor Market</title><link>https://macropaperwarehouse.com/papers/bargaining-and-inequality-in-the-labor-market/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/bargaining-and-inequality-in-the-labor-market/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; How prevalent is individual wage bargaining in the labor market, what determines firms&amp;rsquo; bargaining strategies, how do bargaining encounters unfold for workers, and does heterogeneity in bargaining behavior translate into wage inequality—including the gender wage gap?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Setting.&lt;/strong&gt; The paper develops and validates novel linked survey data for Germany. A firm survey was fielded by the ifo Institute to senior HR professionals and managers in two waves (September 2021 and January 2022), yielding 772 complete responses across all major sectors and regions. These responses were linked—with consent obtained from 72% of firms—to German Social Security records (the Integrated Employment Biographies, IEB) covering 416,821 full-time employees at matched firms in 2020, and to Orbis balance sheet data for firm productivity proxies. A separate worker survey was fielded by the IAB to 135,000 full-time German workers, with 9,756 completing it; nearly 10,000 responses were used for analysis, with 7,079 workers employed at surveyed firms. The worker survey elicited detailed bargaining histories for workers who had received an outside offer in the prior six months, bargaining at the start of current employment (for workers with tenure of three years or less), and responses to a hypothetical salary expectation scenario.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Definition of Individual Bargaining.&lt;/strong&gt; The authors define a firm as having a &amp;ldquo;bargaining strategy&amp;rdquo; if it differentiates pay between workers in the same position it perceives to have similar productivity—encompassing both variation in initial offers (which may reflect firms using information on workers&amp;rsquo; salary expectations) and back-and-forth negotiation. Elicitation distinguishes four employee groups (recent labor market entrants, experienced non-managers, managers, and bottleneck-occupation workers) and two contexts (new external hires and incumbent workers who receive an outside offer).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Prevalence of Bargaining.&lt;/strong&gt; Approximately 50% of surveyed firms are willing to differentiate base wages for recent labor market entrants, more than 80% for experienced non-managers and managers, and nearly all for workers in bottleneck occupations they are struggling to fill. For incumbent workers facing outside offers, 57% of firms would increase pay for recent entrants, and more than 80% for experienced incumbents, managers, and bottleneck workers. In total, 80% of workers in the sample are in positions where individual bargaining is possible.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Magnitude of Wage Differentiation.&lt;/strong&gt; For new external hires, the typical firm expects a gap between the highest and lowest offers of 3% for recent entrants, 5% for experienced non-managers, and 10% for managers (conditional on a gap: 6%, 10%, and 12% respectively). For incumbent workers responding to outside offers, the typical firm will adjust pay by 3% for recent entrants, 6% for experienced non-managers, and 10% for managers (conditional on responding: 6%, 7%, and 14% respectively). Forty-four percent of firms report that variation in initial offers is at least as important as back-and-forth negotiation in determining workers&amp;rsquo; final pay.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Predictors of Firm Bargaining Strategies.&lt;/strong&gt; Contrary to models predicting more productive firms are more likely to bargain (Doniger 2015; Postel-Vinay and Robin 2004; Flinn and Mullins 2021), firms that bargain are not more productive—as proxied by firm age, size, or assets per employee—nor do they pay higher mean wages. A variance decomposition shows that employee-group dummies alone explain 33% of variation in bargaining strategies for new hires, comparable to more than 500 firm dummies. Labor market factors—particularly whether a position is hard to fill—are systematically associated with bargaining willingness. Collective bargaining agreement (CBA) coverage and East German location are negatively correlated with bargaining flexibility.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How Bargaining Unfolds.&lt;/strong&gt; In 57% of worker-firm interactions, the worker provides salary expectations before the firm makes its initial offer; 29% of firms require this information. About one-third of applicants ask for more after the initial offer, requesting on average a 3% increase; conditional on asking, about half of firms raise the offer, but fewer than one-third match what was requested, with the typical worker improving the offer by 1.5%. The majority of outside offers are rejected: only 9% of workers who received an outside offer in the prior six months chose to move to a new firm. Of the 91% who remained at their incumbent firm, 13% successfully renegotiated their pay. Back-and-forth dynamics—where offers are accepted or rejected only after multiple rounds—are consistent with models of two-sided incomplete information.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Worker Heterogeneity and Wage Inequality.&lt;/strong&gt; Workers with better self-assessed outside options are 9 percentage points more likely to ask for an increase after the initial offer and 7 percentage points more likely to successfully negotiate a raise, relative to same-occupation coworkers with worse outside options. Women are 6 percentage points less likely to successfully negotiate their pay upward and show lower salary expectation provision rates, including in a hypothetical scenario in which pay range information is equalized. These gender differences in bargaining are not explained by women negotiating more over non-wage amenities; controlling for outside options and risk tolerance shrinks the female coefficient by at most 15%. Among surveyed workers, after controlling for occupation-establishment fixed effects, there is no gender wage gap at firms that do not bargain, but a 4–5 percentage point gender wage gap at firms that do bargain. Across specifications, firms that engage in individual bargaining have a 3 percentage point higher gender wage gap. A simple decomposition suggests that at surveyed firms, 44% of the residual gender pay gap can be attributed to bargaining. For workers at bargaining firms, a 10 percentage point higher pay premium at the prior firm is associated with 0.5 percent higher pay at the current firm, conditional on occupation-establishment fixed effects; this relationship is statistically insignificant for workers at non-bargaining firms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions.&lt;/strong&gt; Results apply to full-time private-sector workers in Germany between ages 25 and 50, with the firm sample over-representing medium and large firms (median size 50–249 employees). CBA coverage in the sample (41%) reflects Germany&amp;rsquo;s institutional context where firms retain the right to pay above CBA floors. Results are robust to re-weighting to match the overall distribution of German firm size and sector.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-how-do-the-authors-define-individual-bargaining-and-why-is-this-definition-broader-than-standard-labor-economics-usage"&gt;Q1. How do the authors define &amp;ldquo;individual bargaining&amp;rdquo; and why is this definition broader than standard labor economics usage?&lt;/h3&gt;
&lt;p&gt;The authors define a firm as having a bargaining strategy if it differentiates pay between workers in the same position it perceives to have similar productivity, covering both tailoring of initial offers and back-and-forth negotiation. Standard labor economics definitions typically condition on wages being set ex post once outside options are revealed, and focus on back-and-forth negotiation alone. The authors&amp;rsquo; definition is most analogous to standard definitions of price discrimination. Empirically, the vast majority of firms that differentiate initial offers (93%) are also willing to engage in back-and-forth negotiation.&lt;/p&gt;
&lt;h3 id="q2-how-was-the-firm-survey-designed-to-elicit-bargaining-strategies-reliably-and-what-is-the-protocol-question"&gt;Q2. How was the firm survey designed to elicit bargaining strategies reliably, and what is the &amp;ldquo;protocol question&amp;rdquo;?&lt;/h3&gt;
&lt;p&gt;The protocol question asked: &amp;ldquo;How much more could a person maximally receive compared to the fixed compensation you would have offered based on the person&amp;rsquo;s qualification/fit for the position alone?&amp;rdquo; with options ranging from &amp;ldquo;0%/no adjustments possible&amp;rdquo; to &amp;ldquo;more than 40%.&amp;rdquo; Wording was developed through over 100 conversations with HR professionals; &amp;ldquo;qualifications and fit&amp;rdquo; was the phrase most closely aligned with HR professionals&amp;rsquo; concept of productivity. The survey was fielded by the ifo Institute—an organization with decades of experience surveying this population—with a 51% response rate, 83% completion rate, and median response time of 11 minutes.&lt;/p&gt;
&lt;h3 id="q3-what-validation-exercises-support-the-reliability-of-the-elicited-firm-bargaining-measures"&gt;Q3. What validation exercises support the reliability of the elicited firm bargaining measures?&lt;/h3&gt;
&lt;p&gt;Four exercises are reported. First, intra-respondent reliability: the cross-tabulations between the protocol and incidence questions show most mass on or below the diagonal (incidence-implied spread no greater than the protocol-implied flexibility). Second, inter-respondent reliability: among 37 firms with multiple respondents, there is significant overlap in independently provided answers. Third, external validity using publicly available data: for 90% of firms reporting no CBA, no CBA evidence is found; for 99% reporting no pay information in job ads, none is found in online postings; for 82% reporting no salary expectation elicitation, no evidence of it appears in online application forms. Fourth, the elicited firm strategies are highly correlated with the matching workers&amp;rsquo; survey responses—e.g., workers at firms stating they elicit salary expectations are significantly more likely to report having provided these expectations.&lt;/p&gt;
&lt;h3 id="q4-is-firm-productivity-associated-with-whether-a-firm-engages-in-individual-bargaining"&gt;Q4. Is firm productivity associated with whether a firm engages in individual bargaining?&lt;/h3&gt;
&lt;p&gt;No. Firms that bargain and those that do not are similar with respect to firm size, firm age, and total assets per employee, and they also do not differ significantly in their AKM wage premium. These findings are inconsistent with theoretical models predicting that more productive firms are more likely to set pay via bargaining (Doniger 2015; Postel-Vinay and Robin 2004; Flinn and Mullins 2021). The result holds for both binary and continuous measures of bargaining, and is not overturned by machine learning prediction attempts.&lt;/p&gt;
&lt;h3 id="q5-what-firm-characteristics-other-than-productivity-predict-bargaining-strategies"&gt;Q5. What firm characteristics other than productivity predict bargaining strategies?&lt;/h3&gt;
&lt;p&gt;CBA coverage is negatively correlated with wage flexibility—CBA-covered firms report less flexibility even for managers who are typically exempt from CBAs and for groups not covered by CBAs, suggesting institutional norms or culture matter. Firms headquartered in East Germany are less likely to bargain with workers in all groups. Publicly traded firms (stock-based corporations) are more likely to set wages flexibly. These correlations are consistent with the view that managerial style and firm culture (rather than productivity) shape wage-setting strategies.&lt;/p&gt;
&lt;h3 id="q6-what-does-the-variance-decomposition-say-about-the-relative-importance-of-firm-versus-market-factors-in-predicting-bargaining-strategies"&gt;Q6. What does the variance decomposition say about the relative importance of firm versus market factors in predicting bargaining strategies?&lt;/h3&gt;
&lt;p&gt;Employee-group dummies alone explain 33% of the variation in bargaining strategies for new hires. After adjusting for the number of fixed effects used, four employee-group dummies explain as much variation as more than 500 firm dummies. Adding firm characteristics or coarse industry dummies does not significantly improve the adjusted R-squared relative to a model containing only group dummies. This supports models emphasizing market-level factors (worker replaceability, labor market tightness) over firm-level factors.&lt;/p&gt;
&lt;h3 id="q7-how-common-is-it-for-workers-to-provide-salary-expectations-before-receiving-an-initial-offer-and-what-do-firms-do-with-this-information"&gt;Q7. How common is it for workers to provide salary expectations before receiving an initial offer, and what do firms do with this information?&lt;/h3&gt;
&lt;p&gt;In 57% of worker-firm interactions, the worker provides salary expectations before the firm makes its initial offer. Twenty-nine percent of firms require this information; most ask for it. Forty-four percent of firms report that variation in initial offers is at least as important as subsequent back-and-forth negotiations in determining workers&amp;rsquo; final pay. HR professionals and prior research indicate firms interpret variation in stated expectations as reflecting outside options rather than productivity.&lt;/p&gt;
&lt;h3 id="q8-what-fraction-of-outside-offers-are-rejected-and-what-happens-when-workers-stay-at-the-incumbent-firm"&gt;Q8. What fraction of outside offers are rejected, and what happens when workers stay at the incumbent firm?&lt;/h3&gt;
&lt;p&gt;Only 9% of workers who received one or more outside offers in the prior six months chose to move to a new firm. Of the 91% who remained at the incumbent firm, 13% successfully renegotiated their pay at the incumbent. A follow-up survey fielded in spring 2024 corroborates this finding, showing approximately 80% of workers who received an outside offer remained at the incumbent firm; even recoding all job-to-job transitions as accepted offers implies no more than 26% of offers lead to a transition.&lt;/p&gt;
&lt;h3 id="q9-what-do-the-back-and-forth-dynamics-imply-for-appropriate-theoretical-models-of-wage-bargaining"&gt;Q9. What do the back-and-forth dynamics imply for appropriate theoretical models of wage bargaining?&lt;/h3&gt;
&lt;p&gt;That many offers are accepted or rejected only after multiple rounds of negotiation is difficult to rationalize with models assuming either firms or workers have perfect information, which typically predict immediate acceptance or rejection. The patterns are consistent with models of two-sided incomplete information (Perry 1986; Chatterjee and Samuelson 1983). Sixty-nine percent of HR professionals in the survey report that decision-makers at their firm only have market-level information on wages, not specific information on what competitors pay.&lt;/p&gt;
&lt;h3 id="q10-how-do-outside-options-predict-worker-bargaining-behavior-and-outcomes-controlling-for-occupation-establishment-fixed-effects"&gt;Q10. How do outside options predict worker bargaining behavior and outcomes, controlling for occupation-establishment fixed effects?&lt;/h3&gt;
&lt;p&gt;Workers who rated it &amp;ldquo;easy&amp;rdquo; or &amp;ldquo;very easy&amp;rdquo; to obtain a better outside offer are 9 percentage points more likely to ask for an increase after the initial offer and 7 percentage points more likely to successfully negotiate a raise relative to same-occupation-establishment coworkers who rated it &amp;ldquo;difficult&amp;rdquo; or &amp;ldquo;very difficult.&amp;rdquo; The same pattern persists during the employment spell: workers with better outside options are 9 percentage points more likely to initiate and 8 percentage points more likely to succeed in renegotiation. These workers are not more likely to receive raises without asking.&lt;/p&gt;
&lt;h3 id="q11-how-does-risk-tolerance-predict-bargaining-and-how-does-it-compare-to-outside-options"&gt;Q11. How does risk tolerance predict bargaining, and how does it compare to outside options?&lt;/h3&gt;
&lt;p&gt;Workers with greater risk tolerance (those rating themselves 7 or above on a 10-point scale) are more likely to engage in wage negotiations and more likely to succeed both at the start of and during employment spells. Gaps in successful negotiations are somewhat larger than gaps in attempted negotiations, suggesting risk-tolerant workers also negotiate more effectively. However, outside options explain more of the between-worker variation in bargaining behavior than risk tolerance does.&lt;/p&gt;
&lt;h3 id="q12-what-are-the-gender-differences-in-bargaining-behavior-and-can-they-be-explained-by-differences-in-outside-options-or-risk-tolerance"&gt;Q12. What are the gender differences in bargaining behavior, and can they be explained by differences in outside options or risk tolerance?&lt;/h3&gt;
&lt;p&gt;Women are less likely to engage in back-and-forth negotiations and are 6 percentage points less likely to successfully negotiate pay upward during an employment spell. Women are also less likely to provide salary expectations and provide lower expectations as a fraction of their current salary in the hypothetical scenario, including when the salary range is provided—women are 6 percentage points less likely to provide expectations above the top of the stated range. Controlling for outside options and risk tolerance shrinks the female coefficient by at most 15%. There is no evidence that women substitute toward negotiating for non-wage amenities. The pattern is most consistent with women finding negotiation uncomfortable, not with a belief that it will not pay off or fear of backlash.&lt;/p&gt;
&lt;h3 id="q13-what-is-the-estimated-gender-wage-gap-attributable-to-individual-bargaining"&gt;Q13. What is the estimated gender wage gap attributable to individual bargaining?&lt;/h3&gt;
&lt;p&gt;Among surveyed workers, after controlling for occupation-establishment fixed effects, there is no gender wage gap at firms without individual bargaining (coefficient closes to zero), while a 4–5 percentage point gender wage gap persists at firms with individual bargaining. This difference is robust across measures of pay (total daily pay, base pay, pay conditioning on hours worked), alternative fixed effect specifications, and to including non-surveyed workers at surveyed firms. A simple decomposition suggests 44% of the residual gender pay gap at surveyed firms can be attributed to bargaining. Across the interaction specifications, bargaining firms have a 3 percentage point higher gender wage gap and—in one key specification—a 6 percentage point difference between the gender gaps at bargaining and non-bargaining firms.&lt;/p&gt;
&lt;h3 id="q14-how-does-a-workers-prior-firm-wage-premium-affect-current-wages-and-does-bargaining-status-matter"&gt;Q14. How does a worker&amp;rsquo;s prior firm wage premium affect current wages, and does bargaining status matter?&lt;/h3&gt;
&lt;p&gt;In a regression of log current wages on the AKM wage premium of the prior firm (conditional on occupation-establishment fixed effects), a 10 percentage point higher pay premium at the prior firm is associated with 0.5 percent higher pay at the new firm for workers at bargaining firms. For workers whose pay is not set via individual bargaining, the relationship between the prior firm&amp;rsquo;s pay premium and current pay is statistically insignificant. The result is consistent with the idea that during negotiations with a new firm, workers use their prior firm&amp;rsquo;s pay policy as an outside option.&lt;/p&gt;
&lt;h3 id="q15-how-do-akm-person-effects-relate-to-bargaining-behavior"&gt;Q15. How do AKM person effects relate to bargaining behavior?&lt;/h3&gt;
&lt;p&gt;Higher-person-effect individuals are more likely to have provided salary expectations when applying to their current firm and ask for a larger fraction of their current salary in the hypothetical scenario (conditional on their wage). These differences persist when controlling for occupation-establishment fixed effects and age and experience. Higher-person-effect workers are not more likely to receive raises without asking. These results are inconsistent with AKM person effects reflecting only productivity differences and instead suggest that fixed differences in individual bargaining behavior contribute to the variance in person effects—which Card, Heining, and Kline (2013) estimated explains a large share (40%) of the growth in German wage inequality.&lt;/p&gt;
&lt;h3 id="q16-are-the-bargaining-patterns-found-at-surveyed-firms-representative-of-bargaining-more-broadly"&gt;Q16. Are the bargaining patterns found at surveyed firms representative of bargaining more broadly?&lt;/h3&gt;
&lt;p&gt;Two robustness exercises support broader representativeness. First, similar bargaining dynamics are found when including a random sample of German workers employed at non-surveyed firms. Second, re-weighting the sample to match the overall distribution of firm size and sector in Germany yields similar results. Because medium and large firms are over-represented in the firm sample, and because small firms hire infrequently and are less likely to have formal bargaining strategies, the true prevalence of individual bargaining among all German firms may be somewhat lower.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Individual Bargaining Strategy (firm-level).&lt;/strong&gt; A firm has an individual bargaining strategy if it differentiates pay between workers in the same position that it perceives to have similar productivity. This definition encompasses both tailoring of initial offers (based on, e.g., workers&amp;rsquo; stated salary expectations) and back-and-forth negotiation. It is analogous to price discrimination rather than to the standard labor economics distinction between wage posting and Nash bargaining.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Protocol Question.&lt;/strong&gt; The main survey measure of firm bargaining strategies: firms are asked the maximum percentage by which pay could be increased for a new hire above the fixed compensation the firm would have offered based on qualifications and fit alone, with response bins from &amp;ldquo;0%/no adjustments&amp;rdquo; to &amp;ldquo;more than 40%.&amp;rdquo; A zero response is used to classify a firm as not bargaining.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Incidence Question.&lt;/strong&gt; A supplementary survey measure eliciting the expected spread (between highest and lowest offers) that the firm would make to ten candidates with identical qualifications and fit but differing stated salary expectations and competing offers. Used to validate the protocol question and to quantify the importance of initial-offer differentiation relative to back-and-forth negotiation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bottleneck Occupation.&lt;/strong&gt; A firm-defined category of workers in positions that are particularly difficult to fill, drawing on an official German Federal Employment Agency designation. In the paper, bargaining willingness is systematically higher for workers in these positions than for other workers at the same firm, providing evidence that labor market tightness drives bargaining strategies.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Outside Offer Renegotiation.&lt;/strong&gt; Wage renegotiation at the incumbent firm triggered by a worker receiving an outside offer, without a change in job tasks. The paper documents this is empirically more common than actual job-to-job transitions: of workers receiving outside offers, 91% remain at the incumbent firm, and 13% of those who remain successfully renegotiate their pay.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;AKM Person Effect.&lt;/strong&gt; A worker fixed effect estimated from a two-way fixed effects regression of log wages on worker and firm fixed effects (following Abowd, Kramarz, and Margolis 1999). In this paper, AKM person effects are taken from Bellmann et al. (2020), estimated over 2010–2017 German population data. The paper provides evidence that these effects capture, in part, fixed differences in individual bargaining behavior rather than solely differences in productivity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;AKM Firm Effect (Wage Premium).&lt;/strong&gt; The firm fixed effect from the same two-way fixed effects regression, representing the pay premium a firm pays relative to what would be expected given its workforce composition. The paper uses the prior firm&amp;rsquo;s AKM effect as a measure of a worker&amp;rsquo;s outside option quality when testing whether prior-firm pay policy influences current pay under individual bargaining.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Salary Expectations (Gehaltsvorstellungen).&lt;/strong&gt; The wage figure a worker provides to a prospective employer, typically before the firm&amp;rsquo;s initial offer. Legally, German firms (like most US states) cannot ask for salary history but can ask for salary expectations. In the paper, 57% of worker-firm interactions begin with the worker providing expectations; firms report using these to tailor initial offers, interpreting variation in stated expectations as reflecting outside options rather than productivity.&lt;/p&gt;</description></item><item><title>Barriers to Global Capital Allocation</title><link>https://macropaperwarehouse.com/papers/barriers-to-global-capital-allocation/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/barriers-to-global-capital-allocation/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; Why do observed international investment positions and cross-country differences in rates of return to capital fail to conform to a frictionless capital-market benchmark? The paper asks how large the efficiency and distributional costs of barriers to global capital allocation are, and which frictions — capital income taxes, political risk, and geographic/cultural/linguistic distances — matter most.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model.&lt;/strong&gt; The authors develop a multi-country dynamic spatial general equilibrium model in which the entire network of bilateral cross-border investment positions is endogenously determined. Production in each country i follows a three-factor Cobb-Douglas function in reproducible capital, labor, and natural resources, with country-varying income shares. Capital is the only mobile factor. A logit asset demand system governs portfolio shares: the share of country j&amp;rsquo;s savings invested in country i is proportional to the risk-adjusted expected return on capital in i, scaled by the capital stock of i, and inversely proportional to a bilateral portfolio wedge ∆ij. These wedges can be microfounded via either rational inattention (where wedges reflect the precision of prior beliefs about returns) or extreme-value-distributed transaction costs. The model admits multiple microfoundations but yields the same functional form and the same counterfactual welfare calculations regardless of interpretation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Frictions measured.&lt;/strong&gt; Three categories of frictions enter the empirical implementation: (a) bilateral capital income tax rates — a new dataset covering 225 countries (50,625 country pairs), constructed from corporate income tax rates and treaty-adjusted withholding tax rates on dividends and interest, further adjusted for effective tax rates accounting for tax-haven routing; (b) political risk, proxied by an ICRG composite index (excluding socioeconomic conditions) following Alfaro, Kalemli-Ozcan, and Volosovych (2008); (c) geo-political distance, comprising geographic distance, cultural distance (based on 496 World Values Survey questions across 116 countries), and linguistic distance (based on a language-family tree covering 6,737 languages and 242 countries). These distance measures are publicly available at geopoliticaldistance.org. The model covers 96 countries (9,216 dyads), representing 92% of world GDP in 2017.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Gravity Estimation.&lt;/strong&gt; Bilateral investment data (restated for tax havens using the nationality-basis methodology of Coppola et al. 2020 and Damgaard et al. 2019) are regressed on cultural, geographic, and linguistic distance with origin and destination fixed effects. In OLS, a one-standard-deviation increase in cultural distance (0.023 units) is associated with a 24.0% decrease in foreign assets; geographic distance (0.977 units in logs) with a 78.6% decrease; linguistic distance (0.174 units) with a 51.5% decrease. These magnitudes are robust across OLS, PPML, and IV (using religious distance as an instrument for cultural distance). Under IV, the standardized effect of cultural distance on log foreign assets rises to −76.5%.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Tax haven analysis.&lt;/strong&gt; A Tobit regression of the share of bilateral investment routed through tax havens on the estimated tax saving from routing through havens yields coefficients of 0.413–0.999 for equity and 1.001–1.777 for debt (across specifications with varying fixed effects), confirming that tax incentives are a primary driver of the discrepancy between residency-based and nationality-based bilateral positions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model fit (untargeted moments).&lt;/strong&gt; The calibrated baseline model produces: (i) a correlation of 0.658 between model-implied and empirical rates of return to capital (vs. 0.325 for the frictionless benchmark), with a standard deviation of 0.417 (vs. 0.091 frictionless; data: 0.496); (ii) a correlation of 0.947 between model-implied and empirical capital per employee (vs. 0.918 frictionless); (iii) a correlation of 0.94 between model-implied and empirical home bias; the model reproduces the mean home bias of 3.973 vs. 4.006 in data and standard deviation of 1.065 vs. 1.224, while the frictionless benchmark produces exactly zero home bias for all countries. Portfolio-share MSE: 1.16 (baseline) vs. 1.86 (frictionless).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Counterfactual findings.&lt;/strong&gt; Removing all measured barriers raises world GDP by 6.8% relative to the observed equilibrium (equivalent to stating that the distorted equilibrium is 6.8% below the frictionless benchmark). Geo-political distance alone accounts for most of this: when only distance frictions are retained, world GDP is 5.2% below the frictionless level. Capital taxes alone reduce world GDP by 2.6% below frictionless; political risk alone by 0.4%. The standard deviation of log capital per employee is 51.5% higher than it would be without barriers; the standard deviation of log output per employee is 22.5% higher. In the frictionless equilibrium, capital flows from rich to poor countries (the correlation between net foreign assets and development doubles in absolute value), accounting for the Lucas (1990) puzzle. In short-term (one-period) counterfactuals holding wealth fixed, the GDP gain from full barrier removal is 3.6%; the inequality effect remains similar (standard deviation of log capital per employee 48.4% higher with barriers).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope conditions.&lt;/strong&gt; The model focuses on steady-state outcomes; dynamic transition effects are analyzed in extensions but are smaller. Quantitative conclusions are conditioned on: (i) the model sample of 96 countries covering 92% of world GDP in 2017; (ii) the conservative OLS coefficient estimates used for baseline calibration (IV estimates are larger and would amplify results); (iii) the assumption that the logit demand system captures frictions regardless of their microfoundation; (iv) omission of goods-trade frictions from the baseline (when included, the world GDP effect falls to 3.7% and the capital inequality effect to 23.3%).&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-core-theoretical-prediction-about-cross-country-rates-of-return-when-investment-barriers-exist"&gt;Q1. What is the core theoretical prediction about cross-country rates of return when investment barriers exist?&lt;/h3&gt;
&lt;p&gt;A: In the model&amp;rsquo;s frictionless benchmark (Propositions 1 and 2), all origin countries hold identical portfolios and risk-adjusted expected returns are equalized across destinations. When bilateral frictions are introduced, countries that are more &amp;ldquo;peripheral&amp;rdquo; (harder to access for foreign investors due to high geo-political distance or political risk) receive less inward capital and therefore command higher physical rates of return to capital. Countries that are easily accessible (&amp;ldquo;central&amp;rdquo;) attract more capital and exhibit lower rates of return. The Dual Efficiency Theorem establishes that capital is efficiently allocated if and only if marginal products of capital are equalized across countries, which requires that taxes are uniform and that portfolio wedges satisfy a specific cancellation condition.&lt;/p&gt;
&lt;h3 id="q2-how-are-portfolio-wedges-measured-and-what-is-the-identifying-strategy"&gt;Q2. How are portfolio wedges measured, and what is the identifying strategy?&lt;/h3&gt;
&lt;p&gt;A: Portfolio wedges ∆ij are decomposed into a geo-political distance component and a political risk component. The geo-political distance component is specified as a log-linear function of geographic distance, cultural distance, and linguistic distance, with coefficients (β_g, β_c, β_l) estimated from a gravity regression of log bilateral investment on these distances, controlling for origin and destination fixed effects. Because political risk varies only by destination country, it cannot be separately identified from destination fixed effects in the bilateral regression; its elasticity is therefore taken from Alfaro, Kalemli-Ozcan, and Volosovych (2008). The key identification advantage of bilateral data is that origin and destination fixed effects absorb all country-level confounders, so the distance coefficients are identified purely from within-origin, within-destination variation across country pairs.&lt;/p&gt;
&lt;h3 id="q3-what-do-the-ols-gravity-regressions-find-and-are-the-coefficients-stable-across-specifications"&gt;Q3. What do the OLS gravity regressions find, and are the coefficients stable across specifications?&lt;/h3&gt;
&lt;p&gt;A: In the baseline OLS specification (Table 2, column 1), the estimated coefficients on cultural distance, geographic distance, and linguistic distance are −11.944, −1.579, and −4.162 respectively (all significant at the 1% level). In standardized terms, a one-standard-deviation increase in cultural distance reduces foreign assets by 24.0%, geographic distance by 78.6%, and linguistic distance by 51.5%. Adding a rich set of control variables (colonial ties, legal origin, currency pegs, trade agreements, effective tax rates) leaves these magnitudes broadly similar: standardized effects on foreign assets are −26.4%, −80.1%, and −47.6%, respectively. Results are also robust across OLS and PPML specifications and across years 2013–2017. Effects are quantitatively similar for foreign equity and foreign debt, though linguistic distance has a somewhat smaller effect on debt.&lt;/p&gt;
&lt;h3 id="q4-how-does-the-instrumental-variable-strategy-address-reverse-causality-in-cultural-distance-and-what-does-it-find"&gt;Q4. How does the instrumental variable strategy address reverse causality in cultural distance, and what does it find?&lt;/h3&gt;
&lt;p&gt;A: The authors instrument cultural distance with religious distance (based on historical trees of religious affiliation), assuming religious history affects international investment only through its contemporary effect on differences in values and beliefs as captured by the World Values Survey. The instrument is a strong predictor of cultural distance (passes weak-instrument tests comfortably). Under IV, the standardized effect of a one-standard-deviation increase in cultural distance on log foreign assets rises from −24.0% (OLS) to −76.5% (IV). The authors use conservative OLS estimates for their baseline calibration, so the IV results imply the headline counterfactual effects are likely understated.&lt;/p&gt;
&lt;h3 id="q5-how-does-the-model-predict-home-bias-and-how-well-does-it-match-the-data"&gt;Q5. How does the model predict home bias, and how well does it match the data?&lt;/h3&gt;
&lt;p&gt;A: Home bias is defined as the log difference between the domestic portfolio share and the country&amp;rsquo;s share in the world capital stock. In the frictionless model, Proposition 1 implies that all countries hold identical foreign portfolios, so the model produces exactly zero home bias for every country. The baseline model, by incorporating bilateral frictions, generates home bias endogenously without targeting it. The model-implied home bias correlates with the empirically measured home bias at 0.94 across countries and matches both the mean (3.973 model vs. 4.006 data) and standard deviation (1.065 vs. 1.224) closely. The model also predicts, consistent with Lau, Ng, and Zhang (2010), that home bias and rates of return on capital are positively correlated (model-implied ρ = 0.55), and that rates of return on capital correlate negatively with the log of GDP per employee (model-implied ρ = −0.70).&lt;/p&gt;
&lt;h3 id="q6-what-is-the-quantitative-decomposition-of-the-world-gdp-loss-by-type-of-barrier"&gt;Q6. What is the quantitative decomposition of the world GDP loss by type of barrier?&lt;/h3&gt;
&lt;p&gt;A: World GDP in the observed (distorted) equilibrium is measured at $112.9 trillion (PPP), which is 6.8% below the frictionless counterfactual. When all barriers are present except geo-political distance, world GDP is 5.2% below frictionless — meaning distance frictions account for the largest share. When all barriers are present except political risk, world GDP is only 0.4% below frictionless. When all barriers are present except taxes, world GDP is 2.6% below frictionless. These are not exactly additive because the distortions interact; the results confirm that geo-political distance (cultural, linguistic, and geographic) constitutes the dominant source of global capital misallocation among the three measured frictions.&lt;/p&gt;
&lt;h3 id="q7-how-do-barriers-affect-the-cross-country-distribution-of-capital-and-income"&gt;Q7. How do barriers affect the cross-country distribution of capital and income?&lt;/h3&gt;
&lt;p&gt;A: The standard deviation of log capital per employee is 51.5% higher in the distorted equilibrium than in the frictionless counterfactual; the standard deviation of log output per employee is 22.5% higher. When only geo-political distance distortions are maintained, dispersion in log capital per employee is 38.2% higher and in log output per employee 15.9% higher. Maintaining only taxes raises the dispersion in log capital per employee by 12.9% and log output per employee by 6.0%; maintaining only political risk raises them by 7.3% and 3.8%, respectively. In the frictionless equilibrium, the poorest countries gain the most: some of the poorest countries see capital per employee increase by an order of magnitude and income per employee double.&lt;/p&gt;
&lt;h3 id="q8-does-the-model-account-for-the-lucas-puzzle-capital-not-flowing-from-rich-to-poor-countries"&gt;Q8. Does the model account for the Lucas puzzle (capital not flowing from rich to poor countries)?&lt;/h3&gt;
&lt;p&gt;A: Yes. In the observed distorted equilibrium, net foreign asset positions correlate only weakly with the level of development, consistent with Lucas&amp;rsquo;s (1990) observation that capital fails to flow from rich to poor countries. In the frictionless counterfactual, the absolute value of the correlation between net foreign asset positions and log GDP per employee doubles, and capital indeed flows from rich to poor countries as neoclassical theory predicts. The distortions from taxes, political risk, and geo-political distance thus account for the absence of a strong correlation between net positions and development in the data.&lt;/p&gt;
&lt;h3 id="q9-how-do-extensions-incorporating-goods-trade-frictions-capital-controls-and-currency-hedging-costs-affect-the-headline-findings"&gt;Q9. How do extensions incorporating goods-trade frictions, capital controls, and currency hedging costs affect the headline findings?&lt;/h3&gt;
&lt;p&gt;A: Adding goods-trade frictions (country-specific prices for output and capital installation following Monge-Naranjo et al. 2019) reduces the world GDP effect to 3.7% (from 6.8% baseline) and the dispersion of log capital per employee to 23.3% higher (from 51.5%), but the overall pattern of results is preserved. Replacing political risk with capital controls (using Jahan and Wang 2016 de-jure capital account openness) yields a comparable world GDP loss of 6.6% and a geo-political distance effect of 6.2%, very close to the 6.8% and 5.2% in the baseline. Adding currency hedging costs leaves world GDP loss and inequality effects essentially unchanged relative to baseline. None of these extensions materially alters the headline conclusions.&lt;/p&gt;
&lt;h3 id="q10-how-do-the-authors-validate-the-model-against-nationality-based-versus-residency-based-bilateral-investment-data"&gt;Q10. How do the authors validate the model against nationality-based versus residency-based bilateral investment data?&lt;/h3&gt;
&lt;p&gt;A: The model is calibrated to nationality-based positions (restated for tax havens). The MSE for fitting nationality-based external portfolio shares is 1.16, while the MSE for residency-based positions is 1.22. The model was not explicitly designed to distinguish between the two, yet it naturally produces better predictions for nationality-based positions because its frictions incorporate the incentives for indirect investment routing through tax havens. This cross-validation supports the methodological approach of using nationality-restated data and confirms the internal consistency of the model&amp;rsquo;s treatment of tax-haven routing.&lt;/p&gt;
&lt;h3 id="q11-what-are-the-implications-for-global-tax-policy-coordination"&gt;Q11. What are the implications for global tax policy coordination?&lt;/h3&gt;
&lt;p&gt;A: In the presence of information frictions, simple harmonization of capital tax rates across countries does not improve capital allocation efficiency and could worsen it. The Dual Efficiency Theorem implies that efficient capital allocation in a world with information frictions requires that taxes, risk premia, and information frictions satisfy a joint cancellation condition. From a normative perspective, a global social planner maximizing world GDP should impose lower capital tax rates in countries that are &amp;ldquo;peripheral&amp;rdquo; in the network of informational distances, in order to offset the disadvantage created by information frictions for those countries.&lt;/p&gt;
&lt;h3 id="q12-how-is-the-elasticity-parameter-η-calibrated-and-how-sensitive-are-the-results"&gt;Q12. How is the elasticity parameter η calibrated, and how sensitive are the results?&lt;/h3&gt;
&lt;p&gt;A: The elasticity of substitution among countries&amp;rsquo; assets, η, is calibrated at 18.5 based on Koijen and Yogo (2020)&amp;rsquo;s demand-price elasticities for long-term debt (3.1, converted to a gross-return elasticity of approximately 30), short-term debt (25.2, converted to approximately 24.3), and equity (1.3, converted to approximately 14.8), with weights reflecting the composition of global portfolios. The baseline gravity coefficients are calibrated from OLS with controls (cultural: −13.129, geographic: −1.645, linguistic: −3.850), chosen as conservative estimates relative to IV or PPML. Sensitivity analysis using PPML or IV estimates of β yields broadly similar steady-state GDP losses (around 6%), confirming robustness.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Portfolio wedge (∆ij):&lt;/strong&gt; A bilateral distortionary term in the logit asset demand system that captures all frictions reducing the ability of investors from country j to invest in country i. Decomposed empirically into a geo-political distance component and a political risk component. A wedge of 1 means no friction; larger values reduce the share of investment flowing from j to i. Can be interpreted either as prior-belief imprecision under rational inattention or as systematic transaction costs under the extreme-value microfoundation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Geo-political distance:&lt;/strong&gt; A composite of geographic distance (population-weighted geodesic distance), cultural distance (expected disagreement in World Values Survey responses between randomly drawn individuals from two countries, constructed with the &amp;ldquo;flex&amp;rdquo; method using up to 496 questions), and linguistic distance (normalized tree distance in the Ethnologue language family graph, covering 6,737 languages). Distinct from simple physical distance: it captures the informational and transactional barriers that arise from societal dissimilarity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Dual Efficiency Theorem:&lt;/strong&gt; A theoretical result (Theorem in Section 2.8) establishing that capital efficient allocation, equalization of marginal products of capital across countries, and uniform taxes combined with a specific cancellation condition on portfolio wedges are mutually equivalent statements in steady-state equilibrium. This is not a restatement of the First Welfare Theorem; it is a statement about GDP (not welfare) and does not require risk premia to be equalized.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Effective bilateral tax rate (τij):&lt;/strong&gt; The composite bilateral tax rate on capital after accounting for tax-haven routing. Firms in the destination country optimally choose the share of capital issued through tax havens (solving a quadratic cost optimization), trading off the lower tax rate available through havens against an increasing quadratic routing cost. The effective rate is therefore lower than the statutory (de jure) rate when the tax-haven rate is lower than the statutory rate, with the gap depending on the estimated βth coefficient from the Tobit regressions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Logit asset demand system:&lt;/strong&gt; A portfolio allocation rule in which the share of country j&amp;rsquo;s savings invested in destination country i is proportional to the risk-adjusted expected return raised to the power η (the elasticity of substitution) times the destination capital stock, divided by the portfolio wedge and summed over all destinations. Microfounded either by rational inattention (Matejka and McKay 2015; Pellegrino 2023) or by extreme-value-distributed transaction costs. Produces portfolio gravity analogous to trade gravity when combined with the market clearing conditions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Home bias:&lt;/strong&gt; Defined as the log difference between a country&amp;rsquo;s domestic portfolio share (πii, the share of domestic savings invested at home) and that country&amp;rsquo;s share of world capital stock (ki/K). In the frictionless benchmark, home bias is exactly zero for all countries by Proposition 1. The baseline model generates home bias endogenously as a consequence of portfolio wedges and reproduces both the level and cross-sectional distribution of empirically observed home bias without targeting these moments directly.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Core-periphery structure:&lt;/strong&gt; An emergent property of international capital markets under investment barriers: countries that are easily accessible to international investors (low geo-political distance, low political risk, favorable tax treatment) are &amp;ldquo;central&amp;rdquo; and attract capital inflows, driving their rates of return to capital lower; &amp;ldquo;peripheral&amp;rdquo; countries that are less accessible have smaller capital stocks and higher rates of return, compensating investors for overcoming barriers. This structure generates persistent capital misallocation and cross-country income inequality.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Nationality-based vs. residency-based bilateral investment positions:&lt;/strong&gt; Residency-based data (e.g., raw IMF CPIS) attributes investment to the immediate counterparty country, including tax-haven shell companies. Nationality-based data (Coppola et al. 2020; Damgaard et al. 2019; Beck et al. 2024) reattributes investment to the country of the ultimate investor and ultimate issuer, bypassing offshore centers. The model fits nationality-based positions better (MSE 1.16 vs. 1.22 for residency-based) because it incorporates frictions that generate incentives for indirect routing, which is what nationality restatement is designed to undo.&lt;/p&gt;</description></item><item><title>Bayesian inference in proxy SVARs with incomplete identification: Re-evaluating the validity of monetary policy instruments</title><link>https://macropaperwarehouse.com/papers/bayesian-inference-in-proxy-svars-with-incomplete-identification-re-evaluating-the-validity-of-monetary-policy-instruments/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/bayesian-inference-in-proxy-svars-with-incomplete-identification-re-evaluating-the-validity-of-monetary-policy-instruments/</guid><description>&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;Summary blocked:&lt;/strong&gt; No freely available working paper or full text was found. Metadata record only; paper suppressed from site until a working paper is located.&lt;/p&gt;
&lt;/blockquote&gt;</description></item><item><title>Beyond the headline: How personal exposure to inflation shapes the financial choices of households</title><link>https://macropaperwarehouse.com/papers/beyond-the-headline-how-personal-exposure-to-inflation-shapes-the-financial-choices-of-households/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/beyond-the-headline-how-personal-exposure-to-inflation-shapes-the-financial-choices-of-households/</guid><description>&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;Summary blocked:&lt;/strong&gt; No freely available working paper or full text was found. Metadata record only; paper suppressed from site until a working paper is located.&lt;/p&gt;
&lt;/blockquote&gt;</description></item><item><title>Biased expectations and labor market outcomes: Evidence from German survey data and implications for the East–West wage gap</title><link>https://macropaperwarehouse.com/papers/biased-expectations-and-labor-market-outcomes-evidence-from-german-survey-data-and-implications-for-the-eastwest-wage-gap/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/biased-expectations-and-labor-market-outcomes-evidence-from-german-survey-data-and-implications-for-the-eastwest-wage-gap/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research question.&lt;/strong&gt; The paper asks two questions: (1) How do workers&amp;rsquo; biased expectations about job finding and job separation shape the labor market equilibrium and wages? (2) Are differences in expectation biases across workers a quantitatively important driver of wage differentials, specifically the East–West German wage gap?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data.&lt;/strong&gt; The empirical analysis uses the German Socio-Economic Panel (SOEP), a nationally representative longitudinal survey of approximately 30,000 participants per wave. The working-age sample (ages 25–65) covers nine biennial survey waves from 1999 to 2015, yielding 67,772 observations for job separation expectations and 6,423 for job finding expectations. Perceived transition probabilities are reported on a 0–100 scale in steps of 10 percentage points. Actual (statistical) transition probabilities are constructed by estimating probit models that predict realized transitions within 24 months using a rich set of individual, job, and employer characteristics, and are rounded to the nearest decile for consistency with the survey scale.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main empirical findings.&lt;/strong&gt; Employed workers in Germany overestimate their job separation probability by 6.4 percentage points on average (perceived: 19.8%; actual: 13.3%), a pessimistic bias significant at the 1% level. Unemployed workers overestimate their job finding probability by 8.2 percentage points on average (perceived: 57.0%; actual: 48.8%), an optimistic bias also significant at the 1% level. The East–West divergence is striking. East German workers exhibit a pessimistic job separation bias of 12.1 percentage points, compared to only 4.7 percentage points in the West, despite broadly similar actual separation rates (15.1% vs. 12.8%). For job finding, West Germans overestimate their probability by 12.9 percentage points, while East Germans overestimate by only 2.0 percentage points — meaning East Germans are also substantially less optimistic about re-employment. These East–West differences survive controls for compositional differences and alternative definitions of job separation (dismissals only; selected reasons; spell-based) and job finding (including those out of the labor force). The biases are stable over the 1999–2015 sample period with no discernible trend. A cohort analysis shows that the excess pessimism in East Germany is concentrated among cohorts who were already in the labor market at the time of German reunification (born in the 1950s and 1960s), consistent with persistent effects of the communist GDR experience. Individuals do not systematically learn over time: mean changes in individual-level absolute deviations between consecutive waves are close to zero. Individual deviations between perceived and actual rates have statistically significant but quantitatively negligible predictive power for subsequent transitions (a 1 pp higher perceived job separation is associated with only a 0.001 pp higher realized separation rate), ruling out private information as a first-order explanation for the biases.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model.&lt;/strong&gt; The authors extend the Diamond–Mortensen–Pissarides (DMP) frictional labor market framework by (i) allowing workers to hold biased perceived transition rates (λw for job finding, σw for job separation) while firms have rational expectations, and (ii) introducing wage contracts of explicit length T periods after which parties re-bargain. Common knowledge of each party&amp;rsquo;s perceived values is assumed, and generalized Nash bargaining is applied. The contract length T is a key parameter: there exists a critical threshold T* such that a pessimistic job separation bias raises the equilibrium wage for T &amp;lt; T* (the continuation-value effect dominates) and lowers it for T ≥ T* (the within-contract discounting effect dominates). An optimistic job finding bias unambiguously raises the equilibrium wage by inflating the perceived value of unemployment and hence the reservation wage.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Quantitative results.&lt;/strong&gt; The model is calibrated to East Germany. The job separation bias (∆σ = 0.0194) and job finding bias (∆λ = 0.0044) are set to SOEP-based estimates. The critical threshold implied by calibrated parameter values is T* = 10 quarters. The baseline contract length, constructed from the share of permanent (88%) and temporary (12%) contracts in SOEP and average remaining tenure until retirement, is T = 67 quarters (a lower bound). This exceeds T*, so the pessimistic separation bias depresses wages in the baseline. A counterfactual experiment assigns West German bias levels to East German workers, while holding all other parameters fixed. For the preferred calibration range (γ ∈ {0.35, 0.50}, T ∈ {67, 106, 159}), East German wages rise by 1.07 to 2.36 percent. This corresponds to a reduction in the conditional East–West German wage gap (23 percent) of 4.6 to 10.6 percent, and a reduction in the unconditional gap (30 percent) of 3.6 to 7.9 percent. Although wages rise, equilibrium unemployment increases by 0.70 to 1.01 percentage points, widening the already large East–West unemployment gap (approximately 7 percentage points). Net of the unemployment effect, expected lifetime income (computed at actual, unbiased transition rates) rises by 0.7 to 1.88 percent for East German workers under West German biases, implying an unambiguous welfare gain. Under a biennial calibration (robustness), wages increase by up to 3.3 percent and expected lifetime income rises by up to 2.23 percent.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope conditions.&lt;/strong&gt; Results apply to a stationary environment (no aggregate fluctuations). Firms are assumed to have rational expectations; an extension shows results hold provided firm bias is smaller than worker bias. Workers are assumed homogeneous in their bias levels; learning is abstracted from. The quantitative magnitudes are sensitive to the workers&amp;rsquo; bargaining power γ and the contract length T, both of which are subject to uncertainty in calibration.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-how-are-actual-statistical-transition-probabilities-constructed-and-why-are-probit-predicted-probabilities-preferred-over-realized-sample-means"&gt;Q1. How are actual (statistical) transition probabilities constructed, and why are probit-predicted probabilities preferred over realized sample means?&lt;/h3&gt;
&lt;p&gt;A: Realized transition rates in the sample mix transitions for various idiosyncratic reasons that vary substantially across population groups, so raw sample means do not reflect the probability a given individual faces at interview time. The authors estimate probit models separately for job separation (employed sample) and job finding (unemployed sample), including a rich set of covariates — age, gender, education, tenure, firm size, unemployment experience, industry, survey year, and East Germany indicator, among others — and predict individual-level probabilities at the time of the interview. For consistency with the survey&amp;rsquo;s discrete response format, probit-predicted probabilities are rounded to the nearest decile (0%, 10%, &amp;hellip;, 100%). The bias is computed as the individual-level difference between perceived and probit-predicted actual probabilities, averaged over the sample.&lt;/p&gt;
&lt;h3 id="q2-what-is-the-magnitude-and-direction-of-the-aggregate-expectation-biases-in-germany"&gt;Q2. What is the magnitude and direction of the aggregate expectation biases in Germany?&lt;/h3&gt;
&lt;p&gt;A: Employed workers overestimate job separation by 6.4 percentage points on average (perceived 19.8% vs. actual 13.3%), a pessimistic bias significant at the 1% level. Unemployed workers overestimate job finding by 8.2 percentage points (perceived 57.0% vs. actual 48.8%), an optimistic bias also significant at the 1% level. Both directions are statistically robust across alternative definitions of separation and finding, as well as to trimming extreme responses (0% and 100% answers) and adjusting for directional rounding.&lt;/p&gt;
&lt;h3 id="q3-how-large-are-the-eastwest-differences-in-expectation-biases-and-do-they-survive-controls-for-compositional-differences"&gt;Q3. How large are the East–West differences in expectation biases, and do they survive controls for compositional differences?&lt;/h3&gt;
&lt;p&gt;A: East German workers exhibit a pessimistic job separation bias of 12.1 percentage points, more than 2.5 times the West German level of 4.7 percentage points, despite actual separation rates being broadly comparable (15.1% vs. 12.8%). For job finding, West Germans are optimistic by 12.9 percentage points while East Germans are optimistic by only 2.0 percentage points, a difference of 10.9 percentage points. The paper states these differences persist after accounting for compositional differences between regions, and are robust across all alternative definitions of job separation (Dismissals, Selected, Spell) and job finding (out of U or O). The table of robustness results (Table 2) confirms that in all specifications, the pessimistic separation bias is substantially larger in the East and the optimistic finding bias is substantially smaller.&lt;/p&gt;
&lt;h3 id="q4-what-cohort-analysis-is-conducted-to-explore-the-origins-of-greater-east-german-pessimism"&gt;Q4. What cohort analysis is conducted to explore the origins of greater East German pessimism?&lt;/h3&gt;
&lt;p&gt;A: The authors conduct a regression of the individual-level bias on birth-cohort indicators, controlling for age, demographic, and economic characteristics. They find that the pessimistic job separation bias is most pronounced among cohorts born in the 1950s and 1960s — those who experienced adult working life in the communist GDR and lived through reunification — and is smaller for cohorts born before 1950 and substantially smaller for cohorts born after 1970. For job finding, the optimistic bias is comparably low among cohorts born in the 1960s and earlier, but rises significantly for later-born East German cohorts. This cohort pattern is consistent with a long-lasting &amp;ldquo;experience effect&amp;rdquo; of communist institutions and the reunification shock on beliefs, analogous to findings in the broader literature on the persistent effects of communism.&lt;/p&gt;
&lt;h3 id="q5-is-there-evidence-that-individuals-update-their-biased-expectations-over-time"&gt;Q5. Is there evidence that individuals update their biased expectations over time?&lt;/h3&gt;
&lt;p&gt;A: To assess learning, the authors use the panel dimension and compute for each individual in two consecutive survey waves the absolute value of the deviation between perceived and actual transition probabilities, then examine the change in this absolute deviation between waves. The histograms of individual-level changes show substantial dispersion but means close to zero in all four sub-groups (East/West, job separation/finding), indicating no systematic convergence of beliefs toward actual rates. Biases are also stable in the time-series dimension, with perceived and actual rates moving largely in parallel across survey waves from 1999 to 2015, leaving the aggregate bias level roughly constant.&lt;/p&gt;
&lt;h3 id="q6-how-does-the-model-rule-out-private-information-as-an-alternative-explanation-for-the-biases"&gt;Q6. How does the model rule out private information as an alternative explanation for the biases?&lt;/h3&gt;
&lt;p&gt;A: If biases reflected private information about idiosyncratic risk not captured by observable characteristics, individual-level deviations between perceived and actual rates should predict subsequent realized transitions. The authors add the individual-level deviation as an additional regressor in the probit transition models. The estimated coefficients are statistically significant and positive, but quantitatively negligible: a 1 percentage point higher expected job separation probability is associated with only a 0.001 percentage point higher realized separation probability, and a 1 percentage point higher expected job finding probability with a 0.002 percentage point higher realized finding probability. These magnitudes are too small to materially alter the interpretation of the biases as reflecting systematic expectation errors rather than private information.&lt;/p&gt;
&lt;h3 id="q7-what-is-the-role-of-contract-length-t-in-the-model-and-what-is-the-critical-threshold-t"&gt;Q7. What is the role of contract length T in the model, and what is the critical threshold T*?&lt;/h3&gt;
&lt;p&gt;A: The wage contract length T determines which of two opposing effects of pessimistic job separation expectations dominates in bargaining. The first (negative wage) effect: a pessimistic worker discounts future wages within the current contract more heavily than the firm does, so the worker values the contract less and accepts a lower wage. The second (positive wage) effect: a pessimistic worker also discounts the continuation value of future contracts more heavily, making it less attractive to remain in the match, so the firm must offer a higher wage to retain the worker. For short contract lengths (T &amp;lt; T*), the second (positive) effect dominates, so the pessimistic bias raises wages. For long contracts (T ≥ T*), the first (negative) effect dominates, so the pessimistic bias depresses wages. The critical threshold T* is the smallest positive integer such that T*/λw(θ) &amp;lt; β times a weighted sum involving σw and T*. Using calibrated parameter values for East Germany, T* = 10 quarters (2.5 years). The baseline contract length is T = 67 quarters (approximately 16.8 years), well above T*, placing the economy in the regime where pessimism depresses wages.&lt;/p&gt;
&lt;h3 id="q8-how-does-the-optimistic-job-finding-bias-affect-equilibrium-wages-and-unemployment"&gt;Q8. How does the optimistic job finding bias affect equilibrium wages and unemployment?&lt;/h3&gt;
&lt;p&gt;A: An optimistic job finding bias (λw &amp;gt; p(θ)) raises the perceived value of unemployment U because workers expect to escape unemployment sooner. A higher value of unemployment raises the worker&amp;rsquo;s outside option in bargaining, increases the reservation wage, and thereby pushes up the bargained wage. In general equilibrium, the job creation condition (which is unaffected by worker expectations) is unchanged, so the upward rotation of the wage curve reduces labor market tightness θ, raises equilibrium unemployment, and extends average unemployment duration. This comparative static holds unambiguously for any contract length T.&lt;/p&gt;
&lt;h3 id="q9-what-are-the-quantitative-results-of-the-counterfactual-experiment-assigning-west-german-biases-to-east-german-workers"&gt;Q9. What are the quantitative results of the counterfactual experiment assigning West German biases to East German workers?&lt;/h3&gt;
&lt;p&gt;A: The counterfactual assigns West German bias levels (smaller pessimistic separation bias, larger optimistic finding bias) to East German workers while holding all other parameters at East German calibrated values. For the preferred calibration with γ ∈ {0.35, 0.50} and T ∈ {67, 106, 159}, wages in East Germany rise by 1.07 to 2.36 percent. This implies a reduction in the conditional East–West wage gap (23 percent) of 4.6 to 10.6 percent and a reduction in the unconditional gap (30 percent) of 3.6 to 7.9 percent. Equilibrium unemployment in East Germany rises by 0.70 to 1.01 percentage points as a side effect. Net of the unemployment effect, ex-ante unbiased expected lifetime income rises by 0.7 to 1.88 percent, confirming a positive welfare effect of reducing East German pessimism to West German levels. Under the biennial calibration robustness check, wage increases reach up to 3.3 percent, the conditional wage gap narrows by up to 11 percent, and lifetime income rises by up to 2.23 percent.&lt;/p&gt;
&lt;h3 id="q10-how-is-the-bargaining-power-parameter-γ-calibrated-and-why-does-it-matter-for-the-results"&gt;Q10. How is the bargaining power parameter γ calibrated and why does it matter for the results?&lt;/h3&gt;
&lt;p&gt;A: The paper considers a range γ ∈ {0.35, 0.50, 0.65}, rather than a single calibrated value, because γ plays a crucial role in the sensitivity of wages to expectation biases. Lower bargaining power reduces the equilibrium wage directly; however, because lower wages spur job creation, the model requires a higher vacancy cost κ to match the empirical job finding rate, which in turn increases the elasticity of wages with respect to the bias (see the wage equation, which shows that the bias effect scales with κθ/p(θ)). The paper argues that γ = 0.65 is inconsistent with the empirical wage–bias relationship estimated in SOEP data (which is negative and about twice as negative in East Germany as in the West), while γ ∈ {0.35, 0.50} is consistent. Lower bargaining power is also argued to be realistic for East Germany given weaker union representation there relative to the West.&lt;/p&gt;
&lt;h3 id="q11-how-does-the-empirical-relationship-between-the-job-separation-bias-and-wages-serve-as-a-model-validation-target"&gt;Q11. How does the empirical relationship between the job separation bias and wages serve as a model validation target?&lt;/h3&gt;
&lt;p&gt;A: Using SOEP data, the authors regress log hourly wages on the individual-level difference between perceived and actual job separation rates, controlling for individual fixed effects and other covariates, and allow the slope to differ between East and West Germany. They find a statistically significant and negative relationship in both regions, with the effect approximately twice as large in East Germany as in the West. The estimate implies that if East German workers&amp;rsquo; job separation pessimism were reduced to West German levels, hourly wages in the East would be about 1 percent higher. This empirical gradient is used as an external validation check — not a calibration target — to assess which combinations of (γ, T) in the model are quantitatively plausible.&lt;/p&gt;
&lt;h3 id="q12-what-does-the-model-predict-about-the-general-equilibrium-effects-on-unemployment-from-reducing-east-german-pessimism"&gt;Q12. What does the model predict about the general equilibrium effects on unemployment from reducing East German pessimism?&lt;/h3&gt;
&lt;p&gt;A: Reducing East German pessimism — both the pessimistic separation bias and the low optimistic finding bias — shifts the wage curve upward in equilibrium. Because the job creation condition is unaffected by worker beliefs (firms have rational expectations), higher wages reduce the firm&amp;rsquo;s incentive to post vacancies, lowering labor market tightness θ. This leads to higher equilibrium unemployment and longer average unemployment duration. The counterfactual with West German biases implies that East German unemployment would rise by 0.70 to 1.01 percentage points, further widening the approximately 7 percentage point East–West unemployment gap. The authors note this is a welfare-relevant trade-off, but show that the wage gain dominates the unemployment cost in terms of expected lifetime income.&lt;/p&gt;
&lt;h3 id="q13-what-robustness-checks-are-performed-on-the-quantitative-results"&gt;Q13. What robustness checks are performed on the quantitative results?&lt;/h3&gt;
&lt;p&gt;A: The paper considers (i) a narrower definition of job separation (dismissals only) to match the most likely interpretation of the survey question; (ii) targeting the officially reported East German unemployment rate (14.5% average from the Federal Employment Agency) rather than the SOEP-implied rate of 8.6% as a calibration target; (iii) a biennial calibration frequency instead of quarterly. The main results — wage increases and narrowing of the wage gap — are quantitatively similar across these alternatives, with one exception: the biennial calibration yields substantially larger wage increases (up to 3.3%), a larger reduction in the conditional wage gap (up to 11%), and larger lifetime income gains (up to 2.23%).&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Expectation bias (job separation / job finding).&lt;/strong&gt; In this paper, a bias in expectations is defined as a systematic average difference between an individual&amp;rsquo;s perceived transition probability and the actual (statistically predicted) transition probability for their demographic and job group. A pessimistic job separation bias means workers overestimate the probability of losing their job (σw &amp;gt; σ); an optimistic job finding bias means unemployed workers overestimate the probability of re-employment (λw &amp;gt; p(θ)). Biases are not attributed to private information but to systematic expectation errors.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Actual (statistical) transition probability.&lt;/strong&gt; The paper defines actual transition probabilities not as raw sample transition rates but as individual-level predicted probabilities from probit models estimated on realized transitions within 24 months, conditional on a comprehensive set of individual, job, and employer characteristics observed at interview time. These are rounded to the nearest decile for comparability with the survey&amp;rsquo;s discrete response format.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Wage contract length (T).&lt;/strong&gt; The contract length T is the number of periods for which a bargained wage is fixed before the match parties re-bargain. A job match consists of a sequence of consecutive wage contracts of length T. The paper departs from the standard DMP assumption of period-by-period bargaining (T = 1) and shows that T is central to how job separation expectations feed into the bargained wage. A permanent job approximates T → ∞.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;&lt;em&gt;Critical contract length (T&lt;/em&gt;).&lt;/em&gt;* A theoretically derived threshold: the pessimistic job separation bias raises equilibrium wages for contract lengths T &amp;lt; T* and depresses wages for T ≥ T*. Specifically, T* is the smallest positive integer such that T*/λw(θ) &amp;lt; β times a weighted sum involving β, σw, and T*. In the East German calibration, T* = 10 quarters.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Generalized Nash bargaining with common knowledge / agree to disagree.&lt;/strong&gt; The model assumes that both the worker and the firm know each other&amp;rsquo;s perceived values of the job match and outside options and accept them as the basis for bargaining, even though they differ. Workers use their biased perceived transition rates to value employment and unemployment; firms use actual rates. There is no private information. The paper refers to this as workers and firms &amp;ldquo;agreeing to disagree.&amp;rdquo;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Ex-ante unbiased expected lifetime income (EI_{W,U}).&lt;/strong&gt; A welfare measure defined as the present discounted value of income for an individual entering the economy, computed at actual (unbiased) job separation and job finding probabilities rather than at workers&amp;rsquo; perceived (biased) rates. This measure captures the net welfare effect of changing expectation biases because it correctly accounts for actual employment transitions, even though the behavioral responses in equilibrium are driven by biased perceptions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Effective discount factor (β(1 − σw)).&lt;/strong&gt; When a worker holds pessimistic job separation expectations, future payoffs within the current contract are discounted not at the pure time discount factor β but at β(1 − σw), which is smaller when σw is larger. A more pessimistic worker therefore effectively discounts future wage payments more steeply, and this differential discounting relative to the firm (which uses β(1 − σ)) is the key mechanism generating the contract-length dependence of the wage effect.&lt;/p&gt;</description></item><item><title>Borrowing and Spending in the Money: Debt Substitution and the Cash-Out Refinance Channel of Monetary Policy</title><link>https://macropaperwarehouse.com/papers/borrowing-and-spending-in-the-money-debt-substitution-and-the-cash-out-refinance-channel-of-monetary-policy/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/borrowing-and-spending-in-the-money-debt-substitution-and-the-cash-out-refinance-channel-of-monetary-policy/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; Does monetary policy stimulate household borrowing and consumption by enabling cash-out mortgage refinancing (&amp;ldquo;the cash-out refinance channel&amp;rdquo;), or does it primarily induce substitution across borrowing products without meaningfully changing total new household borrowing?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Motivation.&lt;/strong&gt; Prior work (Eichenbaum, Rebelo and Wong 2022; Berger et al. 2021) interprets the strong positive correlation between a borrower&amp;rsquo;s refinance incentive and cash-out refinancing as evidence of a potent, path-dependent monetary policy transmission channel: when rates fall below a borrower&amp;rsquo;s outstanding mortgage rate (&amp;ldquo;in-the-money&amp;rdquo;), the incentive to refinance generates large cash-out activity and consumption. This interpretation presumes that mortgages are effectively the only household borrowing product and that cash-out refinancing reflects a stimulated demand for new borrowing.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Alternative Hypothesis.&lt;/strong&gt; The authors argue instead that households have inelastic, exogenous liquidity needs (for consumption smoothing, housing repairs, health shocks, etc.) and satisfy those needs using whichever borrowing product is cheapest given the rate environment. When mortgage rates fall below a borrower&amp;rsquo;s outstanding rate, cash-out refinancing becomes the least-cost vehicle, so borrowers shift from credit cards, HELOCs, personal loans, and second liens (closed-end seconds) toward cash-out refinancing—substituting borrowing products rather than expanding total borrowing.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data.&lt;/strong&gt; The authors use the Equifax Credit Risk Insight Servicing McDash (CRISM) dataset, which anonymously matches credit bureau records to mortgage servicing data (McDash). The main sample is a 16.5% draw of fixed-rate, first-lien mortgage loans observed at monthly frequency during 2013, yielding approximately 35 million loan-month observations. For the long time-series analysis, the full 2006–2021 sample is used. Borrowing events are identified across five credit instruments: cash-out refinance, HELOC, closed-end second (CES), credit card, and personal loan, each requiring at least $5,000 in new credit.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Identification Strategy.&lt;/strong&gt; The paper uses two complementary approaches to address the endogeneity of mortgage rates and borrower refinance incentives.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Taper Tantrum quasi-experiment (main):&lt;/em&gt; In late spring 2013, two FOMC communication events triggered an approximately 80 basis-point increase in the 30-year fixed mortgage rate over the course of one month. Critically, because the shock arose from changes in long-term rate expectations (LSAPs), short-term rates—and thus HELOC and consumer credit rates—were largely unchanged. The authors exploit cross-sectional variation in pre-Taper &amp;ldquo;rate gaps&amp;rdquo; (outstanding mortgage rate minus estimated current market rate) using a difference-in-differences design (equation 6) to compare how cash-out and alternative borrowing change after the shock for borrowers with different pre-existing refinance incentives.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Monetary policy surprise IV (2006–2021):&lt;/em&gt; Following Berger et al. (2021), the authors instrument for the aggregate share of borrowers with rate gaps between 0 and 2 percentage points using the Bu, Rogers and Wu (2021) (BRW) unified measure of Fed monetary policy shocks, which spans both conventional and unconventional policy. This approach tests whether substitution persists when both long and short rates move together.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;Main Findings.&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Extensive margin (probability of borrowing):&lt;/em&gt; After the Taper Tantrum, the monthly probability of cash-out refinancing declines for all rate gap bins, most strongly for borrowers pushed out of the money by the rate increase (a roughly 0.0012 percentage-point monthly probability decline—more than 85 percent below baseline—for borrowers with pre-Taper rate gaps of approximately 1 percent). Simultaneously, the probability of other borrowing (HELOCs, credit cards, personal loans, CES) rises in a near-mirror image, especially for borrowers at intermediate rate gaps. The combined effect on total borrowing probability is negligible and shows little variation with rate gap.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Intensive margin (amount borrowed conditional on borrowing):&lt;/em&gt; Conditional on a cash-out refinance occurring after the Taper, the average extraction amount &lt;em&gt;increases&lt;/em&gt;, consistent with a borrower-selection effect: low-liquidity-need borrowers, who face the highest effective borrowing cost increase when they move out of the money, disproportionately exit cash-out refinancing, leaving behind a pool of high-liquidity-need borrowers. For borrowers with pre-Taper rate gaps of around 1 percent, the conditional cash-out amount rises about 20 percent after the Taper.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Aggregate borrowing elasticity:&lt;/em&gt; Combining extensive and intensive margin estimates via a hurdle model, a 1 percentage-point increase in mortgage rates reduces total new household borrowing by between 0 and 8 percent (the aggregate borrowing elasticity is not statistically significantly different from zero at the preferred estimate, with a lower-bound of −8 percent), compared with a cash-out probability elasticity of approximately −45 percent in absolute terms.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Debt paydown:&lt;/em&gt; About 10–12 percent of new mortgage debt from cash-out refinances is used to pay down other outstanding debt, and this share is constant across rate gap groups and is not affected by the Taper, implying the MPC from cash-out borrowing does not vary with the rate environment.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Conventional monetary policy:&lt;/em&gt; Using the BRW IV over 2006–2021, the IV first stage yields an F-statistic of approximately 11. The cash-out extensive margin responds positively to the in-the-money share (elasticity 3.5 in IV), while other borrowing responds negatively (elasticity −0.87 in IV), and the all-borrowing elasticity is 0.09 and statistically insignificant. The intensive margin results are directionally consistent: conditional cash-out amounts fall as more borrowers are in the money, while total borrowing amounts respond positively (but insignificantly). Substitution thus holds even when both long and short rates move together.&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Implications for Path Dependence.&lt;/strong&gt; Because out-of-the-money borrowers substitute toward non-cash-out products, the non-linear dependence of cash-out refinancing on the distribution of outstanding mortgage rates does not translate into a correspondingly path-dependent total borrowing response. A back-of-the-envelope calculation using standard MPC assumptions (100 percent for cash-out, 80 percent for rate-term savings) and empirical refinancing frequencies and amounts (average first-lien equity extraction of $40,000 vs. average annual payment savings of $3,000 from rate-term refinancing, with rate-term frequency about 1.5x higher and semi-elasticity about 2x larger) implies that the potential near-term consumption stimulus from cash-out refinancing is approximately 5.5 times larger than from rate-term refinancing—making cash-out the dominant channel in principle. But because debt substitution substantially offsets the interest-rate sensitivity of cash-out refinancing, and because the path dependence of cash-out refinancing is largely eliminated by borrower substitution, the paper concludes that the overall path dependence of monetary policy is weaker than suggested by Berger et al. (2021) and Eichenbaum, Rebelo and Wong (2022).&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-rate-gap-and-why-does-it-capture-the-cash-out-refinance-incentive"&gt;Q1. What is the &amp;ldquo;rate gap&amp;rdquo; and why does it capture the cash-out refinance incentive?&lt;/h3&gt;
&lt;p&gt;The rate gap is defined as a borrower&amp;rsquo;s outstanding fixed mortgage rate minus an estimate of the 30-year fixed mortgage rate currently available to that borrower if they were to refinance (estimated from a regression of origination-period rates on LTV, credit score, loan type, investor type, and month fixed effects). A positive rate gap means the borrower is &amp;ldquo;in the money&amp;rdquo; for a rate-term refinance: they can reset their existing mortgage at a lower rate. The rate gap captures the degree of refinance incentive because resets the interest cost on the entire outstanding balance. Cash-out refinancing is especially attractive when the rate gap is positive because the rate reduction on the existing balance partially subsidizes the new borrowing, lowering its effective cost relative to alternative products.&lt;/p&gt;
&lt;h3 id="q2-what-is-the-conceptual-model-of-debt-substitution-the-authors-propose"&gt;Q2. What is the conceptual model of debt substitution the authors propose?&lt;/h3&gt;
&lt;p&gt;The authors model a homeowner with an inelastic liquidity need l that arrives with probability λ. The borrower can satisfy this need through a cash-out refinance at mortgage rate r_m (resetting their entire mortgage at r_m, which implies an interest cost on the existing balance) or through an alternative product at rate r_a &amp;gt; r_m. The key trade-off is that a cash-out refinance saves on the rate for the liquidity need itself but incurs a cost or benefit depending on whether r_m exceeds or falls below the outstanding rate r_0. When the rate gap is negative (r_0 &amp;lt; r_m), the cash-out refinance penalizes the borrower on the existing balance; when the gap is positive (r_0 &amp;gt; r_m), it saves on the existing balance, further lowering the effective cost of the liquidity need. The model predicts that: (i) the probability of cash-out refinancing is nonlinear and step-like in the rate gap; (ii) the probability of alternative borrowing has the opposite pattern; (iii) higher mortgage rates raise the conditional cash-out amount through selection (low-l borrowers exit cash-out); and (iv) total borrowing is relatively insensitive to mortgage rates.&lt;/p&gt;
&lt;h3 id="q3-how-does-the-taper-tantrum-provide-exogenous-variation-and-what-are-its-limitations"&gt;Q3. How does the Taper Tantrum provide exogenous variation, and what are its limitations?&lt;/h3&gt;
&lt;p&gt;The Taper Tantrum began in late spring 2013 when two FOMC communication events—Chairman Bernanke&amp;rsquo;s congressional testimony and the subsequent FOMC meeting—shifted market expectations about the pace of tapering large-scale asset purchases (LSAPs). The 30-year fixed mortgage rate rose approximately 80 basis points within one month, driven by changes in long-term rate expectations. Because the shock was unanticipated and FOMC did not announce any concrete policy change, the scope for a &amp;ldquo;Fed information effect&amp;rdquo; biasing results is limited. The critical limitation is that the Taper Tantrum affected primarily long-term rates: HELOC rates and consumer credit rates (tied to the federal funds rate and bank prime rate, which were unchanged) were little affected. This means the estimated substitution elasticity holds when the rate spread between mortgage and alternative products widens, which is more directly applicable to unconventional monetary policy (LSAPs) than to conventional policy that moves rates across the full yield curve.&lt;/p&gt;
&lt;h3 id="q4-what-do-the-taper-tantrum-extensive-margin-results-show-and-what-pattern-confirms-substitution"&gt;Q4. What do the Taper Tantrum extensive margin results show, and what pattern confirms substitution?&lt;/h3&gt;
&lt;p&gt;Figure 4 plots the difference-in-differences coefficient β₂ + β₃ by pre-Taper rate gap bin for three outcome variables. The cash-out refinancing probability (blue line) declines for all rate gap bins, most sharply for intermediate rate gap values (borrowers pushed out of the money by the Taper). Borrowers with pre-Taper rate gaps of ~1 percent experience a decline in monthly refinancing probability of about 0.0012, or more than 85 percent below their baseline rate. Other borrowing (black line) shows an almost exact mirror-image pattern: it rises after the Taper, most strongly for the same intermediate rate gap borrowers. The total borrowing probability (red line) shows essentially no response and little variation across rate gap groups, implying substitution nearly completely offsets the cash-out decline.&lt;/p&gt;
&lt;h3 id="q5-how-do-the-intensive-margin-results-for-cash-out-refinancing-compare-to-the-extensive-margin-and-what-explains-the-difference"&gt;Q5. How do the intensive margin results for cash-out refinancing compare to the extensive margin, and what explains the difference?&lt;/h3&gt;
&lt;p&gt;After the Taper, the conditional cash-out amount &lt;em&gt;rises&lt;/em&gt; (the intensive margin effect is positive), while the cash-out probability falls (the extensive margin effect is negative). These opposite signs are consistent with borrower selection: borrowers with small liquidity needs face the steepest increase in effective borrowing cost when they move out of the money and so disproportionately exit cash-out refinancing, raising the average extraction amount among those who remain. For borrowers with pre-Taper rate gaps of ~1 percent, the conditional cash-out amount rises approximately 20 percent after the Taper. Figure 6 corroborates this by showing the increase in average extraction is driven by a sharp decline in small extraction amounts (relative to outstanding balance).&lt;/p&gt;
&lt;h3 id="q6-how-is-the-aggregate-borrowing-elasticity-computed-and-what-does-it-imply-about-monetary-policy-transmission"&gt;Q6. How is the aggregate borrowing elasticity computed and what does it imply about monetary policy transmission?&lt;/h3&gt;
&lt;p&gt;The authors combine extensive and intensive margin estimates using a two-tiered (hurdle) model that allows the decision to borrow and the decision of how much to borrow to respond differently to covariates. The total expected borrowing amount is the product of the estimated borrowing probability and the expected conditional borrowing amount. Pre- and post-Taper aggregate predicted borrowing is calculated for each rate gap group, and the percentage change is divided by the 80 basis-point rate increase to produce a semi-elasticity. The aggregate borrowing elasticity is not statistically significantly different from zero at the main estimate, and the lower-bound estimate (which avoids reliance on the Post dummy for aggregate borrowing) is at most −8 percent per percentage-point increase in rates. This compares with a cash-out probability elasticity of approximately −45 percent, illustrating that substitution accounts for the overwhelming majority of the observed cash-out response.&lt;/p&gt;
&lt;h3 id="q7-why-is-the-brw-monetary-policy-shock-iv-important-for-generalizing-the-taper-tantrum-findings"&gt;Q7. Why is the BRW monetary policy shock IV important for generalizing the Taper Tantrum findings?&lt;/h3&gt;
&lt;p&gt;The Taper Tantrum moved only long rates, whereas conventional monetary policy moves both long and short rates. When short rates rise, the alternative borrowing products (HELOCs, credit cards, personal loans) become more expensive, which could dampen substitution in two ways: (a) the rate spread between mortgage and alternative products narrows, reducing the range of borrower-amount combinations for which substitution makes financial sense; and (b) higher absolute borrowing costs on alternative products may reduce total borrowing among borrowers who would otherwise substitute. The BRW IV, which spans 2006–2021 and reflects shocks to the full yield curve (conventional and unconventional), addresses whether substitution holds when both rate types move. The IV results in Table II (F-statistic ~11) confirm that the cash-out probability elasticity is 3.5 (IV), the other-borrowing elasticity is −0.87 (IV), and the all-borrowing elasticity is 0.09 and statistically insignificant, broadly consistent with the Taper Tantrum findings.&lt;/p&gt;
&lt;h3 id="q8-does-the-share-of-cash-out-proceeds-used-for-debt-paydown-vary-with-the-rate-environment-and-why-does-this-matter"&gt;Q8. Does the share of cash-out proceeds used for debt paydown vary with the rate environment, and why does this matter?&lt;/h3&gt;
&lt;p&gt;An event study finds that total household debt increases by about 88 percent of the increase in mortgage balance in the first two months after a cash-out refinance, implying approximately 12 percent debt paydown; by six months out, the net paydown stabilizes at around 8 percent. Crucially, this share is constant across rate gap groups and does not change after the Taper Tantrum. This constancy implies that the marginal propensity to consume (MPC) out of cash-out refinances does not vary with the rate environment, and therefore the path-dependence of the cash-out channel cannot be attributed to compositional changes in how borrowers use extracted funds.&lt;/p&gt;
&lt;h3 id="q9-why-does-the-paper-argue-cash-out-refinancing-has-far-greater-near-term-consumption-potential-than-rate-term-refinancing-and-what-are-the-implications-for-path-dependence"&gt;Q9. Why does the paper argue cash-out refinancing has far greater near-term consumption potential than rate-term refinancing, and what are the implications for path dependence?&lt;/h3&gt;
&lt;p&gt;A back-of-the-envelope calculation uses: (1) empirical frequencies (rate-term refinance probability is ~1.5x higher than cash-out); (2) near-term liquidity per event (average first-lien cash-out extraction ~$40,000 vs. annual payment savings ~$3,000 from rate-term); (3) semi-elasticities (rate-term has ~2x higher semi-elasticity to rates than cash-out per the IV estimates); and (4) standard MPC assumptions (100% for cash-out, 80% for rate-term savings). The calculation implies the consumption stimulus potential from cash-out refinancing is approximately 5.5 times that of rate-term refinancing per percentage-point change in rates. Because the paper shows the path-dependence of cash-out refinancing is largely offset by substitution, and because cash-out is the dominant near-term channel, the overall path-dependence of monetary policy is weaker than prior models predict.&lt;/p&gt;
&lt;h3 id="q10-what-are-the-key-robustness-checks-and-how-do-they-address-potential-confounds"&gt;Q10. What are the key robustness checks and how do they address potential confounds?&lt;/h3&gt;
&lt;p&gt;Three main robustness exercises are reported. First, a QE1 robustness (Appendix) uses the large decline in mortgage rates after the first LSAP announcement in 2008 as an alternative shock, finding consistent substitution patterns (households shift into cash-out refinancing from other borrowing when pushed into the money). Second, a placebo test shifts the sample back six months and estimates the same specification over the twelve months preceding the Taper; Figure 8 shows no differential substitution by rate gap during this stable-rate period, supporting the interpretation that the Taper Tantrum rate increase drives the cross-sectional substitution pattern. The placebo does reveal a negative Post dummy for other borrowing, consistent with a possible pre-trend in other borrowing, which motivates the lower-bound elasticity calculation that avoids reliance on this coefficient. Third, the authors show that results are little changed when adjustable-rate mortgages (~10 percent of outstanding mortgages in 2013) are included in the sample.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Rate Gap:&lt;/strong&gt; The difference between a borrower&amp;rsquo;s outstanding fixed mortgage rate and the estimated current 30-year fixed mortgage rate available to that borrower if they were to refinance (adjusting for borrower-specific LTV and credit score). A positive rate gap means the borrower is &amp;ldquo;in the money&amp;rdquo; for a rate-term refinance. This is the paper&amp;rsquo;s central measure of refinance incentive, determining whether cash-out refinancing or an alternative borrowing product is the cost-minimizing option for satisfying a given liquidity need.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Debt Substitution:&lt;/strong&gt; The paper&amp;rsquo;s core mechanism: households shift their new borrowing across products (cash-out refinance, HELOC, CES, credit card, personal loan) in response to changes in relative borrowing costs, without proportionally changing total new borrowing. When the rate gap is positive, cash-out refinancing is the cheapest way to borrow (it lowers the rate on the existing balance while providing liquidity), so borrowers substitute from alternative products into cash-out. When the rate gap is negative or mortgage rates rise, borrowers substitute in the opposite direction, keeping their original mortgage rate intact by using alternative products.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Cash-Out Refinance Channel of Monetary Policy:&lt;/strong&gt; The theoretical transmission mechanism by which monetary easing lowers mortgage rates, incentivizes in-the-money borrowers to refinance and extract home equity at reduced cost, and thereby stimulates consumption. Prior literature (Eichenbaum, Rebelo and Wong 2022) treats this channel as path-dependent and quantitatively important because it depends on the distribution of outstanding mortgage rates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Path Dependence of Monetary Policy:&lt;/strong&gt; The property by which the same monetary policy shock generates different aggregate borrowing or consumption responses depending on the historical distribution of outstanding fixed mortgage rates, which reflects prior monetary policy. A large share of in-the-money borrowers (due to a prior rate-cutting cycle) amplifies the cash-out refinance channel; a large share of out-of-the-money borrowers weakens it. The paper shows this path dependence is substantially attenuated by debt substitution.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;In-the-Money Borrower:&lt;/strong&gt; A borrower whose outstanding mortgage rate exceeds the current market mortgage rate (positive rate gap), creating a financial incentive to refinance. In-the-money status interacts with borrowing product choice because a cash-out refinance resets the interest cost on the entire existing balance, generating implicit savings that partially subsidize new liquidity extraction.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Hurdle (Two-Tiered) Model:&lt;/strong&gt; An estimation approach that allows the decision to borrow (extensive margin) and the amount borrowed conditional on borrowing (intensive margin) to respond differently to covariates. The authors use this model to combine extensive and intensive margin estimates into a single aggregate borrowing elasticity, avoiding the distortion that arises from using dollar volume as a dependent variable when intensive and extensive margins have opposite responses to the rate gap.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Taper Tantrum (2013):&lt;/strong&gt; A quasi-experimental shock used as the paper&amp;rsquo;s main source of exogenous variation. In late spring 2013, Federal Reserve communications about tapering large-scale asset purchases (LSAPs) caused the 30-year fixed mortgage rate to increase approximately 80 basis points within one month. Because the shock operated through long-term rate expectations, it moved mortgage rates without significantly affecting HELOC or consumer credit rates (tied to the unchanged federal funds and bank prime rates), enabling the authors to estimate substitution holding alternative product rates approximately fixed.&lt;/p&gt;</description></item><item><title>Bridging micro and macro production functions: The fiscal multiplier of infrastructure investment</title><link>https://macropaperwarehouse.com/papers/bridging-micro-and-macro-production-functions-the-fiscal-multiplier-of-infrastructure-investment/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/bridging-micro-and-macro-production-functions-the-fiscal-multiplier-of-infrastructure-investment/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This paper investigates the fiscal multiplier of infrastructure investment, specifically by incorporating firm-level investment decisions — a dimension absent from prior literature. The central analytical challenge is bridging the micro (firm-level) and macro (state-level) production functions for infrastructure, given that public capital is non-rivalrous: it can be used simultaneously by all firms without being depleted. The paper demonstrates that this non-rivalry generates a systematic discrepancy between firm-level and aggregate-level estimates of the elasticity of substitution between private and public capital, and it shows how this discrepancy shapes the magnitude of the fiscal multiplier.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Methodology&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors build and estimate a heterogeneous-firm general equilibrium model. Firms operate a constant-elasticity-of-substitution (CES) production function using private capital, non-rivalrous public capital (infrastructure), and labor. Firms are subject to idiosyncratic productivity shocks and make lumpy investment decisions subject to both fixed and convex capital adjustment costs, following Cooper and Haltiwanger (2006) and Winberry (2021). The economy has two regions — one with poor infrastructure and one with good infrastructure — motivated by the near-invariant cross-state distribution of infrastructure spending observed in U.S. data.&lt;/p&gt;
&lt;p&gt;The model is estimated via an extended Simulated Method of Moments (SMM) that treats market clearing prices as additional parameters estimated simultaneously with structural parameters, reducing computational cost relative to standard GE estimation. Estimation uses a multi-block Metropolis-Hastings algorithm. Target moments include lumpy investment fraction (0.14, from Zwick and Mahon 2017), average investment-to-capital ratio (0.10), standard deviation of i/k (0.16), private-to-infrastructure capital ratio (0.75, from BEA), high-infrastructure region&amp;rsquo;s private capital share (0.83, from Census BDS), and total working hours (0.33).&lt;/p&gt;
&lt;p&gt;The identification of the key parameter — the firm-level elasticity of substitution between private and public capital (λ) — comes from the relative size of private capital stocks across the two infrastructure groups: under greater complementarity, regions with more infrastructure should hold relatively more private capital.&lt;/p&gt;
&lt;p&gt;External validation is provided by estimating the state-level elasticity from the model&amp;rsquo;s simulated data using a nonlinear least squares method following An et al. (2019), and comparing it to empirical state-level estimates from actual U.S. state data.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings with Quantitative Magnitudes&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Firm-level vs. aggregate-level elasticity gap.&lt;/strong&gt; The estimated firm-level elasticity of substitution is λ = 1.185, implying gross substitutability between private and public capital at the firm level. The state-level elasticity implied by the same model is 0.48 (or 0.35 in a decreasing-returns-to-scale specification), implying gross complementarity. The empirical state-level counterpart estimated from actual U.S. data is 0.445. The paper proves theoretically (Proposition 1) that, given non-rivalry and under mild conditions, firm-level gross substitutability implies aggregate-level gross complementarity. Proposition 2 further shows that this same mechanism micro-founds the increasing-returns-to-scale assumption in Baxter and King&amp;rsquo;s (1993) Cobb-Douglas aggregate production function.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Fiscal multiplier (baseline, 2-year horizon).&lt;/strong&gt; The aggregate output multiplier over a 2-year horizon in the heterogeneous-firm general equilibrium model is &lt;strong&gt;1.088&lt;/strong&gt; in response to a one-time unexpected infrastructure spending shock equal to 1% of steady-state GDP, financed by a lump-sum tax. The corresponding partial-equilibrium output multiplier (holding prices fixed at steady state) is 1.858; the gap reflects crowding out of private investment induced by the general equilibrium interest rate response. In the baseline, the interest rate rises by 0.39% after the shock; the investment multiplier is -0.043.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Comparison with representative-agent model.&lt;/strong&gt; When the same implied returns-to-scale parameters are used in a representative-agent model (following Baxter and King 1993), the output multiplier is 0.991 and the investment multiplier is -0.157, both substantially lower than the heterogeneous-firm baseline. The key mechanism: under convex adjustment costs, the Jensen&amp;rsquo;s inequality effect implies that heterogeneous firms face a greater average adjustment burden than the representative firm, making their investment less responsive to the general equilibrium crowding-out pressure.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Sensitivity to elasticity of substitution.&lt;/strong&gt; Across the heterogeneous-firm model: at λ = 3 (high substitutability), the output multiplier falls to 0.672; at λ = 0.5 (complementarity), it rises to 1.364. The multiplier is significantly more sensitive to λ in the heterogeneous-firm model than in the representative-agent model, because non-rivalry amplifies the effect of any given elasticity value through each firm&amp;rsquo;s production function.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Cross-state distribution of gains.&lt;/strong&gt; Under the baseline spending allocation (81% to Good states, 19% to Poor states), per $1 of infrastructure spending, Good states receive $1.072 of the $1.088 total output gain, while Poor states receive only $0.016. In a counterfactual with equal spending across states, the total output multiplier falls to 0.873, Good states&amp;rsquo; output multiplier falls to 0.810, and Poor states&amp;rsquo; output multiplier rises to approximately 0.062 (about four times the baseline level of 0.016). This quantifies a sharp efficiency-equality trade-off in the allocation of infrastructure investment.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Employment and earnings effects.&lt;/strong&gt; Compared to steady state, the baseline fiscal shock produces an average annual increase of 0.304% in employment and 0.389% in wages, yielding a $0.713 increase in earnings and a $0.148 increase in consumption per $1 of fiscal spending in general equilibrium. In partial equilibrium (no price changes), earnings increase by $1.294 and consumption by $0.605 per $1 spent.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Results are conditional on: (i) lump-sum tax financing of the fiscal shock; (ii) a one-time unexpected (MIT) shock with no persistence; (iii) a closed-economy framework with endogenous real interest rate; (iv) the estimated two-region structure calibrated to U.S. state-level infrastructure data; (v) firm-level investment dynamics calibrated to Compustat and BDS moments. The authors note that incorporating time-to-build assumptions (tested in an appendix) reduces the aggregate fiscal multiplier, consistent with Ramey (2020).&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-core-theoretical-result-connecting-firm-level-and-aggregate-level-elasticities-and-what-is-the-intuition"&gt;Q1. What is the core theoretical result connecting firm-level and aggregate-level elasticities, and what is the intuition?&lt;/h3&gt;
&lt;p&gt;A: Proposition 1 proves that, given non-rivalrous public capital and mild data conditions (at least one firm has private capital below total infrastructure, and aggregate private capital exceeds total infrastructure), if the firm-level elasticity of substitution λ ≥ 1 (gross substitutes), then the aggregate-level elasticity ξ &amp;lt; 1 (gross complements). The intuition is that a marginal increase in public capital raises the marginal product of private capital for every firm simultaneously due to non-rivalry; the sum of these MPK gains across all firms exceeds any single firm&amp;rsquo;s gain. To represent this amplified benefit within an aggregate production function, a stronger complementarity is required than what any single firm faces. Put differently, non-rivalry means aggregate private and public capital &amp;ldquo;look&amp;rdquo; more complementary than they truly are at the firm level.&lt;/p&gt;
&lt;h3 id="q2-how-does-non-rivalry-micro-found-the-baxter-king-aggregate-production-function"&gt;Q2. How does non-rivalry micro-found the Baxter-King aggregate production function?&lt;/h3&gt;
&lt;p&gt;A: Proposition 2 shows that if firms use a CES production function with gross substitutability (λ ≥ 1) and non-rivalrous public capital, then fitting aggregate output with a Cobb-Douglas production function (as in Baxter and King 1993, H(K,N,L) = zK^α L^{1-α} N^ζ) yields ζ &amp;gt; 0, implying increasing returns to scale (IRS). This is the paper&amp;rsquo;s micro-foundation for a widely-used but previously ad hoc assumption in the macro-fiscal literature. The corollary states that both gross complementarity in the aggregate CES function and IRS in the aggregate Cobb-Douglas follow from the same non-rivalry mechanism at the firm level.&lt;/p&gt;
&lt;h3 id="q3-why-does-the-heterogeneous-firm-model-produce-a-higher-output-multiplier-than-the-representative-agent-model"&gt;Q3. Why does the heterogeneous-firm model produce a higher output multiplier than the representative-agent model?&lt;/h3&gt;
&lt;p&gt;A: Two mechanisms drive the difference. First, due to Jensen&amp;rsquo;s inequality and the convexity of adjustment costs, heterogeneous firms face a higher average adjustment burden than the representative (average) firm; this means heterogeneous firms are less responsive to interest rate changes that crowd out investment. The investment multiplier is -0.043 in the heterogeneous-agent baseline versus -0.157 in the representative-agent model. Second, the fixed adjustment cost (present in the baseline but absent from the representative-agent model) further dampens investment sensitivity via the extensive margin. Because less private investment is crowded out, more of the direct output boost from infrastructure spending survives into the aggregate multiplier, yielding 1.088 versus 0.991.&lt;/p&gt;
&lt;h3 id="q4-what-is-the-novel-estimation-procedure-and-why-is-it-necessary"&gt;Q4. What is the novel estimation procedure and why is it necessary?&lt;/h3&gt;
&lt;p&gt;A: Standard SMM applied to GE models requires solving for market-clearing prices for every candidate parameter vector, creating a nested optimization loop that is computationally prohibitive. The authors extend SMM by treating market-clearing prices (wage w and marginal utility of consumption p) as additional parameters and appending market-clearing conditions as additional target moments — effectively requiring those moments to equal zero. A multi-block Metropolis-Hastings algorithm jointly draws from the price block and the parameter block. This approach generates posterior draws that simultaneously satisfy market clearing and fit empirical moments, without the inner loop. The resulting market-clearing accuracy is e^{-4} at the posterior mean.&lt;/p&gt;
&lt;h3 id="q5-how-is-the-firm-level-elasticity-of-substitution-λ-identified-from-the-data"&gt;Q5. How is the firm-level elasticity of substitution (λ) identified from the data?&lt;/h3&gt;
&lt;p&gt;A: λ is identified from the cross-state difference in private capital stocks between high- and low-infrastructure regions. Under the model, if private and public capital are more complementary (lower λ), high-infrastructure regions should attract relatively more private capital. The data moment used is the Good region&amp;rsquo;s share of aggregate private capital (0.83 from Census BDS data). This identification strategy is analogous to Bartik-instrument approaches in the empirical literature, where a parameter governing cross-state sensitivity to aggregate shocks is identified from cross-sectional variation.&lt;/p&gt;
&lt;h3 id="q6-how-is-the-model-validated-externally"&gt;Q6. How is the model validated externally?&lt;/h3&gt;
&lt;p&gt;A: The authors compute the state-level elasticity from the estimated model by fixing firm-level parameters and re-estimating only the elasticity and regional productivity from the model&amp;rsquo;s simulated state-level data, using the same NLLS estimator as An et al. (2019). The model-implied state-level elasticity is 0.349 (DRS specification) or 0.482 (CRS specification). The empirical estimate from actual U.S. state-level data following the same estimator is 0.445. Both indicate gross complementarity at the state level, consistent with the theoretical prediction. This external validation is not used in the estimation itself, providing an independent check.&lt;/p&gt;
&lt;h3 id="q7-what-are-the-roles-of-extensive-vs-intensive-investment-margins-in-the-crowding-out-effect"&gt;Q7. What are the roles of extensive vs. intensive investment margins in the crowding-out effect?&lt;/h3&gt;
&lt;p&gt;A: Table 9 decomposes the investment multiplier of -0.043 by investment margin. When only the extensive margin (the discrete decision of whether to invest) is allowed to respond, the investment multiplier is -0.032 — approximately 74% of the baseline crowding-out effect. When only the intensive margin (investment size conditional on adjusting) responds, the multiplier is -0.011 — about 25% of the total. Thus the extensive margin is the dominant channel through which higher interest rates crowd out private investment. When both margins are held fixed, the output multiplier rises to 1.139, confirming that investment crowding-out reduces the output multiplier by about 0.05.&lt;/p&gt;
&lt;h3 id="q8-how-does-the-elasticity-of-substitution-affect-the-fiscal-multiplier-quantitatively-and-why-does-this-matter-more-in-the-heterogeneous-firm-model"&gt;Q8. How does the elasticity of substitution affect the fiscal multiplier quantitatively, and why does this matter more in the heterogeneous-firm model?&lt;/h3&gt;
&lt;p&gt;A: In the heterogeneous-firm GE model: λ = 3 gives an output multiplier of 0.672, λ = 1.185 (baseline) gives 1.088, and λ = 0.5 gives 1.364 — a range of 0.692. In the representative-agent model, the comparable range across the implied ζ values is much narrower (0.970 to 0.998). The amplification in the heterogeneous-firm model occurs because non-rivalry means each firm&amp;rsquo;s production function directly incorporates the public capital stock, so the elasticity parameter has first-order consequences for every firm&amp;rsquo;s investment incentive response to a fiscal shock. This heightened sensitivity underscores why accurately estimating λ at the firm level — rather than importing a state-level estimate — is critical for quantifying infrastructure multipliers.&lt;/p&gt;
&lt;h3 id="q9-what-is-the-efficiency-equality-trade-off-in-cross-state-infrastructure-allocation"&gt;Q9. What is the efficiency-equality trade-off in cross-state infrastructure allocation?&lt;/h3&gt;
&lt;p&gt;A: Under the baseline allocation (81% of infrastructure spending to Good states, 19% to Poor states), per $1 of infrastructure spending, the Good states receive $1.072 of output gains and Poor states receive only $0.016. In the equal-spending counterfactual, the total output multiplier falls from 1.088 to 0.873. The Poor states&amp;rsquo; output multiplier rises from $0.016 to $0.062 (approximately fourfold), while the Good states&amp;rsquo; falls from $1.072 to $0.810. The Poor states also see earnings multipliers more than double (from $0.017 to $0.042). This trade-off arises because Good states have both more private capital (benefiting from non-rivalry) and higher estimated TFP — so each dollar of infrastructure is more productive there. Equal allocation reduces aggregate efficiency while partially mitigating regional inequality.&lt;/p&gt;
&lt;h3 id="q10-how-do-the-papers-multiplier-estimates-compare-to-the-existing-literature"&gt;Q10. How do the paper&amp;rsquo;s multiplier estimates compare to the existing literature?&lt;/h3&gt;
&lt;p&gt;A: In partial equilibrium (no GE adjustment), the authors find an output multiplier of 1.858, consistent with Chodorow-Reich&amp;rsquo;s (2019) cross-sectional multiplier of approximately 1.8. Once the general equilibrium interest rate effect is included, the multiplier falls to 1.09, which falls within the 0.6-1.2 range from Ramey (2011). Literature using representative-agent models without non-rivalry (e.g., Ramey 2020) typically reports multipliers of 0.3 to 0.8 using returns-to-scale parameters of 0.07-0.12; the paper shows these correspond to fiscal multipliers of 0.847-0.882 in the representative-agent framework. The heterogeneous-firm model, once it incorporates the non-rivalry-corrected elasticities, yields a meaningfully higher multiplier of 1.088.&lt;/p&gt;
&lt;h3 id="q11-what-role-does-time-to-build-play-and-how-does-the-paper-handle-it"&gt;Q11. What role does time-to-build play, and how does the paper handle it?&lt;/h3&gt;
&lt;p&gt;A: The baseline model assumes a time-to-build period s = 1 year (one-year lag before new infrastructure is productive). The paper notes in Appendix H that incorporating extended time-to-build reduces the aggregate fiscal multiplier, operating through two channels: a news effect (agents adjust behavior upon anticipating future infrastructure) and a general equilibrium effect endogenous to the news effect. This finding is consistent with Ramey (2020). The baseline results are therefore reported under the minimal one-year time-to-build assumption, with longer lags serving as a robustness check.&lt;/p&gt;
&lt;h3 id="q12-what-is-the-role-of-region-specific-tfp-heterogeneity-in-the-model"&gt;Q12. What is the role of region-specific TFP heterogeneity in the model?&lt;/h3&gt;
&lt;p&gt;A: The model includes two regions that differ both in infrastructure levels and in region-specific productivity (TFP) levels. The TFP of the Good region is estimated to be approximately double that of the Poor region (x = 2.064 for Good vs. 1 for Poor). This productivity difference is estimated to partially capture heterogeneous congestion effects (which are not separately modeled) and is estimated jointly with the infrastructure elasticity. The productivity differential is identified from the Good region&amp;rsquo;s share of aggregate output (0.849 in the data). The large TFP gap is also the reason why equal spending on Poor states generates a much smaller output gain than spending on Good states: not only is infrastructure utilization lower (fewer firms), but underlying productivity is also lower.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Non-rivalry of public capital&lt;/strong&gt;: The property by which infrastructure stock (Nj,t) enters each firm&amp;rsquo;s production function at the full regional level, not divided among firms. Formally, a single marginal unit of public capital raises every firm&amp;rsquo;s marginal product of private capital simultaneously, so the aggregate marginal product gain summed across firms exceeds any single firm&amp;rsquo;s gain. This is the central mechanism driving the micro-macro elasticity discrepancy in the paper.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Firm-level elasticity of substitution (λ)&lt;/strong&gt;: The elasticity governing the degree of substitutability between private capital (k) and public infrastructure (N) in the firm&amp;rsquo;s CES production function. At λ = 1 the production function is Cobb-Douglas; λ &amp;gt; 1 is gross substitutability; λ &amp;lt; 1 is gross complementarity. In the paper&amp;rsquo;s estimation, λ = 1.185, meaning private and public capital are gross substitutes at the firm level.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Gross substitutability vs. gross complementarity&lt;/strong&gt;: Two inputs are gross substitutes (complements) if an increase in the quantity of one raises (lowers) the demand for the other, holding output price fixed. In the paper&amp;rsquo;s framework, private and public capital are gross substitutes at the firm level (λ = 1.185 &amp;gt; 1) but gross complements at the state level (ξ ≈ 0.48 &amp;lt; 1), with non-rivalry explaining the inversion upon aggregation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Convex adjustment cost&lt;/strong&gt;: A cost C(I,k) = (µ/2)(I/k)² · k that scales quadratically with the investment rate. In the heterogeneous-firm model, this cost plays a critical role: by Jensen&amp;rsquo;s inequality, heterogeneous firms&amp;rsquo; average adjustment burden under a convex cost exceeds that of the representative (average) firm, making aggregate investment less sensitive to interest rate changes and thereby dampening crowding out.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Fixed adjustment cost (ξ)&lt;/strong&gt;: A one-time overhead cost drawn from a uniform distribution [0, ξ̄], paid only when a firm makes a large-scale investment outside the &amp;ldquo;inaction band&amp;rdquo; [−νk, νk]. This cost generates lumpy investment at the firm level, with about 14% of firms making lumpy investments in any given year. It also creates an extensive margin of investment adjustment that accounts for approximately 74% of the baseline crowding-out effect.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Fiscal multiplier (as defined in this paper)&lt;/strong&gt;: The ratio of the present value of aggregate output deviations from steady state to the present value of the fiscal spending shock, both summed over a T-year horizon. For the short run, T = 2 years; for the long run, T = 5 years. This is computed as a perfect-foresight transition path response to a one-time MIT shock equal to 1% of steady-state GDP.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;MIT shock (one-time unexpected shock)&lt;/strong&gt;: An unanticipated, non-persistent one-period deviation in infrastructure spending. The term &amp;ldquo;MIT shock&amp;rdquo; refers to a deterministic transition experiment where agents have perfect foresight about all future values after the initial shock occurs. This contrasts with persistent policy rules and allows isolating the dynamic effects of a one-time fiscal impulse.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Extended SMM with market-clearing moments&lt;/strong&gt;: The paper&amp;rsquo;s estimation innovation. Rather than solving for market-clearing prices at each parameter candidate (the standard costly inner loop), wages (w) and marginal utility of consumption (p) are treated as parameters with associated moments being the market-clearing conditions set to zero. A multi-block Metropolis-Hastings algorithm draws from the price block and the parameter block separately, generating posterior draws that jointly satisfy market clearing and empirical moment conditions.&lt;/p&gt;</description></item><item><title>Can Deficits Finance Themselves?</title><link>https://macropaperwarehouse.com/papers/can-deficits-finance-themselves/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/can-deficits-finance-themselves/</guid><description>&lt;p&gt;The paper asks whether a government can run a deficit today — issuing &amp;ldquo;stimulus checks&amp;rdquo; — and allow debt to return to its initial level without any future tax hike or spending cut. In environments combining &lt;strong&gt;(i) nominal rigidity&lt;/strong&gt; and &lt;strong&gt;(ii) a violation of Ricardian equivalence&lt;/strong&gt; (due to finite lives or liquidity constraints), this is possible through two complementary self-financing channels: (a) a Keynesian boom in real activity that expands the tax base and automatically raises revenue at existing tax rates; and (b) a surge in inflation that erodes the real value of outstanding nominal government debt. The paper&amp;rsquo;s headline result is that &lt;strong&gt;self-financing increases monotonically as fiscal adjustment is delayed&lt;/strong&gt;, converging to &lt;strong&gt;full self-financing&lt;/strong&gt; in the limit: if monetary policy does not lean too heavily against the fiscal stimulus, the initial deficit eventually returns debt to trend with no required future adjustment. Calibrated to empirical evidence on intertemporal MPCs, the speed of fiscal adjustment, the Phillips curve slope, and the monetary reaction, the model finds self-financing up to &lt;strong&gt;ν ≈ 0.95&lt;/strong&gt; — with the tax base channel dominant and inflation contributing negligibly.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Environment&lt;/strong&gt; (Section 2): Baseline is a perpetual-youth overlapping-generations (OLG) version of the textbook New Keynesian model. Households survive from one period to the next with probability ω ∈ (0,1]; when ω=1 the model reduces to the standard PIH-RANK benchmark in which Ricardian equivalence holds and no self-financing occurs. When ω&amp;lt;1, two properties of consumer demand emerge: (i) consumers discount future disposable income at a rate higher than the interest rate (&amp;ldquo;discounting&amp;rdquo;), so a distant future tax hike barely affects today&amp;rsquo;s spending; (ii) consumers spend transfers relatively quickly (&amp;ldquo;front-loading&amp;rdquo;), so the Keynesian boom plays out before the promised tax hike arrives. The supply block is exactly the standard NKPC. Fiscal policy follows a rule in which taxes respond to income through a fixed tax rate τy (tax base channel) and to debt through a speed-of-adjustment coefficient τd ∈ (0,1) (with τd→0 meaning indefinitely delayed adjustment). Monetary policy keeps (expected) real rates constant in the baseline — a &amp;ldquo;neutral&amp;rdquo; benchmark that neither offsets nor amplifies the fiscal stimulus.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Self-financing result&lt;/strong&gt; (Sections 3–4): Starting from a date-0 deficit shock ε0 (lump-sum transfer of 1% of steady-state output), define the &lt;strong&gt;degree of self-financing&lt;/strong&gt; ν as the fraction of ε0 financed by the tax base and debt erosion channels; 1−ν equals the discounted present value of future tax hikes required to stabilize debt. The central results are:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Theorem 1 (baseline, φ=0)&lt;/strong&gt;: If ω&amp;lt;1 and τy&amp;gt;0, ν increases monotonically as τd→0, with ν→1 in the limit. Intuition via two-period analogy: when cumulative short-run MPC → 1, the Keynesian multiplier → 1/τy, and the induced tax revenue → 1 — exactly financing the original ε0.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Proposition 3&lt;/strong&gt;: For any given τd or delay H, ν is strictly decreasing in ω: larger departures from permanent income (smaller ω) deliver faster and larger Keynesian booms and hence greater self-financing.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Theorem 2 (general monetary policy)&lt;/strong&gt;: Under a general real rate rule rt = φ·yt, there exists a threshold φ̄ ∈ (0, τy/(β·D^ss/Y^ss)) such that: if φ&amp;lt;φ̄, full self-financing is achieved in the limit; if φ&amp;gt;φ̄, ν is bounded strictly below 1 by ν̄(φ). If the monetary authority perfectly stabilizes output and inflation (φ→∞), ν=0 by construction.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Theorem 3 (general aggregate demand)&lt;/strong&gt;: With generalized demand ct = Md·dt + My·(yt−tt) + δ·Et[Σ(βω)^k(yt+k−tt+k)], self-financing holds whenever (i) ω&amp;lt;1 and (ii) Md&amp;gt;1−β and My·(1 + δ·βω/(1−βω)) ≥ 1. This nests the baseline OLG model, hybrid spender-OLG models, and approximately represents quantitative HANK models.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Distinction from FTPL&lt;/strong&gt;: The Fiscal Theory of the Price Level (Cochrane) breaks Ricardian equivalence through equilibrium selection in a PIH-RANK setting; the self-financing here operates under the &lt;em&gt;conventional&lt;/em&gt; equilibrium, with an active monetary authority and passive fiscal authority. The inflation channel is not the focal mechanism — the tax base channel is dominant.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Calibration&lt;/strong&gt; (Table 1, hybrid OLG-spender model, quarterly frequency):&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;em&gt;Consumer spending&lt;/em&gt;: share of hand-to-mouth (HtM) spenders µ = 0.073; OLG survival rate ω = 0.865; jointly matched to average MPC = 0.2 and short-run MPC slope from Fagereng, Holm, and Natvik (2021)&lt;/li&gt;
&lt;li&gt;&lt;em&gt;Fiscal adjustment&lt;/em&gt;: τd ∈ {0.085, 0.026, 0.004} (fast to slow; from Galí et al. 2007, Bianchi-Melosi 2017, Auclert-Rognlie 2020 respectively; equivalent to H ∈ {12, 23, 43} quarters under the non-Markovian rule)&lt;/li&gt;
&lt;li&gt;&lt;em&gt;Monetary policy&lt;/em&gt;: real rate feedback φ = 0 (neutral baseline)&lt;/li&gt;
&lt;li&gt;&lt;em&gt;Nominal rigidities&lt;/em&gt;: NKPC slope κ = 0.0062 (Hazell et al. 2022 point estimate)&lt;/li&gt;
&lt;li&gt;&lt;em&gt;Standard parameters&lt;/em&gt;: EIS σ=1 (log utility); β = 0.998 (1% annual real rate); tax feedback τy = 0.33 (DeLong-Summers benchmark: 33 cents of surplus per dollar of output); liquid wealth D^ss/Y^ss = 1.04 (Kaplan et al. 2018)&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Quantitative results&lt;/strong&gt; (Figure 3, Table 2):&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;For empirically calibrated τd range, &lt;strong&gt;ν reaches up to 0.95&lt;/strong&gt;, nearly full self-financing in the most realistic (slow adjustment) specification&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Virtually all self-financing (≈95–100%) occurs through the tax base channel&lt;/strong&gt; — the flat NKPC (κ=0.0062) limits inflation and debt erosion to a negligible share; with steeper NKPC (κ=0.1), about &lt;strong&gt;20% of self-financing comes through date-0 inflation&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;The quantitative fiscal multiplier at τd=0.085 is &lt;strong&gt;1.11&lt;/strong&gt;, consistent with Ramey (2011) empirical estimates for transfers with relatively quick adjustment&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Table 2 (νmax as function of monetary ψ and NKPC κ)&lt;/strong&gt;: Full self-financing (νmax = 1) is attainable when ψ ≤ 1.25 and κ = 0.0062; drops to νmax = 0.63 at ψ=1.5 and κ=0.0062; drops to νmax = 0.22 with κ=0.1 and ψ=1; approaches 0 with both aggressive monetary and flexible prices. Key lesson: moderate monetary reaction combined with flat NKPC (consistent with evidence) supports near-full self-financing.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Robustness&lt;/strong&gt;:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;em&gt;HANK model&lt;/em&gt;: same conclusions as hybrid spender-OLG; intertemporal MPCs nearly identical (Wolf, 2021; Auclert et al., 2023)&lt;/li&gt;
&lt;li&gt;&lt;em&gt;Distortionary fiscal adjustment&lt;/em&gt;: negligible impact, since the required adjustment itself vanishes in the limit&lt;/li&gt;
&lt;li&gt;&lt;em&gt;Government purchases&lt;/em&gt;: same self-financing logic applies (Keynesian boom raises tax revenue)&lt;/li&gt;
&lt;li&gt;&lt;em&gt;Investment&lt;/em&gt;: Keynesian cross applies to consumption; net of investment aggregate demand follows the same law of motion — self-financing result unchanged&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Scope conditions&lt;/strong&gt;: Self-financing requires Ricardian equivalence to fail (ω&amp;lt;1); in the PIH-RANK benchmark (ω=1), neither self-financing channel is operative. Monetary accommodation is assumed neutral or weak; aggressive offsetting (φ&amp;gt;φ̄) prevents full self-financing. The paper is purely positive: whether deficits are optimal is a separate normative question. Results are log-linearized dynamics; the quantitative conclusions depend on discipline from empirical MPC evidence, NKPC estimates, and fiscal adjustment speed. The self-financing mechanism operates through aggregate demand and is not driven by r&amp;lt;g or by seigniorage from a convenience yield.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-two-period-intuition-for-full-self-financing"&gt;Q1. What is the two-period intuition for full self-financing?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;In a two-period economy with fully myopic consumers (MPC=1), a date-0 transfer of ε stimulates output by y = MPC/(1−MPC·(1−τy)) · ε, generating tax revenue τy·y; with MPC→1 the output multiplier converges to 1/τy and tax revenue converges to exactly ε — full self-financing via the tax base.&lt;/strong&gt; The infinite-horizon economy with ω&amp;lt;1 mirrors this intuition when fiscal adjustment is delayed far enough: the &amp;ldquo;short run&amp;rdquo; cumulative MPC approaches 1 (by discounting and front-loading), the Keynesian cross delivers a multiplier of 1/τy, and the additional tax revenue precisely repays the deficit, with no future tax hike needed.&lt;/p&gt;
&lt;h3 id="q2-why-does-the-degree-of-self-financing-ν-increase-as-fiscal-adjustment-is-delayed"&gt;Q2. Why does the degree of self-financing ν increase as fiscal adjustment is delayed?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;As the gap H between the date-0 transfer and the promised future tax hike widens, two effects amplify the Keynesian boom: (i) near-term demand is less dampened by anticipation of the future tax hike (discounting makes far-ahead taxes nearly irrelevant to today&amp;rsquo;s spending); and (ii) the general equilibrium income feedback — the Keynesian cross — has more time to play out before being curtailed by the eventual tax hike, amplifying the total output and revenue response.&lt;/strong&gt; The longer the delay, the larger the short-run cumulative MPC, and the larger the fraction of the deficit self-financed through the tax base.&lt;/p&gt;
&lt;h3 id="q3-why-does-aggressive-monetary-policy-block-self-financing"&gt;Q3. Why does aggressive monetary policy block self-financing?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;If the monetary authority raises real interest rates in response to the fiscal boom (φ&amp;gt;0), it discourages household spending, slowing and shrinking the Keynesian boom; above the threshold φ̄, the real rate increase is strong enough to counteract the tax base feedback before the cumulative MPC can converge to 1, meaning full self-financing becomes impossible and some future fiscal adjustment is always required.&lt;/strong&gt; Conversely, monetary accommodation (φ&amp;lt;0) accelerates the boom and permits full self-financing with less delay, while perfectly stabilizing output and inflation (φ→∞) entirely shuts down both self-financing channels.&lt;/p&gt;
&lt;h3 id="q4-what-is-the-role-of-the-nkpc-slope-in-determining-which-channel-operates"&gt;Q4. What is the role of the NKPC slope in determining which channel operates?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;When the NKPC is flat (κ=0.0062, the Hazell et al. 2022 estimate), a large output boom generates negligible inflation, so debt erosion contributes almost nothing and the tax base channel carries essentially all the self-financing; when the NKPC is steep (κ=0.1, consistent with supply-constrained post-COVID), the same boom generates materially more inflation, shifting the financing split so that ~20% comes through debt erosion while ~80% still comes through the tax base.&lt;/strong&gt; The overall degree of self-financing ν is affected only through the monetary response: a steeper NKPC triggers a more aggressive real rate response, moderating the boom, but this is captured in the analysis of Theorem 2 and Table 2.&lt;/p&gt;
&lt;h3 id="q5-how-does-this-paper-relate-to-and-differ-from-the-fiscal-theory-of-the-price-level-ftpl"&gt;Q5. How does this paper relate to and differ from the Fiscal Theory of the Price Level (FTPL)?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The FTPL (Cochrane) achieves deficit financing through inflation in a PIH-RANK environment by abandoning the Taylor principle and exploiting equilibrium selection; this paper requires no such departure — both monetary and fiscal policy follow conventional active/passive assignments, and the equilibrium studied is the unique bounded one.&lt;/strong&gt; The key difference is in the consumer block: Ricardian equivalence fails here through finite lives or liquidity constraints (empirically grounded), not through equilibrium selection. Moreover, while FTPL highlights the debt erosion (inflation) channel, this paper finds the tax base (real activity) channel is dominant under empirically calibrated flat Phillips curves.&lt;/p&gt;
&lt;h3 id="q6-what-new-conditions-on-aggregate-demand-ensure-self-financing-extends-beyond-the-olg-baseline"&gt;Q6. What new conditions on aggregate demand ensure self-financing extends beyond the OLG baseline?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Theorem 3 identifies two sufficient conditions: (1) &amp;ldquo;positive geometric discounting&amp;rdquo; (ω&amp;lt;1 in the generalized demand block), ensuring that far-ahead future taxes have negligible effect on current demand; and (2) &amp;ldquo;sufficient front-loading&amp;rdquo; (Md &amp;gt; 1−β and My·(1 + δ·βω/(1−βω)) ≥ 1), ensuring that income is spent quickly enough for the Keynesian feedback to deliver self-financing before debt explodes.&lt;/strong&gt; The classical PIH-RANK fails condition (1); the spender-saver model with any margin of PIH consumers fails condition (2); the OLG baseline satisfies both; and the hybrid spender-OLG (the quantitative workhorse) satisfies both for any ω&amp;lt;1.&lt;/p&gt;
&lt;h3 id="q7-is-a-margin-of-truly-pih-consumers-fatal-for-self-financing"&gt;Q7. Is a margin of truly PIH consumers fatal for self-financing?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Yes — introducing any strictly positive mass of PIH consumers breaks self-financing entirely, creating a discontinuity: ν=0 whenever µ_PIH &amp;gt; 0, no matter how small.&lt;/strong&gt; The intuition is that PIH consumers never fully spend any income received in finite time (they smooth it across their infinite horizon), so the cumulative MPC never reaches 1 and the Keynesian boom cannot fully finance the deficit. However, the discontinuity is fragile: replacing literal PIH consumers with &amp;ldquo;near-PIH&amp;rdquo; consumers (finite but large ω) restores ν→1 in the limit as H→∞ and is consistent with empirical evidence on high MPCs for liquid households.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;fiscal self-financing&lt;/strong&gt; : the property that a deficit-financed government transfer raises output and inflation sufficiently to replenish government revenue (via the tax base channel) and reduce the real debt burden (via the inflation/debt erosion channel), allowing debt to return to steady state without future tax increases; the degree ν ∈ [0,1] measures what fraction of the initial deficit is self-financed.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;tax base channel&lt;/strong&gt; : the mechanism by which a Keynesian boom in real activity — triggered by the deficit-financed transfer — automatically raises tax revenue (by τy dollars per dollar of additional output) without any change in tax rates; dominant over the debt erosion channel whenever the NKPC is flat (empirically, κ ≈ 0.006).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;discounting and front-loading&lt;/strong&gt; : the two consumer demand properties necessary for self-financing; &amp;ldquo;discounting&amp;rdquo; (ω&amp;lt;1) means far-ahead future taxes barely affect current spending, allowing the deficit to stimulate demand even with a promised future tax hike; &amp;ldquo;front-loading&amp;rdquo; means the income response is spent quickly, so the Keynesian boom plays out before the delayed tax hike arrives, raising tax revenue sufficiently to finance the deficit.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;speed of fiscal adjustment&lt;/strong&gt; (τd) : the quarterly feedback from public debt to tax revenue in the fiscal rule; τd→0 means indefinitely delayed adjustment and maximum self-financing; empirically disciplined values range from τd=0.085 (fast, Galí et al. 2007) to τd=0.004 (slow, Auclert-Rognlie 2020), with νmax ≈ 0.95 across this range under neutral monetary policy and flat NKPC.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;hybrid spender-OLG model&lt;/strong&gt; : the paper&amp;rsquo;s quantitative workhorse, combining a fraction µ of hand-to-mouth spenders with OLG perpetual-youth consumers; jointly calibrated to match the impact and short-run MPCs from Fagereng et al. (2021), while also providing a close proxy for aggregate demand in quantitative HANK models (Auclert et al. 2023; Wolf 2021).&lt;/p&gt;</description></item><item><title>Can Trade Policy Mitigate Climate Change?</title><link>https://macropaperwarehouse.com/papers/can-trade-policy-mitigate-climate-change/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/can-trade-policy-mitigate-climate-change/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;Farrokhi and Lashkaripour (2025) study the interaction between trade policy and climate change. The central research question is whether and how countries can use trade policy — specifically import tariffs — to address carbon leakage arising from domestic carbon pricing. When a country prices carbon domestically, production and emissions can shift to countries without carbon pricing, partially offsetting domestic emissions reductions. The paper asks how optimal import tariffs should be designed to internalize this leakage, how they relate to standard terms-of-trade tariffs, and what additional gains multilateral coordination can deliver.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Methodology and Data.&lt;/strong&gt; The paper develops a multi-country, multi-sector trade model in which carbon emissions are proportional to output with sector-specific emission intensities, and countries choose trade taxes and subsidies strategically in Nash equilibrium alongside domestic carbon prices. The model is calibrated to 43 countries and 56 sectors using the 2014 baseline from the World Input-Output Database (WIOD 2016) for trade flows and input-output linkages, IEA data for sector-level carbon emissions, and GTAP for trade elasticities.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings.&lt;/strong&gt; The paper&amp;rsquo;s first key result is that the optimal unilateral import tariff decomposes additively into a standard terms-of-trade component and a carbon leakage correction component. The carbon leakage correction is proportional to the emission intensity of imports from the exporting country in that sector and to the gap between the social cost of carbon and the actual domestic carbon price in the exporting country, divided by the import price. This decomposition implies that countries have incentives to impose import tariffs beyond those justified by standard terms-of-trade arguments, specifically to correct for the carbon embodied in imports from countries with insufficient carbon pricing.&lt;/p&gt;
&lt;p&gt;The paper derives a sufficient statistic for the optimal carbon tariff that depends only on observable trade elasticities and emission intensities, enabling calibration without full structural estimation beyond the model&amp;rsquo;s standard parameters.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Quantitative Magnitudes.&lt;/strong&gt; In the calibrated model, optimal unilateral carbon tariffs are on average 30% above standard optimal tariffs globally (28% above for the EU; 33% above for the US). The excess is largest in carbon-intensive sectors: petroleum products (41% above standard optimal), cement and non-metallic minerals (45% above standard optimal), basic metals (38% above standard optimal), and chemicals (32% above standard optimal). Imposing the optimal unilateral carbon tariff yields a welfare gain of +0.8% consumption equivalent for the imposing country, with trading partners losing on average 0.3%, and a net global gain of +0.4%.&lt;/p&gt;
&lt;p&gt;Multilateral coordination — a symmetric global carbon pricing agreement — eliminates the strategic motive for carbon trade wars, delivers an additional global welfare gain of +0.6% above the unilateral optimum, and eliminates 85% of the carbon leakage remaining under unilateral policy.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;CBAM Analysis.&lt;/strong&gt; The paper evaluates the EU Carbon Border Adjustment Mechanism (CBAM) against the theoretically optimal carbon tariff. The EU CBAM as currently implemented — covering only direct emissions — captures 60% of the theoretically optimal carbon tariff. Extending coverage to indirect (supply-chain) emissions would capture 85% of optimal. The welfare gain to the EU from CBAM relative to no border adjustment is +0.4%.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions and Robustness.&lt;/strong&gt; Results are qualitatively robust to trade elasticity assumptions but quantitatively sensitive to them. Optimal carbon tariffs are regressive with respect to developing countries; multilateral coordination mitigates this distributional effect via income transfers. General equilibrium labor market effects reduce welfare gains by approximately 20% but do not change the qualitative ranking of policies.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-formal-structure-of-the-optimal-unilateral-import-tariff-in-the-presence-of-carbon-externalities"&gt;Q1. What is the formal structure of the optimal unilateral import tariff in the presence of carbon externalities?&lt;/h3&gt;
&lt;p&gt;The optimal import tariff from country j in sector s is tau*_js = tau^ToT_js + tau^carbon_js, where tau^ToT is the standard terms-of-trade optimal tariff (inverse of the export supply elasticity) and tau^carbon is a carbon leakage correction equal to e_js × (lambda_j − lambda*) / P_js. Here e_js is the emission intensity of country j in sector s, lambda_j is the social cost of carbon in the importing country, lambda* is the actual domestic carbon price in the exporting country, and P_js is the import price. Countries therefore have two distinct and additive incentives to impose import tariffs: the classical terms-of-trade motive and a novel carbon leakage correction motive.&lt;/p&gt;
&lt;h3 id="q2-what-is-the-sufficient-statistic-result-and-why-does-it-matter-for-implementation"&gt;Q2. What is the sufficient statistic result and why does it matter for implementation?&lt;/h3&gt;
&lt;p&gt;The paper shows that the optimal carbon tariff can be expressed as a function of observable trade elasticities and emission intensities alone, without requiring estimation of structural parameters beyond those standard to the trade model. This sufficient statistic result matters because it means regulators can in principle calculate and implement the theoretically optimal carbon border adjustment using data that are already collected — sectoral emission intensities and trade elasticities — rather than relying on unobservable structural primitives.&lt;/p&gt;
&lt;h3 id="q3-by-how-much-do-optimal-carbon-tariffs-exceed-standard-optimal-tariffs-in-the-aggregate-and-in-the-most-carbon-intensive-sectors"&gt;Q3. By how much do optimal carbon tariffs exceed standard optimal tariffs in the aggregate and in the most carbon-intensive sectors?&lt;/h3&gt;
&lt;p&gt;Globally, optimal unilateral carbon tariffs are on average 30% above standard optimal tariffs (28% above for the EU, 33% above for the US). The excess is largest in highly carbon-intensive sectors: cement and non-metallic minerals (45% above), petroleum products (41% above), basic metals (38% above), and chemicals (32% above). These are precisely the sectors where emission intensities are highest, consistent with the carbon leakage correction being proportional to emission intensity.&lt;/p&gt;
&lt;h3 id="q4-what-are-the-welfare-effects-of-unilateral-optimal-carbon-tariff-policy"&gt;Q4. What are the welfare effects of unilateral optimal carbon tariff policy?&lt;/h3&gt;
&lt;p&gt;For the country imposing the optimal unilateral carbon tariff, the welfare gain is +0.8% in consumption-equivalent terms relative to no carbon tariff. Trading partners lose on average 0.3%. The net global welfare gain is +0.4%. These numbers reflect the fact that unilateral carbon tariffs are partly beggar-thy-neighbor in structure — they improve the imposing country&amp;rsquo;s terms of trade in addition to correcting leakage — which is why multilateral coordination is needed to eliminate the strategic distortion.&lt;/p&gt;
&lt;h3 id="q5-what-additional-gains-does-multilateral-coordination-deliver-over-unilateral-policy"&gt;Q5. What additional gains does multilateral coordination deliver over unilateral policy?&lt;/h3&gt;
&lt;p&gt;Multilateral coordination — modeled as a symmetric global carbon pricing agreement — generates an additional global welfare gain of +0.6% above the unilateral optimum. It also eliminates 85% of the carbon leakage that persists under unilateral policy. The mechanism is that coordination removes the strategic motive for trade wars over carbon policy: under unilateral policy, each country has an incentive to impose carbon tariffs partly for terms-of-trade reasons, but under a coordinated agreement these beggar-thy-neighbor components are internalized.&lt;/p&gt;
&lt;h3 id="q6-how-well-does-the-eus-cbam-as-actually-implemented-capture-the-theoretically-optimal-carbon-border-adjustment"&gt;Q6. How well does the EU&amp;rsquo;s CBAM as actually implemented capture the theoretically optimal carbon border adjustment?&lt;/h3&gt;
&lt;p&gt;The EU CBAM as implemented — covering only direct emissions from covered sectors — captures 60% of the theoretically optimal carbon tariff. Extending the CBAM to include indirect emissions embedded in supply chains would raise this to 85% of optimal. The remaining gap (15% under the extended CBAM) reflects the difficulty of accounting for all upstream emission intensities across complex global supply chains.&lt;/p&gt;
&lt;h3 id="q7-what-is-the-welfare-gain-to-the-eu-from-cbam-relative-to-no-border-adjustment"&gt;Q7. What is the welfare gain to the EU from CBAM relative to no border adjustment?&lt;/h3&gt;
&lt;p&gt;The welfare gain to the EU from implementing CBAM (relative to having no carbon border adjustment at all) is +0.4% in consumption-equivalent terms. This figure corresponds to the direct CBAM as implemented, covering only direct emissions.&lt;/p&gt;
&lt;h3 id="q8-how-sensitive-are-the-results-to-trade-elasticity-assumptions-and-what-are-the-distributional-implications-for-developing-countries"&gt;Q8. How sensitive are the results to trade elasticity assumptions, and what are the distributional implications for developing countries?&lt;/h3&gt;
&lt;p&gt;The results are qualitatively robust to trade elasticity assumptions but quantitatively sensitive — the magnitude of optimal carbon tariffs and welfare effects depends on the specific elasticities used. On distributional grounds, optimal carbon tariffs are regressive with respect to developing countries, meaning developing economies bear disproportionate costs from carbon border adjustments. Multilateral coordination partially mitigates this distributional concern through income transfers implied by the symmetric global agreement.&lt;/p&gt;
&lt;h3 id="q9-how-do-general-equilibrium-labor-market-effects-alter-the-conclusions"&gt;Q9. How do general equilibrium labor market effects alter the conclusions?&lt;/h3&gt;
&lt;p&gt;General equilibrium labor market effects reduce the welfare gains by approximately 20% relative to the baseline estimates, but do not change the qualitative ranking of policies (unilateral carbon tariff better than no border adjustment; multilateral coordination better than unilateral). This suggests that the core policy conclusions are robust to incorporating labor market general equilibrium effects, even if the precise magnitudes are somewhat smaller.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Carbon Leakage.&lt;/strong&gt; In this paper, carbon leakage refers specifically to the shift in production and emissions to countries without domestic carbon pricing that occurs when one country implements a carbon price. It is the mechanism by which domestic carbon pricing is partially offset, motivating the use of trade policy as a complementary instrument.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Carbon Leakage Correction (tau^carbon).&lt;/strong&gt; The component of the optimal import tariff that is distinct from the standard terms-of-trade tariff. It equals emission intensity × (social cost of carbon − domestic carbon price in exporter) / import price. It corrects for the fact that imports from countries with insufficient carbon pricing embody unpriced carbon externalities.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Terms-of-Trade Tariff (tau^ToT).&lt;/strong&gt; The standard optimal import tariff arising from a large country&amp;rsquo;s ability to manipulate its terms of trade. Equal to the inverse of the export supply elasticity of the trading partner. The paper establishes that carbon tariffs add to — rather than replace — this classical component.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sufficient Statistic for Optimal Carbon Tariff.&lt;/strong&gt; A formula expressing the optimal carbon tariff as a function of observable trade elasticities and emission intensities, without requiring estimation of unobservable structural parameters beyond those standard to the trade model. The term is used in the paper&amp;rsquo;s specific sense of an empirically implementable formula that is exact within the model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Emission Intensity.&lt;/strong&gt; Sector-specific carbon emissions per unit of output in a given country, denoted e_js for country j and sector s. Used as the key observable that scales the carbon leakage correction component of the optimal tariff.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Multilateral Coordination.&lt;/strong&gt; Modeled as a symmetric global carbon pricing agreement in which all countries simultaneously adopt optimal carbon pricing. In the paper&amp;rsquo;s framework, this eliminates the strategic motive for unilateral carbon trade wars and achieves additional welfare gains and leakage reductions beyond what any single country can achieve unilaterally.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Carbon Border Adjustment Mechanism (CBAM).&lt;/strong&gt; The EU policy instrument that imposes a carbon price on imports from sectors covered by the EU Emissions Trading System, evaluated in the paper against the theoretically optimal carbon tariff. The paper distinguishes between the direct-emissions-only CBAM as implemented (capturing 60% of optimal) and a hypothetical full CBAM including indirect supply-chain emissions (capturing 85% of optimal).&lt;/p&gt;</description></item><item><title>Cap‐and‐Trade and Carbon Tax Meet Arrow–Debreu</title><link>https://macropaperwarehouse.com/papers/capandtrade-and-carbon-tax-meet-arrowdebreu/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/capandtrade-and-carbon-tax-meet-arrowdebreu/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Anderson and Duanmu (2025) ask how general equilibrium (GE) interactions — factor reallocation across sectors, capital misallocation under climate uncertainty, and the distributional incidence of damages — alter the social cost of carbon (SCC) relative to the partial equilibrium (PE) estimates embedded in standard integrated assessment models (IAMs). The paper also characterizes conditions for Pareto improvements through climate policy and derives the optimal carbon tax in second-best environments with pre-existing distortions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Framework&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors build a dynamic Arrow-Debreu economy with L goods, K capital stocks (including climate stocks), and T periods. The climate module specifies that the carbon stock evolves as S_{t+1} = S_t + sum_j e_j(q_j) − alpha·S_t, and climate damage functions D_j(S_t) = 1 − d_j·(S_t − S_0) reduce sector-specific production possibilities sets. Firms and households take the climate trajectory as given and do not internalize their own emissions&amp;rsquo; impact, generating the externality. Under standard regularity conditions, the authors prove existence of a competitive equilibrium and establish that it is inefficient: output is too high and climate-intensive sectors are too large relative to the social optimum.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;General Formula for the SCC&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper derives a general SCC formula — SCC_t = Sum_{tau &amp;gt;= t} beta^(tau−t) · [dW/dS_tau / (dW/dY_t)] — that decomposes into four components: (1) the standard direct productivity-loss term, (2) a GE factor-reallocation term capturing inefficient reallocation as damages shift relative prices, (3) a capital-misallocation term reflecting distortions in investment from climate uncertainty, and (4) a distribution term reflecting the welfare losses from the regressive incidence of climate damages. All three correction terms are positive under standard conditions, so the GE SCC exceeds the PE SCC. The paper shows that this formula nests existing IAM frameworks as special cases.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Quantitative Findings&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Calibrating to three leading IAMs, the authors find that general equilibrium interactions raise the SCC by 15–40% above standard PE estimates:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;DICE-calibrated: GE correction of &lt;strong&gt;18%&lt;/strong&gt; above the PE estimate.&lt;/li&gt;
&lt;li&gt;FUND-calibrated: GE correction of &lt;strong&gt;15%&lt;/strong&gt; above the PE estimate.&lt;/li&gt;
&lt;li&gt;PAGE-calibrated: GE correction of &lt;strong&gt;40%&lt;/strong&gt; above the PE estimate, the largest correction owing to greater sector heterogeneity in that model.&lt;/li&gt;
&lt;li&gt;Median calibration: a PE SCC of &lt;strong&gt;$51/tCO₂&lt;/strong&gt; rises to a GE SCC of &lt;strong&gt;$62/tCO₂&lt;/strong&gt;.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Decomposing the aggregate GE correction: factor reallocation across sectors accounts for &lt;strong&gt;55%&lt;/strong&gt;, capital misallocation due to climate uncertainty for &lt;strong&gt;30%&lt;/strong&gt;, and the distributional regressivity of damages for &lt;strong&gt;15%&lt;/strong&gt;.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Second-Best Policy and Uncertainty&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In environments with pre-existing distortions, the optimal carbon tax deviates from the SCC: revenue recycling through labor tax cuts generates additional welfare gains of &lt;strong&gt;10–15%&lt;/strong&gt; of carbon tax revenue; undertaxed capital implies the optimal carbon tax should be set above the SCC (double dividend); and in monopolistically competitive sectors the optimal carbon tax is below the SCC because the carbon tax amplifies monopoly distortions. Under climate uncertainty, the SCC carries a risk premium proportional to the variance of damage estimates times the coefficient of relative risk aversion, estimated at &lt;strong&gt;+$8–15/tCO₂&lt;/strong&gt; (15–25% of the base SCC).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The quantitative corrections are calibrated to DICE, FUND, and PAGE and therefore inherit those models&amp;rsquo; parameterizations of damage functions and discount rates. The GE factor-reallocation and capital-misallocation channels are larger when sectors are more heterogeneous in damage exposure — as is explicit in the PAGE result. Second-best corrections depend on the sign and magnitude of pre-existing distortions (labor taxes, capital taxes, market structure).&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-core-inefficiency-result-and-what-does-it-imply-about-the-competitive-equilibrium"&gt;Q1. What is the core inefficiency result, and what does it imply about the competitive equilibrium?&lt;/h3&gt;
&lt;p&gt;The paper&amp;rsquo;s efficiency theorem establishes that the competitive equilibrium is Pareto inefficient because firms and households take the climate trajectory as given and do not internalize the impact of their own emissions on the carbon stock. As a consequence, output is too high and climate-intensive sectors are too large relative to the social optimum. This externality is the fundamental justification for climate policy in the model.&lt;/p&gt;
&lt;h3 id="q2-how-does-the-papers-general-scc-formula-extend-existing-approaches-and-what-are-the-novel-terms"&gt;Q2. How does the paper&amp;rsquo;s general SCC formula extend existing approaches, and what are the novel terms?&lt;/h3&gt;
&lt;p&gt;The general formula SCC_t = Sum_{tau &amp;gt;= t} beta^(tau−t) · [dW/dS_tau / (dW/dY_t)] nests standard IAM SCC formulas as special cases. The novel terms relative to partial equilibrium are: (i) a GE reallocation term capturing losses from inefficient factor reallocation as climate damages change relative prices across sectors; (ii) a capital-misallocation term capturing distortions in investment arising from climate uncertainty; and (iii) a distribution term capturing welfare losses from the regressive incidence of damages. All three terms are positive under standard conditions, implying GE SCC &amp;gt; PE SCC in all calibrations.&lt;/p&gt;
&lt;h3 id="q3-how-are-the-quantitative-ge-corrections-decomposed-and-which-channel-dominates"&gt;Q3. How are the quantitative GE corrections decomposed, and which channel dominates?&lt;/h3&gt;
&lt;p&gt;Of the total GE correction above the PE baseline, factor reallocation across sectors contributes 55%, capital misallocation due to climate uncertainty contributes 30%, and the distributional regressivity of damages contributes 15%. Factor reallocation is the dominant channel because, as climate damages alter relative prices, production shifts toward less-damaged sectors in ways that are distorted by the original carbon externality — generating second-order losses absent from PE damage functions.&lt;/p&gt;
&lt;h3 id="q4-why-does-the-page-calibration-produce-a-larger-ge-correction-40-than-dice-18-or-fund-15"&gt;Q4. Why does the PAGE calibration produce a larger GE correction (40%) than DICE (18%) or FUND (15%)?&lt;/h3&gt;
&lt;p&gt;The paper attributes PAGE&amp;rsquo;s larger GE correction to greater sector heterogeneity in that model&amp;rsquo;s parameterization. When damage exposure is more heterogeneous across sectors, the relative-price effects of marginal carbon are larger, amplifying the factor-reallocation channel. DICE and FUND, with more uniform sector-level damage structures, exhibit smaller reallocation corrections.&lt;/p&gt;
&lt;h3 id="q5-what-is-the-median-calibration-implication-for-the-scc-in-dollar-terms"&gt;Q5. What is the median-calibration implication for the SCC in dollar terms?&lt;/h3&gt;
&lt;p&gt;In the median calibration, a PE SCC of $51/tCO₂ rises to a GE SCC of $62/tCO₂, an increase of roughly $11/tCO₂ or approximately 22%. This figure is directly computable from observable trade elasticities and sector-level damage estimates.&lt;/p&gt;
&lt;h3 id="q6-how-should-the-carbon-tax-be-adjusted-when-pre-existing-labor-market-distortions-are-present-and-what-is-the-magnitude-of-the-welfare-gain-from-revenue-recycling"&gt;Q6. How should the carbon tax be adjusted when pre-existing labor market distortions are present, and what is the magnitude of the welfare gain from revenue recycling?&lt;/h3&gt;
&lt;p&gt;When labor taxes create a pre-existing wedge, using carbon tax revenue to reduce labor taxes generates additional welfare gains of 10–15% of total carbon tax revenue — the double dividend in the labor market dimension. The optimal carbon tax in this case includes the SCC plus a correction term for the labor-market distortion.&lt;/p&gt;
&lt;h3 id="q7-how-do-capital-market-distortions-alter-the-optimal-carbon-tax-relative-to-the-scc"&gt;Q7. How do capital market distortions alter the optimal carbon tax relative to the SCC?&lt;/h3&gt;
&lt;p&gt;If capital is undertaxed (a pre-existing distortion in capital markets), the optimal carbon tax is set above the SCC. The intuition is that a higher carbon tax partially offsets the under-taxation of capital by raising the effective cost of carbon-intensive investment, capturing a double-dividend in the capital market.&lt;/p&gt;
&lt;h3 id="q8-how-does-monopolistic-competition-modify-the-optimal-carbon-tax"&gt;Q8. How does monopolistic competition modify the optimal carbon tax?&lt;/h3&gt;
&lt;p&gt;For monopolistically competitive sectors, the optimal carbon tax is below the SCC. The reasoning is that applying a carbon tax to these sectors amplifies existing monopoly markups and associated distortions, so the social cost of the carbon tax exceeds the raw SCC in those sectors. The optimal policy trades off carbon correction against monopoly amplification.&lt;/p&gt;
&lt;h3 id="q9-what-is-the-risk-premium-in-the-scc-under-climate-uncertainty-and-how-is-it-estimated"&gt;Q9. What is the risk premium in the SCC under climate uncertainty, and how is it estimated?&lt;/h3&gt;
&lt;p&gt;The paper adds a term to the SCC proportional to the variance of damage estimates times the coefficient of relative risk aversion. Using empirical estimates of damage uncertainty, this risk premium is estimated at +$8–15/tCO₂, representing 15–25% of the base SCC. This term is absent from deterministic SCC calculations and constitutes a further reason standard PE estimates understate the true social cost.&lt;/p&gt;
&lt;h3 id="q10-what-is-the-papers-claim-regarding-computability-of-the-ge-correction"&gt;Q10. What is the paper&amp;rsquo;s claim regarding computability of the GE correction?&lt;/h3&gt;
&lt;p&gt;The paper states that the novel GE terms are computable from observable trade elasticities and sector-level damage estimates, implying the GE correction is not merely a theoretical construct but can be implemented in quantitative policy analysis using data sources already available to researchers and policymakers.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Social Cost of Carbon (General Equilibrium Formula)&lt;/strong&gt;
Defined in the paper as SCC_t = Sum_{tau &amp;gt;= t} beta^(tau−t) · [dW/dS_tau / (dW/dY_t)], the present discounted value of the marginal welfare loss from an additional unit of carbon, expressed relative to the marginal utility of current output. The paper&amp;rsquo;s version adds GE reallocation, capital-misallocation, and distributional terms absent from standard PE formulations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;GE Adjustment Factor&lt;/strong&gt;
The ratio of the general equilibrium SCC to the partial equilibrium SCC, expressed as GE/PE = 1 + phi_realloc + phi_capital + phi_distribution. Under standard conditions all three phi terms are positive, so the GE SCC strictly exceeds the PE SCC.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Climate Damage Function (Sector-Specific)&lt;/strong&gt;
Specified as D_j(S_t) = 1 − d_j·(S_t − S_0), a sector-specific multiplicative reduction in the production possibilities set as the carbon stock rises above the pre-industrial level S_0. Heterogeneity in d_j across sectors is the driver of the factor-reallocation GE correction.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Carbon Stock Evolution&lt;/strong&gt;
S_{t+1} = S_t + sum_j e_j(q_j) − alpha·S_t, where alpha is the natural decay rate of atmospheric carbon and e_j(q_j) is sectoral emissions as a function of output. Firms and households treat S_t as exogenous, generating the externality.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Double Dividend&lt;/strong&gt;
In second-best environments, a carbon tax can generate two welfare gains simultaneously: correcting the carbon externality and reducing the deadweight loss from a pre-existing distortion (labor or capital tax). The paper finds revenue recycling via labor tax cuts yields 10–15% of carbon tax revenue as additional welfare gain; undertaxed capital implies the optimal carbon tax is set above the SCC.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Risk Premium in the SCC&lt;/strong&gt;
An additive term in the SCC under climate uncertainty, proportional to the variance of damage estimates times the coefficient of relative risk aversion. Empirically estimated at +$8–15/tCO₂, representing 15–25% of the base SCC.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Second-Best Optimal Carbon Tax&lt;/strong&gt;
Written as tau*_carbon = SCC + CORRECTION, where the correction depends on the sign and magnitude of pre-existing distortions. The correction is positive under undertaxed capital (raise above SCC), negative under monopolistic competition (lower below SCC), and augmented by revenue-recycling gains when labor taxes are present.&lt;/p&gt;</description></item><item><title>Cash or card? A structural model of payment choices</title><link>https://macropaperwarehouse.com/papers/cash-or-card-a-structural-model-of-payment-choices/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/cash-or-card-a-structural-model-of-payment-choices/</guid><description>&lt;p&gt;Lippi and Moracci (2026) ask how euro area households choose between cash and card payments, and whether existing theoretical models can explain observed behavior. They draw on ECB payment diary surveys (SUCH and SPACE waves I–III, 2015–2024) covering transaction-level records that include purchase size, payment method chosen, cash on hand before each transaction, and merchant acceptance of cards. This granular data allows the authors to isolate unforced payment choices — transactions in which the consumer had sufficient cash, the merchant accepted cards, and the consumer held a card — from mechanically constrained ones.&lt;/p&gt;
&lt;p&gt;The authors document three empirical patterns. First, roughly 39% of individuals in the sample violate the simple transaction-size threshold rule of Whitesell (1989): their largest unforced cash payment exceeds their smallest unforced card payment. Second, between 27% and 49% of unforced transactions are settled by card across survey waves, contradicting the &amp;ldquo;cash burns&amp;rdquo; policy of Alvarez and Lippi (2017) under which cards are used only when cash is exhausted. Third, and most novel, the probability of card use rises sharply as implied residual cash holdings (m′ = m − s) approach zero — that is, when a cash payment would nearly deplete the wallet. This suggests a precautionary motive: consumers maintain a cash buffer to cover purchases at merchants who do not accept cards.&lt;/p&gt;
&lt;p&gt;To rationalize these facts, the authors build an inventory-theoretic model with a compound Poisson expenditure flow (random arrival times and random transaction sizes drawn from a lognormal distribution), imperfect card acceptance (fraction ϕ of merchants accept cards, set at 0.89 for 2023–24), a fixed cost b per cash withdrawal, a fixed cost κ per card transaction (sign unrestricted), and a utility penalty u per missed purchase. The optimal policy takes an (s,S) form for withdrawals and a state-dependent threshold for payment choice. When 0 &amp;lt; κ &amp;lt; b, the agent uses cards for purchases large enough that paying cash would push balances below a threshold m̃, thereby avoiding a costly withdrawal or the risk of missing a future purchase. The critical transaction size above which cards are used, s(m), rises with cash on hand, generating the interaction the data reveals.&lt;/p&gt;
&lt;p&gt;The model is calibrated by minimum distance to four moments from the 2023–24 SPACE wave: average cash balances relative to daily expenditure, annual withdrawal frequency, the unforced card expenditure share, and realized purchase frequency. The estimated annual cost of managing consumption transactions for the average euro area household is approximately 15 euros — a remarkably small burden. Three counterfactual experiments quantify welfare implications. Removing card access raises the annual cost from 15 to about 50 euros, implying a card ownership value of roughly 35 euros per year. Near-universal card acceptance (ϕ = 0.99) reduces the annual cost by nearly 75%, from 15 to about 4 euros, while average cash holdings fall from 130% to about 20% of daily expenditure. A complete ban on cash would cost the average consumer approximately 60 euros per year more than the current mixed system. A cashless equilibrium requires both near-universal acceptance (ϕ above 99%) and card costs at or below zero (κ ≤ 0); neither condition alone is sufficient given the estimated magnitude of the missed-purchase cost u.&lt;/p&gt;
&lt;p&gt;Q: What is the central empirical puzzle the paper addresses?
A: Existing models predict either a pure transaction-size threshold (Whitesell 1989) or a pure cash-burns rule (Alvarez and Lippi 2017). The data shows both rules are violated: 39% of individuals with observed unforced transactions of both types violate the threshold rule, and 27–49% of unforced transactions are paid by card despite available cash. Neither model alone accounts for the novel finding that card usage spikes precisely when a cash payment would nearly exhaust the wallet.&lt;/p&gt;
&lt;p&gt;Q: What data does the paper use and what is its key advantage?
A: The authors use ECB payment diaries from four survey waves: SUCH (2015–16) and SPACE I, II, III (2019, 2021–22, 2023–24). For each transaction the diary records payment method, purchase size, and cash on hand, along with merchant acceptance of each payment method. Critically, the combined information on cash holdings and acceptance allows the authors to distinguish forced from unforced payment choices, which is essential for identifying the behavioral determinants of payment method selection.&lt;/p&gt;
&lt;p&gt;Q: What is the novel empirical fact the paper contributes?
A: The paper documents that the probability of card use increases sharply as implied residual cash (m′ = m − s) approaches zero. This pattern holds across all survey waves. It is consistent with a precautionary motive: consumers use cards to avoid depleting a cash buffer that provides insurance for encounters with merchants who do not accept cards.&lt;/p&gt;
&lt;p&gt;Q: How does the theoretical model generate the precautionary motive for cash?
A: Cards are accepted in only fraction ϕ of stores; when a merchant does not accept cards and the consumer lacks cash, the purchase is missed at utility cost u. This creates an incentive to maintain positive cash balances. Combined with a fixed withdrawal cost b and a fixed card cost κ, the agent optimally targets a cash level m* and withdraws before the wallet empties (trigger m̄ &amp;gt; 0), holding a buffer against card-rejection events.&lt;/p&gt;
&lt;p&gt;Q: What is the key proposition characterizing the optimal payment policy?
A: Proposition 1 establishes three regimes. When κ ≤ 0, the card always dominates and is used for all purchases. When κ ≥ b, cash always dominates and cards are used only for forced transactions. In the intermediate case 0 &amp;lt; κ &amp;lt; b, a threshold m̃ ∈ (m̄, m*) divides behavior: for m &amp;lt; m̃ the agent uses cash for all transactions; for m ≥ m̃ the agent uses a card for any purchase exceeding a size threshold s(m), where s(m) is increasing in m. The threshold s(m) distinguishes this policy from Whitesell (1989)&amp;rsquo;s fixed threshold.&lt;/p&gt;
&lt;p&gt;Q: How does the payment threshold s(m) vary with cash on hand, and why?
A: s(m) is the purchase size above which the value loss from paying cash — pushing the agent closer to m̄ and raising the probability of a missed purchase or costly withdrawal — exceeds the fixed card cost κ. As m rises, a larger cash payment is needed to trigger this concern, so s(m) increases. This means card use becomes less frequent as cash balances grow for most of the state space, consistent with the empirical finding that cash probability rises with cash on hand.&lt;/p&gt;
&lt;p&gt;Q: What are the calibrated parameter values and what do they imply?
A: The withdrawal cost b is estimated at 0.003 EUR — very small. The per-transaction card cost κ is about 60% of b, meaning cards are cheaper to use per transaction than visiting an ATM. The cost of a missed purchase u is approximately 1 EUR. The arrival rate λ is calibrated so that about 2% of purchase opportunities are missed under the estimated card acceptance rate of 0.89. These values imply that the payment system imposes a small but non-trivial welfare burden, concentrated in the precautionary costs of maintaining cash.&lt;/p&gt;
&lt;p&gt;Q: What is the estimated annual cost of managing consumption transactions?
A: Under the optimal policy for 2023–24 parameters, the annual cost C is approximately 15 euros per household. This decomposes into opportunity costs of holding cash (RM), withdrawal costs (bn), card usage costs, and the disutility from missed purchases. The authors characterize this as &amp;ldquo;remarkably small,&amp;rdquo; suggesting the current payment system is relatively efficient from the household&amp;rsquo;s perspective.&lt;/p&gt;
&lt;p&gt;Q: How does this cost compare across demographic groups and over time?
A: Until 2019 the estimated annual cost was around 20 euros; it stabilized around 15 euros from 2021–22 onward, with the decline driven primarily by households holding less cash in the post-pandemic period. Across age groups, education levels, income brackets, and gender, each subgroup faces a very similar cost as a proportion of their expenditure, indicating limited distributional variation in payment system costs.&lt;/p&gt;
&lt;p&gt;Q: What is the welfare value of owning a payment card?
A: Setting ϕ = 0 (cash-only economy), the annual cost rises from 15 to approximately 50 euros. The value of card ownership is therefore approximately 35 euros per year. The savings come primarily from lower opportunity costs of holding cash (since card access reduces the precautionary motive) and lower disutility from missed purchases; withdrawal cost reductions play a negligible role.&lt;/p&gt;
&lt;p&gt;Q: What happens under near-universal card acceptance (ϕ = 0.99)?
A: Average cash holdings fall from about 130% of daily expenditure to about 20% of daily expenditure, a reduction of approximately 110 percentage points. The unconditional card expenditure share rises by 17 percentage points to about 93%, mostly through an increase in forced card transactions (agents more often lack cash). Unforced card expenditure falls by about 10 percentage points because the precautionary motive for using cards — preserving a cash buffer — weakens when acceptance is near-universal. The annual management cost falls by nearly 75%, from 15 to approximately 4 euros.&lt;/p&gt;
&lt;p&gt;Q: Under what conditions does a cashless economy emerge?
A: The model identifies two jointly necessary conditions: card acceptance near universal (ϕ above 99%) and card costs at or below zero (κ ≤ 0). Raising ϕ alone from the estimated 0.89 to 0.99 reduces cash use substantially but does not eliminate it, because the estimated cost of missed purchases u is large enough that consumers still maintain a small cash buffer. For κ ≤ 0, cash holdings M/e are insensitive to κ and depend only on ϕ. With current card usage costs, even near-universal acceptance would not produce a cashless economy.&lt;/p&gt;
&lt;p&gt;Q: What is the cost of a complete cash ban?
A: Under a cashless policy, the annual cost is approximately 75 euros — about 5 times the 15-euro baseline and about 25 euros more than the cash-only cost of 50 euros. A complete ban on cash would increase transaction management costs by approximately 60 euros per year for the average consumer. This is because at ϕ = 0.89, nearly 11% of purchase encounters would result in missed transactions.&lt;/p&gt;
&lt;p&gt;Q: How does card acceptance affect cash management in the model and data?
A: As ϕ falls, the precautionary motive for holding cash strengthens: the withdrawal trigger m̄ rises, average cash holdings increase, and withdrawals occur when the wallet is still substantially full. This prediction is qualitatively consistent with the empirical finding that in areas with lower card acceptance, individuals hold higher cash balances and withdraw at higher residual cash levels.&lt;/p&gt;
&lt;p&gt;Q: What are the main limitations the authors acknowledge?
A: Three caveats are identified. First, the model has no exogenous cash inflows (wage payments, gifts); incorporating Miller-Orr-style inflows could affect cash resilience estimates. Second, the card cost κ is fixed and independent of transaction size s; allowing κ(s) = κ₀ + κₛ·s would better capture reward-program economies relevant for the US. Third, merchant card acceptance is treated as exogenous; endogenizing it as a game between merchants would allow a joint welfare evaluation of acceptance decisions, payment choices, and cash management.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;Unforced transactions: Transactions in which both cash and card payments are feasible — specifically, cash holdings exceed the purchase size, the merchant accepts cards, and the consumer holds a card. Isolating unforced transactions is necessary to identify behavioral determinants of payment choice, stripping out mechanical constraints imposed by cash insufficiency or merchant non-acceptance.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Precautionary cash buffer: A positive cash balance maintained above the withdrawal trigger (m̄ &amp;gt; 0) to insure against purchases at merchants who do not accept cards. In the model, this buffer arises because card non-acceptance combined with insufficient cash results in a missed purchase at utility cost u; the precautionary motive is stronger when ϕ is lower.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Transaction-size threshold s(m): The purchase size above which a consumer with cash holdings m optimally pays by card (when cards are available and 0 &amp;lt; κ &amp;lt; b). Unlike the fixed threshold of Whitesell (1989), s(m) is increasing in m, generating a novel interaction between cash on hand and payment method choice that the ECB diary data confirms.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Cash burns policy: The policy of Alvarez and Lippi (2017) in which cards are used only when cash is fully exhausted (m = 0). The paper documents that 27–49% of unforced transactions are settled by card across survey waves, constituting a systematic violation of this rule that the model resolves by introducing transaction-size heterogeneity and a precautionary motive.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Imperfect card acceptance (ϕ): The exogenous fraction of merchants willing to accept card payments, set at 0.89 for 2023–24 in the calibration. Imperfect acceptance is the primary driver of the precautionary demand for cash; it also determines the frequency of missed purchases under a cashless policy and is the key parameter governing whether a cashless economy can emerge.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Annual transaction management cost (C): The total yearly household cost of operating within the payment system, defined as C = RM + bn + κ·(number of card purchases) + u·(number of missed purchases). Estimated at approximately 15 euros for the average euro area household in 2023–24, decomposed across opportunity costs of cash holdings, withdrawal costs, card usage costs, and missed-purchase disutility.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Ss withdrawal policy: The optimal cash replenishment rule characterized by a trigger level m̄ and a target level m*. The agent withdraws whenever cash falls to m̄, resetting balances to m*. A strictly positive trigger (m̄ &amp;gt; 0) reflects the precautionary motive: the agent refills before cash is exhausted in order to maintain insurance against card non-acceptance events.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;</description></item><item><title>Central bank reputation with noise</title><link>https://macropaperwarehouse.com/papers/central-bank-reputation-with-noise/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/central-bank-reputation-with-noise/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; How does noise in the mapping from central bank actions to realized inflation affect the existence and character of reputational equilibria in monetary policy? Specifically, can a central bank that faces uncertainty about whether it is perceived as &amp;ldquo;hawkish&amp;rdquo; or &amp;ldquo;dovish&amp;rdquo; sustain a pure strategy separating equilibrium, and how should each type behave as a function of its current reputation?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model and Methodology.&lt;/strong&gt; Amador and Phelan build on the monopolistic-competition, cash-in-advance framework of Chari, Christiano, and Eichenbaum (1998) and extend it to allow for (i) two central bank types — hawkish (type 1, high penalty γ₁ for inflationary actions) and dovish (type 2, lower penalty γ₂ &amp;lt; γ₁) — whose identity is private information; (ii) type switching governed by a Markov process, with probability δ that a hawkish bank is replaced by a dovish one and probability ε that a dovish bank is replaced by a hawkish one; and (iii) noise between the central bank&amp;rsquo;s chosen action μᵢ and realized money growth μₐ, which is drawn from a density f(μₐ|μᵢ) with full support. The equilibrium concept is pure symmetric Markov perfect equilibrium, in which all strategies are functions only of the public Bayesian posterior ρ that the current central bank is hawkish. The paper proceeds analytically to characterize no-pooling results and then computationally to demonstrate existence of separating equilibria.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings.&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;No pooling equilibria exist (analytical).&lt;/strong&gt; Propositions 2 and 3 establish that no pure symmetric Markov equilibrium can have both types choosing the same positive action for any reputation ρ, as long as γ₁ ≠ γ₂ and Assumption 1 (pricing distortion sufficiently severe) holds. The intuition: if both types pool, realized inflation is uninformative, reputation does not change, and there are no dynamic incentives — but different static incentives (γ₁ ≠ γ₂) then imply different optimal actions, a contradiction.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Without sufficient noise, separating equilibria also fail to exist.&lt;/strong&gt; In the no-noise limit, Bayesian updating forces the dovish bank&amp;rsquo;s reputation to jump to its maximum after one period of mimicking the hawkish action, making mimicry cheap when the discount factor β is high or the type-persistence probability ε is low. This makes the incentive-compatibility constraint for the dovish bank very difficult to satisfy, potentially precluding existence of a separating equilibrium.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;With sufficient noise, pure strategy separating equilibria exist and have appealing properties (computational).&lt;/strong&gt; The benchmark parameterization sets α = 1, σ = 5, β = 0.99, h(μ) = 0.5μ², ε = δ = 0.02, and the noise distribution such that the hawkish type&amp;rsquo;s unconstrained target would deliver mean inflation of 2% and the dovish type&amp;rsquo;s 3%. Under these parameters:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;In the full-information (known-type) world: price P = 1.313 for the hawkish type and P = 1.338 for the dovish type, with E[log(c) − αc] = −1.0297 and −1.0320 respectively, versus the efficient benchmark of −1.&lt;/li&gt;
&lt;li&gt;In the reputational equilibrium, both types choose lower inflationary actions than they would absent reputation considerations — because reputation is valuable (higher ρ lowers household prices and thus improves welfare for both types).&lt;/li&gt;
&lt;li&gt;Both types&amp;rsquo; optimal actions are U-shaped in reputation ρ: they are most restrained — choosing the lowest inflationary actions — when ρ is middling (interior), because Bayesian updating is most sensitive (and thus the reputation cost of inflating is greatest) at interior beliefs, while it is difficult to move extreme beliefs.&lt;/li&gt;
&lt;li&gt;Average equilibrium inflation is 2.1%, which lies below the weighted average of unconstrained type targets (2.5% given equal switching probabilities), demonstrating that reputation concerns compress inflation outcomes.&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Ergodic distribution of reputation remains interior.&lt;/strong&gt; Starting from ρ = 0.5, expected reputation conditional on being hawkish stays below 0.63 and conditional on being dovish stays above 0.38, reflecting that noise and type switching prevent reputation from collapsing to its extremes.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Welfare implications.&lt;/strong&gt; The hawkish type is made worse off by ongoing household uncertainty (relative to the reference game in which type is immediately revealed), while the dovish type is made better off. Households are better off under continuing uncertainty than under immediate revelation, unless reputation is near its maximum — because uncertainty suppresses inflationary temptations for both types.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions.&lt;/strong&gt; Results apply within a monopolistic-competition, cash-in-advance economy with discrete time, infinite horizon, and Markov strategies. The no-pooling result requires Assumption 1 (the pricing distortion is sufficiently severe that the central bank has a positive incentive to inflate from μ = 0). The no-noise existence failure is an informal argument holding fixed discount and type-switching parameters. Computational results are specific to the benchmark parameterization but are verified to be robust to variation in β, σ, γ₁, γ₂, ε, and δ.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-fundamental-time-inconsistency-problem-in-the-underlying-chari-et-al-1998-economy-and-how-does-the-paper-extend-it"&gt;Q1. What is the fundamental time-inconsistency problem in the underlying Chari et al. (1998) economy, and how does the paper extend it?&lt;/h3&gt;
&lt;p&gt;A1: In the Chari et al. (1998) monopolistic-competition cash-in-advance economy, households exploit market power when setting prices, and the cash-in-advance constraint depresses consumption efficiency; this creates an ex-post temptation for the central bank to inflate and partially offset these distortions, even though in equilibrium such inflation is anticipated and only worsens inefficiencies. Equilibrium consumption equals (1/α) × ((σ−1)/σ) × (β/(1+μ)), compounding a monopoly distortion (σ−1)/σ &amp;lt; 1 and a cash-in-advance distortion β/(1+μ) &amp;lt; 1 below the efficient level 1/α. Amador and Phelan add household uncertainty about the central bank&amp;rsquo;s type — captured by the Bayesian posterior ρ that the bank is hawkish — allowing reputation to be endogenously determined and to feed back into equilibrium pricing.&lt;/p&gt;
&lt;h3 id="q2-why-does-reputation-matter-only-through-differences-in-inflation-costs-γᵢ-and-not-through-differences-in-effective-discount-factors-alone"&gt;Q2. Why does reputation matter only through differences in inflation costs γᵢ and not through differences in effective discount factors alone?&lt;/h3&gt;
&lt;p&gt;A2: Proposition 1 establishes that if γ₁ = γ₂ (equal inflation penalties), then even if the two types have different effective discount factors β₁ = β(1−δ) ≠ β₂ = β(1−ε), there exists a pooling Markov equilibrium in which both types choose the same action μ* and reputation plays no role. When both types have identical static incentives, they will always choose the same action given that reputation doesn&amp;rsquo;t affect payoffs in such an equilibrium. Hence the relevant dimension of heterogeneity for reputation to matter is the inflation cost parameter γᵢ, not patience.&lt;/p&gt;
&lt;h3 id="q3-what-is-the-formal-argument-that-no-pooling-equilibrium-can-exist-when-γ--γ"&gt;Q3. What is the formal argument that no pooling equilibrium can exist when γ₁ ≠ γ₂?&lt;/h3&gt;
&lt;p&gt;A3: Propositions 2 and 3 provide the formal argument. If both types pool at any reputation ρ with a common positive action μ, Bayesian updating implies that ρ⁺ is independent of the money growth realization μₐ. The first-order condition for type i then reduces to the static condition (∂E[log(c) − αc|μ]/∂μ) = γᵢh&amp;rsquo;(μ), which cannot hold simultaneously for types 1 and 2 since γ₁ ≠ γ₂ and h&amp;rsquo;(μ) &amp;gt; 0 for μ &amp;gt; 0. This logic rules out pooling at the stationary reputation ρ* = ε/(δ+ε) in Proposition 2 and at any reputation where μ &amp;gt; 0 in Proposition 3.&lt;/p&gt;
&lt;h3 id="q4-why-does-noise-facilitate-the-existence-of-separating-equilibria"&gt;Q4. Why does noise facilitate the existence of separating equilibria?&lt;/h3&gt;
&lt;p&gt;A4: Without noise, if types separate, observing the hawkish action reveals the bank is hawkish with certainty, pushing reputation to its maximum (1−δ) in a single period. This makes mimicry extremely cheap for the dovish type when β₂ is large or ε is small: the incentive compatibility condition requires that the dovish type&amp;rsquo;s static gain from choosing its own action exceeds the value gain from jumping to the best possible reputation, which is a very stringent requirement. With noise, mimicry generates only a probabilistic shift in beliefs rather than a discrete jump to the extreme, so the dovish type must maintain the hawkish action repeatedly to achieve a reputational gain — making mimicry costly enough that the incentive compatibility condition can be satisfied.&lt;/p&gt;
&lt;h3 id="q5-what-is-the-reference-game-and-what-analytical-purpose-does-it-serve"&gt;Q5. What is the &amp;ldquo;reference game&amp;rdquo; and what analytical purpose does it serve?&lt;/h3&gt;
&lt;p&gt;A5: The reference game is a variant in which the central bank&amp;rsquo;s type is fixed and is revealed to households immediately after they set prices at date t = 0. From t = 1 onward, the game reduces to the full-information, single-type game of Section 4. This allows the authors to isolate the &amp;ldquo;direct&amp;rdquo; effect of reputation — the fact that expected type affects equilibrium prices today — from the &amp;ldquo;indirect&amp;rdquo; or strategic effect of the central bank actively managing its reputation. In the numerical example, the reference-game prices form the upper dashed line in Figure 1, while the actual game&amp;rsquo;s prices form the lower solid line, with the gap between them attributable to the central bank&amp;rsquo;s incentive to restrain inflation in order to protect reputation.&lt;/p&gt;
&lt;h3 id="q6-what-are-the-equilibrium-price-and-welfare-levels-in-the-benchmark-numerical-example-and-how-do-they-compare-to-efficient-and-full-information-benchmarks"&gt;Q6. What are the equilibrium price and welfare levels in the benchmark numerical example, and how do they compare to efficient and full-information benchmarks?&lt;/h3&gt;
&lt;p&gt;A6: The efficient benchmark delivers log(c) − αc = −1 with consumption c* = 1/α = 1. Under full information with only the hawkish type present, P = 1.313 and E[log(c) − αc] = −1.0297; under only the dovish type, P = 1.338 and E[log(c) − αc] = −1.0320. In the reputational equilibrium, prices lie below the full-information mixed benchmark for any given ρ (the solid line in Figure 1 lies below the dashed reference-game line), reflecting that the central banks&amp;rsquo; desire to maintain reputation leads both types to restrain inflation beyond what the direct price effect alone would induce.&lt;/p&gt;
&lt;h3 id="q7-how-does-the-u-shape-of-optimal-central-bank-actions-in-reputation-arise-and-what-does-it-imply-for-policy"&gt;Q7. How does the U-shape of optimal central bank actions in reputation arise, and what does it imply for policy?&lt;/h3&gt;
&lt;p&gt;A7: The U-shape arises because Bayesian updating is most powerful at interior beliefs: for extreme reputations (near ε or 1−δ), any given realization of money growth moves the posterior relatively little, so the reputational cost of inflating is small. For interior (middling) reputations, the same action shifts the posterior substantially, making reputation more sensitive to inflation choices and thus increasing the marginal cost of inflating. Both types therefore choose their minimum inflationary actions at middling reputations. The policy implication is that a hawkish central bank with a very low reputation (following a run of high realized inflation outcomes) should not dramatically tighten, because further contraction does relatively little for its reputation until nature delivers enough favorable realizations to move it to a more interior range.&lt;/p&gt;
&lt;h3 id="q8-what-happens-to-the-ergodic-distribution-of-reputation-and-inflation-and-what-does-this-imply-about-the-persistence-of-reputational-dynamics"&gt;Q8. What happens to the ergodic distribution of reputation and inflation, and what does this imply about the persistence of reputational dynamics?&lt;/h3&gt;
&lt;p&gt;A8: Starting from ρ = 0.5, expected reputation remains in the interior: above 0.38 for the dovish type and below 0.63 for the hawkish type. The ergodic distribution of ρ (Figure 5) concentrates at interior values rather than the poles, showing that noise and type switching prevent reputation from stabilizing at extremes. The ergodic inflation distribution (Figure 6) has an average of 2.1%, compared to 2% under an all-hawkish world and 3% under an all-dovish world. Because ε = δ (types are equally likely in the long run), the unconstrained-type-weighted average would be 2.5%, so reputational incentives reduce equilibrium average inflation by approximately 0.4 percentage points.&lt;/p&gt;
&lt;h3 id="q9-who-gains-and-who-loses-from-ongoing-type-uncertainty-relative-to-immediate-revelation"&gt;Q9. Who gains and who loses from ongoing type uncertainty relative to immediate revelation?&lt;/h3&gt;
&lt;p&gt;A9: The hawkish type&amp;rsquo;s value function (Figure 3a) lies below the reference-game dashed line for intermediate reputations, indicating that the hawkish type is made worse off by uncertainty — it must bear the cost of restraining inflation beyond what is statically optimal in order to signal its type, but the households partially &amp;ldquo;blame&amp;rdquo; it for high realized inflation regardless. The dovish type (Figure 3b) is made better off under continuing uncertainty because its reputation benefits from households&amp;rsquo; inability to perfectly distinguish types. Households (Figure 3c) are better off under uncertainty unless reputation is very high, because uncertainty suppresses inflation temptations for both types and keeps prices lower.&lt;/p&gt;
&lt;h3 id="q10-what-happens-to-equilibrium-behavior-under-robustness-checks-on-key-parameters"&gt;Q10. What happens to equilibrium behavior under robustness checks on key parameters?&lt;/h3&gt;
&lt;p&gt;A10: When the discount factor β or the elasticity of substitution σ decreases, both types inflate more and prices rise. When the hawkish type&amp;rsquo;s penalty γ₁ decreases (becomes less hawkish), both types inflate more and prices rise. When the dovish type&amp;rsquo;s penalty γ₂ decreases (becomes more dovish), the dovish type inflates more and, somewhat counterintuitively, the hawkish type inflates less, leaving prices roughly unchanged but slightly higher. When switching probabilities ε or δ increase, prices rise and both types inflate more, analogously to a decrease in β. Across all robustness exercises, the dovish type never inflates less than the hawkish type — consistent with Proposition 1&amp;rsquo;s implication that the inflation-cost difference γ₁ − γ₂ is the fundamental driver of separation.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Hawkish type (type 1):&lt;/strong&gt; A central bank that receives a relatively large negative payoff γ₁h(μᵢ) for taking inflationary actions, where γ₁ &amp;gt; γ₂. In the paper&amp;rsquo;s own sense, this type is not behavioral — it optimizes fully and can choose any action — but has a strong intrinsic cost to inflation, making it prefer lower money growth rates ceteris paribus.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Dovish type (type 2):&lt;/strong&gt; A central bank with a lower penalty parameter γ₂ &amp;lt; γ₁ for inflationary actions. Like the hawkish type, it is fully strategic and optimizing, differing only in the magnitude of its intrinsic inflation cost.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Reputation (ρ):&lt;/strong&gt; The Bayesian posterior probability that households assign to the current central bank being the hawkish type. It is the single payoff-relevant state variable in the Markov equilibrium, evolving through Bayes&amp;rsquo; rule applied to realized money growth and type-switching probabilities.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Pure symmetric Markov perfect equilibrium:&lt;/strong&gt; An equilibrium in which all households set the same price and consume the same amount (symmetry), and all strategies — prices P(ρ), central bank actions μ₁(ρ) and μ₂(ρ), and household consumption c(μₐ, ρ) — depend on history only through the current reputation ρ (Markov). The paper focuses exclusively on pure (non-mixed) strategy equilibria.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Pooling equilibrium:&lt;/strong&gt; An equilibrium in which both types choose the same action μ₁(ρ) = μ₂(ρ) at some reputation ρ. The paper proves analytically that no pooling equilibrium can exist when γ₁ ≠ γ₂ and the pricing distortion is sufficiently severe (Assumption 1).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Separating equilibrium:&lt;/strong&gt; An equilibrium in which μ₁(ρ) ≠ μ₂(ρ) for all ρ, so that realized money growth outcomes are informative about type and reputation evolves non-trivially. The paper argues that sufficient noise is necessary for such equilibria to exist.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Effective discount factor (βᵢ):&lt;/strong&gt; The discount factor net of type-switching: β₁ = β(1−δ) for the hawkish type (which survives as hawkish with probability 1−δ) and β₂ = β(1−ε) for the dovish type. Central banks care only about payoffs while they are active, so effective discounting captures both time preference and expected tenure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Noise (disconnection between actions and outcomes):&lt;/strong&gt; The stochastic wedge between the central bank&amp;rsquo;s chosen action μᵢ and realized money growth μₐ, governed by a density f(μₐ|μᵢ) with full support. In the paper&amp;rsquo;s framework, noise is not merely a nuisance but a structural feature that makes reputational equilibria possible by preventing single-period complete revelation of type.&lt;/p&gt;</description></item><item><title>Changing Opportunity: Sociological Mechanisms Underlying Growing Class Gaps</title><link>https://macropaperwarehouse.com/papers/changing-opportunity-sociological-mechanisms-underlying-growing-class-gaps/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/changing-opportunity-sociological-mechanisms-underlying-growing-class-gaps/</guid><description>&lt;p&gt;This paper documents sharp divergent trends in intergenerational economic mobility by race and class in the United States across the 1978 to 1992 birth cohorts, and investigates the causal mechanisms driving those changes. The core empirical facts are two: between 1978 and 1992 birth cohorts, the earnings gap between white children from high-income versus low-income families grew by approximately 28–30% (the &amp;ldquo;white class gap&amp;rdquo;), while the earnings gap between white and Black children from low-income families shrank by approximately 27–30% (the &amp;ldquo;white-Black race gap&amp;rdquo;). These twin trends — growing class gaps and shrinking race gaps — appear consistently across earnings, employment rates, educational attainment, SAT/ACT scores, incarceration, marriage, and mortality, and they hold in nearly every region of the country.&lt;/p&gt;
&lt;p&gt;The data are drawn from de-identified federal income tax returns linked to decennial census records and the Numident database, covering 57 million children born between 1978 and 1992, with information on parental and child incomes, employment, marital status, mortality, and residential location, supplemented by ACS educational attainment and linked SAT/ACT records covering 24.8 million students. Children&amp;rsquo;s outcomes are measured primarily as household income percentile ranks at age 27.&lt;/p&gt;
&lt;p&gt;In dollar terms, the white class gap (mean income difference between children raised at the 25th vs. 75th parental income percentile) grew from $17,720 to $20,950 in real 2023 dollars, while the white-Black race gap for low-income families fell from $20,810 to $14,910. The intergenerational rank-rank slope for white children increased from 0.23 to 0.29. The racial gap in intergenerational persistence of poverty — the probability of a child born to the bottom income quintile remaining there — shrank from 14.7 percentage points to 4.1 percentage points (a 72% reduction), driven roughly equally by improvement in Black children&amp;rsquo;s chances of escaping poverty and deterioration in low-income white children&amp;rsquo;s chances. The white class gap in early-adulthood mortality more than doubled, while the white-Black race gap in mortality fell by 77%.&lt;/p&gt;
&lt;p&gt;The paper systematically rules out three alternative explanations. Observable family characteristics (parental education, wealth, occupation, and marital status) explain only 7% of the growing white class gap and none of the shrinking white-Black race gap. Neighborhood-level common shocks, tested by including childhood county or Census tract-by-cohort fixed effects, similarly explain only 7% of the class gap and none of the race gap. The divergent trends persist even among children raised in the same Census tract, pointing to forces that operate differentially across race and class groups within the same neighborhood.&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s central finding is that changes in children&amp;rsquo;s outcomes across cohorts are strongly and positively correlated (r = 0.91 across subgroups) with changes in parental employment rates within the child&amp;rsquo;s social community, defined as families sharing the same race, class, and childhood county. Low-income white communities experienced sharp relative declines in parental employment rates; low-income Black communities experienced relative improvements. These community-level parental employment changes account for nearly all of the divergent trends.&lt;/p&gt;
&lt;p&gt;To establish causation, the paper exploits variation in the age at which children move to counties with changing parental employment rates. Children who moved at younger ages (before age 8) to counties where parental employment was increasing experienced larger improvements in earnings than those who moved at older ages (after age 13), consistent with a causal exposure effect with greater impact for longer durations of exposure. Sibling comparisons — comparing outcomes of younger versus older siblings who moved together — confirm that the age gradient reflects causal exposure rather than family-level selection.&lt;/p&gt;
&lt;p&gt;The social interaction mechanism is supported by two sources of variation: children&amp;rsquo;s outcomes are more strongly related to parental employment rates of their own birth cohort than adjacent cohorts (cohort specificity unlikely to be explained by resources), and outcomes are primarily driven by the employment rates of same-race, same-class community members, with cross-racial influence appearing only in counties where cross-racial interaction is greater (counties with small Black population shares or higher interracial marriage rates). The unified explanation the paper proposes is that children&amp;rsquo;s outcomes mimic those of the adults in their social communities, following Borjas (1992).&lt;/p&gt;
&lt;p&gt;Q: What are the precise magnitudes of the growing white class gap and shrinking white-Black race gap in income percentile ranks?
A: The white class gap — the difference in mean household income ranks between white children raised at the 25th versus 75th parental income percentiles — increased from 11.1 to 14.1 percentile ranks between the 1978 and 1992 birth cohorts, a 28% increase. The white-Black race gap for children from low-income families fell from 14.9 to 10.9 percentile ranks, a 27% decrease. The intergenerational rank-rank slope for white children increased from 0.23 to 0.29 (a 28% rise in persistence).&lt;/p&gt;
&lt;p&gt;Q: How did the trends in poverty persistence versus upward mobility differ?
A: The convergence in white-Black outcomes was driven almost entirely by changes in poverty persistence rather than upward mobility. The racial gap in the probability of remaining in the bottom income quintile shrank from 14.7 percentage points to 4.1 percentage points (a 72% reduction), with roughly half from Black children being less likely to remain at the bottom and half from white children being more likely to remain. By contrast, the white-Black gap in the probability of rising from the bottom quintile to the top quintile fell by only 1.9 percentage points (17%).&lt;/p&gt;
&lt;p&gt;Q: How widespread geographically were the divergent trends?
A: Outcomes declined for low-income white families in nearly every county, but the largest declines occurred in historically high-mobility areas such as the Great Plains and the coasts. For low-income Black families, outcomes improved in most areas, with the largest gains in historically low-mobility regions including the Southeast and the industrial Midwest. The correlation between county-level changes for low-income white versus low-income Black children is a positive 0.58, meaning the areas where Black families improved most tended to be areas where white families declined least, not most.&lt;/p&gt;
&lt;p&gt;Q: Do the trends persist when using non-rank, inflation-adjusted dollar outcomes?
A: Yes. The white class gap in mean household income grew from $17,720 to $20,950 in real 2023 dollars, and the white-Black race gap for low-income families narrowed from $20,810 to $14,910. The paper also reports similar patterns for individual earnings (as opposed to household income), ruling out changes in household composition as a driver.&lt;/p&gt;
&lt;p&gt;Q: What do the pre-labor-market outcomes show?
A: The divergent trends emerge before children enter the labor market. The white class gap in educational attainment grew by 20%, driven by growing gaps in four-year college completion. The white-Black race gap in educational attainment disappeared by the 1992 cohort, driven by narrowing gaps in high school graduation. The white class gap in the share of students taking the SAT/ACT increased by 12.1 percentage points between the 1980 and 1991 birth cohorts, while the white-Black race gap in SAT/ACT-taking decreased by 20.3 percentage points. The white class gap in mean SAT/ACT scores grew by 62% between the 1980 and 1997 birth cohorts among test-takers.&lt;/p&gt;
&lt;p&gt;Q: How large is the mortality dimension of these trends?
A: The white class gap in early-adulthood mortality (ages 24–27) more than doubled between the 1978 and 1992 birth cohorts, while the white-Black race gap in early-adulthood mortality decreased by 77%. These non-monetary outcomes are invariant to inflation and income measurement choices, confirming the robustness of the broader trends.&lt;/p&gt;
&lt;p&gt;Q: How much do family-level characteristics explain?
A: Controlling jointly for parental education, wealth, occupation, and marital status reduces the estimated growth in the white class gap by only 7% (from 3.37 to 3.13 percentile ranks). The same controls do not explain the shrinking white-Black race gap — the estimated reduction in the race gap actually becomes slightly larger (4.56 rather than 4.16 percentiles) after controlling for family characteristics, indicating that observable family factors work against the observed convergence.&lt;/p&gt;
&lt;p&gt;Q: How much do neighborhood-level common shocks explain?
A: Including childhood county fixed effects interacted with birth cohort explains only 7% of the growing white class gap and none of the shrinking white-Black race gap. Including Census tract fixed effects yields essentially identical results. The divergent trends persist among children growing up in the same Census tract, ruling out explanations based on differential exposure to neighborhood-level economic shocks.&lt;/p&gt;
&lt;p&gt;Q: What is the community-level parental employment correlation, and what does it explain?
A: Changes in children&amp;rsquo;s earnings, SAT/ACT scores, and educational attainment across cohorts are strongly positively correlated with changes in parental employment rates within the child&amp;rsquo;s community (same race, same class, same county), controlling for the employment status of the child&amp;rsquo;s own parents. The correlation between changes in children&amp;rsquo;s outcomes and changes in community parental employment rates across all race and class subgroups is 0.91. This single community-level factor — as proxied by parental employment rates — accounts for nearly all of the divergent trends by race and class.&lt;/p&gt;
&lt;p&gt;Q: What is the quasi-experimental design for estimating causal effects, and what does it assume?
A: The paper compares outcomes of children who moved to counties with increasing parental employment rates at younger versus older ages, across earlier versus later birth cohorts. The identification assumption is &amp;ldquo;constant selection by age&amp;rdquo;: any selection of families into moving to a given county in years when parental employment is higher may differ across cohorts, but those selection differences must not themselves vary systematically with the age at which children move. The paper treats this as a &amp;ldquo;constant selection by age&amp;rdquo; assumption standard in the neighborhood effects literature.&lt;/p&gt;
&lt;p&gt;Q: What do the causal exposure results show?
A: Children who moved before age 8 to communities where parental employment was increasing show systematically higher earnings in later birth cohorts, while children who made the same move after age 13 show little difference in earnings across cohorts. This pattern — larger effects at younger ages — is consistent with a causal exposure effect of growing up in an improving community, with effects proportional to the duration of exposure.&lt;/p&gt;
&lt;p&gt;Q: How do sibling comparisons validate the identification assumption?
A: When siblings move together to a community with increasing parental employment rates, the younger sibling — who receives more years of exposure to the higher-employment environment — earns significantly more than the older sibling. The earnings difference is proportional to the age gap between siblings. This rules out explanations based on fixed unobserved family characteristics and supports the constant-selection-by-age assumption.&lt;/p&gt;
&lt;p&gt;Q: What evidence distinguishes social interaction mechanisms from economic resource mechanisms?
A: Two sources of variation are used. First, children&amp;rsquo;s outcomes are much more strongly related to the parental employment rates of peers in their own birth cohort than peers in adjacent cohorts — a cohort-specificity that is implausible for economic resource channels (school budgets, local tax bases) which would not vary sharply across adjacent cohorts. Second, outcomes of low-income white children are driven primarily by the employment rates of low-income white parents, not by low-income Black or high-income white parents&amp;rsquo; employment rates, and vice versa for low-income Black children — consistent with interaction patterns being stratified by race and class.&lt;/p&gt;
&lt;p&gt;Q: What role does cross-racial interaction play?
A: In counties where Black children constitute a small share of the population (making cross-racial interaction more likely), Black children&amp;rsquo;s outcomes are also related to low-income white parental employment rates. Similarly, in counties with higher interracial marriage rates (a proxy for cross-racial interaction), Black children&amp;rsquo;s outcomes are related to white parental employment rates even after controlling for racial composition. This cross-sectional variation supports the interpretation that the influence channel is social interaction rather than parallel economic shocks.&lt;/p&gt;
&lt;p&gt;Q: How do the findings for Hispanic, Asian, and AIAN children compare?
A: Changes in economic mobility for Hispanic, Asian, and AIAN children between 1978 and 1992 birth cohorts were much more modest than for white and Black children. For children from low-income families, mean household income ranks were essentially unchanged for Asian children and rose by only about 0.5 percentiles for Hispanic and AIAN children. However, the same community-level parental employment rate mechanism explains the (smaller) changes for these groups as well; the correlation between changes in children&amp;rsquo;s outcomes and changes in community parental employment rates is 0.91 across all subgroups.&lt;/p&gt;
&lt;p&gt;Q: What is the paper&amp;rsquo;s unified theoretical account of all the divergent trends?
A: The paper concludes that a parsimonious theory — that children&amp;rsquo;s outcomes mimic those of the parents in their social communities, following Borjas (1992) — explains the divergent trends by race and class. Because social interaction is stratified by race and class even within neighborhoods, changes in parental outcomes in the parent generation propagate differentially to white versus Black and high-income versus low-income children, producing growing class gaps and shrinking race gaps through the same underlying mechanism.&lt;/p&gt;
&lt;p&gt;Q: What does the paper imply about the malleability of economic mobility disparities?
A: Because the causal exposure effects of community environments on children&amp;rsquo;s outcomes can be detected within a 14-year span (1978 to 1992 birth cohorts), the paper implies that differences in economic mobility by race and class may be malleable in policy-relevant timeframes. This is despite the fact that long-standing disparities partly trace back to historical factors such as slavery, Jim Crow laws, redlining, and the Great Migration.&lt;/p&gt;
&lt;p&gt;White class gap: The difference in mean household income ranks in adulthood for white children born to families at the 25th versus 75th percentiles of the national parental income distribution; increased from 11.1 to 14.1 percentile ranks (28%) between the 1978 and 1992 birth cohorts.&lt;/p&gt;
&lt;p&gt;White-Black race gap: The difference in mean household income ranks in adulthood for white versus Black children born to families at the 25th percentile of the national parental income distribution; decreased from 14.9 to 10.9 percentile ranks (27%) between the 1978 and 1992 birth cohorts.&lt;/p&gt;
&lt;p&gt;Social community: In this paper&amp;rsquo;s usage, other families who share the same race, class category, and childhood county as a given child; the unit within which community-level parental employment rates are measured and found to be predictive of children&amp;rsquo;s outcomes.&lt;/p&gt;
&lt;p&gt;Causal exposure effect: The effect on a child&amp;rsquo;s adult outcomes of an additional year spent growing up in a community with higher parental employment rates, estimated quasi-experimentally by comparing children who moved to counties with changing parental employment rates at younger versus older ages; larger effects at younger ages imply a causal, duration-sensitive exposure channel.&lt;/p&gt;
&lt;p&gt;Constant selection by age: The identification assumption underlying the quasi-experimental design; requires that any systematic differences in the types of families who move to a county when parental employment is high versus low do not themselves vary with the age at which children move to that county.&lt;/p&gt;
&lt;p&gt;Intergenerational rank-rank slope: The OLS slope coefficient from regressing child income percentile rank on parental income percentile rank; for white children, increased from 0.23 in the 1978 birth cohort to 0.29 in the 1992 birth cohort, indicating greater persistence of economic status.&lt;/p&gt;
&lt;p&gt;Cohort-specificity of community effects: The empirical pattern that children&amp;rsquo;s outcomes are more strongly related to the parental employment rates of peers in their own birth cohort than those of adjacent cohorts, used in the paper as evidence favoring social interaction over economic resource channels as the mediating mechanism.&lt;/p&gt;</description></item><item><title>Civil War–Induced Displacement and Human Capital</title><link>https://macropaperwarehouse.com/papers/civil-warinduced-displacement-and-human-capital/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/civil-warinduced-displacement-and-human-capital/</guid><description>&lt;p&gt;This paper examines the impact of conflict-driven forced displacement on human capital accumulation using the Mozambican civil war (1977–1992) as the empirical setting. During this war, over four million civilians — roughly a third of the population — fled to rural areas, cities, neighboring countries, or UN-managed refugee camps. The study advances on prior work in three dimensions: it uses the full post-war population census (12 million individuals) rather than a small survey; it studies multiple displacement trajectories in a single framework; and it separately identifies place-based exposure effects from a general uprootedness effect.&lt;/p&gt;
&lt;p&gt;The primary data source is the 1997 Mozambican census, which records each individual&amp;rsquo;s place of birth, residence in 1992 (the war&amp;rsquo;s end), and residence in 1997. Key outcomes are educational attainment and sectoral employment (agricultural versus services). The authors supplement the census with digitized colonial road and school maps, georeferenced conflict events, and landmine contamination data.&lt;/p&gt;
&lt;p&gt;The main identification strategy compares approximately 135,000 siblings (from 45,000 families) separated during the war, using the sibling who stayed behind as a within-family counterfactual. This design controls for household-level characteristics including religious and ethnic background, aspirations, and exposure to violence.&lt;/p&gt;
&lt;p&gt;The key findings are as follows. First, rural-born IDPs displaced to cities have a 7.3 percentage point higher likelihood of attending primary school and 0.53 more years of schooling compared to their siblings who stayed behind — roughly one-third of the non-displaced mean. Rural-born IDPs displaced to other rural areas also show gains, with a 3 percentage point higher likelihood of attending school and 0.24 additional years, supporting the uprootedness hypothesis even for displacements that did not reach urban centers. Urban-born IDPs forcibly relocated to the countryside — primarily through FRELIMO&amp;rsquo;s villagization scheme — experienced 9 percentage point lower primary school attendance and approximately 0.5 fewer years of schooling relative to siblings who remained in cities.&lt;/p&gt;
&lt;p&gt;External displacement (to camps in Malawi or Zimbabwe) generated no significant schooling gains relative to staying siblings, despite UN-built schools in camps, likely because scarce employment opportunities reduced perceived returns to education.&lt;/p&gt;
&lt;p&gt;Second, the paper jointly estimates place-based and uprootedness effects in a single within-family framework. Place effects are statistically significant: displacement to a district one standard deviation more developed than one&amp;rsquo;s birthplace raises schooling likelihood by approximately 3 percentage points (OLS) to 5 percentage points (2SLS reduced form). Crucially, a residual uprootedness effect of approximately 2–4 percentage points persists even after controlling fully for destination-origin differences in development and conflict intensity. This uprootedness effect is quantitatively comparable to being displaced to a district one standard deviation more developed than one&amp;rsquo;s birthplace.&lt;/p&gt;
&lt;p&gt;Third, a primary survey of 208 Nampula residents conducted in early 2020 — three decades after the war — confirms lasting educational gains. IDPs displaced to Nampula have a 10 percentage point higher likelihood of completing primary school relative to their siblings who stayed in the countryside, and their educational attainment converged to levels of urban-born, never-displaced residents despite large urban-rural education gaps. However, IDPs report significantly lower social capital, civic participation, and community trust than urban-born respondents, and score significantly worse on mental health indicators, including depression, loneliness, and pessimism. These psychosocial costs persist three decades after the war&amp;rsquo;s end.&lt;/p&gt;
&lt;p&gt;The findings apply to a low-income, post-colonial African setting characterized by widespread illiteracy (over 60%) and subsistence agriculture (over 85% of employment) at the war&amp;rsquo;s close. The results are robust to alternative age restrictions, extended family comparisons, dropping the oldest sibling, same-sex sibling pairs, and narrowing the age gap between sibling pairs to as few as two years.&lt;/p&gt;
&lt;p&gt;Q: What is the core identification strategy and why is it preferred over cross-sectional estimates?
A: The authors compare siblings within the same household who experienced different displacement trajectories during the war. Because siblings share household-level characteristics — parental preferences for education, ethnic and religious background, wealth, and local conflict exposure — the within-family design controls for confounders that would bias cross-sectional estimates. The within-family estimates are systematically smaller than cross-sectional ones (e.g., 7.3 pps vs. 24–30 pps for rural-to-urban displacement in primary school attendance), confirming that sorting was present even in the unpredictable civil war setting.&lt;/p&gt;
&lt;p&gt;Q: What do the results show for rural-born IDPs displaced to urban centers?
A: Within the sibling-pair framework, rural-born IDPs displaced to cities and towns have a 7.3 percentage point higher likelihood of attending primary school and 0.53 more years of schooling compared to their siblings who stayed in rural birthplaces, against a non-displaced sibling mean of approximately 20% primary school access and one year of formal schooling. These IDPs also show a 4 percentage point higher likelihood of non-agricultural employment five years after the war&amp;rsquo;s end.&lt;/p&gt;
&lt;p&gt;Q: What do the results show for rural-born IDPs displaced to other rural areas?
A: Even displacement to a different rural district — not a city — generates modest but statistically significant gains: a 3 percentage point higher likelihood of attending school and 0.24 additional years of schooling relative to siblings staying in their birthplace rural district. The authors interpret this as evidence for the uprootedness hypothesis, since rural Mozambique at the time was among the most impoverished and insecure environments in the world, meaning destination quality alone cannot explain the gain.&lt;/p&gt;
&lt;p&gt;Q: What do the results show for externally displaced refugees?
A: Refugees displaced to camps and settlements in Malawi, Zimbabwe, Tanzania, Zambia, and Swaziland show schooling levels statistically similar to their siblings who remained in their rural birthplaces, despite UN-built primary schools in camps. The authors attribute the absence of gains to low perceived returns to education stemming from scarce employment opportunities at displacement destinations. Externally displaced individuals do show a 5 percentage point lower likelihood of agricultural employment relative to staying siblings.&lt;/p&gt;
&lt;p&gt;Q: What are the consequences of urban-to-rural forced displacement?
A: Urban-born individuals forcibly relocated to the countryside — primarily through FRELIMO&amp;rsquo;s villagization and food production programs — have approximately 9 percentage point lower likelihood of attending primary school and 0.5 fewer years of schooling compared to siblings who remained in urban areas. These results indicate that FRELIMO&amp;rsquo;s coercive relocation policies imposed material human capital costs on the displaced.&lt;/p&gt;
&lt;p&gt;Q: How are place-based and uprootedness effects separated empirically?
A: The authors construct principal component indices for destination-origin differences in regional development (aggregating population density, Portuguese-speaking share, offspring mortality, road density, colonial market density, and school density) and conflict intensity (conflict events per capita and landmine contamination per capita). They then include these continuous exposure measures alongside a binary displacement indicator in within-family regressions. The coefficient on the binary displacement indicator — conditional on destination-origin development and conflict differences — isolates the uprootedness effect for individuals displaced to districts with identical characteristics to their birthplace.&lt;/p&gt;
&lt;p&gt;Q: What are the magnitudes of the place-based and uprootedness effects?
A: Under OLS, displacement to a district one standard deviation more developed than one&amp;rsquo;s birthplace raises schooling likelihood by approximately 3 percentage points. The residual uprootedness effect — displacement per se, controlling for destination quality — raises schooling likelihood by approximately 2 percentage points. Under 2SLS (instrumenting destination-origin development differences with the development of districts within 100 km of birthplace), the place-based effect rises to approximately 5 percentage points in the reduced form, and the uprootedness effect remains significant at approximately 4 percentage points. Both the uprootedness and place-based effects are of comparable magnitude.&lt;/p&gt;
&lt;p&gt;Q: What instrument is used in the 2SLS specifications and what is its first-stage strength?
A: The instrument exploits the fact that Mozambique&amp;rsquo;s heavily mined and rudimentary transportation network constrained civilian movement — the median displaced sibling ended up roughly 97 kilometers from birthplace. The authors instrument actual destination-origin development and conflict differences with the predicted differences based on the characteristics of districts within 100 km of the birthplace. The first-stage elasticity between actual and proximity-predicted differences in development is 0.86, and for conflict is 0.88, both precisely estimated.&lt;/p&gt;
&lt;p&gt;Q: What do the long-run survey results from Nampula show about educational persistence?
A: In a 2020 survey of 208 Nampula residents aged over 35, IDPs who fled to Nampula during the war have a 10 percentage point higher likelihood of completing primary school relative to their siblings who stayed in the countryside. Their educational attainment converges to the level of urban-born, never-displaced Nampula residents, despite large historical and contemporary urban-rural education gaps in northern Mozambique. The majority of IDPs (73%) report that extended relatives or friends advised them to attend school upon arriving in the city, and most believed education was necessary for urban employment.&lt;/p&gt;
&lt;p&gt;Q: What are the long-run psychosocial costs documented in the Nampula survey?
A: Even three decades after the war&amp;rsquo;s end, IDPs in Nampula report significantly lower social capital, civic participation, and community trust compared to urban-born never-displaced residents. IDPs also score significantly worse on mental health indicators including depression, loneliness, and pessimism. These findings suggest that forced displacement imposes persistent psychosocial costs that are not remediated by economic or educational convergence.&lt;/p&gt;
&lt;p&gt;Q: What drives displacement in the data, and does selection threaten identification?
A: Linear probability and multinomial logit models show that conflict intensity and geographic proximity (distance to the border for external displacement; distance to cities for urban displacement) are the primary correlates of displacement type, while differences in destination development are uncorrelated with displacement. Nevertheless, the overall explanatory power of these models is low, confirming many idiosyncratic and unpredictable features of the war. The within-family design addresses residual selection on household characteristics, and the 2SLS design addresses selection on destination-specific characteristics.&lt;/p&gt;
&lt;p&gt;Q: How do educational gains translate into sectoral employment outcomes?
A: Across specifications, gains in schooling move in tandem with a shift out of agriculture into services. Rural-to-urban IDPs have a 4 percentage point higher likelihood of non-agricultural employment five years after the war, while externally displaced show a 5 percentage point lower likelihood of agricultural employment. Urban-born IDPs displaced to the countryside are more likely to work in agriculture after the war. The authors interpret this co-movement as suggesting that conflict-driven human capital accumulation may contribute to structural transformation away from subsistence agriculture.&lt;/p&gt;
&lt;p&gt;Q: How robust are the within-family estimates?
A: The authors conduct six sensitivity checks: adding family fixed effects to cross-sectional regressions, restricting to individuals aged 12–18 in 1997 to address co-habitation concerns, extending comparisons to cousins and other relatives, dropping the oldest male sibling to minimize favoritism concerns, restricting to same-sex sibling pairs, and narrowing the age gap to two years. Across all permutations, the qualitative ordering is preserved: refugees show no significant schooling gains, rural-to-urban IDPs show gains of 5–6 percentage points in primary attendance and 0.35–0.5 extra years, rural-to-rural IDPs show small positive gains, and urban-to-rural IDPs show losses.&lt;/p&gt;
&lt;p&gt;Uprootedness hypothesis: The idea, traced in the paper to Stigler and Becker (1977) and earlier scholars, that forced displacement incentivizes human capital investment precisely because education is a mobile asset that cannot be expropriated — distinct from place-based effects of destination quality.&lt;/p&gt;
&lt;p&gt;Place-based (exposure) effects: The impact on human capital outcomes attributable to differences between the development level and conflict intensity of the displacement destination and the individual&amp;rsquo;s birthplace, measured as destination-origin differences in a principal component index of regional development.&lt;/p&gt;
&lt;p&gt;Separated siblings design: An identification strategy that compares siblings from the same household who experienced different displacement trajectories during the war, holding constant all household-level characteristics including parental preferences, ethnicity, religion, wealth, and local conflict exposure.&lt;/p&gt;
&lt;p&gt;Internal displacement (IDP): Conflict-driven movement within national borders to either rural areas or urban centers, constituting approximately 60% of global forced displacement and the majority of displacement in the Mozambican civil war context.&lt;/p&gt;
&lt;p&gt;Source text origin: A categorization of the working paper text used for summarization — distinguishing full PDF or HTML text from abstract-only text. Abstract-only text is a hard block for summary generation in the pipeline.&lt;/p&gt;
&lt;p&gt;Structural transformation: In this paper&amp;rsquo;s usage, the shift of workers out of subsistence agriculture into services associated with human capital accumulation triggered by conflict-driven displacement, treated as a potential mechanism of post-conflict recovery.&lt;/p&gt;
&lt;p&gt;Psychosocial costs of displacement: Long-run deficits in social capital, civic engagement, community trust, and mental health (depression, loneliness, pessimism) reported by IDPs three decades after displacement, persisting despite convergence in educational attainment and employment.&lt;/p&gt;</description></item><item><title>Climate change and the macroeconomics of bank capital regulation</title><link>https://macropaperwarehouse.com/papers/climate-change-and-the-macroeconomics-of-bank-capital-regulation/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/climate-change-and-the-macroeconomics-of-bank-capital-regulation/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This paper asks two related questions about the intersection of climate policy and bank capital regulation. First, can differentiated bank capital requirements — imposing higher equity charges on loans to fossil energy firms — serve as a quantitatively meaningful climate policy instrument, in particular relative to carbon taxes? Second, how should optimal bank capital requirements respond to a carbon-tax-induced clean energy transition?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Methodology&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors build a quantitative multi-sector DSGE model with two layers of default: corporate default at the firm level and bank failure at the bank level. Three intermediate goods sectors are modeled — non-energy, fossil energy, and clean energy — linked via a nested CES final-good production structure. Banks collect deposits from households (who value deposits for liquidity services) and issue defaultable loans to all three sectors. Deposit insurance, combined with limited liability for bank owners, generates an inefficiently high bank risk-taking motive, creating a role for capital regulation. The Ramsey-optimal capital requirement balances the social benefit of liquid deposit provision to households against the social cost of bank failure.&lt;/p&gt;
&lt;p&gt;The model is calibrated to quarterly data, targeting a 0.7% annualized bank failure rate, a 2% annualized corporate default rate, a 30% loan recovery rate, a deposit spread of -100 basis points, and a baseline Ramsey-optimal equity requirement of 8% (consistent with Basel III). Sectoral parameters follow Bartocci, Notarpietro, and Pisani (2022) and Fried, Novan, and Peterman (2022): the energy-to-non-energy elasticity of substitution is 0.2, the clean-to-fossil energy elasticity is 3, and full abatement occurs at carbon taxes exceeding 125 $/tonne of carbon (ToC). The clean transition experiment imposes a linear carbon tax path from zero to 10 $/ToC over 40 quarters, announced as an unanticipated but fully credible shock.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Finding 1 — Fossil-penalizing capital requirements are quantitatively negligible as climate policy.&lt;/em&gt; Raising the capital requirement on fossil loans from the baseline 8% to 12% (a 150% risk-weight, consistent with current BB- treatment) reduces the fossil capital share within the energy sector by only 0.06 percentage points (from 80.00% to 79.94%) and cuts aggregate emissions by only 0.08%. A 1 $/ToC carbon tax, by contrast, achieves a 5.23% emission reduction while modestly reducing the fossil capital share to 79.80%. The difference arises because capital requirements affect only the size and financing cost of fossil firms, leaving abatement incentives unchanged; the loan-rate effect on fossil firms is small (loan rate rises from 124 bps to 128 bps), consistent with Kashyap, Stein, and Hanson (2010).&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Finding 2 — Sustainability-linked capital requirements remain insufficient.&lt;/em&gt; Conditioning the fossil capital requirement on firms&amp;rsquo; abatement effort (κ_f = 0.12 − η_t) induces an optimal abatement effort of 2.69% and an effective fossil requirement of approximately 9.5%. The implied emission reduction remains far below even a modest carbon tax: the authors state the induced emission reduction falls short by a factor of almost 100 relative to full abatement.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Finding 3 — Ramsey-optimal capital requirements decline monotonically along the transition (in the baseline real model).&lt;/em&gt; When a carbon tax gradually rises from zero to 10 $/ToC over 40 quarters, aggregate loan demand contracts permanently because clean, fossil, and non-energy goods are imperfect substitutes and the shock is recessionary for GDP. Banks reduce balance sheets, deposit supply falls, the deposit spread widens by approximately 8 basis points in the long run, and corporate default rates across all sectors rise by almost 0.1 percentage points from the baseline of 2.05% (in steady state). To counteract the deposit scarcity and associated firm risk-taking, the Ramsey-optimal capital requirement declines symmetrically and monotonically to a lower long-run level. Bank capital regulation cannot affect impact default rates because leverage decisions are made before the transition is announced.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Finding 4 — Nominal rigidities produce a temporary tightening before the long-run relaxation.&lt;/em&gt; When debt is denominated in nominal terms and Rotemberg price adjustment costs are added, the clean transition is inflationary in the short run (consistent with Ciccarelli and Marotta 2021). Inflation makes deposit financing more attractive, inducing firms to temporarily increase nominal loan issuance; real deposits rise briefly, the deposit spread narrows by around 2 basis points, and the optimal capital requirement tightens over the initial phase of the transition before converging to the same lenient long-run level as the baseline. The short-run tightening is followed by a permanent relaxation.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Finding 5 — Differentiated sector-specific capital requirements are only warranted when banks are not diversified across sectors.&lt;/em&gt; In the baseline, perfectly diversified banks face a symmetric aggregate loan demand contraction, so uniform adjustment suffices. When sector-specific banks are introduced (an extreme case meant to bound concentration effects), fossil banks experience a strong reduction in deposit supply while clean banks experience the opposite. The optimal response is temporarily tighter capital requirements for clean banks and relaxed requirements for fossil banks. In the long run, both converge to an aggregate risk-weight of approximately 99.85% relative to the baseline (a small but symmetric relaxation), very close to the diversified baseline.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;All results are derived within a model calibrated to match broad financial-market and macroeconomic regularities rather than a specific country. Physical risk from climate change is abstracted away throughout. The carbon tax is set exogenously (not derived from a climate policy optimum). Firms cannot switch technologies, providing a conservative lower bound on the sectoral reallocation. Results are robust to halving the deposit demand elasticity parameter (γ_D = 0.6 versus 1.5 in the baseline) and to raising the energy/non-energy substitution elasticity to 3 from 0.2.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-core-trade-off-that-determines-the-optimal-level-of-bank-capital-requirements-in-this-model"&gt;Q1. What is the core trade-off that determines the optimal level of bank capital requirements in this model?&lt;/h3&gt;
&lt;p&gt;A: The optimal capital requirement balances two welfare-relevant effects of bank leverage. Tighter requirements reduce bank failure rates, limiting the resource losses (proportional to deposits under DIA management) and the inefficient risk-taking that deposit insurance induces. At the same time, tighter requirements force banks to reduce deposit-financed lending, shrinking the supply of liquid deposits that households value directly in utility. The Ramsey planner chooses the capital requirement that equates the marginal welfare benefit of lower bank failure against the marginal welfare cost of reduced deposit provision. In the baseline calibration this optimum is at 8%.&lt;/p&gt;
&lt;h3 id="q2-why-does-raising-capital-requirements-on-fossil-loans-have-such-a-small-effect-on-carbon-emissions"&gt;Q2. Why does raising capital requirements on fossil loans have such a small effect on carbon emissions?&lt;/h3&gt;
&lt;p&gt;A: Capital requirements affect the deposit-financing wedge for fossil loans — the share of loans that can be funded via cheap, deposit-financed sources — but they do not enter firms&amp;rsquo; first-order condition for abatement. Firms respond by modestly reducing leverage and investment (the loan rate for fossil energy firms rises from 124 bps to 128 bps), but the emission intensity of fossil production is unchanged. In equilibrium, the fossil capital share within the energy sector declines by only 0.06 percentage points (from 80.00% to 79.94%), reducing total emissions by 0.08%. A 1 $/ToC carbon tax produces a 5.23% emission reduction, many times larger, because carbon taxes directly alter the return to abatement and the profitability of fossil relative to clean production.&lt;/p&gt;
&lt;h3 id="q3-how-does-the-sustainability-linked-capital-requirement-work-and-why-is-it-still-insufficient"&gt;Q3. How does the sustainability-linked capital requirement work and why is it still insufficient?&lt;/h3&gt;
&lt;p&gt;A: Under sustainability-linked capital requirements, the fossil loan charge is set as κ_f = κ̃ − η_t, so firms that abate more face lower capital requirements on their loans and thus lower financing costs. This creates a direct financial incentive for abatement that the simple penalizing factor lacks. With κ̃ = 0.12, the equilibrium abatement effort is 2.69% and the effective fossil requirement falls to approximately 9.5%. Despite this improvement relative to the plain fossil factor, the climate impact remains far smaller than even a modest carbon tax: the induced emission reduction falls short by a factor of almost 100 relative to full abatement. The fundamental limitation is that the feedback from abatement to financing cost is attenuated by deposit-financing wedge mechanics, making the instrument too weak to substitute for direct carbon pricing.&lt;/p&gt;
&lt;h3 id="q4-what-are-the-impact-short-run-and-long-run-effects-of-the-clean-transition-on-default-rates-and-bank-failure"&gt;Q4. What are the impact, short-run, and long-run effects of the clean transition on default rates and bank failure?&lt;/h3&gt;
&lt;p&gt;A: On impact, the unexpected compliance cost increase raises fossil firms&amp;rsquo; default threshold, causing a sharp but short-lived uptick in fossil firm default rates (from 2.05% to approximately 2.08% in the baseline transition) and a brief increase in bank failure. Clean firm defaults fall slightly on impact due to higher clean energy prices. In the short run, clean firms increase risk-taking (higher leverage) because the relative attractiveness of debt financing improves as deposit spreads widen; fossil firms deleverage. In the long run, aggregate corporate default rates rise by almost 0.1 percentage points from the baseline of 2.05% (equivalently 2.7% in the Appendix B long-run analysis), driven by the widening of the deposit spread (approximately 8 bps), which raises the deposit financing wedge for all firms. Bank failure rates are always tied to binding capital requirements and revert quickly to their steady-state level.&lt;/p&gt;
&lt;h3 id="q5-why-can-bank-capital-regulation-not-mitigate-the-impact-default-spike-when-the-transition-is-announced"&gt;Q5. Why can bank capital regulation not mitigate the impact default spike when the transition is announced?&lt;/h3&gt;
&lt;p&gt;A: At the moment of announcement, leverage decisions for the current period have already been made. The bank capital requirement binds on new lending decisions but cannot alter the existing capital structure of banks or firms. Therefore the regulator faces a &amp;ldquo;bygone&amp;rdquo; on impact: changing the capital requirement in the announcement period does not affect current corporate default rates or bank failure rates. The regulator&amp;rsquo;s tool only becomes effective for lending decisions going forward, implying that the transition-induced impact default surge cannot be smoothed by macroprudential policy.&lt;/p&gt;
&lt;h3 id="q6-why-do-ramsey-optimal-capital-requirements-decline-along-the-transition-rather-than-tighten-to-address-higher-default-risk"&gt;Q6. Why do Ramsey-optimal capital requirements decline along the transition rather than tighten to address higher default risk?&lt;/h3&gt;
&lt;p&gt;A: The key channel is that aggregate loan demand contracts permanently as imperfect substitutability across sectors makes the carbon tax recessionary. Banks shrink their balance sheets, reducing deposit supply. The resulting deposit scarcity makes deposits more valuable to households (widening the spread), which also makes deposit financing cheaper for banks, partially offsetting the loan demand decline but at the cost of higher corporate leverage. The welfare loss from reduced liquidity provision and higher firm default rates dominates, so the planner relaxes capital requirements to stimulate deposit supply. The dominant effect is the large, permanent decline in credit demand, which makes it welfare-improving to allow banks to operate at lower capital ratios to rebuild deposit provision.&lt;/p&gt;
&lt;h3 id="q7-what-is-the-role-of-the-deposit-financing-wedge-in-transmitting-carbon-tax-shocks-to-the-entire-corporate-sector"&gt;Q7. What is the role of the deposit financing wedge in transmitting carbon tax shocks to the entire corporate sector?&lt;/h3&gt;
&lt;p&gt;A: The deposit financing wedge (Ξ_t) reflects the benefit for banks of funding loans through deposits rather than equity, combining the liquidity premium households pay on deposits and the deposit insurance put (expected repayment is only 1 − F(μ_{t+1}) per unit of deposits issued). When aggregate loan demand falls due to carbon taxes, deposits become scarcer relative to their steady-state level, making the wedge larger. Through the loan pricing condition, all sectors — not just fossil — face more attractive deposit-financed debt, causing clean and non-energy firms to also increase their leverage and default risk along the transition. This is the mechanism through which a sector-specific shock has symmetric aggregate effects that shape optimal bank regulation.&lt;/p&gt;
&lt;h3 id="q8-how-do-nominal-rigidities-change-the-optimal-path-of-capital-requirements-along-the-clean-transition"&gt;Q8. How do nominal rigidities change the optimal path of capital requirements along the clean transition?&lt;/h3&gt;
&lt;p&gt;A: With Rotemberg price adjustment costs and nominally denominated debt, the clean transition is inflationary in the short run (consistent with empirical evidence in Ciccarelli and Marotta 2021). Inflation lowers the real value of outstanding nominal loan obligations, incentivizing firms across all sectors to temporarily increase nominal borrowing. Banks accommodate this demand by increasing deposit issuance, which briefly narrows the deposit spread by around 2 basis points. With deposit supply temporarily elevated, the regulator&amp;rsquo;s trade-off tilts toward reducing bank failure rather than stimulating deposit provision, so optimal capital requirements tighten during the inflationary phase before reverting to the lenient long-run path of the baseline model. The long-run level is unchanged.&lt;/p&gt;
&lt;h3 id="q9-under-what-conditions-are-sector-specific-capital-requirements-welfare-improving"&gt;Q9. Under what conditions are sector-specific capital requirements welfare-improving?&lt;/h3&gt;
&lt;p&gt;A: Sector-specific requirements are only welfare-improving when banks are not perfectly diversified across sectors, so that the transition has heterogeneous effects on sector-specific deposit supply and bank failure rates. In the baseline with perfectly diversified banks, the loan demand decline affects all banks uniformly, so a symmetric uniform adjustment is optimal. When sector-specific banks are introduced as an extreme case of carbon concentration, fossil banks experience a sharp reduction in deposit provision while clean banks see deposits temporarily increase. The planner responds by temporarily relaxing requirements for fossil banks and tightening them for clean banks. In the long run, both converge to approximately the same aggregate relaxation as the diversified baseline (aggregate risk-weight of 99.85%).&lt;/p&gt;
&lt;h3 id="q10-how-does-the-carbon-tax-shock-experiment-relate-to-the-perfect-foresight-transition-analysis"&gt;Q10. How does the carbon tax shock experiment relate to the perfect-foresight transition analysis?&lt;/h3&gt;
&lt;p&gt;A: In the carbon tax shock experiment, the tax level follows an AR(1) process with persistence ρ_τ = 0.9, starting from a long-run level of 10 $/ToC, with a one-standard-deviation shock implying an additional 10 $/ToC on impact. Fossil firm default rates spike from 2% to approximately 2.8% on impact and revert relatively quickly. Emissions decline by slightly more than 10% on impact and revert as the shock dissipates. The macroeconomic dynamics — GDP, investment, loan demand, and bank failure rate responses — closely resemble the impact and short-run effects of the perfect-foresight transition. Optimal capital requirements decline temporarily in both cases, confirming that the transition-path results are not an artifact of the specific perfect-foresight assumption.&lt;/p&gt;
&lt;h3 id="q11-what-is-the-forced-safety-effect-and-how-does-it-interact-with-the-models-capital-requirement-trade-off"&gt;Q11. What is the &amp;ldquo;forced safety effect&amp;rdquo; and how does it interact with the model&amp;rsquo;s capital requirement trade-off?&lt;/h3&gt;
&lt;p&gt;A: The &amp;ldquo;forced safety effect&amp;rdquo; (following Bahaj and Malherbe 2020) refers to the positive effect of tighter capital requirements on loan supply that operates through reducing bank failure probability. When banks are less likely to fail (lower F(μ_{t+1})), the expected bank productivity conditional on not failing — (1 − G(μ_{t+1})) — rises toward one, reducing the discount applied to future loan payoffs in the bank&amp;rsquo;s stochastic discount factor. This improves the profitability of lending and expands loan supply. In the model, this effect partially offsets the direct loan-supply reduction from higher equity requirements but does not dominate, so the overall effect of tighter requirements on deposit supply is still negative, preserving the core trade-off.&lt;/p&gt;
&lt;h3 id="q12-what-robustness-checks-are-performed-and-do-they-materially-change-the-main-results"&gt;Q12. What robustness checks are performed and do they materially change the main results?&lt;/h3&gt;
&lt;p&gt;A: The authors consider three main robustness checks. First, reducing the deposit demand elasticity parameter from γ_D = 1.5 to γ_D = 0.6 (recalibrating ω_D = 0.012 to preserve the -100 bp deposit spread target) has almost no effect on the optimal path of capital requirements. Second, raising the energy/non-energy substitution elasticity from ε̃ = 0.2 to ε̃ = 3 (and adjusting the energy weight to maintain a 10% energy share) produces much stronger fossil investment declines and smaller clean investment responses, but aggregate loan demand and bank deposits contract only slightly less, so the relaxation in capital requirements is slightly smaller than in the baseline. Third, recalibrating to a 2% annualized bank failure rate (versus the baseline 0.7%) does not materially change results. The conclusion that capital requirements should decline along the transition is robust across all specifications.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Deposit financing wedge (Ξ_t):&lt;/strong&gt; The gain for banks from funding loans via deposits rather than equity. It comprises two components: (i) the liquidity premium — households value deposits for their liquidity services, so the deposit rate lies below the risk-free rate; and (ii) the deposit insurance put — the expected repayment obligation per unit of deposits is only 1 − F(μ_{t+1}), not one, since the DIA covers depositors in the event of bank failure. A larger wedge makes deposit-financed lending more profitable, expanding loan supply. In this paper the wedge is the central transmission mechanism through which capital requirements and aggregate loan demand interact.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bank failure threshold (μ_t):&lt;/strong&gt; The realization of the bank-specific idiosyncratic risk shock below which a bank cannot service depositors and transfers all assets and liabilities to the deposit insurance agency. It depends on the ratio of deposit repayment obligations to the aggregate realized loan portfolio return. In the model the threshold increases when aggregate loan payoffs fall (as in a carbon tax shock), temporarily raising bank failure rates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Ramsey-optimal capital requirement:&lt;/strong&gt; The sequence of sector-specific (or uniform) capital ratios chosen by a benevolent government planner to maximize household welfare, treating the capital requirement as the sole policy instrument. In this model the Ramsey problem is solved nonlinearly along the perfect-foresight transition path. The planner internalizes that tighter requirements simultaneously reduce bank failure probability and shrink deposit supply; the optimum trades off these two objectives.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sustainability-linked capital requirement:&lt;/strong&gt; A capital requirement on fossil loans that explicitly depends on the abatement effort undertaken by fossil firms (κ_f = κ̃ − η_t), creating a direct financing-cost incentive for emission reduction. This contrasts with a plain fossil penalizing factor, which affects only the financing cost of fossil capital without altering abatement incentives. The paper shows that even sustainability-linked requirements are quantitatively negligible as climate policy relative to carbon taxes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Carbon compliance cost per unit of fossil production (ξ_t):&lt;/strong&gt; A summary statistic combining the direct carbon tax payment and the abatement cost at the optimal abatement effort. It measures the total policy-induced wedge that reduces the profitability of fossil capital and raises fossil firms&amp;rsquo; break-even default threshold. In the transition experiment, compliance costs rise from zero to approximately 4% of fossil production value as the tax increases from 0 to 10 $/ToC.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Asset stranding channel:&lt;/strong&gt; The mechanism through which an unanticipated tightening of carbon policy raises fossil firms&amp;rsquo; default probability on impact (by increasing compliance costs above the level priced into existing loan contracts) and subsequently reduces their loan demand permanently. The paper contrasts its treatment of this channel — where stranding affects bank regulation through aggregate deposit supply effects — against models (such as Carattini, Melkadze, and Heutel 2023) where stranding causes an inefficient credit crunch via a financial accelerator.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Deposit spread (s^D_t):&lt;/strong&gt; Defined as the annualized difference between the deposit rate and the risk-free rate, expressed in basis points. Because households value deposits for liquidity services, the deposit rate lies permanently below the risk-free rate (spread is negative). In the baseline calibration the target is -100 bps. The spread widens (becomes less negative) when deposits become scarcer, which is the case along the carbon tax transition as bank balance sheets contract.&lt;/p&gt;</description></item><item><title>Comment on 'Asset Bubbles and Overlapping Generations' by Tirole</title><link>https://macropaperwarehouse.com/papers/comment-on-asset-bubbles-and-overlapping-generations-by-tirole/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/comment-on-asset-bubbles-and-overlapping-generations-by-tirole/</guid><description>&lt;p&gt;Tirole (1985) studied an overlapping generations model with capital accumulation and showed that the emergence of asset bubbles can resolve the capital over-accumulation problem when the economy is dynamically inefficient. His Proposition 1(c) claims that a bubble can emerge if and only if the dividend growth rate exceeds the bubbleless steady-state interest rate. This comment identifies an error in that proposition: the stated condition is necessary but not sufficient for bubble existence. The paper constructs an explicit counterexample in which the dividend growth rate exceeds the bubbleless interest rate but no bubble equilibrium exists, and separately constructs a case in which a bubble exists even when the condition in Proposition 1(c) fails. Corrected necessary and sufficient conditions for bubble existence are derived, and the implications of the correction for the welfare results and the relationship between dynamic inefficiency and bubbles are characterized.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-error-in-tiroles-proposition-1c"&gt;Q1. What is the error in Tirole&amp;rsquo;s Proposition 1(c)?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The error is in the sufficiency direction: Tirole argued that whenever the dividend growth rate exceeds the bubbleless interest rate, a bubble equilibrium exists; Pham and Toda construct a parameter configuration satisfying this condition where no bubble equilibrium exists, because the continuity argument used in Tirole&amp;rsquo;s proof fails at boundary parameter values.&lt;/strong&gt; The necessity direction — that bubble existence requires this rate comparison — is not challenged.&lt;/p&gt;
&lt;h3 id="q2-how-do-the-corrected-conditions-change-the-interpretation-of-dynamic-inefficiency"&gt;Q2. How do the corrected conditions change the interpretation of dynamic inefficiency?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Tirole&amp;rsquo;s original result linked bubbles tightly to dynamic inefficiency (r &amp;lt; g), providing a clean condition for when bubbles are both feasible and welfare-improving by absorbing excess saving. The correction weakens this link: bubble existence requires additional structural conditions beyond the rate comparison, meaning dynamic inefficiency is a necessary but not sufficient condition for bubbles in the Tirole framework.&lt;/strong&gt; Policy prescriptions based on the r &amp;lt; g condition for bubble welfare analysis need qualification.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;dynamic inefficiency&lt;/strong&gt; : the OLG condition in which the interest rate falls below the growth rate, making intergenerational transfers from young to old welfare-improving; related to but not sufficient for bubble existence under the corrected Tirole conditions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;bubble existence condition&lt;/strong&gt; : the necessary and sufficient conditions under which an asset bubble can emerge and persist in the Tirole OLG model; the corrected version requires more than the dividend-growth-rate-exceeds-interest-rate comparison of the original Proposition 1(c).&lt;/p&gt;</description></item><item><title>Comment on "Artificial Intelligence and Technological Unemployment" by Wang and Wong</title><link>https://macropaperwarehouse.com/papers/comment-on-artificial-intelligence-and-technological-unemployment-by-wang-and-wong/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/comment-on-artificial-intelligence-and-technological-unemployment-by-wang-and-wong/</guid><description>&lt;p&gt;This comment, written by J. Carter Braxton (University of Wisconsin), discusses the paper &amp;ldquo;Artificial Intelligence and Technological Unemployment&amp;rdquo; by Wang and Wong (2025), which develops and quantifies an equilibrium labor search model to evaluate the employment effects of spreading AI. Wang and Wong&amp;rsquo;s central finding is that improvements in AI quality will increase productivity by a factor of three while reducing employment by 23%, with approximately half of the employment decline occurring within the next five years. Braxton&amp;rsquo;s comment serves two purposes: first, to clarify the model&amp;rsquo;s structural channels through which AI affects employment; and second, to bring empirical evidence from the spread of computers in the 1980s–2000s to bear on the relative magnitude of those channels.&lt;/p&gt;
&lt;p&gt;Braxton identifies two competing forces within Wang and Wong&amp;rsquo;s framework. The &lt;strong&gt;job destruction channel&lt;/strong&gt; arises from endogenous separations: as AI quality improves, firms increasingly replace matched workers with AI, raising outflows from employment. The &lt;strong&gt;job creation channel&lt;/strong&gt; arises from the free-entry condition: rising AI quality increases firm profits on all matches, inducing firms to post more vacancies, which raises workers&amp;rsquo; job-finding rates and employment inflows. Whether aggregate employment rises or falls depends on which channel dominates — a quantitative question the authors resolve through calibration, finding the job destruction channel dominant. Braxton notes that three modeling choices (learning-by-using, the requirement that firms must be matched with a worker to adopt AI, and disembodied technological change) each push &lt;em&gt;against&lt;/em&gt; the job-destruction result, making the authors&amp;rsquo; findings more striking.&lt;/p&gt;
&lt;p&gt;Braxton then evaluates the relative strength of these channels using the historical spread of personal computers. Drawing on Bick, Blandin, and Deming (2024), he notes that workplace AI adoption in 2024 follows nearly the same time trend and income-distribution profile as computer adoption in 1984, making computers a plausible historical analog. Using the CPS Computer Supplement (1984–2003), Braxton measures the change in computer usage by occupation and regresses it against the change in employment-to-unemployment (EU) transition rates by occupation. The estimated coefficient is 0.0146 (robust SE 0.0064), indicating that occupations with higher computer adoption rates saw higher flows into unemployment — confirming that a job destruction channel was active during the computer era. However, regressing the change in log occupation-level employment (1980–2000 Census) on the change in computer usage yields a coefficient of 0.7761 (robust SE 0.2658), with a positive slope indicating that occupations more exposed to computers saw &lt;em&gt;higher&lt;/em&gt; employment growth. For the computer episode, therefore, the job creation channel dominated the job destruction channel — the opposite of Wang and Wong&amp;rsquo;s AI projection.&lt;/p&gt;
&lt;p&gt;Braxton also cites his own prior work showing that even when job creation and destruction balance in aggregate, workers displaced by technological change face lasting earnings losses and elevated permanent income risk, raising the question of how to optimally insure these workers.&lt;/p&gt;
&lt;p&gt;The comment concludes by identifying avenues for future research: introducing occupational heterogeneity (with some occupations more exposed to AI than others) and worker heterogeneity (skills that are complements versus substitutes to AI). The central open question is whether AI is qualitatively different from prior episodes of technological change, and if so, why.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q1: What are the two central channels through which AI quality affects employment in Wang and Wong&amp;rsquo;s model, and how do they operate?&lt;/strong&gt;
The job destruction channel operates through endogenous separations: as AI quality (At) improves, firms that are matched with workers are more likely to replace them with AI at rate ρ, adding the term ρµAt Ht It to outflows from employment in the law of motion for employment. The job creation channel operates through the free-entry condition: higher AI quality raises firm profits on all existing matches (because technological change is disembodied, benefiting matches formed today with future AI gains), inducing firms to post more vacancies, which via free entry reduces the firm&amp;rsquo;s matching probability but raises the worker&amp;rsquo;s job-finding rate αt and thereby increases employment inflows. The net employment effect depends on which channel quantitatively dominates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What is Wang and Wong&amp;rsquo;s quantitative finding about the aggregate employment and productivity effects of AI?&lt;/strong&gt;
Using a calibrated equilibrium labor search model, Wang and Wong find that the spread of AI will increase productivity by a factor of three while reducing employment by 23%. Approximately half of the employment decline is projected to occur within the next five years. A version of the model holding job-finding rates fixed yields a similar result, indicating that through the lens of their model the job creation channel is quantitatively small and the job destruction channel dominates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What three modeling choices push against Wang and Wong&amp;rsquo;s job-destruction result, and why does Braxton view this as making the finding more striking?&lt;/strong&gt;
First, AI improves through &amp;ldquo;learning by using&amp;rdquo; — it learns from all output being produced — which creates an incentive for employment to remain elevated to accelerate AI learning, dampening job destruction. Second, firms can only adopt AI if currently matched with a worker, which creates an incentive for vacancy posting and pushes in favor of job creation. Third, AI improvements are disembodied (raising productivity in all matches, including those formed before the improvement), which increases the value of forming new matches today and strengthens job creation. Because each of these assumptions pushes against the job destruction result, Braxton argues that finding job destruction dominant despite these model features makes the result more striking.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: How does Braxton use the historical spread of computers to assess the job destruction and job creation channels?&lt;/strong&gt;
Braxton measures occupation-level computer adoption as the change in the share of CPS Computer Supplement respondents who reported using a computer at work between 1984 and 2003 (denoted ΔCPUo,84–03), using occupation codes from Autor and Dorn (2013). He then regresses the occupation-level change in EU transition rates (ΔEUo,84–03, from monthly CPS micro data) on ΔCPUo,84–03 to measure the job destruction channel, and separately regresses the change in log occupation-level employment (Δlog Eo,80–00, from the 1980 and 2000 Census IPUMS) on ΔCPUo,84–03 to assess the net employment effect. A positive coefficient on the employment regression indicates job creation dominates; a negative coefficient indicates job destruction dominates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What do the regression results show about the job destruction and job creation channels during the computer era?&lt;/strong&gt;
The job destruction regression yields a coefficient of β = 0.0146 (robust SE = 0.0064, R² = 0.0178), indicating that occupations with higher computer adoption rates did see higher employment-to-unemployment transition rates — the job destruction channel was present. However, the employment-level regression yields a coefficient of β = 0.7761 (robust SE = 0.2658, R² = 0.0348), with a positive slope indicating that occupations more exposed to computers experienced &lt;em&gt;higher&lt;/em&gt; employment growth between 1980 and 2000. Thus, for the computer episode, the job creation channel dominated the job destruction channel — the opposite of what Wang and Wong project for AI.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: What is the basis for treating the computer episode as a relevant analog to the spread of AI?&lt;/strong&gt;
Braxton cites Bick, Blandin, and Deming (2024), who show that AI adoption in the workplace in 2024 is following nearly the same aggregate time trend as the spread of personal computers in the early 1980s. Moreover, the distribution of AI usage across the income distribution in 2024 is nearly identical to computer usage across the income distribution in 1984: for both technologies, workplace usage peaks between the 80th and 90th percentiles of the income distribution before declining modestly at the top. Bick et al. (2024) also show the similarities hold by education level and age.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: Even if job creation and destruction balance in aggregate, what does prior work suggest about the distributional consequences for workers?&lt;/strong&gt;
Braxton and Taska (2023) show that workers in occupations more exposed to technological change (measured by changes in computer and software task requirements) suffered larger earnings losses following displacement. Braxton, Herkenhoff, Rothbaum, and Schmidt (2024, forthcoming AER) show that workers in occupations more exposed to technological change experienced larger increases in permanent income risk between the 1980s and 2010s. These findings imply that even if AI does not reduce aggregate employment, workers who are displaced will face deteriorating labor market prospects, raising the question of how to optimally provide insurance.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What policy implication does Braxton draw from the distributional consequences of technological change?&lt;/strong&gt;
Braxton and Taska (2025, forthcoming Review of Economic Dynamics) show that technological change expands the motive for governments to provide retraining subsidies. Braxton argues that if AI represents an acceleration of technological change, even larger retraining subsidies — and potentially other forms of insurance — may be needed for displaced workers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What are the main avenues for future research identified in the comment?&lt;/strong&gt;
Braxton identifies two principal directions. First, introducing occupational heterogeneity into the Wang-Wong framework, so that some occupations are more exposed to AI displacement than others, would allow the model to generate richer distributional implications. Second, allowing worker heterogeneity in skills — distinguishing skill dimensions that are complements to AI from those that are substitutes — would permit the model to capture differential effects across the workforce. The overarching research question is whether AI is qualitatively different from prior technological change episodes, and if so, to identify the precise mechanisms that make it different.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Job destruction channel&lt;/strong&gt;: In the Wang-Wong model, the increase in endogenous separations driven by firms replacing matched workers with AI as AI quality improves. Formally, this is the term ρµAt Ht It in the law of motion for employment, representing separations that occur when a firm adopts AI and the worker and firm cannot renegotiate a mutually acceptable wage.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Job creation channel&lt;/strong&gt;: The increase in vacancy posting and worker job-finding rates induced by rising AI quality. Because higher AI quality raises firm profits on all matches (via disembodied technological change), the free-entry condition implies firms post more vacancies, lowering the firm&amp;rsquo;s matching probability but raising the worker&amp;rsquo;s job-finding rate αt, increasing employment inflows.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Free-entry condition&lt;/strong&gt;: The equilibrium condition equating the cost of posting a vacancy (κt) to the expected benefit (the probability of matching ft times the firm&amp;rsquo;s match surplus Πt). This condition pins down the job-finding rate for workers: when firms find it more profitable to post vacancies, αt rises.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Disembodied technological change&lt;/strong&gt;: The modeling assumption that AI quality improvements raise productivity in all existing matches, not just those formed after the improvement. This means future AI gains benefit matches formed today, increasing the incentive to create new matches and pushing in favor of the job creation channel.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Learning by using&lt;/strong&gt;: The mechanism in Wang-Wong whereby AI quality (At) improves as a function of current aggregate employment (Ht) and the learning rate µ. Because AI learns from all output being produced, maintaining higher employment accelerates AI improvement, creating a motive that partially offsets the job destruction channel.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Employment-to-unemployment (EU) transition rate&lt;/strong&gt;: The rate at which employed workers flow into unemployment in a given occupation, used by Braxton as the empirical measure of the job destruction channel during the computer episode. Measured from monthly CPS micro data.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Capitalization effect&lt;/strong&gt;: The tendency for firms to post more vacancies today in anticipation of future productivity improvements, because the cost of posting is paid upfront while the benefits of a future-better-AI accrue to the match going forward. Referenced by Braxton as relevant to understanding the job creation channel in Wang-Wong&amp;rsquo;s framework (citing Pissarides (2000), Chapter 3).&lt;/p&gt;</description></item><item><title>Comment on: Is it AI or data that drives market power?</title><link>https://macropaperwarehouse.com/papers/comment-on-is-it-ai-or-data-that-drives-market-power/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/comment-on-is-it-ai-or-data-that-drives-market-power/</guid><description>&lt;p&gt;This paper is a published comment by Miao Ben Zhang (USC Marshall School of Business) on Mihet, Rishabh, and Gomes (2025), &amp;ldquo;Is It AI or Data That Drives Market Power?&amp;rdquo; Zhang identifies three contributions of the commented paper and benchmarks each against the existing literature, offering targeted suggestions for strengthening the analysis.&lt;/p&gt;
&lt;p&gt;The first contribution Zhang discusses is the commented paper&amp;rsquo;s distinction between raw data, AI capability, and processed data. Raw data is modeled as a by-product of production linearly related to firm size; processed data is modeled as the abundance of signals improving the precision of firms&amp;rsquo; next-period productivity predictions. The commented paper&amp;rsquo;s key modeling innovation is a formula linking raw data (n_{i,t}), firm-level AI capability (z_i), and processed data (n_{i,t}-tilde): processed data equals a weighted sum of an information entropy effect — e^(-z_i) * (-n_{i,t} * ln(n_{i,t})) — and an AI capability effect — (1 - e^(-z_i)) * n_{i,t} * e^(n_{i,t}). Zhang notes this formula implies that the marginal value of raw data can turn negative for firms with low AI capability, consistent with information-theoretic constraints from the rational inattention literature (Sims, 2003). Zhang requests more empirical support for this equation, specifically asking whether low-AI firms exhibit lower TFP than high-AI firms at similar data-intensity levels, and encouraging discussion of existing measures of data-processing ability such as human capital in data engineering and ML pipeline automation.&lt;/p&gt;
&lt;p&gt;The second contribution is the commented paper&amp;rsquo;s modeling of a secondary market for trading processed data among firms. Zhang notes that facilitating processed data markets — for example via APIs or structured knowledge sharing — can, per the commented paper&amp;rsquo;s simulation and empirical analysis, democratize innovation and reduce market concentration, enabling even low-AI firms to compete. Zhang flags that the paper is silent on firm acquisition as an alternative channel for accessing processed data, arguing this omission is significant given that processed data, unlike ideas or technologies, is less portable and cannot be obtained simply by poaching skilled employees.&lt;/p&gt;
&lt;p&gt;The third contribution is the commented paper&amp;rsquo;s empirical strategy. The commented paper constructs firm-level proxies for AI intensity and data intensity, then exploits two exogenous technological shocks — the advent of AWS cloud computing and transformer-based architectures — to identify causal effects of improvements in compute and processed data accessibility. The evidence shows that compute improvements disproportionately benefit data-rich firms, while processed data access disproportionately benefits low-AI firms. The central empirical message is that access to raw data tends to foster market concentration, whereas access to processed data tends to reduce market concentration. Zhang raises a measurement concern: the commented paper relies on firm-level Herfindahl-Hirschman Index (HHI) calculations based on time-varying, text-based industry definitions (Hoberg and Phillips, 2016). Zhang argues a positive effect on this HHI could reflect either genuine firm growth relative to competitors or reclassification of the firm into different, possibly more concentrated, sectors — making the HHI measure alone insufficient to support claims about product market concentration. Zhang recommends complementing this with industry-level concentration measures anchored to fixed baseline industry codes (FIC codes from Hoberg and Phillips, 2016), constructed at the FIC-year level, following the approach of Gutierrez and Philippon (2017) on industries&amp;rsquo; growth and median Q.&lt;/p&gt;
&lt;p&gt;No quantitative magnitudes from regressions or calibrations are reported in the comment itself, as this is a discussion piece rather than an original empirical paper. All claims above are drawn directly from the text.&lt;/p&gt;
&lt;p&gt;Q: What are the three contributions of Mihet, Rishabh, and Gomes (2025) that Zhang identifies?
A: First, the paper explicitly models the distinct roles of raw data, AI capability, and processed data, linking the information entropy literature to firm production. Second, it models a secondary market for trading processed data among firms, relevant for policy on data sharing platforms. Third, it empirically tests the model&amp;rsquo;s predictions using firm-level proxies and two exogenous technological shocks.&lt;/p&gt;
&lt;p&gt;Q: What is the core formula linking raw data, AI capability, and processed data in the commented paper?
A: Processed data (n_{i,t}-tilde) equals e^(-z_i) * (-n_{i,t} * ln(n_{i,t})) plus (1 - e^(-z_i)) * n_{i,t} * e^(n_{i,t}), where z_i is firm-level AI capability and n_{i,t} is raw data. The first term captures the information entropy effect (which can reduce or negate the value of raw data for low-AI firms) and the second captures the AI capability effect (where AI turns raw data into abundant useful signals).&lt;/p&gt;
&lt;p&gt;Q: Why can the marginal value of raw data turn negative, according to the framework?
A: Information-theoretic constraints — long studied through concepts like Shannon entropy and Sims&amp;rsquo;s rational inattention — imply that unprocessed raw data may harm rather than help firms that lack adequate processing capabilities. Zhang situates this in the broader macro-finance literature on information choice (Sims, 2003; Veldkamp, 2011).&lt;/p&gt;
&lt;p&gt;Q: What empirical suggestion does Zhang make regarding the raw data versus AI capability distinction?
A: Zhang asks whether, in the commented paper&amp;rsquo;s sample of publicly-traded firms with measures of data intensity and AI intensity, low-AI firms exhibit lower TFP (following Imrohoroglu and Tuzel, 2012) than high-AI firms when controlling for similar levels of data intensity. Zhang also encourages discussion of anecdotal evidence for negative information entropy effects and of existing measures of data processing ability such as human capital in data engineering, annotation, cleaning, or ML pipeline automation (Abis and Veldkamp, 2024).&lt;/p&gt;
&lt;p&gt;Q: What is the policy relevance of the secondary market for processed data?
A: The commented paper&amp;rsquo;s simulation and empirical analysis shows that facilitating processed data markets (e.g., via APIs or structured knowledge sharing) can democratize innovation and reduce market concentration, enabling even low-AI firms to compete. This aligns with recent literature on secondary markets for structured data and foundation model outputs (Gans, 2018, 2024; Conti et al., 2023, 2024; Athey, 2019). Platforms may have incentives to restrict processed data access, potentially reinforcing incumbent power (Carballa Smichowski et al., 2023).&lt;/p&gt;
&lt;p&gt;Q: What channel does Zhang argue the commented paper neglects in its analysis of market concentration?
A: Zhang argues the paper is silent on firm acquisition as an alternative means by which firms access processed data, noting that processed data is less portable than ideas or technologies — it cannot be obtained simply by poaching a skilled employee. Zhang contends this acquisition channel appears central to the paper&amp;rsquo;s focus on market concentration and encourages the authors to include a discussion of it.&lt;/p&gt;
&lt;p&gt;Q: What is the central empirical finding of the commented paper regarding raw versus processed data and market concentration?
A: Access to raw data tends to foster market concentration, while access to processed data tends to reduce market concentration. The evidence shows that compute improvements (proxied by the AWS shock) disproportionately benefit data-rich firms, while processed data accessibility (proxied by the transformer architecture shock) disproportionately benefits low-AI firms, consistent with theoretical predictions.&lt;/p&gt;
&lt;p&gt;Q: What is Zhang&amp;rsquo;s specific concern about the HHI measure used in the commented paper?
A: The commented paper constructs firm-level HHI using time-varying, text-based industry definitions (Hoberg and Phillips, 2016). Zhang argues a positive effect on this HHI is ambiguous: it could reflect genuine firm growth relative to competitors or reclassification of the firm into different, possibly more concentrated, sectors. Zhang concludes that the HHI measure alone is not strong enough to support claims about product market concentration.&lt;/p&gt;
&lt;p&gt;Q: What robustness check does Zhang recommend for the empirical analysis?
A: Zhang recommends constructing industry-level concentration measures at the FIC-year level using fixed baseline FIC codes from Hoberg and Phillips (2016), available at the Hoberg-Phillips Data Library. The authors could then analyze how industries with high versus low average or median AI intensity and data intensity respond to the two technological shocks in terms of concentration. Zhang cites Gutierrez and Philippon (2017) as an example of this approach and notes it would help distinguish within-industry dynamics from shifts in firm business focus, aligning with best practices from De Loecker, Eeckhout, and Unger (2020) on persistent market power.&lt;/p&gt;
&lt;p&gt;Raw data: A by-product of firms&amp;rsquo; production, modeled as linearly related to firm size; represents unprocessed observations that have not yet been transformed into useful signals. Distinguished from processed data, which is what actually improves productivity predictions.&lt;/p&gt;
&lt;p&gt;Processed data: Modeled as the abundance of signals that improves the precision of firms&amp;rsquo; predictions of their next-period productivity (following Farboodi and Veldkamp, 2022). Unlike ideas or technologies, processed data is less portable and cannot easily be transferred by poaching skilled employees.&lt;/p&gt;
&lt;p&gt;AI capability (z_i): Firm-level ability to transform raw data into processed data. Firms with low AI capability may receive negative marginal value from additional raw data due to information entropy effects; firms with high AI capability extract large gains from the same raw data.&lt;/p&gt;
&lt;p&gt;Information entropy effect: The component of the raw-to-processed-data transformation — e^(-z_i) * (-n_{i,t} * ln(n_{i,t})) — that captures the information-theoretic cost of possessing raw data without adequate processing capability. At low AI capability, this effect can reduce or negate the precision of signals.&lt;/p&gt;
&lt;p&gt;Secondary market for processed data: A market in which firms trade processed data, modeled in the commented paper as a platform or API-based exchange. The commented paper&amp;rsquo;s analysis shows this market can democratize innovation and reduce market concentration by enabling low-AI firms to access processed data they cannot produce internally.&lt;/p&gt;
&lt;p&gt;Firm-level HHI (text-based): Herfindahl-Hirschman Index calculated using time-varying, text-based industry definitions (Hoberg and Phillips, 2016). Zhang identifies a measurement ambiguity: a positive effect on this measure could reflect genuine competitive gains or reclassification into more concentrated sectors.&lt;/p&gt;</description></item><item><title>Committed to flexible fiscal rules</title><link>https://macropaperwarehouse.com/papers/committed-to-flexible-fiscal-rules/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/committed-to-flexible-fiscal-rules/</guid><description>&lt;p&gt;A central debate in fiscal policy is whether fiscal rules—numerical constraints on budget deficits or debt levels—impair a government&amp;rsquo;s ability to respond to adverse economic shocks, creating a fundamental trade-off between debt stabilization and macroeconomic stabilization. This paper uses data on large, random natural disasters as exogenous shocks to address the endogeneity of rule adoption and provides new empirical and theoretical evidence on this trade-off. Contrary to the trade-off hypothesis, countries with fiscal rules perform significantly better following such disasters than countries without rules: GDP and private consumption are persistently higher, and fiscal policy is significantly more expansionary. The superior performance is shown to depend on the existence of prior fiscal space and the presence of escape clauses in the rules. A model of sovereign default with endogenous fiscal space and tax plans rationalizes these findings: tight rules prevent myopic governments from accumulating excessive debt in good times, which creates fiscal space for deficit spending when disasters strike, keeping sovereign spreads lower and enabling more expansionary fiscal responses.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-identification-strategy-and-why-do-natural-disasters-solve-the-endogeneity-problem"&gt;Q1. What is the identification strategy and why do natural disasters solve the endogeneity problem?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Large natural disasters serve as a source of exogenous, random adverse economic shocks; by interacting disaster exposure with the presence or absence of fiscal rules, the paper identifies the effect of rules on macroeconomic performance without confounding from the non-random adoption of rules.&lt;/strong&gt; Endogeneity is a central concern in the fiscal rules literature because countries that adopt rules may differ in politically or economically relevant ways from those that do not (e.g., more disciplined political environments, stronger institutions). Using large disasters as quasi-experimental variation removes this concern: the timing and magnitude of natural disasters are uncorrelated with which countries happened to adopt fiscal rules, isolating the effect of rules on crisis response.&lt;/p&gt;
&lt;h3 id="q2-what-are-the-main-empirical-findings"&gt;Q2. What are the main empirical findings?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Countries with fiscal rules show significantly higher output and private consumption following large natural disasters, and implement significantly more expansionary fiscal policy, compared to countries without rules—holding over a 1970Q1–2018Q4 quarterly panel—with confidence bands at the 68% and 90% levels based on 500 Monte Carlo draws.&lt;/strong&gt; The result directly contradicts the commonly held view that fiscal rules restrict governments&amp;rsquo; ability to respond to shocks. Moreover, the paper finds that the superior performance of rule-constrained countries is conditional on two features: the existence of fiscal space prior to the shock (low debt or deficit positions), and the presence of escape clauses that allow rules to be suspended during severe adverse events.&lt;/p&gt;
&lt;h3 id="q3-what-is-the-model-mechanism"&gt;Q3. What is the model mechanism?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;In the sovereign default model, a fiscal rule prevents a myopic government from over-borrowing in good times out of political economy considerations (e.g., electoral incentives to spend); this forced restraint creates fiscal space—lower debt, lower sovereign spreads—which allows the government to run deficits when a shock hits without triggering a default episode or a sharp rise in borrowing costs.&lt;/strong&gt; The model predicts that, relative to a no-rule economy, when a disaster strikes in a rule-constrained economy: sovereign spreads spike by less, the fiscal policy response is more expansionary, and output and consumption are higher. Escape clauses in the rules are important: they allow the government to depart from the rule explicitly in crisis situations without destroying the credibility of the rule in normal times.&lt;/p&gt;
&lt;h3 id="q4-what-is-the-policy-implication-for-the-covid-19-fiscal-response"&gt;Q4. What is the policy implication for the COVID-19 fiscal response?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper&amp;rsquo;s findings directly address the suspension of fiscal rules during COVID-19: the theoretical and empirical results suggest that rules with escape clauses do not impair crisis response and may actually improve it, by ensuring fiscal space is available when needed.&lt;/strong&gt; The paper&amp;rsquo;s evidence implies that the COVID-era suspension of rules in many countries (including the EU&amp;rsquo;s Stability and Growth Pact) was not necessarily required to enable expansionary fiscal responses—countries with well-designed rules including escape clauses could have responded expansively while maintaining rule credibility.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;escape clause&lt;/strong&gt; : a provision in a fiscal rule that explicitly permits departure from the rule&amp;rsquo;s numerical target under defined circumstances (severe recessions, natural disasters, etc.); the paper finds that the presence of escape clauses is one of the two conditions for rule-constrained countries to outperform non-rule countries after adverse shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;fiscal space&lt;/strong&gt; : the buffer of low debt and deficit levels that allows a government to increase spending or cut taxes during a shock without triggering unsustainable debt dynamics or elevated sovereign spreads; the paper shows fiscal space is created by rules in good times and consumed in bad times.&lt;/p&gt;</description></item><item><title>Competition and the Phillips curve</title><link>https://macropaperwarehouse.com/papers/competition-and-the-phillips-curve/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/competition-and-the-phillips-curve/</guid><description>&lt;p&gt;Fujiwara and Matsuyama ask whether the well-documented flattening of the New Keynesian Phillips curve (NKPC) and the concurrent rise in market concentration and markup rates are causally linked or merely coincidental. Under the canonical New Keynesian model with CES demand, competition is irrelevant to the Phillips curve regardless of whether entry is endogenous — concentration neither changes its slope nor affects inflation directly. This paper overturns that irrelevance result by extending the canonical model in two directions: (1) incorporating endogenous firm entry and exit following Bilbiie, Ghironi, and Melitz (2008) and Bilbiie, Fujiwara, and Ghironi (2014), and (2) replacing CES with the Homothetic Single Aggregator (HSA) demand system (Matsuyama and Ushchev 2017, 2020b), a flexible, tractable class of homothetic demand systems that nests CES and Translog as special cases.&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s theoretical results depend on two of Marshall&amp;rsquo;s laws of demand. The Second law states that the price elasticity of demand rises with the firm&amp;rsquo;s own price; the Third law states that the rate of increase in that elasticity falls with price. Together these conditions imply that the markup rate and pass-through rate are endogenous to the competitive environment.&lt;/p&gt;
&lt;p&gt;The main findings, delivered under both Rotemberg (1982) and Calvo (1983) pricing, are that higher entry costs — leading to market concentration — cause Phillips curve flattening through two distinct, complementary channels:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Structural (steady-state) effect.&lt;/strong&gt; Under Rotemberg pricing, the slope of the NKPC is proportional to the price elasticity zeta(z); market concentration reduces z, hence reduces zeta(z) under the Second law, directly flattening the curve. Under Calvo pricing, the slope is proportional to the pass-through rate rho(z); the Third law implies that concentration reduces rho(z), again flattening the curve. The Calvo–Rotemberg equivalence, which holds under CES to first order (Roberts 1995), breaks down under HSA: each pricing mechanism highlights a different channel.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Observational (omitted variable bias) effect.&lt;/strong&gt; Endogenous entry generates an endogenous cost-push shock through strategic complementarity in price setting. Because the number of firms N_t is omitted from a naive regression of inflation on real marginal cost, and because N_t is positively correlated with the marginal cost under the Second law, the omitted variable bias is negative — the estimated slope is biased downward. This bias is amplified with greater concentration under the Third law (Rotemberg case) and under both the Second and Third laws (Calvo case).&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;Quantitatively, the paper simulates under three parametric HSA families — CES, Translog, and Co-PaTh (Constant Pass-Through). De Loecker, Eeckhout, and Unger (2020) document that aggregate markups rose from 21% above marginal cost to 61% — a rise of approximately 40 percentage points. The authors&amp;rsquo; simulations imply this increase corresponds to an entry cost roughly 3.5 times higher under Translog and roughly 2.5 times higher under Co-PaTh with pass-through rate rho = 0.5. Under these parameterizations, the accompanying market concentration can halve the slope of the NKPC. Impulse responses confirm that the responses of inflation to both technology shocks and monetary policy shocks become smaller as market concentration deepens.&lt;/p&gt;
&lt;p&gt;Scope conditions: results require departure from CES (the Second and/or Third law must hold); endogenous entry is necessary for the dynamic cost-push channel; the structural flattening requires only the Second law under Rotemberg but additionally the Third law under Calvo; the omitted variable bias requires the Second law under Rotemberg and both laws under Calvo. The model is closed-economy, with symmetric monopolistic competition and Rotemberg or Calvo price adjustment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the irrelevance result the paper overturns, and why does CES produce it?&lt;/strong&gt;
Under CES, the market share function takes the form s(z) = gamma * z^(1-theta), yielding a constant price elasticity zeta = theta and a pass-through rate rho = 1, regardless of the number of firms or entry costs. As a result, concentration neither alters the slope of the NKPC nor generates any endogenous cost-push shock; competition is simply irrelevant to inflation dynamics. This irrelevance holds even with endogenous entry under CES.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What is the Homothetic Single Aggregator (HSA) and why is it used?&lt;/strong&gt;
HSA is a class of homothetic demand systems, originally proposed by Matsuyama and Ushchev (2017), in which the market share of each intermediate input variety depends solely on its own price normalized by a single price aggregator A_t. This single aggregator serves as a sufficient statistic summarizing all competitive pressure effects on pricing behavior, including the markup rate and pass-through rate. HSA nests CES and Translog as special cases, is analytically tractable (equilibrium existence and uniqueness are straightforward to ensure with endogenous entry), and is flexible enough to accommodate both the Second and Third laws of demand.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What are Marshall&amp;rsquo;s Second and Third laws as defined in the paper?&lt;/strong&gt;
The Second law states that the price elasticity of demand zeta(z) is increasing in the normalized price z (equivalently, increasing in the single price aggregator A_t, which rises with fewer firms). The Third law, as defined by Matsuyama and Ushchev (2023b), states that the rate of increase in the price elasticity is decreasing in z. Together they ensure that both markup rates and pass-through rates respond systematically to changes in competitive pressure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: How does market concentration structurally flatten the NKPC under Rotemberg pricing?&lt;/strong&gt;
Under Rotemberg pricing, the slope of the NKPC equals (zeta(z) - 1) / chi, where chi is the Rotemberg price adjustment cost parameter. Higher entry costs reduce the equilibrium number of firms, which reduces competitive pressure and lowers z. Under the Second law, lower z reduces zeta(z), directly shrinking the slope coefficient. This is the steady-state effect of concentration: the structural slope of the curve declines because the price elasticity falls.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: How does market concentration structurally flatten the NKPC under Calvo pricing?&lt;/strong&gt;
Under Calvo pricing, the slope of the NKPC is positively related to the pass-through rate rho(z) rather than the price elasticity. The Third law implies that lower z (more concentration) reduces rho(z). Market concentration therefore causes structural flattening through the pass-through channel under Calvo. This is why the Calvo–Rotemberg equivalence — which holds to first order under CES — breaks down under HSA: Rotemberg highlights the Second law / price elasticity channel and Calvo highlights the Third law / pass-through channel.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: What is the endogenous cost-push shock and how does it arise?&lt;/strong&gt;
When the number of operating firms N_t changes endogenously, it alters the single price aggregator A_t and therefore the competitive environment facing each firm. Under the Second law, firms exhibit strategic complementarity in price setting: a firm reduces its markup when other firms lower their prices (A_t falls with more entry). Consequently, movements in N_t directly enter the NKPC as an additional term — (1/chi) * (1 - rho(z)) / rho(z) * N_hat_t — acting as an endogenous cost-push shock. This channel is absent under CES because rho = 1 makes the coefficient zero.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: How does the endogenous cost-push shock create a negative omitted variable bias?&lt;/strong&gt;
A naive regression of inflation on real marginal cost omits the N_hat_t term. Under the Second law, N_t is positively correlated with the marginal cost (more entry drives markups down, consistent with marginal cost movements), so the omitted variable N_hat_t is positively correlated with the included regressor. Because the true coefficient on N_hat_t in the NKPC is negative, omitting it biases the estimated slope on marginal cost downward (negative omitted variable bias). The estimated relationship between inflation and marginal cost is therefore weaker than the true structural relationship.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: How is the omitted variable bias amplified by concentration?&lt;/strong&gt;
Under the Third law (Rotemberg case) and under both the Second and Third laws (Calvo case), greater market concentration amplifies the magnitude of this negative bias. The intuition is that higher concentration makes the pass-through rate rho(z) smaller, which increases the coefficient on N_hat_t in the NKPC and thereby raises the magnitude of the bias when N_hat_t is omitted. Greater concentration thus generates both more structural flattening and more observational flattening simultaneously.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What are the quantitative magnitudes of Phillips curve flattening in the simulations?&lt;/strong&gt;
De Loecker, Eeckhout, and Unger (2020) document that aggregate markups rose from 21% above marginal cost to 61% — approximately 40 percentage points. The paper&amp;rsquo;s simulations imply this corresponds to an entry cost increase of roughly 3.5 times under Translog and roughly 2.5 times under Co-PaTh with rho = 0.5. According to Figure 2, the accompanying market concentration can halve the slope of the NKPC. The slope declines more steeply for demand systems with smaller pass-through rates (rho further from 1).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: How do impulse responses change with market concentration?&lt;/strong&gt;
As entry costs rise (deeper concentration), the responses of the inflation rate to both technology shocks and monetary policy shocks become smaller in magnitude. Under the Second law, a positive technology shock increases the number of firms through a wealth effect, but strategic complementarity in price setting reduces markups, muting the inflation response relative to CES. The dynamic effect of endogenous entry thus weakens the transmission of real economic shocks to inflation — a supply side effect of monetary policy that parallels Baqaee, Farhi, and Sangani (2021) but operates through firm entry rather than the misallocation channel.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: What is the cyclicality of the markup rate under HSA, and why is it ambiguous?&lt;/strong&gt;
Under CES with flexible prices, the markup is constant. Under CES with sticky prices, the markup is procyclical (marginal cost falls with a positive technology shock but the price is rigid in the short run). Under the Second law with flexible prices, a positive technology shock increases firm entry, which reduces markups, making the markup countercyclical. In a sticky price equilibrium under the Second and Third laws, the cyclicality is therefore ambiguous: it depends on the tension between nominal rigidities (pushing toward procyclicality) and the pass-through rate (pushing toward countercyclicality).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: Why do the three price indices in the model differ, and which is used for the NKPC?&lt;/strong&gt;
The model features three aggregate price measures: the final goods price (CPI) P_t, which captures productivity effects of entry; the single price aggregator A_t, which captures competitive effects of entry and is the reference price for firms; and the average price index (PPI) p_t, which is not affected by entry effects and is the measured price index. Because entry effects shift P_t and A_t in ways that are not directly observed, the paper evaluates NKPC responsiveness in terms of p_t (PPI inflation), the measurable index.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q13: How does this paper relate to Wang and Werning (2022) and Baqaee, Farhi, and Sangani (2021)?&lt;/strong&gt;
Wang and Werning (2022) use a dynamic oligopoly model with exogenous entry and CES/Kimball demand, showing that higher concentration amplifies real effects of monetary policy and generates inflation persistence and endogenous cost-push shocks. Baqaee, Farhi, and Sangani (2021) use monopolistic competition with exogenous entry and Kimball demand under Calvo pricing, showing flattening through real rigidities and a misallocation channel (supply side effects of monetary policy). This paper uses monopolistic competition with endogenous entry and HSA under both Rotemberg and Calvo pricing; it produces supply side effects through firm entry rather than misallocation, and uses HSA rather than Kimball because HSA more readily guarantees equilibrium uniqueness with endogenous entry.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q14: What parametric families of HSA are used in simulations and what are their properties?&lt;/strong&gt;
Three families are used: CES (constant price elasticity theta, pass-through rho = 1, benchmark); Translog (satisfies the Second law, variable markups and pass-through); and Co-PaTh or Constant Pass-Through (proposed by Matsuyama and Ushchev 2020a, constant pass-through rate rho in (0,1) under flexible prices, containing CES as a limit as rho approaches 1). For Calvo pricing, a fourth family — PEM (Power Elasticity of Markup, proposed by Matsuyama and Ushchev 2023b) — is used; PEM satisfies the Third law in its strong form and contains Co-PaTh as a limit case. Translog is noted to behave similarly to Co-PaTh with rho = 0.5.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q15: What are the policy implications for central banks?&lt;/strong&gt;
Rising market concentration, by flattening the NKPC both structurally and observationally, reduces the effectiveness of monetary policy in achieving price stability through real economic activity — consistent with the concerns expressed by Federal Reserve officials (Clarida, Daly, Williams) quoted in the paper. The results suggest that empirical estimates of the NKPC slope that omit endogenous entry dynamics will be systematically biased downward, potentially leading central banks to underestimate the true structural responsiveness of inflation to demand conditions. Competition policy and barriers to entry thus have macroeconomic consequences beyond standard allocative efficiency considerations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Homothetic Single Aggregator (HSA):&lt;/strong&gt; A class of homothetic demand systems in which the market share of each input variety depends solely on its own price normalized by a single price aggregator A_t, which serves as a sufficient statistic for all competitive pressure effects on firm pricing behavior including the markup rate and pass-through rate. Nests CES and Translog as special cases.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Marshall&amp;rsquo;s Second Law of Demand (as used in the paper):&lt;/strong&gt; The condition that the price elasticity of demand zeta(z) is strictly increasing in the firm&amp;rsquo;s normalized price z. Under this condition, markup rates and pass-through rates vary endogenously with competitive pressure, and strategic complementarity in price setting arises.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Marshall&amp;rsquo;s Third Law of Demand (as used in the paper):&lt;/strong&gt; The condition, defined by Matsuyama and Ushchev (2023b), that the rate of increase in the price elasticity is decreasing in z. This law determines how the pass-through rate responds to concentration changes and is the relevant condition for structural flattening under Calvo pricing.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Pass-through rate rho(z):&lt;/strong&gt; The fraction of a cost change that a monopolistically competitive firm passes through to its price under flexible pricing, defined as rho(z) = [1 - d&lt;em&gt;ln(zeta/(zeta-1))/d&lt;/em&gt;ln(z)]^(-1). Under CES, rho = 1 (complete pass-through); under the Second law, rho &amp;lt; 1 (incomplete pass-through); it declines with concentration under the Third law.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Endogenous cost-push shock:&lt;/strong&gt; The direct effect of changes in the endogenous number of firms N_t on inflation in the NKPC, arising from strategic complementarity in price setting under HSA. This term is absent under CES (where the coefficient is zero) and generates an omitted variable bias in naive regressions of inflation on marginal cost.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Steady-state (structural) flattening:&lt;/strong&gt; The reduction in the true structural slope of the NKPC caused by market concentration operating through lower price elasticity (Rotemberg channel) or lower pass-through rate (Calvo channel). This is the first of the paper&amp;rsquo;s two reasons for observed Phillips curve flattening.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Observational (omitted variable bias) flattening:&lt;/strong&gt; The downward bias in empirically estimated NKPC slopes arising because naive regressions omit the endogenous cost-push shock term. The bias is negative and is amplified by greater market concentration under the Third law and/or Second law depending on the pricing mechanism.&lt;/p&gt;</description></item><item><title>Complete Pass-Through in Levels</title><link>https://macropaperwarehouse.com/papers/complete-pass-through-in-levels/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/complete-pass-through-in-levels/</guid><description/></item><item><title>Consistent Evidence on Duration Dependence of Price Changes</title><link>https://macropaperwarehouse.com/papers/consistent-evidence-on-duration-dependence-of-price-changes/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/consistent-evidence-on-duration-dependence-of-price-changes/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; This paper asks two related questions. First, can one develop a robust, distribution-free estimator for the discrete-time mixed proportional hazard (MPH) model of duration with unobserved heterogeneity? Second, what does that estimator reveal about the shape of the hazard of price changes, the role of heterogeneity in shaping aggregate price dynamics, and the distinction between regular price changes and sales?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Methodology.&lt;/strong&gt; The authors develop a linear generalized method of moments (GMM) estimator for the discrete-time MPH model, building on identification results in Honoré (1993). The model specifies that the probability a price spell ends at duration t, conditional on surviving to t, equals the product of a product-specific frailty parameter θ (unobserved, fixed over time) and a common baseline hazard bt. The estimator exploits repeated price spells per product via moment conditions that are linear in bt, making estimation and inference straightforward. It accommodates right- and left-censored data, competing risks, and spell-specific observable characteristics, without requiring any parametric assumption on the frailty distribution. The estimator is consistent as the number of products grows, even with a short time dimension. A Hansen-Sargan J-test of overidentifying restrictions and a test of the monotone-average-type prediction are also developed.&lt;/p&gt;
&lt;p&gt;The estimator is applied to two datasets: (1) IRI weekly store data (2001–2011), covering 30 product categories and more than 21 million products, yielding 684,919,778 pairs of durations; and (2) Online Micro Price data from Cavallo (2018), comprising approximately 250,000 products at daily frequency.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings with Quantitative Magnitudes.&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Baseline hazard and heterogeneity.&lt;/em&gt; In the pooled IRI data, the Kaplan-Meier hazard is steeply declining throughout the entire range from 2 to 60 weeks. In contrast, the estimated baseline hazard is roughly constant until week 4 and then declines only modestly, with a noticeable spike at week 52. The ratio of the Kaplan-Meier hazard to the baseline hazard — the average type, E[θ|t] — drops by approximately 60 percent within the first 20 weeks, and continues to decline, reaching roughly 0.3 of its initial value after one year. This decomposition reveals substantial unobserved heterogeneity that accounts for a large fraction of the observed decline in the Kaplan-Meier hazard.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Implications for structural models.&lt;/em&gt; The finding of a decreasing baseline hazard is inconsistent with canonical state-dependent pricing models (Golosov and Lucas, 2007), which predict an increasing hazard, conditional on a given firm&amp;rsquo;s type. The decreasing baseline hazard is instead broadly consistent with time-dependent pricing models, though not with a constant-hazard (Calvo, 1983) specification.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Monetary policy impulse response.&lt;/em&gt; In a calibrated time-dependent pricing model with strategic complementarity (α = 0, 0.5, 0.95), the aggregate price level dynamics in the estimated heterogeneous-firm MPH economy are close to those of a homogeneous-firm economy that uses the Kaplan-Meier hazard as the common price-change hazard. The homogeneous-firm approximation is substantially closer to the MPH economy than a Taylor (1979, 1980) staggered-contract economy with the same Kaplan-Meier hazard, particularly when strategic complementarity is strong (α = 0.95). The Calvo economy provides a poor approximation due to its exponential (constant-speed) price convergence structure.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Regular versus temporary price changes.&lt;/em&gt; Using the competing-risks extension with spell-specific observables — classifying spells by whether they start and end with a price increase (+) or decrease (−) — the authors separately estimate four baseline hazards. The baseline hazard for consecutive price increases (b++t) is relatively flat, especially for the first 6 weeks, then flat until week 45, with a spike near one year, consistent with price-plan models. The baseline hazard for reversals (particularly b−+t, price decreases followed by price increases, associated with sales) is steeply declining. The J-test statistics are substantially lower for price trends (J++ = 3,920; J−− = 3,401) than for reversals (J+− = 8,737; J−+ = 7,910), and markedly lower than the pooled-model J = 10,498, indicating that the MPH structure fits regular price changes considerably better than sales.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions.&lt;/strong&gt; Results are conditional on weekly store-level price data for mostly packaged consumer goods (30 IRI product categories). The analysis focuses on price spells of at least 2 weeks to avoid spurious duration-one spells from mid-week price changes. The maximum duration examined is 60 weeks. The comparison of estimation methods relies on the IRI data only; the Online Micro Price data confirm weekly decision-making through a spike in the daily hazard every 7 days. Comparisons with maximum likelihood estimates show that GMM recovers more heterogeneity (average type declines to 0.37 at 6 months by GMM versus 0.48 by continuous-time MLE), and that time aggregation explains most of the discrepancy between the two methods.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-mixed-proportional-hazard-mph-model-as-used-in-this-paper-and-what-does-the-estimator-identify"&gt;Q1. What is the mixed proportional hazard (MPH) model as used in this paper, and what does the estimator identify?&lt;/h3&gt;
&lt;p&gt;A1. The MPH model specifies that the hazard that a price spell ends at duration t, conditional on surviving to t, equals θ·bt, where θ is a product-specific frailty parameter drawn from an unknown distribution G and bt is a baseline hazard common to all products. The estimator, which is linear in bt, identifies the baseline hazard up to a multiplicative constant using moment conditions derived from repeated spell data, without restricting the shape of the frailty distribution. Identification relies on comparing the joint survival probabilities of two consecutive spells for the same product and exploits the symmetry implied by the MPH structure across spells.&lt;/p&gt;
&lt;h3 id="q2-how-does-the-kaplan-meier-hazard-relate-to-the-baseline-hazard-and-what-does-this-relationship-imply-about-heterogeneity"&gt;Q2. How does the Kaplan-Meier hazard relate to the baseline hazard, and what does this relationship imply about heterogeneity?&lt;/h3&gt;
&lt;p&gt;A2. The paper proves that the Kaplan-Meier hazard Ht equals bt times E[θ|t], the mean frailty among spells surviving to duration t. Because higher-type products (those with a higher propensity to change prices) exit the pool of surviving spells earlier, E[θ|t] is strictly decreasing in t — a form of dynamic selection. The ratio Ht/bt, normalized to 1 at the start, falls to approximately 0.4 by week 20 in the pooled IRI data and to approximately 0.3 after one year, documenting that a large share of the decline in the Kaplan-Meier hazard reflects heterogeneity rather than structural negative duration dependence.&lt;/p&gt;
&lt;h3 id="q3-what-does-the-estimated-baseline-hazard-imply-about-structural-models-of-price-setting"&gt;Q3. What does the estimated baseline hazard imply about structural models of price setting?&lt;/h3&gt;
&lt;p&gt;A3. A decreasing baseline hazard is inconsistent with the canonical state-dependent model of Golosov and Lucas (2007), in which a firm&amp;rsquo;s hazard of price change is increasing in the time since the last change, because larger deviations from the desired price accumulate with duration. The decreasing baseline hazard is instead consistent with time-dependent pricing models and with price-plan models where within-plan switches are costless. The mild spike at week 52 in the baseline hazard is consistent with Taylor-type annual pricing rules.&lt;/p&gt;
&lt;h3 id="q4-what-is-the-approximate-aggregation-result-for-monetary-policy-and-how-quantitatively-accurate-is-it"&gt;Q4. What is the approximate aggregation result for monetary policy, and how quantitatively accurate is it?&lt;/h3&gt;
&lt;p&gt;A4. In the time-dependent pricing model without strategic complementarity (α = 0), the impulse response of the aggregate price level to a monetary shock in a heterogeneous-firm economy is exactly the same as in a homogeneous-firm economy whose single firm uses the Kaplan-Meier survival function. This extends Carvalho and Schwartzman (2015) to an approximation in the case with strategic complementarity (α = 0.5 and α = 0.95). Numerically, the path of aggregate prices in the estimated MPH economy is close to that in the homogeneous-firm Kaplan-Meier economy, and substantially closer to it than to the Taylor-contract economy — the difference is most pronounced at horizons beyond about half a year when α = 0.95, where the Taylor economy shows notably slower initial convergence and faster later convergence relative to the MPH and homogeneous economies.&lt;/p&gt;
&lt;h3 id="q5-how-do-the-papers-results-differ-from-those-obtained-using-maximum-likelihood-estimation-of-the-continuous-time-mph-model"&gt;Q5. How do the paper&amp;rsquo;s results differ from those obtained using maximum likelihood estimation of the continuous-time MPH model?&lt;/h3&gt;
&lt;p&gt;A5. The GMM estimator recovers substantially more heterogeneity than maximum likelihood (MLE) applied to the continuous-time model with continuous records (assumed gamma frailty). The average type falls from 1 to 0.37 at six months under GMM, versus only 0.48 under MLE. The authors investigate two sources of this discrepancy: the assumed frailty distribution family (gamma) and time aggregation. They conclude that time aggregation is quantitatively more important in the IRI weekly data — that is, the continuous-time MLE approach fails to properly account for the discrete nature of the data-generating process, leading it to understate heterogeneity and recover a steeper baseline hazard.&lt;/p&gt;
&lt;h3 id="q6-how-does-the-paper-distinguish-regular-price-changes-from-sales-without-directly-observing-a-sales-flag"&gt;Q6. How does the paper distinguish regular price changes from sales without directly observing a sales flag?&lt;/h3&gt;
&lt;p&gt;A6. The competing-risks extension classifies each spell by whether it starts with a price increase or decrease (observable characteristic χ ∈ {+, −}) and by whether it ends with a price increase or decrease (competing risk ρ ∈ {+, −}). Price trends — spells where the direction is the same at both the start and end (++ or −−) — are interpreted as regular price changes; price reversals (especially −+, i.e., price decrease followed by increase) are associated with sales. This approach is consistent with the statistical model used for estimation, avoids the bias from simply dropping suspected sales spells before estimation, and allows the MPH structure to hold only for the risks of interest even if it fails for others.&lt;/p&gt;
&lt;h3 id="q7-how-well-does-the-mph-model-fit-regular-price-changes-versus-sales"&gt;Q7. How well does the MPH model fit regular price changes versus sales?&lt;/h3&gt;
&lt;p&gt;A7. The J-test of overidentifying restrictions yields test statistics of J++ = 3,920 for consecutive price increases and J−− = 3,401 for consecutive price decreases, compared with J = 10,498 for the pooled model and J+− = 8,737 and J−+ = 7,910 for the reversal hazards. All rejections are at conventional significance levels (critical value 1,749 at 5%), but the rejection is substantially milder for price trends than for price reversals. For individual product categories, the model cannot be rejected for 8 categories (out of 30) for b++ and 21 categories for b−−, suggesting the MPH structure is a much better description of regular price changes than of sales.&lt;/p&gt;
&lt;h3 id="q8-what-role-do-one-week-price-spells-play-in-the-data-and-why-are-they-excluded"&gt;Q8. What role do one-week price spells play in the data, and why are they excluded?&lt;/h3&gt;
&lt;p&gt;A8. In the IRI data, prices are measured as the ratio of weekly revenue to quantity, so a price change occurring mid-week generates a spurious price spell of duration one week. If all spells including one-week spells are retained, the autocorrelation of spell durations is only 0.029 in levels and even negative (−0.042) in logs, which is inconsistent with a mixture model. Once one-week spells are excluded, the autocorrelation rises to 0.235 in levels and 0.233 in logs, and is stable when two-week spells are also excluded (0.248 and 0.256). The paper therefore sets the lower duration bound at T̲ = 2 weeks.&lt;/p&gt;
&lt;h3 id="q9-what-does-the-daily-online-micro-price-data-add-relative-to-the-weekly-iri-data"&gt;Q9. What does the daily Online Micro Price data add relative to the weekly IRI data?&lt;/h3&gt;
&lt;p&gt;A9. The daily data reveal a sharp spike in the price-change hazard every seven days, suggesting that even when prices are observed daily, the decision to change prices is made at the weekly frequency. This justifies the use of a discrete-time model with a one-week period. The estimates from daily and weekly aggregations of the same data are broadly similar, though weekly data recovers somewhat less heterogeneity than daily data. Aggregating IRI weekly data to monthly frequency understates heterogeneity even more, confirming that frequency matters for measuring heterogeneity.&lt;/p&gt;
&lt;h3 id="q10-what-are-the-computational-advantages-of-the-gmm-estimator-relative-to-maximum-likelihood"&gt;Q10. What are the computational advantages of the GMM estimator relative to maximum likelihood?&lt;/h3&gt;
&lt;p&gt;A10. Because the moment conditions are linear in the baseline hazard bt, the GMM estimator is obtained in closed form, making estimation fast and inference straightforward. On the pooled IRI sample, GMM estimation (including standard errors) required 70 minutes on a machine with 60 GB memory, whereas the maximum likelihood estimator required 15 hours on a machine with 256 GB memory and failed entirely on the 60 GB machine. The GMM approach also avoids the need to specify the frailty distribution family and guarantees a global solution (proved by the identification result), whereas the likelihood function is non-linear in bt and may have multiple local maxima.&lt;/p&gt;
&lt;h3 id="q11-what-is-the-shape-of-the-b-baseline-hazard-for-regular-price-increases-and-what-models-does-it-support"&gt;Q11. What is the shape of the b++ baseline hazard for regular price increases, and what models does it support?&lt;/h3&gt;
&lt;p&gt;A11. The baseline hazard for spells starting and ending with a price increase (b++) is decreasing during the first 6 weeks — dropping by almost 50% — and then flat until approximately week 45, with a pronounced spike at around one year. This shape is consistent with price-plan models (Eichenbaum, Jaimovich, and Rebelo, 2011) with Calvo-type switching between plans, where within-plan changes are costless and the hazard of between-plan switching is approximately constant. The annual spike is consistent with Taylor-type pricing. Approximately 76.8% of complete spells starting after a price increase last at most 6 weeks.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Baseline hazard (bt).&lt;/strong&gt; The component of the MPH hazard that is common to all products and may vary arbitrarily with elapsed duration t. It represents structural duration dependence — the tendency for a given product to be more or less likely to change price as a function of how long its current spell has lasted — net of heterogeneity. It is identified only up to a multiplicative constant.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Frailty parameter (θ) / frailty distribution (G).&lt;/strong&gt; The product-specific scaling factor in the MPH model, fixed over all spells for a given product, that captures permanent unobserved differences in price-change frequency across products. The paper treats G as a nuisance parameter and does not require a parametric assumption on its shape. A higher θ means the product has a higher baseline propensity to change its price.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Average type (E[θ|t]).&lt;/strong&gt; The mean frailty parameter among spells that have survived to at least duration t. Because high-type products change price earlier and exit the pool of surviving spells first, the average type is provably strictly decreasing in t under the MPH model. It is measured as the ratio of the Kaplan-Meier hazard to the baseline hazard, and its rate of decline measures the importance of dynamic selection.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Kaplan-Meier hazard (Ht).&lt;/strong&gt; The probability that a randomly drawn spell ends at duration t, conditional on having lasted at least t periods. It mixes together structural duration dependence (captured by bt) and dynamic selection (captured by changes in the average type). It can be estimated without imposing the MPH structure, requiring only stationarity of the duration process.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Competing risks.&lt;/strong&gt; The framework in which a price spell can end for multiple distinct reasons — here, ending with a price increase or a price decrease — each with its own hazard function. The paper&amp;rsquo;s GMM approach allows the MPH structure to hold for only a subset of risks and observables, without imposing any structure on the remaining risks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Price trends vs. price reversals.&lt;/strong&gt; A classification of spells based on the direction of the surrounding price changes. Price trends are spells where the direction of the price change at the start and end of the spell is the same (++ or −−), interpreted as regular price changes. Price reversals are spells where the direction switches (e.g., −+, a price decrease followed by a price increase), associated with sales and other temporary price changes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Strategic complementarity in pricing (α).&lt;/strong&gt; The degree to which a firm&amp;rsquo;s target price responds to the average price set by other firms. Parameterized by α ∈ [0, 1), where α = 0 yields the exact aggregation result (only the Kaplan-Meier hazard matters) and higher α increases aggregate price stickiness by making firms reluctant to deviate from the average price when few others are adjusting.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Dynamic selection.&lt;/strong&gt; The mechanism by which the composition of the pool of surviving price spells shifts toward lower-type (more price-sticky) products as duration increases, because higher-type products change price sooner and exit the pool. This is the source of the gap between the steeply declining Kaplan-Meier hazard and the more modestly declining baseline hazard.&lt;/p&gt;</description></item><item><title>Costs of Financing U.S. Federal Debt Under a Gold Standard: 1791-1933</title><link>https://macropaperwarehouse.com/papers/costs-of-financing-u.s.-federal-debt-under-a-gold-standard-1791-1933/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/costs-of-financing-u.s.-federal-debt-under-a-gold-standard-1791-1933/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;This paper constructs a new dataset of US federal bond prices and uses it to estimate the full term structure of yields on gold-denominated US federal debt from 1791 to 1933 — the entire gold standard era. The core research question is how the costs of financing US federal debt evolved over this period and what monetary, fiscal, and financial policy changes drove that evolution, with the ultimate aim of understanding how the US built fiscal capacity and transformed its debt from a &amp;ldquo;junk bond&amp;rdquo; into a global &amp;ldquo;safe asset.&amp;rdquo;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Methodology.&lt;/strong&gt; The authors compile monthly prices, quantities, and descriptions of all US Treasury securities from 1776 to 1960 (the Hall et al. 2018 dataset). Bonds with less than one year to maturity are excluded from the main estimation due to liquidity premia. The primary estimation uses a Dynamic Nelson-Siegel (DNS) model with stochastic volatility (Diebold and Li 2006; Hautsch and Yang 2012), estimated by Bayesian MCMC. A key methodological innovation is the addition of bond-specific idiosyncratic pricing errors (Assumption 3), which allows the authors to include bonds with heterogeneous contract features — call options, indefinite maturities, conversion features — that characterize 19th-century US debt without either dropping them from the sample or having their idiosyncrasies distort the common yield curve. The data are &amp;ldquo;big&amp;rdquo; in the time-series dimension but sparse in the maturity (cross-sectional) dimension, frequently offering fewer than five price observations per month; the DNS framework pools information across time to address this sparsity.&lt;/p&gt;
&lt;p&gt;For the greenback period (1862–1878), the authors extend the approach by modeling the greenback yield curve as a function of the gold yield curve and a time-varying VAR model of exchange rate expectations (Assumptions 4–5). Only nine greenback-denominated bonds exist in the sample, most of them short-term; the VAR is estimated jointly using exchange rate data and the relative prices of greenback and gold bonds.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings.&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Long-run decline in yields.&lt;/strong&gt; The 10-year gold-denominated zero-coupon yield fell from approximately 8% in 1800 to approximately 2% in 1900, consistent with global secular decline trends, but the trajectory stabilized near 2% after 1900 — suggesting US debt began to play a distinctive &amp;ldquo;safe-asset&amp;rdquo; role from the turn of the 20th century.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;War spikes were much larger than previously understood.&lt;/strong&gt; The paper&amp;rsquo;s estimate of the 10-year gold yield reaches a peak of approximately 16% near the end of the Civil War. This is substantially higher than the Homer and Sylla (2004) peak of 6% at the start of the war. The discrepancy arises because Homer and Sylla used bonds trading at par — which did not exist during the Civil War — while this paper uses the full universe of bonds at monthly frequency.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Yield curve slope switched sign.&lt;/strong&gt; The term spread (10-year minus 2-year gold yield) was typically negative before the Civil War (inverted yield curve) and turned persistently positive afterward. The authors link this switch to a change in long-run inflation predictability: inflation was relatively hard to forecast before the Civil War and easier to forecast after, consistent with a negative inflation-risk premium in the pre-war period.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Default risk premium disappeared around 1905.&lt;/strong&gt; Comparing hypothetical gold-denominated US consols to UK consols (the 19th-century benchmark safe asset), US yields were persistently above UK yields until approximately 1905, when US yields fell below UK yields. This indicates that US federal debt acquired safe-asset characteristics well before World War I, foreshadowing the shift in global reserve asset status during and after Bretton Woods.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Nominal anchor during the Civil War.&lt;/strong&gt; Despite a 60% depreciation of the greenback against gold during the Civil War (100 greenback dollars could be purchased for as few as 40 gold dollars in summer 1864), investors expected greenbacks to eventually return to gold parity. Estimated long-run exchange rate expectations remained anchored at one-for-one parity throughout the period. This kept greenback-denominated bond yields flat at approximately 6% — bonds traded around par — explaining the &amp;ldquo;Civil War yield puzzle&amp;rdquo; noted by Friedman and Schwartz (1963).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Short-rate disconnect.&lt;/strong&gt; Short-maturity government bonds (less than one year) traded with a premium of approximately 0.25 to 0.5 percentage points relative to model-implied yields throughout most of the 19th century, reflecting scarcity of money-like assets. This premium effectively disappeared from the 1880s until World War I — coinciding with the National Banking Era — and then reappeared in the 1920s after the Federal Reserve created a secondary market for Certificates of Indebtedness.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-why-does-the-paper-restrict-estimation-to-bonds-with-maturity-greater-than-one-year"&gt;Q1. Why does the paper restrict estimation to bonds with maturity greater than one year?&lt;/h3&gt;
&lt;p&gt;Short-maturity Treasury notes exhibited particularly large estimated bond-specific pricing errors in preliminary analysis, which the authors attribute to a liquidity premium: short-term government debt was used for transactions and thus commanded a money-like premium that a common discount function cannot accommodate. To keep this liquidity premium from distorting estimates of the longer end of the curve, these bonds are excluded from the main estimation. Short-maturity bonds are then studied separately as an &amp;ldquo;out-of-sample&amp;rdquo; exercise (the short-rate disconnect).&lt;/p&gt;
&lt;h3 id="q2-how-does-the-dynamic-nelson-siegel-model-with-stochastic-volatility-solve-the-cross-sectional-sparsity-problem"&gt;Q2. How does the Dynamic Nelson-Siegel model with stochastic volatility solve the cross-sectional sparsity problem?&lt;/h3&gt;
&lt;p&gt;The DNS model parameterizes the entire yield curve at each date using only three latent factors — level (L), slope (S), and curvature (C) — which follow a driftless random walk. The stochastic volatility component, captured in the covariance matrix Σt, governs how much information is pooled across adjacent time periods. When Σt → 0, the yield curve is assumed constant (full pooling); when Σt → ∞, estimates are date-by-date (no pooling). By allowing Σt to vary, the model pools more heavily in sparse periods and less during wars when yields change rapidly. The companion paper (Payne et al. 2023a) confirms via information criteria that stochastic volatility and correlated shocks improve fit without overfitting.&lt;/p&gt;
&lt;h3 id="q3-what-is-the-bond-specific-pricing-error-and-why-is-it-essential-for-historical-data"&gt;Q3. What is the bond-specific pricing error and why is it essential for historical data?&lt;/h3&gt;
&lt;p&gt;Assumption 3 adds to each bond i a Gaussian pricing error with mean zero and bond-specific standard deviation σ(i)_m (scaled by Macaulay duration to approximate yield-space errors). This allows bonds with idiosyncratic contract features — call options, conversion clauses, ambiguous payment currency — to inform the common yield curve without unduly distorting it. Bonds with larger σ(i)_m receive less weight in estimation. In modern datasets, researchers pre-select homogeneous bonds and use time-specific pricing errors; the historical sparsity prevents that approach here.&lt;/p&gt;
&lt;h3 id="q4-how-large-were-civil-war-yields-compared-to-prior-estimates-and-why-does-the-discrepancy-arise"&gt;Q4. How large were Civil War yields compared to prior estimates, and why does the discrepancy arise?&lt;/h3&gt;
&lt;p&gt;The paper&amp;rsquo;s posterior median for the 10-year gold zero-coupon yield peaks at approximately 16% near the end of the Civil War. Homer and Sylla (2004) report a peak of 6% at the start of the war. The discrepancy arises because Homer and Sylla used bonds trading close to par, but during the Civil War no federal bonds traded at gold-price par (Lincoln&amp;rsquo;s re-election was uncertain in summer 1864; 100 greenback dollars could be purchased for 40 gold dollars, implying 6% coupon bonds were priced at 40% of par, implying yields in excess of 15%). This paper uses the full universe of Treasury bonds at monthly frequency and allows all bonds — regardless of trading price — to inform the yield curve.&lt;/p&gt;
&lt;h3 id="q5-when-did-us-debt-cease-to-carry-a-default-risk-premium-relative-to-uk-debt-and-how-is-this-measured"&gt;Q5. When did US debt cease to carry a default risk premium relative to UK debt, and how is this measured?&lt;/h3&gt;
&lt;p&gt;The authors compare yields-to-maturity on gold-denominated UK consols to those on hypothetical gold-denominated US consols promising the same coupon flows. Because both countries were on a gold standard for most of the period and UK consols were the 19th-century safe asset, the spread is interpreted as a risk premium on US debt. US yields fell below UK yields persistently after approximately 1905, indicating that US debt was priced as a safe asset well before World War I. US yields were temporarily close to UK yields in the 1820s but the spread re-widened after the Jacksonian era, state defaults in the 1840s, and the Civil War. The spread closed only after Civil War disruptions resolved, the National Banking System matured, and gold-greenback parity was restored in 1879.&lt;/p&gt;
&lt;h3 id="q6-what-is-the-nominal-anchor-finding-during-the-greenback-era-and-what-econometric-method-uncovers-it"&gt;Q6. What is the &amp;ldquo;nominal anchor&amp;rdquo; finding during the greenback era, and what econometric method uncovers it?&lt;/h3&gt;
&lt;p&gt;During 1862–1878, the federal government issued non-convertible greenback dollars alongside gold bonds. The greenback depreciated substantially (to 40 cents per gold dollar in 1864), yet greenback-paying bonds traded near par, implying greenback yields near 6%. The authors model the greenback yield curve as a product of the gold discount function and a &amp;ldquo;multiplier&amp;rdquo; z(j)_t capturing the expected future gold-to-greenback exchange rate at each horizon j (Assumption 4). The exchange rate expectations are estimated via a time-varying VAR(2) model of the gold-to-greenback and gold-to-goods exchange rates (Assumption 5), jointly constrained by the prices of greenback bonds via an interest-rate parity condition. The resulting estimates show that throughout the greenback era — even during large wartime depreciations — investors&amp;rsquo; long-run expectations of the exchange rate remained anchored near gold parity, consistent with anticipated eventual resumption.&lt;/p&gt;
&lt;h3 id="q7-how-did-political-events-affect-exchange-rate-expectations-during-and-after-the-civil-war"&gt;Q7. How did political events affect exchange rate expectations during and after the Civil War?&lt;/h3&gt;
&lt;p&gt;The time-varying VAR captures shifts in exchange rate expectations associated with identifiable political events. Grant&amp;rsquo;s victory in 1869 (which resolved uncertainty about whether debts would be honored in gold) coincided with an increase in the price of greenbacks, a decrease in expected greenback appreciation, and a closing of the gap between greenback and gold 10-year yields. In the early 1870s, following the Panic of 1873 and uncertainty about resumption, investors came to expect that gold-greenback discrepancies would persist almost indefinitely, causing gold and greenback yields to converge. The Resumption Act of January 1875 then shifted 2-year and 10-year expectations back toward parity.&lt;/p&gt;
&lt;h3 id="q8-what-is-the-short-rate-disconnect-and-what-does-it-reveal-about-the-national-banking-era"&gt;Q8. What is the short-rate disconnect and what does it reveal about the National Banking Era?&lt;/h3&gt;
&lt;p&gt;The short-rate disconnect is the difference between observed yields-to-maturity for bonds with less than one year to maturity and the yields-to-maturity implied by the model estimated on bonds with more than one year maturity. A positive disconnect means short-maturity bonds yielded less than long-maturity bonds conditional on the model — indicating a liquidity premium on short-term debt. The authors find a persistent premium of 0.25 to 0.5 percentage points through most of the 19th century, reflecting scarcity of money-like assets when state bank notes circulated at variable discounts. The premium disappeared from approximately the 1880s to World War I, coinciding with the mature National Banking Era after greenback-gold parity was restored in January 1879. The authors interpret this as evidence that the National Banking Acts (1862–1866), which allowed National Banks to issue standardized bank notes backed by long-term US government bonds, ultimately succeeded in supplying liquid assets and equalizing the pricing of short- and long-term federal debt — but only after the currency risk from the greenback period had been resolved.&lt;/p&gt;
&lt;h3 id="q9-how-does-the-composite-long-term-yield-series-officer-williamson--homer-sylla-distort-historical-narratives"&gt;Q9. How does the composite long-term yield series (Officer-Williamson / Homer-Sylla) distort historical narratives?&lt;/h3&gt;
&lt;p&gt;The composite series combines Homer and Sylla US federal yields (1798–1861), New England Municipal bond yields (1862–1899), and corporate bond yields (1900–1940). The paper shows that this composite series substantially underestimates the increase in US federal borrowing costs during Civil War deficits (peak of 6% vs. this paper&amp;rsquo;s 16%) and overstates post-Civil War borrowing costs by mixing in riskier private obligations. The authors argue that earlier findings of no strong association between 19th-century interest costs and deficits (Evans 1985, 1987) may reflect the composite series&amp;rsquo; failure to accurately capture federal borrowing costs during large deficit episodes.&lt;/p&gt;
&lt;h3 id="q10-how-did-the-yield-curve-slope-change-after-the-civil-war-and-what-explains-it"&gt;Q10. How did the yield curve slope change after the Civil War and what explains it?&lt;/h3&gt;
&lt;p&gt;The term spread (10-year minus 2-year gold yield) was typically negative before the Civil War and positive after the late 1870s. Major wars caused sharp temporary decreases (inversions). The authors connect the sign switch to a change in long-run inflation dynamics documented in a companion paper (Payne et al. 2023b): long-run inflation was hard to predict before the Civil War and easier to predict after, suggesting gold bonds provided a better inflation hedge in the pre-war period (negative inflation-risk premium), which is consistent with asset pricing theory producing a downward-sloping yield curve. After the Civil War, as inflation became more predictable, the inflation-risk premium became positive and the yield curve turned upward-sloping.&lt;/p&gt;
&lt;h3 id="q11-what-did-the-national-banking-acts-seek-to-do-and-was-the-puzzle-of-bank-note-under-issuance-resolved"&gt;Q11. What did the National Banking Acts seek to do and was the puzzle of bank note under-issuance resolved?&lt;/h3&gt;
&lt;p&gt;The National Banking Acts (1862, 1863, 1865, 1866) authorized federally chartered banks to issue bank notes up to 90% of the par or market value of eligible US Treasury bonds deposited as collateral, subject to a 1% annual tax on notes outstanding (0.5% after 1900), compared to a 10% tax on state bank notes. The intended goals were to increase the supply of short-term liquid assets and to increase bank demand for long-term federal debt, thereby lowering long-term yields and eliminating the short-rate disconnect. A long-standing puzzle (Friedman-Schwartz, Cagan, Champ, Calomiris-Mason) held that yields on eligible Treasuries did not fall enough to equal the note tax rate, implying under-issuance. The paper&amp;rsquo;s analysis of the short-rate disconnect offers a resolution: if one focuses on the disconnect rather than the yield-tax spread, the National Banking Acts appear to have largely achieved their goals by the 1880s — but only after greenback-gold parity was restored, suggesting that currency devaluation risk had initially restrained bank note issuance, as hypothesized by Cagan (1965).&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Dynamic Nelson-Siegel (DNS) model with stochastic volatility:&lt;/strong&gt; A parametric yield curve model (Diebold-Li 2006) parameterizing zero-coupon yields at each date as a function of three latent factors — level (L), slope (S), curvature (C) — following a driftless random walk. The paper extends this with time-varying shock volatilities (stochastic volatility) to allow the degree of information pooling across time periods to vary with institutional and wartime disruptions. Used here to handle cross-sectional sparsity in historical bond data.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bond-specific pricing error:&lt;/strong&gt; A Gaussian pricing error with bond-specific standard deviation σ(i)_m (scaled by Macaulay duration) added to each bond&amp;rsquo;s observed price. Allows bonds with heterogeneous and idiosyncratic contract features (call options, conversion clauses) to inform a common discount function without distorting it, by automatically down-weighting &amp;ldquo;peculiar&amp;rdquo; bonds through higher estimated σ(i)_m.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Short-rate disconnect (liquidity premium):&lt;/strong&gt; The systematic difference between observed yields-to-maturity on bonds with less than one year to maturity and yields implied by a pricing kernel fitted on bonds with more than one year to maturity. Interpreted as a money-like convenience yield (liquidity premium) on short-term debt: when money-like assets are scarce, short-term bonds are overpriced (lower yields) relative to the term structure implied by longer maturities. Measured here as an out-of-sample fit residual from the DNS model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Denomination risk:&lt;/strong&gt; The risk that the unit of account in which bond payments are promised may change in value relative to gold. During the greenback era (1862–1878), bonds denominated in greenbacks carried denomination risk because greenbacks could depreciate against gold. The paper distinguishes denomination risk from default risk by estimating separate gold and greenback yield curves and modeling exchange rate expectations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Nominal anchor:&lt;/strong&gt; The phenomenon in which long-run market expectations of the gold-to-greenback exchange rate remained anchored near gold parity (one-for-one) even during large short-run depreciations during the Civil War. Inferred from the observation that greenback-denominated bonds traded near par (yield ~6%) while the spot greenback depreciated by up to 60% against gold, implying investors anticipated eventual full appreciation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Default risk premium (US-UK yield spread):&lt;/strong&gt; The difference between yields on hypothetical gold-denominated US consols and yields on UK consols. Since both were on a gold standard (so inflation expectations are similar), and UK consols were the 19th-century benchmark safe asset, the spread is interpreted as the compensation investors demanded for the risk that the US might default or alter payment terms. Persistently positive until approximately 1905, then became negative.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Convenience yield:&lt;/strong&gt; An implicit yield that accrues to holders of money-like or safe assets because of their use in transactions or as collateral. In this paper, it emerges as the spread between yields on US federal bonds and other low-risk bonds in the late 19th century, reflecting increased demand for Treasuries as reserves under the National Banking System. Historically identified via the short-rate disconnect disappearing in the National Banking Era.&lt;/p&gt;</description></item><item><title>Customer accumulation, returns to scale, and secular trends</title><link>https://macropaperwarehouse.com/papers/customer-accumulation-returns-to-scale-and-secular-trends/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/customer-accumulation-returns-to-scale-and-secular-trends/</guid><description>&lt;p&gt;This paper asks how rising returns to scale in production contributed to three concurrent U.S. secular trends since 1980: declining business dynamism, rising markups, and growing firm expenditures on customer acquisition. The author constructs a firm dynamics model in the Hopenhayn (1992) tradition with endogenous entry and exit, heterogeneous markups, and customer accumulation grounded in directed search in the product market. Firms compete for customers through both prices and selling activities; larger firms gain a competitive edge when returns to scale rise because their marginal costs fall more than those of smaller firms—even though the technological shift is uniform across firms. This demand-based channel triggers winners-and-losers dynamics and the rise of superstar firms.&lt;/p&gt;
&lt;p&gt;The empirical foundation rests on Compustat data for U.S. publicly traded firms (1977–2014) and Business Dynamics Statistics (BDS) for aggregate and sector-level dynamism measures. Production-function estimation using Ackerberg, Caves, and Frazer (2015) augmented with sales-share controls documents that aggregate returns to scale rose from approximately 1.0 in 1980 to approximately 1.05 by 2014—a within-sector increase, not a reallocation effect. Over the same period, the cost-weighted markup rose by 42%, the firm entry rate fell by 33%, the excess reallocation rate fell by 29%, and selling costs relative to production costs rose by 60%–90% depending on the measure used.&lt;/p&gt;
&lt;p&gt;The model is calibrated to 1980 steady-state moments (firm life-cycle patterns, markups, entry and reallocation rates). A 5% increase in returns to scale—matching the empirical estimate—accounts for: a +15 percentage point rise in the average cost-weighted markup (vs. +42% in the data); a 33% decline in the entry rate (exactly matching the data); a 21% decline in the reallocation rate (vs. 29% in the data); and a 23% increase in selling costs relative to production costs (vs. 60%–90% in the data). The model also generates a 53% rise in the share of firms aged 11 years or older (vs. 50% in the data) and a 58% decline in the employment share of firms aged 5 years or younger (vs. 56% in the data), closely tracking the aging of the U.S. firm population. Firm-level responsiveness to productivity shocks declines by 0.08 in the model, versus about 0.01 in Compustat and 0.09 in Decker et al. (2020).&lt;/p&gt;
&lt;p&gt;Sector-level panel regressions with sector fixed effects confirm the model&amp;rsquo;s directional predictions: within-sector increases in returns to scale are associated with lower entry rates (coefficient −2.89, significant at 1%), lower reallocation rates (−1.16, significant at 1%), higher markups (+3.15, significant at 1%), and higher selling costs relative to production costs (+1.85 for the advertising-based measure; +8.52 for adjusted SG&amp;amp;A).&lt;/p&gt;
&lt;p&gt;A key scope condition is that the model yields a constrained-efficient allocation: directed search and full internalization of returns to scale imply decentralized equilibrium efficiency, making the paper a laboratory for assessing how far efficient firm responses to technological change can explain the secular trends without invoking market failures. The model fits the post-2000 transition dynamics better than the 1980s–1990s period, and explains a substantial but incomplete share of the trends, suggesting complementary—possibly inefficient—forces also contributed.&lt;/p&gt;
&lt;p&gt;Q: What is the core mechanism through which rising returns to scale generate winners-and-losers dynamics?&lt;/p&gt;
&lt;p&gt;A: The marginal cost of production under increasing returns to scale (alpha &amp;gt; 1) is MC(z,n) = l(n,z)^(1−alpha) × (1/alpha) × (W/e^z), which depends on firm size l(n,z). A uniform rise in alpha rotates the marginal cost schedule clockwise by firm size: larger firms see a proportionally larger cost reduction than smaller firms, even though the technological change is identical across all firms. Because firms compete for the same pool of customers, this asymmetric cost advantage allows large firms to offer lower prices while sustaining higher margins, attracting customers away from small firms. The result is a demand-based channel that generates winners-and-losers dynamics and increases market concentration.&lt;/p&gt;
&lt;p&gt;Q: How does the model capture customer accumulation, and why is it central to the paper&amp;rsquo;s argument?&lt;/p&gt;
&lt;p&gt;A: The model introduces directed search in the product market, where firms post advertisements and customers—including those already matched with a firm—choose which submarket to enter by trading off offered utility against matching probability. A constant-returns-to-scale matching function governs match creation; in submarket with tightness theta, customers match with probability m(theta) = theta(1+theta)^(−1) and firms attract customers with probability q(theta) = (1+theta)^(−1). The customer accumulation motive creates an investment-harvest trade-off: firms can either post high promised utility (low prices) to grow their customer base or extract surplus through high prices. Rising returns to scale amplify large firms&amp;rsquo; ability to resolve this trade-off favorably, linking the technological change directly to markup dynamics, entry incentives, and selling expenditures.&lt;/p&gt;
&lt;p&gt;Q: What is the directed search framework&amp;rsquo;s role in ensuring equilibrium uniqueness and efficiency?&lt;/p&gt;
&lt;p&gt;A: The author introduces firm-side commitment contracts—specifying price, separation probability, and continuation utility contingent on productivity realizations—combined with directed search. Because search is directed on both sides and firms fully internalize returns to scale, the decentralized equilibrium is constrained-efficient. This delivers uniquely determined heterogeneous prices in equilibrium (solving the indeterminacy problem common in customer-market models) and establishes the paper&amp;rsquo;s efficient-mechanism benchmark: it tests how far profit-maximizing firm responses to technological change—without any market failure—can account for the secular trends.&lt;/p&gt;
&lt;p&gt;Q: How are prices structured in the model, and what life-cycle pattern do they generate?&lt;/p&gt;
&lt;p&gt;A: Each firm charges two distinct prices in each period: one to incumbent customers (the same for all incumbents, since they are identical conditional on being attached to the same firm) and one to newly acquired customers (which varies based on the promised utility in the submarket searched). Firms that are expanding their customer base offer greater promised utility and therefore charge lower prices to attract customers; firms harvesting their existing base charge higher prices. Because firms enter small and grow, this dynamic generates a price life cycle: young firms invest via low prices and mature firms harvest through higher prices, which the model reproduces as a rising markup pattern over the firm life cycle—an untargeted moment the model fits well.&lt;/p&gt;
&lt;p&gt;Q: What does the calibration target and what untargeted moments does the model reproduce?&lt;/p&gt;
&lt;p&gt;A: The model is calibrated to 1980 using: the number of employees of entrant firms (pinning entry customer base n_e), employees of age-5 firms (pinning convex cost chi_1), share of firms aged 11+ years (pinning chi_2), average firm size (operating cost f), entry rate (entry cost kappa), excess reallocation rate (exit shock delta), and average cost-weighted markup (linear cost c). Untargeted moments reproduced include: a sales-weighted markup of 0.28 (vs. 0.25 in De Loecker et al. 2020), endogenous customer turnover of approximately 9% (vs. 15% in Gourio and Rudanko 2014), and an elasticity of customer base shrinkage to price of 0.08 (within the 0.01–0.16 range from Paciello et al. 2019). The model also matches markup and selling-cost life-cycle patterns that are typically overlooked.&lt;/p&gt;
&lt;p&gt;Q: How large is the quantitative contribution of the 5% rise in returns to scale to each secular trend?&lt;/p&gt;
&lt;p&gt;A: Comparing the 1980 steady state (alpha = 1) to the 2014 steady state (alpha = 1.05): the average cost-weighted markup rises by 15% in the model versus 42% in the data; the entry rate declines by 33% in the model, exactly matching the data; the reallocation rate declines by 21% in the model versus 29% in the data; and selling costs relative to production costs rise by 23% in the model versus 60%–90% in the data. The model thus explains a substantial share of each trend while leaving a residual requiring additional mechanisms.&lt;/p&gt;
&lt;p&gt;Q: How does the model explain the aging of U.S. firms, and how well does it match the data?&lt;/p&gt;
&lt;p&gt;A: The winners-and-losers mechanism shifts activity toward larger, older firms, which mechanically ages the firm population. The model generates a 53% increase in the share of firms aged 11 years or older (vs. 50% in the data) and a 58% decline in the employment share of firms aged 5 years or younger (vs. 56% in the data). This aging arises because rising returns to scale increase the cost of customer acquisition, acting as a barrier to entry that disproportionately hurts new, small firms while allowing large incumbents to remain viable at lower productivity thresholds.&lt;/p&gt;
&lt;p&gt;Q: What is the channel through which rising returns to scale reduce business dynamism specifically?&lt;/p&gt;
&lt;p&gt;A: The unequal reduction in marginal costs intensifies competition for customers and raises customer acquisition costs. This operates through two simultaneous effects on the exit threshold: (i) lower marginal costs allow large firms to remain viable at lower productivity levels despite higher customer acquisition costs; and (ii) heightened competition forces smaller firms to require higher productivity to survive in a market that has become increasingly costly to operate in. Higher customer acquisition costs therefore function as an endogenous barrier to entry, reducing the entry rate and the reallocation of resources across firms.&lt;/p&gt;
&lt;p&gt;Q: Does the model attribute the secular trends entirely to efficient firm behavior, and what does it conclude about residual explanations?&lt;/p&gt;
&lt;p&gt;A: No. The model is explicitly designed as a constrained-efficient benchmark, and the paper finds that while rising returns to scale account for a substantial share of the trends—particularly in magnitude—the transition dynamics show a less accurate fit before the 2000s. The author concludes that complementary mechanisms, likely involving inefficiencies (such as market power from horizontal product differentiation or barriers to entry beyond those captured by the model), played a significant role in the earlier evolution of these trends and in the portion of the trends not explained by the efficient channel.&lt;/p&gt;
&lt;p&gt;Q: What evidence supports the rising returns to scale finding, and what are its limitations?&lt;/p&gt;
&lt;p&gt;A: Production-function estimation using the Ackerberg-Caves-Frazer method with sales-share controls on Compustat data shows returns to scale rising from approximately 1.0 in 1980 to approximately 1.05 by 2014, driven primarily by within-sector increases rather than reallocation toward high-returns sectors. A translog production function finds limited evidence of heterogeneous increases across firm sizes within Compustat. However, Compustat predominantly covers large publicly traded firms; smaller firms outside the sample may have experienced minimal or no increase in returns to scale. If technology adoption involves fixed costs, the aggregate impact could be larger than estimated, meaning the quantitative exercises likely represent a conservative lower bound.&lt;/p&gt;
&lt;p&gt;Q: How does the paper relate to and extend the directed search literature in product markets?&lt;/p&gt;
&lt;p&gt;A: The paper builds on Gourio and Rudanko (2014) and Roldan-Blanco and Gilbukh (2020), where customers are locked in once matched, by introducing labor-search tools from Schaal (2017) to allow: (i) incumbent customer switching between firms at rates of 10%–25% annually (Gourio and Rudanko 2014), and (ii) a non-zero price sensitivity of incumbent customers (Paciello et al. 2019). It also allows firms to invest in demand through selling expenditures, which prior directed search models in product markets typically abstracted from, making it possible to study how technological changes affect customer reallocation and firms&amp;rsquo; cost structures jointly.&lt;/p&gt;
&lt;p&gt;Customer capital: The stock of customers a firm has accumulated through prior selling and pricing decisions; treated as a state variable that firms invest in (by offering low prices and spending on advertisements) or harvest from (by charging high markups), with a customer turnover rate estimated at 10%–25% annually in the literature.&lt;/p&gt;
&lt;p&gt;Directed search in the product market: A market structure in which both firms and customers choose which submarket (indexed by the promised utility level) to enter, trading off match probability against terms; delivers constrained-efficient equilibrium and uniquely determined heterogeneous prices.&lt;/p&gt;
&lt;p&gt;Investment-harvest trade-off: The firm&amp;rsquo;s dynamic choice between offering high promised utility (low prices, low current markups) to grow the customer base versus extracting surplus through high prices from an existing customer base; shaped by the firm&amp;rsquo;s current size, productivity, and the cost structure implied by returns to scale.&lt;/p&gt;
&lt;p&gt;Returns to scale (alpha): The curvature of the production function y = e^z × l^alpha; equals 1.0 under constant returns and approximately 1.05 by 2014 in the empirical estimates; the paper&amp;rsquo;s central technological change parameter, whose rise disproportionately reduces marginal costs for larger firms.&lt;/p&gt;
&lt;p&gt;Winners-and-losers dynamics: The reallocation of customers and market share from small to large firms triggered by the asymmetric cost advantage large firms obtain when returns to scale rise; the demand-based channel through which superstar firms emerge.&lt;/p&gt;
&lt;p&gt;Cost-weighted markup: The average markup aggregated using each firm&amp;rsquo;s costs as weights, as opposed to sales-weighted markup; the primary measure of market power used in the paper, rising by 42% in the data between 1980 and 2014.&lt;/p&gt;
&lt;p&gt;Constrained-efficient allocation: An equilibrium outcome in which, given the frictions present (search-and-matching in the product market), no social planner operating under the same constraints could improve welfare; the paper uses this as a benchmark to assess how far efficient firm responses explain secular trends without invoking market failures.&lt;/p&gt;
&lt;p&gt;Selling costs relative to production costs: The ratio of customer acquisition expenditures (advertising or adjusted SG&amp;amp;A) to cost of goods sold; rose by 60%–90% in the data between 1980 and 2014 and by 23% in the model&amp;rsquo;s steady-state comparison.&lt;/p&gt;</description></item><item><title>Default Options and Retirement Saving Dynamics</title><link>https://macropaperwarehouse.com/papers/default-options-and-retirement-saving-dynamics/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/default-options-and-retirement-saving-dynamics/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research question.&lt;/strong&gt; Does automatic enrollment (auto-enrollment) in retirement savings plans increase lifetime wealth accumulation and welfare? The prior literature established large short-run participation effects but had not traced the policy&amp;rsquo;s consequences over a full working life.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data.&lt;/strong&gt; The paper draws on two primary sources. First, a proprietary panel of 401(k) administrative records from nearly 600 U.S. firms, covering roughly 159,216 first-year employees across 86 firms (for the &amp;ldquo;increasing default&amp;rdquo; fact) and 6,415 employees across 34 firms (for structural estimation), observed between December 2006 and December 2017. Second, 12 successive waves (2006–2017) of the U.K. Annual Survey of Hours and Earnings (ASHE), a 1% nationally representative panel of approximately 200,000 private-sector employees per year, including 37,120 job-switchers, used to exploit the phased rollout of the U.K. Pension Act of 2008.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Methodology.&lt;/strong&gt; The paper proceeds in three steps. (1) Three empirical stylized facts are documented using quasi-experimental variation (comparing employees hired before versus after changes in the default contribution rate within the same firm, and exploiting the staggered employer-size-based rollout of U.K. auto-enrollment). (2) A structural lifecycle model is estimated via the Method of Simulated Moments, using three preference parameters—intertemporal discount factor (δ), elasticity of intertemporal substitution (σ), and opt-out cost (k)—identified from the within-firm default variation in 34 U.S. firms. (3) The estimated model is used for out-of-sample validation and counterfactual welfare analysis.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Three stylized facts.&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Fact I — Increasing the default reduces participation.&lt;/em&gt; Among 159,216 first-year employees in 86 auto-enrollment firms, each percentage-point increase in the default contribution rate reduces 401(k) participation by approximately 1 percentage point and increases contributions strictly below the new default by 1 percentage point. When the default rose from 3% to 6%, workers were 3.2 percentage points more likely to contribute at 1% or 2% of salary. This &amp;ldquo;drop-out&amp;rdquo; pattern is consistent with an opt-out cost model but is inconsistent with loss-aversion and psychological-anchoring theories, both of which predict that raising the default should weakly increase low-end contributions.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Fact II — Non-autoenrolled workers catch up within three years.&lt;/em&gt; In the estimation sample of 34 U.S. firms offering a 50% match up to 6% and an auto-enrollment default of 3%, median cumulative employee 401(k) contributions of non-autoenrolled workers equal those of autoenrolled workers after three years of tenure. Because non-autoenrolled workers compensate for initial non-participation by contributing more later—earning similar cumulative employer match and tax benefits over the full three-year horizon—a modest opt-out cost suffices to explain the observed inertia. Previous studies (which examined only the first year of tenure and did not allow future contribution adjustment) inferred opt-out costs of $1,000–$2,200 or more; the dynamic model implies a cost of only approximately $250.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Fact III — Prior auto-enrollment reduces saving in the next job.&lt;/em&gt; Using the phased U.K. policy rollout, workers who were auto-enrolled in their previous job and then move to a new employer that has not yet implemented auto-enrollment participate 12.8 percentage points less and contribute 0.55% of salary less in the new plan relative to otherwise similar job-switchers from non-auto-enrollment employers. When the new employer also has auto-enrollment, no statistically significant difference is observed. Placebo rollout tests confirm the effect is not a pre-existing selection pattern. This negative spillover contradicts a &amp;ldquo;savings habit&amp;rdquo; hypothesis and suggests that auto-enrollment&amp;rsquo;s short-run boost overstates lifetime savings effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Structural estimation results.&lt;/strong&gt; The estimated quarterly discount factor is δ = 0.987 (approximately 0.949 annually), and the elasticity of intertemporal substitution is σ = 0.435, both standard in lifecycle models. The opt-out cost is estimated at &lt;strong&gt;$254&lt;/strong&gt; per contribution-rate change (standard error $11). Sensitivity exercises show that combining a short observation window (first year only), sticky contributions (no intra-job adjustment), no income uncertainty, immediate vesting, and penalty-free DC withdrawals yields an opt-out cost of $3,004—broadly matching the range in previous studies. The low baseline estimate is thus driven by the dynamic nature of decisions (ability to compensate later), the illiquidity of retirement accounts (which reduces their perceived value), and income uncertainty (which expands the inaction range).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Long-run wealth effects.&lt;/strong&gt; Simulating a universal 3% auto-enrollment policy, the model predicts that &lt;strong&gt;wealth at retirement changes by less than 2% for the top 7 income deciles&lt;/strong&gt;. For individuals in the top two deciles, total wealth at age 65 is actually reduced by less than 1% because many high earners who would voluntarily contribute above 3% are pulled down to the default. At the &lt;strong&gt;bottom decile&lt;/strong&gt;, however, auto-enrollment raises total retirement wealth by more than &lt;strong&gt;12%&lt;/strong&gt;; savings increases are concentrated in the first 20 years of working life and peak around age 45, where bottom-quintile workers hold an additional 20% of average annual lifetime earnings. Even at the bottom, approximately one-third of the early savings gains are offset by lower contributions after age 45, as the wealth effect dominates. Crowd-out of liquid savings is limited: for bottom-quintile individuals, &lt;strong&gt;89%&lt;/strong&gt; of the increase in retirement savings at age 65 passes through to total wealth; for middle-quintile individuals, &lt;strong&gt;62%&lt;/strong&gt; passes through.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Out-of-sample validation.&lt;/strong&gt; The U.S.-estimated model is not rejected (at the 10% level) in 8 of 11 response moments in the 86-firm sample where defaults were raised between two positive rates, covering over 85% of workers. Recalibrated to U.K. institutions (using δ and σ from the U.S. and k = £160 via the average USD/GBP exchange rate), the model replicates the roughly 30-percentage-point increase in both participation and contributions at the 1% U.K. default. The model also predicts a 9.6-percentage-point drop in participation when workers move from an auto-enrollment to an opt-in employer, close to the empirical 12.8 percentage points.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Welfare and optimal policy.&lt;/strong&gt; Under utilitarian preferences (policymaker shares individuals&amp;rsquo; discount rate, no redistributive motive), the opt-in regime is always preferred to auto-enrollment regardless of policy incidence, because matching and tax incentives already induce over-saving relative to individuals&amp;rsquo; revealed time preferences. Under &lt;strong&gt;paternalistic&lt;/strong&gt; preferences (social discount factor = 1) or &lt;strong&gt;inequality-averse&lt;/strong&gt; preferences (Pareto weights inversely proportional to income, with degree of inequality aversion ν = 1 following Saez 2002), an auto-enrollment default at or near the employer matching threshold (6% of income) maximizes social welfare. A 6% auto-enrollment default improves welfare by 0.3% in lifetime consumption-equivalent for the bottom decile even under a utilitarian policymaker when incidence is on employers. These optimal policy rankings are robust to whether the opt-out cost is treated as fully welfare-relevant (π = 1) or welfare-irrelevant (π = 0), and hold under three incidence scenarios (employer profit reduction, match-rate adjustment, wage adjustment).&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-core-mechanism-by-which-non-autoenrolled-workers-catch-up-at-the-median-and-why-does-this-reduce-the-implied-opt-out-cost-relative-to-prior-estimates"&gt;Q1. What is the core mechanism by which non-autoenrolled workers &amp;ldquo;catch up&amp;rdquo; at the median, and why does this reduce the implied opt-out cost relative to prior estimates?&lt;/h3&gt;
&lt;p&gt;A: Non-autoenrolled workers who do not contribute in their first year are not permanently forgoing employer matching and tax benefits; they can contribute more later in the same job and earn similar cumulative benefits. The paper shows that at the median and 75th percentile, cumulative employee 401(k) contributions among opt-in workers equal those of autoenrolled workers after three years of tenure in 34 U.S. firms offering a 50%-up-to-6% match at a 3% default. This dynamic substitutability means the opportunity cost of initial non-participation is far smaller than one-period back-of-the-envelope calculations suggest. Previous studies, which implicitly or explicitly assumed static contribution decisions or examined only the first year, inferred opt-out costs of $1,000–$2,200; in a fully dynamic model the same inertia requires only ~$254.&lt;/p&gt;
&lt;h3 id="q2-why-does-fact-i-higher-default-reduces-participation-specifically-rule-out-loss-aversion-and-anchoring-as-the-primary-mechanism-and-what-does-it-support-instead"&gt;Q2. Why does Fact I (higher default reduces participation) specifically rule out loss aversion and anchoring as the primary mechanism, and what does it support instead?&lt;/h3&gt;
&lt;p&gt;A: Under loss aversion, contributions above the default feel like losses while contributions below the default feel like gains. Raising the default shifts some contributions from the loss domain into the gain domain, making low contributions relatively less attractive. Proposition 2 demonstrates formally that loss-averse preferences predict a weakly lower fraction contributing below the new (higher) default — the opposite of what is observed. Similarly, Proposition 3 shows that psychological anchoring shifts preferences toward the new default, also predicting more participation at low rates when the default rises. Only the opt-out cost model (Proposition 1) predicts that a higher default causes some workers to incur the cost to switch &lt;em&gt;away&lt;/em&gt; from the default and end up at lower contribution rates, matching the empirical finding that each 1-percentage-point rise in the default increases contributions strictly below the old default by approximately 1 percentage point.&lt;/p&gt;
&lt;h3 id="q3-what-is-the-quantitative-magnitude-of-the-opt-out-cost-and-what-modeling-assumptions-are-responsible-for-it-being-much-smaller-than-prior-estimates"&gt;Q3. What is the quantitative magnitude of the opt-out cost, and what modeling assumptions are responsible for it being much smaller than prior estimates?&lt;/h3&gt;
&lt;p&gt;A: The baseline estimate is $254 per contribution-rate change (s.e. $11), roughly an order of magnitude smaller than prior estimates of $1,000–$3,000+. Table 4 decomposes the sources of the difference: using only first-year data changes the estimate only slightly (to $226). Assuming contributions cannot be changed within a job (&amp;ldquo;sticky contributions&amp;rdquo;) raises the cost to $308 with four years of data or $712 with one year of data. Eliminating income uncertainty raises the estimate to $465. Assuming immediate vesting raises it to $344. Assuming penalty-free DC withdrawals raises it to $609. Combining all these restrictions simultaneously yields $3,004 — closely matching the prior literature. The three key drivers are thus: (1) the ability to adjust contributions over time within a job; (2) the illiquidity of the DC account (early-withdrawal penalties); and (3) income uncertainty widening the inaction range.&lt;/p&gt;
&lt;h3 id="q4-how-does-the-paper-validate-the-structural-model-out-of-sample-and-what-confidence-does-this-provide-in-the-long-run-predictions"&gt;Q4. How does the paper validate the structural model out of sample, and what confidence does this provide in the long-run predictions?&lt;/h3&gt;
&lt;p&gt;A: Two out-of-sample exercises are reported. First, the model estimated on 34 U.S. firms (introduction of auto-enrollment from 0% to 3% default) is used to predict workers&amp;rsquo; response when 86 other firms raised the default from one positive rate to a higher rate. The model prediction cannot be rejected at the 10% level in 8 of 11 response-moment cases, covering 71 of 86 firms and more than 85% of workers. Second, the model is re-calibrated to U.K. institutions (keeping U.S. preference estimates, setting k = £160 via exchange rate) and applied to the phased rollout of the U.K. Pension Act of 2008. The model replicates the roughly 30-percentage-point increase in both participation and contributions at the 1% default following the policy, and predicts a 9.6-percentage-point drop in participation when previously autoenrolled workers move to a new opt-in employer — compared with an empirical estimate of 12.8 percentage points (s.e. 5.5 pp).&lt;/p&gt;
&lt;h3 id="q5-what-are-the-distributional-implications-of-a-universal-3-auto-enrollment-policy-for-wealth-at-retirement"&gt;Q5. What are the distributional implications of a universal 3% auto-enrollment policy for wealth at retirement?&lt;/h3&gt;
&lt;p&gt;A: The effect is concentrated at the bottom. For the top 7 income deciles, retirement wealth at age 65 changes by less than 2% relative to the opt-in counterfactual. For the top two deciles, total wealth at age 65 is actually reduced by less than 1% because high-earning workers who would voluntarily contribute above 3% are pulled down to the default. For the bottom decile, the policy raises total retirement wealth by more than 12%. Even at the bottom, roughly one-third of the early savings gains are later offset by lower contributions after age 45 as the wealth effect dominates, so even 20-year empirical follow-ups may overstate the policy&amp;rsquo;s lifetime effect at the bottom.&lt;/p&gt;
&lt;h3 id="q6-how-large-is-crowd-out-of-liquid-savings-by-auto-enrollment-and-what-explains-the-limited-degree-of-substitution"&gt;Q6. How large is crowd-out of liquid savings by auto-enrollment, and what explains the limited degree of substitution?&lt;/h3&gt;
&lt;p&gt;A: Crowd-out is modest. For bottom-quintile workers, 89% of the increase in retirement savings at age 65 translates into higher total wealth; for middle-quintile workers, 62% passes through. The limited crowd-out arises because liquid assets serve a precautionary motive and DC accounts serve a lifecycle motive — the two assets are not close substitutes. Additionally, as in Kaplan and Violante (2014), the marginal propensity to consume out of liquid assets is high in the model, so autoenrolled workers reduce consumption rather than run down liquid balances. These predictions align with Beshears et al. (2021), who find no significant increase in unsecured debt after four years, and Chetty et al. (2014), who estimate an 80% pass-through to total savings in a different Danish policy.&lt;/p&gt;
&lt;h3 id="q7-why-do-previously-autoenrolled-workers-contribute-less-when-they-switch-to-an-opt-in-employer-and-how-is-this-consistent-with-the-model"&gt;Q7. Why do previously autoenrolled workers contribute less when they switch to an opt-in employer, and how is this consistent with the model?&lt;/h3&gt;
&lt;p&gt;A: The most plausible explanation, and the one consistent with the model&amp;rsquo;s out-of-sample predictions, is a standard wealth effect: workers auto-enrolled early accumulate more retirement wealth and therefore have less incentive to contribute in a new job. The model predicts a 9.6-percentage-point participation drop for AE-to-non-AE movers, close to the empirical 12.8 pp. An alternative explanation — that previously autoenrolled workers rationally expect their new employer to soon adopt auto-enrollment and thus delay active enrollment — is partially ruled out by the finding that the empirical estimate is closer to the model prediction for job-switchers whose new employer is not expected to adopt auto-enrollment in the next 12 months.&lt;/p&gt;
&lt;h3 id="q8-what-are-the-welfare-implications-of-auto-enrollment-under-utilitarian-paternalistic-and-inequality-averse-policymakers-and-how-robust-are-these-to-the-incidence-assumption"&gt;Q8. What are the welfare implications of auto-enrollment under utilitarian, paternalistic, and inequality-averse policymakers, and how robust are these to the incidence assumption?&lt;/h3&gt;
&lt;p&gt;A: Under utilitarian preferences (policymaker shares individuals&amp;rsquo; discount factor, no extra redistributive weight), the opt-in regime is always preferred regardless of whether the policy&amp;rsquo;s cost falls on employer profits, the match rate, or wages. The negative welfare effect is largest when incidence falls on wages (approximately 50% larger than under match-rate reduction). Under paternalistic preferences (social discount factor = 1), a 6% default (equal to the employer matching threshold) is optimal under all three incidence scenarios. Under inequality-averse preferences (ν = 1 Pareto weights), a 6% default is optimal when incidence falls on employers, and a 5% default when incidence falls on workers. These results are identical whether the opt-out cost is treated as fully welfare-relevant (π = 1) or welfare-irrelevant (π = 0). A 6% auto-enrollment default increases welfare by 0.3% in lifetime consumption-equivalent for the bottom income decile even under a utilitarian planner when incidence is on employers.&lt;/p&gt;
&lt;h3 id="q9-how-does-the-paper-address-heterogeneity-in-default-effects-across-age-and-income-groups-within-a-parsimonious-homogeneous-preference-model"&gt;Q9. How does the paper address heterogeneity in default effects across age and income groups within a parsimonious homogeneous preference model?&lt;/h3&gt;
&lt;p&gt;A: The model has only three estimated preference parameters (δ, σ, k), yet it endogenously replicates empirical heterogeneity. Conditional on participating, workers in their 20s are approximately 20 percentage points more likely to stay at the 3% default than workers in their late 50s and early 60s; the model attributes this to the option value of waiting: young workers can compensate for current non-saving by contributing more later, so the cost of opting out is effectively smaller for them. The lowest-income workers are approximately 40 percentage points more likely to remain at the default than the highest-paid; the model explains this primarily because the fixed opt-out cost of $254 represents a larger share of earnings for low-income individuals (and secondarily because high-income workers have more to gain from active contribution decisions due to higher marginal tax rates and a lower Social Security replacement rate). All model-predicted coefficients fall within the 95% confidence intervals of the empirical estimates.&lt;/p&gt;
&lt;h3 id="q10-what-does-the-paper-conclude-about-the-broader-relevance-of-the-dynamic-opt-out-cost-framework-beyond-retirement-saving"&gt;Q10. What does the paper conclude about the broader relevance of the &amp;ldquo;dynamic opt-out cost&amp;rdquo; framework beyond retirement saving?&lt;/h3&gt;
&lt;p&gt;A: The paper argues that wherever individuals can compensate for present inaction with future actions — as in retirement saving — the observed inertia at a default understates the freedom of choice preserved by the nudge, and short-run effects overstate long-term consequences. In contrast, in domains such as healthcare plan choice or school selection, future actions cannot easily offset present inertia; opt-out costs are likely to remain large; and the distinction between a nudge and a hard mandate collapses. The paper therefore argues that the appeal of &amp;ldquo;libertarian paternalism&amp;rdquo; (Thaler and Sunstein 2003) is domain-specific and is strongest precisely where intertemporal adjustment is possible.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Opt-out cost (k).&lt;/strong&gt; In this paper, a utility cost — estimated at $254 per contribution-rate change — that individuals must pay every time they choose a retirement contribution rate different from the current default. The cost is modeled as a consumption reduction and captures both real transaction costs (form-filling, adviser fees) and behavioral costs (cognitive cost of attention and optimal-choice search). It is fixed and homogeneous across individuals, and applies symmetrically in any direction of deviation from the default.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Auto-enrollment default contribution rate.&lt;/strong&gt; The positive contribution rate at which new hires are automatically enrolled in a defined-contribution plan, with the option to opt out by incurring the opt-out cost. In the paper&amp;rsquo;s estimation sample, this is 3% of salary. The default is exogenous at the start of each new job but endogenous thereafter: once established, the default for subsequent periods equals the worker&amp;rsquo;s contribution rate in the previous period.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Default eﬀect.&lt;/strong&gt; The empirically observed tendency of workers to remain at the default contribution rate rather than actively choosing a different rate. In this paper, the default effect is explained by opt-out costs rather than loss aversion or psychological anchoring — a distinction identified through the novel prediction that raising the default from a positive rate to a higher positive rate reduces overall participation (the &amp;ldquo;drop-out&amp;rdquo; effect), a pattern consistent only with opt-out costs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Drop-out eﬀect.&lt;/strong&gt; The paper&amp;rsquo;s term (following Caplin and Martin 2017) for the empirical finding that increasing the auto-enrollment default contribution rate causes some workers to stop contributing altogether or to contribute at rates strictly below the initial default. This effect is used as a discriminating test between competing theories of the default effect.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Dynamic opt-out cost framework.&lt;/strong&gt; The paper&amp;rsquo;s core modeling insight: that opt-out costs must be estimated in a fully dynamic lifecycle model that allows workers to adjust contributions over time, to hold liquid assets and unsecured debt, and to face labor market risk. In a static or short-horizon model, the opportunity cost of initial non-participation appears large (because the worker permanently forgoes match and tax benefits), requiring large opt-out costs. In the dynamic model, the ability to compensate later shrinks the implied opportunity cost and hence the opt-out cost required to rationalize observed inertia.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Crowd-out of liquid savings.&lt;/strong&gt; The extent to which higher DC retirement contributions induced by auto-enrollment reduce liquid asset holdings (or increase unsecured borrowing), rather than increasing total wealth. The paper estimates limited crowd-out (89% pass-through to total wealth for bottom-quintile workers, 62% for middle-quintile workers), attributable to the different roles of liquid assets (precautionary motive) and DC accounts (lifecycle motive) in the model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Policy incidence.&lt;/strong&gt; The channel through which employers balance their budget in response to higher matching costs created by auto-enrollment. The paper considers three scenarios: employers absorb costs through reduced profits; employers reduce the match rate; employers reduce wages. Optimal policy rankings and welfare magnitudes differ across these scenarios, but the qualitative conclusions — utilitarian policymaker prefers opt-in; paternalistic or inequality-averse policymaker prefers AE at 6% — are robust across incidence assumptions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Consumption-equivalent variation (γ).&lt;/strong&gt; The welfare metric used in the paper: the proportional increase in consumption in every period and every state of the world that would make the policymaker indifferent between an auto-enrollment policy at default d and the opt-in regime. A 6% default increases welfare by 0.3% in consumption-equivalent for the bottom income decile under a utilitarian policymaker when incidence is on employers.&lt;/p&gt;</description></item><item><title>Designing Disability Insurance Reforms: Tightening Eligibility Rules or Reducing Benefits?</title><link>https://macropaperwarehouse.com/papers/designing-disability-insurance-reforms-tightening-eligibility-rules-or-reducing-benefits/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/designing-disability-insurance-reforms-tightening-eligibility-rules-or-reducing-benefits/</guid><description>&lt;p&gt;This paper develops a sufficient statistics framework for the welfare analysis of disability insurance (DI) policy reforms and applies it to two reform episodes in Austria. The framework derives social optimality conditions for the two main DI policy instruments — eligibility rules and benefit levels — expressed in terms of estimable reduced-form objects (fiscal multipliers and insurance losses). The fiscal multiplier of a DI policy instrument is defined as the ratio of total fiscal cost savings to the mechanical (counterfactual-behavior-held-fixed) fiscal cost savings; it measures how much the program shrinks per dollar mechanically removed, and values above 1 indicate behavioral crowd-out of DI enrollment. The paper then evaluates two Austrian reforms: (1) a 2013 increase in the Rehabilitation Stricter Assessment (RSA) age threshold from 57 to 58 (and separately to 59), which tightened eligibility for DI applicants aged 57 by requiring them to demonstrate inability to be retrained for alternative work; and (2) a 2003 reform that reduced DI benefit generosity for workers aged 30–60 as a side effect of a pension reform. Using difference-in-differences with cohorts just above and below the relevant thresholds, the paper finds that the RSA reform generated a fiscal multiplier of 2.50 (RSA to 58) and 2.05 (RSA to 59), while the benefit reduction generated a multiplier of only 1.41 (ages 57–60) and 1.36 (ages 30–56). The large gap implies that for a given mechanical cost saving, tighter eligibility rules generate 1.8 times more total fiscal savings than benefit cuts. The paper further provides empirical evidence that the insurance losses associated with stricter eligibility rules are, in all likelihood, smaller than those from benefit reductions, strengthening the dominance of eligibility tightening over benefit cuts as a DI reform instrument.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-sufficient-statistics-framework-and-what-does-it-deliver"&gt;Q1. What is the sufficient statistics framework and what does it deliver?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper derives social optimality conditions for two DI policy instruments — tighter eligibility rules and lower benefits — in terms of two sufficient statistics: the fiscal multiplier (total fiscal savings / mechanical fiscal savings) and the insurance loss (marginal utility of consumption of the affected recipients).&lt;/strong&gt; An eligibility reform that tightens the threshold θ from θ* to θ* + dθ is welfare-improving if and only if the fiscal multiplier exceeds the social value of one dollar in the hands of the marginally excluded applicant; a benefit cut from b to b − db is welfare-improving if the multiplier exceeds the social value of one dollar held by the average current DI recipient. Because the fiscal multiplier is estimable from reduced-form variation and the insurance loss gives the welfare benchmark, the framework converts the welfare question into: &amp;ldquo;Is the multiplier large enough relative to the insurance value?&amp;rdquo;&lt;/p&gt;
&lt;h3 id="q2-how-is-the-fiscal-multiplier-decomposed-and-why-does-this-decomposition-matter"&gt;Q2. How is the fiscal multiplier decomposed, and why does this decomposition matter?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The fiscal multiplier equals 1 + B/M where B is the behavioral fiscal effect (savings from deterred applications and enrollment) and M is the mechanical fiscal effect (savings from unchanged behavior on the affected population); the multiplier exceeds 1 whenever the behavioral response amplifies the direct savings.&lt;/strong&gt; The decomposition matters because the behavioral effect operates through marginal applicants (who apply only under lenient rules) while the mechanical effect operates through always-applicants (who apply regardless). These groups have different characteristics: in the Austrian data, marginal applicants are more likely to be employed at age 56 (73% vs. 60% for always-applicants) and more likely to be blue-collar workers with musculoskeletal impairments, while always-applicants are more likely to be on sick leave — a proxy for genuine disability. Confusing the two groups would misidentify who bears the insurance loss.&lt;/p&gt;
&lt;h3 id="q3-how-is-the-mechanical-fiscal-effect-identified-when-marginal-and-always-applicants-cannot-be-directly-observed"&gt;Q3. How is the mechanical fiscal effect identified when marginal and always-applicants cannot be directly observed?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper exploits previously-rejected DI applicants (those who filed applications between ages 50–56 and were rejected) as a proxy group for always-applicants: these individuals qualify as always-applicants by revealed preference (they applied under strict rules), and the paper shows that their DI benefit receipt and net fiscal expenditures after a simulated age-57 application are statistically indistinguishable from those of all-age-57 applicants in the whole population.&lt;/strong&gt; The mechanical fiscal effect per always-applicant in the whole population is estimated as M = 5,585 Euro × 0.070 = 391 Euro per capita (the product of the mechanical effect among pre-57 re-applicants and the share of always-applicants in the population, πAA = 0.070). The behavioral fiscal effect is then computed as the residual: B = total fiscal savings – M = 976 − 391 = 585 Euro, yielding the multiplier of 2.50.&lt;/p&gt;
&lt;h3 id="q4-what-are-the-reduced-form-effects-of-the-rsa-reform-on-di-enrollment-and-employment"&gt;Q4. What are the reduced-form effects of the RSA reform on DI enrollment and employment?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;A one-year increase in the RSA from age 57 to 58 reduces DI inflow by approximately 21 percentage points at age 57 (relative to a control-group mean), increases employment by roughly 15 percentage points, increases other benefit receipt (unemployment insurance and social assistance) by about 16 percentage points, and generates net fiscal cost savings of approximately 976 Euro per person per year in the two years after the reform.&lt;/strong&gt; The pattern of employment and benefit substitution shows that the behavioral response is substantial: a large share of those deterred from DI enrollment at 57 transition to employment or other social benefits rather than remaining without any income support, which is why the fiscal multiplier of 2.50 substantially exceeds 1.&lt;/p&gt;
&lt;h3 id="q5-what-are-the-effects-of-the-2003-di-benefit-generosity-reduction"&gt;Q5. What are the effects of the 2003 DI benefit generosity reduction?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;A 1-percentage-point reduction in DI benefit generosity for the ages 57–60 cohort produces a behavioral fiscal effect of 18.69 Euro per year (through reduced DI inflow and application), a mechanical fiscal effect of 45.16 Euro per year (1% of the pre-reform mean benefit expenditure of 4,516 Euro among those aged 57–60), and a total fiscal effect of 63.85 Euro — yielding a multiplier of 1.41.&lt;/strong&gt; For the younger cohort (ages 30–56), the multiplier is 1.36, with a behavioral effect of 1.18 Euro and mechanical effect of 3.24 Euro per year. The lower multipliers for benefit cuts relative to eligibility tightening reflect the fact that benefit reductions affect all current recipients uniformly (generating large mechanical savings) rather than targeting a group with a strong behavioral response at the margin.&lt;/p&gt;
&lt;h3 id="q6-how-does-the-paper-compare-the-insurance-losses-of-the-two-di-instruments"&gt;Q6. How does the paper compare the insurance losses of the two DI instruments?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper derives a sufficient condition under which the insurance loss from tighter eligibility rules is smaller than the insurance loss from benefit cuts: it requires that the per-dollar income loss borne by the marginally excluded applicant (upper-bounded by their DI benefit minus available social welfare benefits) is weakly smaller than the per-dollar income loss of current recipients (lower-bounded by the benefit reduction itself), evaluated within each income quintile.&lt;/strong&gt; Implementing this condition empirically using the Austrian income data, the paper finds that the income losses borne by marginally excluded applicants fall short of those borne by current recipients at all income quintile comparisons — meaning tighter eligibility rules both generate higher fiscal multipliers and impose smaller insurance losses than benefit cuts, making eligibility tightening the dominant instrument when the goal is to reduce DI program costs.&lt;/p&gt;
&lt;h3 id="q7-what-welfare-conclusion-follows-from-combining-the-fiscal-multipliers-with-the-insurance-benchmark"&gt;Q7. What welfare conclusion follows from combining the fiscal multipliers with the insurance benchmark?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Combining the multiplier estimates with hand-to-mouth CRRA assumptions as an upper-bound calculation, the RSA increase to 58 is welfare-improving if the coefficient of relative risk aversion of affected DI recipients is below 2.8; the RSA increase to 59 is welfare-improving if risk aversion is below 2.2.&lt;/strong&gt; The corresponding critical risk aversion level for the benefit cut (ages 57–60) is 1.1 — below the range typically estimated in the literature for low-income individuals — suggesting the benefit cut was likely welfare-reducing while the eligibility reform was likely welfare-improving. For a given dollar of mechanical budget reduction, stricter eligibility rules generate 1.8 times the total fiscal savings (= 2.50 / 1.41) relative to benefit cuts.&lt;/p&gt;
&lt;h3 id="q8-what-is-the-complier-analysis-and-what-does-it-reveal-about-who-is-affected-by-each-instrument"&gt;Q8. What is the complier analysis and what does it reveal about who is affected by each instrument?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Using the complier analysis method adapted for difference-in-differences settings, the paper estimates that the RSA-58 reform affects three types of individuals in the age-57 population: marginal applicants (πMA = 0.014, who apply only under lenient rules), always-applicants (πAA = 0.070, who apply regardless), and never-applicants (πNA = 0.916, who never apply).&lt;/strong&gt; Marginal applicants differ from always-applicants in that they are more likely to be employed at age 56 (73% vs. 60%) and less likely to be on sick leave; they are more likely to apply with musculoskeletal impairments than with mental impairments — consistent with these workers facing the largest relaxation in disability eligibility when reaching the RSA and being on the borderline of eligibility under strict rules.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;fiscal multiplier of a DI instrument&lt;/strong&gt; : total fiscal cost savings divided by mechanical fiscal cost savings from that instrument; equals 1 + B/M where B is the behavioral savings (from deterred applications) and M is the mechanical savings (from unchanged behavior); values above 1 indicate behavioral crowd-out and are the policy-relevant benchmark against which insurance losses must be compared.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;mechanical fiscal effect (M)&lt;/strong&gt; : the fiscal cost savings that would accrue if DI application and enrollment behavior were held fixed at pre-reform levels; for eligibility tightening, this equals the DI benefits that would have been paid to always-applicants who are now rejected; identified using the subpopulation of previously-rejected DI applicants as a proxy for always-applicants.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;behavioral fiscal effect (B)&lt;/strong&gt; : the additional fiscal savings generated by deterred applications and enrollment that result from the reform; equals total fiscal savings minus the mechanical fiscal effect; operates through marginal applicants who adjust their application behavior in response to stricter rules or lower benefits.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;always-applicants&lt;/strong&gt; : individuals who apply for DI regardless of whether eligibility rules are strict or lenient; they bear the mechanical cost of eligibility tightening (being rejected under stricter rules); in the Austrian data, their population share at age 57 is estimated at 7.0%.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;rehabilitation stricter assessment (RSA) age&lt;/strong&gt; : the Austrian policy threshold above which DI applicants are evaluated under more lenient standards that do not require demonstration that the applicant can be retrained for alternative work; increasing the RSA age from 57 to 58 subjects the age-57 cohort to stricter evaluation criteria.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;insurance loss&lt;/strong&gt; : the welfare cost to DI recipients or excluded applicants from the income reduction caused by a DI reform; the right-hand side of the social optimality condition; the paper bounds it using income losses by income quintile rather than requiring utility function assumptions.&lt;/p&gt;</description></item><item><title>Digital Distractions with Peer Influence</title><link>https://macropaperwarehouse.com/papers/digital-distractions-with-peer-influence/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/digital-distractions-with-peer-influence/</guid><description>&lt;p&gt;This paper estimates the causal effects of mobile app usage on college students&amp;rsquo; academic performance, physical health, and labor market outcomes, while separately identifying behavioral (endogenous) and contextual (exogenous) peer effects in app usage — the first study to do so within a unified empirical framework. The analysis draws on administrative data for three freshman cohorts (2018–2020) at a mid-tier Chinese university, linked to individual-level mobile phone usage records from a major telecommunications carrier covering 6,430 students over four years (excluding COVID semester). High-frequency GPS data, hourly app usage records for the 2020 cohort, and two waves of university surveys supplement the main dataset.&lt;/p&gt;
&lt;p&gt;The identification strategy addresses three challenges: endogeneity of own app usage, endogeneity of peer group formation, and the reflection problem in peer effects. For own usage, two instrumental variables are used: (1) a shift-share instrument interacting the September 2020 launch of the blockbuster game Yuanshen with students&amp;rsquo; pre-college app usage intensity; and (2) China&amp;rsquo;s October 2019 minors&amp;rsquo; game restriction policy (prohibiting under-18s from playing online games 10 p.m.–8 a.m. and capping weekday gaming at 90 minutes/day) interacted with the evolving number of underage pre-college friends. For peer effects, the university&amp;rsquo;s random dormitory assignment within gender-class units provides exogenous peer variation; behavioral peer effects are further isolated using the minors&amp;rsquo; restriction policy interacted with roommates&amp;rsquo; pre-college underage friend networks, an instrument that affects roommates but not the focal student. Contextual peer effects are recovered by subtracting the estimated behavioral component from reduced-form estimates.&lt;/p&gt;
&lt;p&gt;The main findings are as follows. First, app usage is contagious: a one standard deviation (s.d.) increase in roommates&amp;rsquo; in-college total app usage raises a student&amp;rsquo;s own usage by 5.8% (IV). Behavioral peer effects dominate: contextual peer effects are small and statistically insignificant. Second, own app usage severely harms academic performance: a one s.d. increase in total app usage reduces GPA for required courses by 36.2% of a within-cohort-major s.d. (IV), and a one s.d. increase in game app usage alone reduces GPA by 56.6% of a within-cohort-major s.d. The direct disruption effect of roommates&amp;rsquo; app usage reduces GPA by a further 20.6% of a within-cohort-major s.d.; combining the indirect channel (behavioral contagion), the total roommate effect reaches 22.7% of a within-cohort-major s.d., more than 60% of the own-usage effect. Third, the effect on physical education scores is roughly four times larger than on required-course GPA: a one s.d. increase in own app usage reduces PE scores by 2.74 points, while roommates&amp;rsquo; app usage has no direct effect on PE. Fourth, a one s.d. increase in own in-college app usage reduces initial wages upon graduation by 2.3% (12.1% of within-cohort-major wage s.d.); a one s.d. increase in roommates&amp;rsquo; usage reduces wages by 0.9% directly, with a total effect (including the contagion channel) of approximately 1.0% (5.3% of within-cohort-major s.d.). Controlling for cumulative GPA reduces the gaming-to-wage coefficient by roughly one-third, indicating that academic performance is an important but partial mediator.&lt;/p&gt;
&lt;p&gt;A back-of-the-envelope policy simulation extending the minors&amp;rsquo; gaming cap (3 hours/week) to college students — binding for 34.3% of student-month observations — projects an average wage increase of 0.9% at graduation, approximately half the wage premium from one additional year of work experience in developing countries.&lt;/p&gt;
&lt;p&gt;Mechanism evidence from GPS data shows that Yuanshen&amp;rsquo;s launch caused students to arrive at study halls 18.2 minutes later and leave 23.4 minutes earlier per day. High-frequency sleep data show that a one s.d. increase in nighttime app usage reduces sleep duration by approximately 30 minutes and raises the probability of sleeping late by 34 percentage points. Survey evidence indicates that heavy app users recognize the addictive nature of gaming, pointing to self-control problems rather than lack of awareness.&lt;/p&gt;
&lt;p&gt;The scope conditions are: single mid-tier Chinese university; 2018–2020 cohorts; outcomes through initial job placement only; peer group restricted to dormitory roommates; findings rely on IV exclusion restrictions conditional on student and time fixed effects.&lt;/p&gt;
&lt;p&gt;Q: What is the core research question?
A: The paper asks how individual and peer mobile app usage affect college students&amp;rsquo; academic performance, physical health, and early labor market outcomes, and it separately identifies the behavioral (endogenous) versus contextual (exogenous) components of peer influence in app usage. This is claimed as the first study to disentangle these two types of peer effects within a unified empirical framework.&lt;/p&gt;
&lt;p&gt;Q: What data does the paper use?
A: Administrative records for 7,479 undergraduates across three freshman cohorts (2018–2020) at a medium-sized mid-tier Chinese university are linked to monthly mobile app usage records from a telecommunications provider covering 75% of the provincial population; 6,430 students are matched. The dataset also includes GPS location data at 5-minute intervals, hourly app usage for the 2020 cohort (used to infer sleep), and two waves of voluntary annual surveys with 1,798 respondents (24% response rate). Labor market outcomes — employment status, wages, post-graduate admissions — are available for the 2018 and 2019 cohorts.&lt;/p&gt;
&lt;p&gt;Q: How does the paper address the endogeneity of own app usage?
A: Two sets of instruments are used. The first interacts the September 2020 launch of Yuanshen (the most popular game in China, with over 13 million Chinese users by 2021, the majority under age 25) with students&amp;rsquo; pre-college app usage, forming a shift-share instrument under the assumption that the game launch is orthogonal to unobserved GPA determinants conditional on student fixed effects. The second interacts China&amp;rsquo;s October 2019 minors&amp;rsquo; game restriction policy with the evolving count of a student&amp;rsquo;s underage pre-college friends; event studies confirm no pre-trends and a sharp, transitory drop in app usage post-policy that dissipates as friends age out of the restricted group.&lt;/p&gt;
&lt;p&gt;Q: How does the paper solve the reflection problem and separate behavioral from contextual peer effects?
A: Three-step procedure: (1) random dormitory assignment within gender-class units yields reduced-form peer effect estimates using roommates&amp;rsquo; pre-college app usage as the exogenous peer shifter; (2) behavioral peer effects are isolated via an IV using the minors&amp;rsquo; restriction policy interacted with roommates&amp;rsquo; (not the focal student&amp;rsquo;s) underage pre-college friend networks — an instrument that shifts roommates&amp;rsquo; app usage but is orthogonal to the focal student&amp;rsquo;s outcomes; (3) contextual peer effects are recovered as the residual from subtracting the estimated behavioral effect from the reduced-form estimate.&lt;/p&gt;
&lt;p&gt;Q: How large and significant are the behavioral versus contextual peer effects in app usage?
A: A one s.d. increase in roommates&amp;rsquo; in-college total app usage raises own usage by 5.8% (IV estimate, significant). For game apps alone the behavioral spillover is 10.7%, and for games plus video it is 6.5%. Contextual peer effects (identified from roommates&amp;rsquo; pre-college characteristics) are much smaller and statistically insignificant, indicating that peer influence operates primarily through the direct imitation of peers&amp;rsquo; actions rather than their background traits.&lt;/p&gt;
&lt;p&gt;Q: What is the effect of own app usage on GPA?
A: The IV estimate shows a one s.d. increase in total in-college app usage reduces GPA for required courses by 0.716 points, equivalent to 36.2% of a within-cohort-major GPA s.d. (significant at 1%). For game apps alone, a one s.d. increase reduces GPA by 1.119 points, or 56.6% of a within-cohort-major s.d. OLS estimates are biased toward zero, likely because negative health shocks reduce both GPA and app usage simultaneously.&lt;/p&gt;
&lt;p&gt;Q: How large is the total peer effect of roommates&amp;rsquo; app usage on a student&amp;rsquo;s GPA?
A: Roommates&amp;rsquo; app usage directly lowers GPA by 0.408 points (20.6% of within-cohort-major s.d.) through disruption of the dormitory study environment or crowding out of group study. The behavioral contagion channel (5.8% increase in own usage per s.d. of roommates&amp;rsquo; usage) adds an additional 0.042 points, bringing the total effect to approximately 0.450 points, or 22.7% of a within-cohort-major s.d. — over 60% of the own-usage effect.&lt;/p&gt;
&lt;p&gt;Q: What is the effect on physical education (PE) scores, and why do roommates&amp;rsquo; app usage not matter there?
A: A one s.d. increase in own total app usage reduces PE scores by 2.74 points (IV), approximately four times the magnitude of the effect on required-course GPA, consistent with health literature on excessive screen time. Roommates&amp;rsquo; app usage has no statistically significant direct effect on PE, which the authors attribute to the irrelevance of dormitory noise and study disruptions for outdoor physical activity.&lt;/p&gt;
&lt;p&gt;Q: What are the effects of app usage on wages at graduation?
A: Doubling total app usage during college reduces initial wages by approximately 2% (IV). A one s.d. increase in own usage reduces wages by 2.3%, or 12.1% of a within-cohort-major wage s.d. A one s.d. increase in roommates&amp;rsquo; usage directly reduces wages by 0.9% (4.8% of within-cohort-major s.d.); including the behavioral contagion channel, the total roommate effect is approximately 1.0% (5.3% of within-cohort-major s.d.). Controlling for cumulative GPA reduces the game-usage-to-wage coefficient by about one-third, implying GPA is a partial but not complete mediator.&lt;/p&gt;
&lt;p&gt;Q: What does the policy simulation of the gaming cap say?
A: Extending the minors&amp;rsquo; game restriction (3 hours/week cap) to college students would bind for 34.3% of student-month observations, reducing average monthly gaming from 12.1 hours to 8 hours (a one-third decrease). Incorporating the behavioral peer multiplier for gaming (0.078), average gaming further converges to approximately 7.65 hours in steady state. The implied wage gain at graduation is 0.9%, approximately half the wage premium from one additional year of work experience in developing countries (Lagakos et al., 2019 estimate).&lt;/p&gt;
&lt;p&gt;Q: What does the GPS evidence show about time allocation?
A: Following Yuanshen&amp;rsquo;s launch, the average student arrives at the study hall 18.2 minutes later and returns to the dormitory 23.4 minutes earlier per day. The minors&amp;rsquo; restriction reverses this: students with the average number of minor friends arrive at study halls 17.4 minutes earlier and return to the dorm 19.8 minutes later. Both game shocks also shift tardiness and absence rates for major-required courses in the expected directions, and the effects intensify over time with Yuanshen&amp;rsquo;s growing popularity.&lt;/p&gt;
&lt;p&gt;Q: What do the sleep data show?
A: A one s.d. increase in nighttime app usage (9 p.m.–3 a.m.) is associated with roughly 30 minutes less sleep (7% of the mean), a 34 percentage point higher probability of sleeping late, and a 4.5 percentage point higher probability of waking up late. Daytime app usage (8 a.m.–9 p.m.) is also associated with 7.2 fewer minutes of sleep (1.8% of mean) and a 3.7 percentage point higher probability of late wake-up. These results are descriptive (from the 2020 cohort hourly data) rather than IV-based.&lt;/p&gt;
&lt;p&gt;Q: What does the survey evidence show about mechanisms and self-awareness?
A: Heavier app users report worse physical health and higher stress, are less likely to have obtained professional certifications by graduation, submit fewer job applications, and express lower satisfaction with job offers. Notably, heavier users are more likely to acknowledge the addictive nature of apps and games, suggesting a self-control problem rather than informational deficiency. They also report better relationships with roommates and greater likelihood of following roommates&amp;rsquo; advice on post-graduation choices, a potential direct channel for peer labor market effects.&lt;/p&gt;
&lt;p&gt;Q: How representative is the sample, and what are the key scope conditions?
A: The university is a mid-tier institution in southern China with students predominantly from the 30th–80th CEE score percentile among provincial college-admitted applicants; it is less female (42% vs. 53% nationally) and more rural (40% vs. 27% nationally). Survey respondents oversample less advantaged backgrounds and are re-weighted. Findings pertain to dormitory roommates as the peer group; all labor market outcomes are initial wages upon graduation; the sample covers 2018–2021 with COVID semester excluded. The peer effects estimates rest on random dormitory assignment, which the authors verify by showing no within-dorm correlation in pre-college characteristics.&lt;/p&gt;
&lt;p&gt;Behavioral (endogenous) peer effects: The mechanism by which a peer&amp;rsquo;s actual behavior — here, contemporaneous app usage — directly influences a focal individual&amp;rsquo;s own behavior. In this paper, identified via IV using the minors&amp;rsquo; game restriction policy interacted with roommates&amp;rsquo; underage pre-college friend networks, which shifts roommates&amp;rsquo; usage but not the focal student&amp;rsquo;s characteristics.&lt;/p&gt;
&lt;p&gt;Contextual (exogenous) peer effects: The influence of peers&amp;rsquo; pre-determined background characteristics (e.g., pre-college app usage, reflecting motivation, study habits, attitudes toward academics) on a focal individual&amp;rsquo;s outcomes, independent of peers&amp;rsquo; actual in-college behavior. Recovered as the residual after subtracting estimated behavioral peer effects from reduced-form estimates; found to be small and insignificant in this setting.&lt;/p&gt;
&lt;p&gt;Shift-share instrument (Yuanshen): A quasi-experimental instrument constructed by interacting the mid-sample launch date of the blockbuster game Yuanshen (September 2020) with students&amp;rsquo; pre-college app usage intensity, under the assumption that pre-college usage predicts differential susceptibility to the shock while the launch itself is orthogonal to the university&amp;rsquo;s academic environment.&lt;/p&gt;
&lt;p&gt;Minors&amp;rsquo; game restriction policy: China&amp;rsquo;s October 2019 policy prohibiting individuals under 18 from playing online games between 10 p.m. and 8 a.m. and capping weekday gaming at 90 minutes per day (tightened to 3 hours/week in September 2021). Used both as an instrument for own app usage (via underage pre-college friends) and as an instrument for roommates&amp;rsquo; usage (via roommates&amp;rsquo; underage friends) to isolate behavioral peer effects.&lt;/p&gt;
&lt;p&gt;Reflection problem: The identification challenge first articulated by Manski (1993) arising because an individual&amp;rsquo;s behavior both affects and is affected by peers simultaneously, making it impossible to separately identify the direction of influence from observational data without exogenous variation in peer behavior.&lt;/p&gt;
&lt;p&gt;Source text origin: The paper&amp;rsquo;s own data provenance category distinguishing whether summaries are based on full working paper text (pdf or oa-html) versus abstract only — a distinction the paper itself does not use but that is relevant to the review pipeline running this analysis.&lt;/p&gt;
&lt;p&gt;Within-cohort-major GPA standard deviation: The unit used to scale all GPA effect sizes, defined as the standard deviation of GPA within students of the same graduation cohort and declared major. This normalization accounts for systematic differences in grading across fields and years, making effect magnitudes comparable across specifications.&lt;/p&gt;</description></item><item><title>Disaggregated Economic Accounts</title><link>https://macropaperwarehouse.com/papers/disaggregated-economic-accounts/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/disaggregated-economic-accounts/</guid><description>&lt;p&gt;This paper develops and implements a &lt;strong&gt;system of disaggregated economic accounts&lt;/strong&gt; that breaks down national accounting positions into bilateral flows between small groups of consumers, producers, the government, and the rest of the world. Standard national accounts document aggregate income and production plus input-output trade between producer industries; they contain no comprehensive data on which consumers buy from which producers or which producers pay income to which consumers. The paper fills this gap by measuring, for Denmark, all 36 positions in the UN System of National Accounts (SNA) — consumer spending, labor compensation, profit income, intermediates trade, government transfers and taxes, and foreign trade — as bilateral cell-to-cell flows, satisfying all national accounting identities at the level of individual cells and at the aggregate level. The data reveal systematic stylized facts about domestic spending shares, gravity of spending, urban bias, and assortative matching between consumer and producer characteristics. Combining the disaggregated accounts with a general equilibrium model with nominal wage rigidities, the paper shows that &lt;strong&gt;fiscal transfer multipliers vary substantially across consumer cells&lt;/strong&gt; — from below 1 to above 2 — depending on the &lt;strong&gt;spending intensity&lt;/strong&gt; of recipient cells on the slack (unemployed) portion of the economy. Applying the framework to a hypothetical U.S. tariff shock on Denmark (calibrated to July 2025 effective tariff levels on China), the paper demonstrates that the cells generating the highest multipliers are not those directly exposed to the shock or even those made slack, but those whose spending intensity on slack cells is high. The disaggregated accounts allow the government to select more effective fiscal policies: choosing transfers targeting high-spending-intensity cells saves approximately &lt;strong&gt;0.4–0.7% of Danish GDP&lt;/strong&gt; relative to programs targeting low-intensity cells, for the same GDP stimulus.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Measurement framework&lt;/strong&gt; (Section II): The paper assigns every Danish adult to one of approximately &lt;strong&gt;2,744 consumer cells&lt;/strong&gt;, defined by the interaction of 98 municipalities (regions) and 28 industries (industry of main employment). Every production establishment is assigned to one of approximately &lt;strong&gt;2,646 producer cells&lt;/strong&gt; by region and industry. Median consumer cell contains &lt;strong&gt;658 adults&lt;/strong&gt;; median producer cell contains &lt;strong&gt;47 establishments&lt;/strong&gt;. The circular flow includes: (i) consumer spending on domestic and foreign producers; (ii) labor compensation paid by producer cells to consumer cells; (iii) profit income (dividends, mixed income, owner-occupied housing surplus) from producers to consumers; (iv) intermediates trade between domestic producers; (v) foreign trade; (vi) government taxes, transfers, and spending. A &amp;ldquo;bottom-up&amp;rdquo; approach uses microdata — geocoded transaction records from Danske Bank (largest Danish bank) and administrative government registers — to directly measure bilateral flows; a &amp;ldquo;top-down&amp;rdquo; approach distributes aggregate flows using assignment algorithms. Year: 2018. Data available at disaggregatedaccounts.com.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Stylized facts&lt;/strong&gt; (Section IV):&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;1. Domestic spending shares (§IV.B)&lt;/strong&gt;: The share of a consumer cell&amp;rsquo;s spending going to domestic rather than foreign producers ranges from &lt;strong&gt;75% to almost 100%&lt;/strong&gt; (average 92%). Rural (small-population) cells, older cells, and less college-educated cells have higher domestic spending shares. Population size, average age, and college share jointly explain about half of the cross-cell variation in domestic shares; the patterns hold within industry and within region. The majority of foreign spending goes to travel-related and specialized retail categories (hotels, airlines, food away from home, clothing).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;2. Gravity (§IV.C)&lt;/strong&gt;: Consumer spending declines with distance (log-log gradient = &lt;strong&gt;−1.33&lt;/strong&gt;, column 1 of Table II). On average, roughly &lt;strong&gt;50%&lt;/strong&gt; of spending stays in the home region and an additional &lt;strong&gt;10%&lt;/strong&gt; goes to regions within 25 km. The distance gradient is steeper for groceries and fuel (local, in-person purchases) and shallower for telecommunications, insurance, and hotels. Rural, older, and less college-educated consumers spend more locally (stronger distance gradient, consistent with higher domestic shares).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;3. Urban bias (§IV.E)&lt;/strong&gt;: Consumer spending flows disproportionately toward large cities. The 15 largest regions receive &lt;strong&gt;34%&lt;/strong&gt; of national consumer spending while accounting for only &lt;strong&gt;27%&lt;/strong&gt; of consumers. Urban bias is absent for everyday purchases (groceries) and strong for irregular or remote purchases (telecommunications, specialized retail). Rural consumers also visit urban regions in person, so urban bias is present in card payments too.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;4. Assortative spending (§IV.D)&lt;/strong&gt;: Consumers tend to spend on producer cells employing workers with similar characteristics. Age of consumers and average age of workers in receiving cells are positively correlated (β = 0.178); college share similarly (β = 0.120); domestic spending share similarly (β = 0.203). The slopes are well below 1 (consumers purchase from many cells), but mild assortative spending reinforces first-order domestic spending patterns through higher-order connections.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;5. Triangular flows (§IV.F)&lt;/strong&gt;: A distinctive cross-regional pattern: consumer spending and intermediates trade flow on net from rural to urban regions (urban regions run a net internal trade surplus); rural regions run a net external surplus (rural manufacturers export; e.g., Novo Nordisk in Kalundborg, Vestas in Nakskov); urban regions import relatively more from abroad. This triangular flow arises from urban consumption amenities and urban business service concentration.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Spending intensity (§IV.G)&lt;/strong&gt;: The paper constructs a reduced-form measure capturing, for each consumer cell i, how much its spending contributes to the income of a target group of cells — accounting for all higher-order connections (the infinite sum over indirect spending chains). The &lt;strong&gt;domestic spending intensity&lt;/strong&gt; of cell i is defined recursively as the sum over all domestic producer cells j of (spending share αji × domestic spending intensity of producer cell j). Values range from roughly 0.4 to 0.9. The measure is strictly greater than the direct domestic spending share because the recursive formula incorporates second- and higher-order domestic connections. Domestic spending intensity is higher for rural, older, and less college-educated cells (consistent with the stylized facts). A &lt;strong&gt;spending intensity on slack cells&lt;/strong&gt; can be constructed in the same way by replacing the target group with cells experiencing demand-driven unemployment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;General equilibrium model&lt;/strong&gt; (Sections V–VI): The model is a static small open economy with many consumer and producer cells. Consumer utility is Cobb-Douglas over goods from all producer cells and foreign goods. Each producer cell&amp;rsquo;s production function is Cobb-Douglas with decreasing returns to scale (equivalent to a fixed factor). The key friction is &lt;strong&gt;downward nominal wage rigidity&lt;/strong&gt;: Wi ≥ (1−δ)W̄i. When demand for a cell&amp;rsquo;s labor falls sufficiently (more than fraction δ), the wage rigidity binds and some workers in that cell become &lt;strong&gt;slack&lt;/strong&gt; (unemployed demand-determined). A fiscal transfer to consumer cell i raises its income, which stimulates spending, which flows through the disaggregated network to raise labor demand across cells. The multiplier is higher when recipient spending flows disproportionately to slack cells, generating additional employment. The model is calibrated using the measured disaggregated accounts: spending shares αji, profit shares κij, labor shares λij, intermediates shares ωjj′, and tax rates are all taken directly from the disaggregated data. Baseline elasticity of substitution = 1 (Cobb-Douglas); robustness checks use short-run elasticities (&amp;lt; 1) and long-run elasticities (&amp;gt; 1), with no material change in conclusions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Analytical result&lt;/strong&gt; (Proposition 1): In an economy-wide recession (all cells slack), the vector of transfer multipliers is µ = ϕ′ · (I − M)⁻¹ · M · D((1 − τ̄ᵢ)⁻¹), where M is a transformed Leontief-style spending matrix incorporating the disaggregated accounts and τ̄ᵢ are fiscal externalities. The key insight is that the multiplier of cell i&amp;rsquo;s transfer is closely linked to its &lt;strong&gt;spending intensity&lt;/strong&gt; on all other domestic cells, with all higher-order connections captured by the (I − M)⁻¹ M term. A cell&amp;rsquo;s multiplier is high when: (i) it spends domestically rather than on imports; (ii) it spends on producers that in turn employ domestic workers in slack cells; and (iii) these higher-order effects amplify through the circular flow.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Economy-wide recession: quantitative multipliers&lt;/strong&gt; (Table III):&lt;/p&gt;
&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Transfer policy&lt;/th&gt;
&lt;th&gt;Multiplier&lt;/th&gt;
&lt;th&gt;Cost to raise GDP by 5% (bn DKK)&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Uniform (all adults)&lt;/td&gt;
&lt;td&gt;1.04&lt;/td&gt;
&lt;td&gt;96.08&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Top 10% domestic spending intensity&lt;/td&gt;
&lt;td&gt;1.21&lt;/td&gt;
&lt;td&gt;81.99&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;2018 child tax credit&lt;/td&gt;
&lt;td&gt;1.02&lt;/td&gt;
&lt;td&gt;97.85&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;2022 inflation relief to elderly&lt;/td&gt;
&lt;td&gt;1.13&lt;/td&gt;
&lt;td&gt;88.11&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;2023 housing rent inflation support&lt;/td&gt;
&lt;td&gt;1.03&lt;/td&gt;
&lt;td&gt;96.45&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Construction worker support&lt;/td&gt;
&lt;td&gt;1.23&lt;/td&gt;
&lt;td&gt;81.16&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Consulting/IT worker support&lt;/td&gt;
&lt;td&gt;0.95&lt;/td&gt;
&lt;td&gt;105.22&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;
&lt;p&gt;High-multiplier policies (construction workers, 2022 elderly relief) target rural, older, less college-educated cells with high domestic spending intensity. Low-multiplier policies (consulting/IT workers, 2023 housing relief, 2018 child tax credit) target urban, young, or college-educated cells with lower domestic intensity. The gap between the best and worst policies amounts to savings of roughly 15 bn DKK (≈ 2.4 bn USD), or &lt;strong&gt;0.4–0.7% of Danish GDP&lt;/strong&gt;, for the same aggregate GDP impact.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;U.S. tariff shock application&lt;/strong&gt; (Section VII): The paper analyzes a hypothetical U.S. tariff increase to 41.4% (the July 2025 effective U.S. tariff on China) on Danish exports, motivated by Greenland tensions. The shock reduces export revenue by 41.4% for each producer cell, with direct exposure varying by region: Billund (Lego headquarters), Kalundborg (pharmaceuticals), and a Copenhagen manufacturing hinterland face the largest direct declines — up to &lt;strong&gt;8% of total regional sales&lt;/strong&gt;. The shock propagates through the disaggregated network; cells whose income falls by more than 4% become slack. Key findings:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Regional slackness follows direct exposure but is also shaped by proximity to other exposed regions (urban bias propagates the shock to cities) and isolation (Billund has high direct exposure but low slackness relative to exposure because it is geographically isolated from other high-exposure cells)&lt;/li&gt;
&lt;li&gt;Transfer multipliers for this heterogeneous recession (Proposition 2) depend on &lt;strong&gt;spending intensity on slack cells&lt;/strong&gt;, not on direct exposure or own slackness&lt;/li&gt;
&lt;li&gt;Table IV (R² for multiplier): slack cell indicator alone explains R² = 0.015; direct spending share on slack raises R² to 0.366; spending intensity on slack cells raises R² to &lt;strong&gt;0.769&lt;/strong&gt; (column 3); adding both spending share and spending intensity on slack reaches R² = 0.840 (column 4)&lt;/li&gt;
&lt;li&gt;Billund, despite high exposure, has low multiplier because its spending (often local to a low-exposure vicinity) does not create labor demand for slack cells elsewhere&lt;/li&gt;
&lt;li&gt;Some of the highest-multiplier regions are themselves non-slack but are surrounded by many slack cells, so their spending effectively employs slack workers&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Dynamic model&lt;/strong&gt; (Section VIII): The paper extends to a dynamic OLG (Blanchard-Yaari) model with heterogeneous marginal propensities to consume (MPCs) calibrated from a 2009 Danish fiscal policy. Key result: static and year-4 dynamic multipliers are closely correlated (slope ≈ 0.898). Long-run cumulative multipliers exactly equal static multipliers (formally proved in Appendix V.F): in the long run, all transfers are fully spent. MPCs and domestic spending intensity are &lt;strong&gt;complementary&lt;/strong&gt; determinants of dynamic multipliers — targeting high-MPC cells amplifies short-horizon (year 0–2) multipliers, while targeting high-spending-intensity cells shapes both short- and long-run multipliers. The paper&amp;rsquo;s main mechanism (spending intensity on slack cells) is robust at all horizons.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Robustness&lt;/strong&gt; (Section IX): (i) Counterfactual accounts with reversed stylized patterns (e.g., rural cells spending like urban cells) lead to substantially different multipliers — the specific measured patterns drive the results. (ii) Imposing standard simplifying assumptions (consumer spending flows only to local producers; spending flows across regions in proportion to intermediate trade) misses most of the multiplier variation. (iii) The mechanism is similarly important in less open economies. (iv) Low short-run and high long-run substitution elasticities (from the trade literature) produce similar multiplier rankings across cells.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope conditions&lt;/strong&gt;: The implementation is a proof of concept for Denmark, using existing micro data from a single large bank and government registers; full coverage of all banks and complete data on within-firm flows would strengthen measurement. Capital-related transactions (saving, investment, financial assets) are aggregated into a single capital accumulation cell — disaggregating these would require different data. The model is intentionally static (with a dynamic extension), abstracting from price adjustment dynamics beyond the NK wage rigidity. The analysis is a partial equilibrium in the sense that monetary policy response is not modeled; the fixed exchange rate assumption is realistic for Denmark (pegged to the Euro) but may not transfer to economies with flexible rates. The proof of concept suggests that national statistical agencies could benefit substantially from measuring disaggregated flows through refined surveys.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-missing-from-standard-national-accounts-that-this-papers-system-provides"&gt;Q1. What is missing from standard national accounts that this paper&amp;rsquo;s system provides?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Standard national accounts measure aggregate consumer spending, income, and output, plus intermediates trade among producer industries (input-output tables); what they do not measure is which specific consumer groups buy from which specific producer groups, or which specific producer groups pay labor and profit income to which specific consumer groups.&lt;/strong&gt; This means that propagation of a shock through the circular flow — e.g., a tariff shock that reduces exports by rural manufacturers, which reduces income for rural workers, who then reduce spending on urban services, which reduces urban workers&amp;rsquo; income — cannot be traced without simplifying assumptions (like &amp;ldquo;spending flows only to local producers&amp;rdquo;) that the disaggregated data shows to be empirically inaccurate. The paper provides a proof of concept demonstrating that measuring these bilateral consumer-to-producer and producer-to-consumer flows, while satisfying all national accounting identities, is feasible with existing micro data and yields policy-relevant variation in fiscal multipliers.&lt;/p&gt;
&lt;h3 id="q2-why-do-rural-older-and-less-college-educated-consumer-cells-have-higher-fiscal-multipliers-during-an-economy-wide-recession"&gt;Q2. Why do rural, older, and less college-educated consumer cells have higher fiscal multipliers during an economy-wide recession?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;These groups have higher domestic spending intensity — a higher fraction of their spending reaches domestic consumers rather than leaking abroad — because they spend less on international tourism, less on imported goods accessed through online retail or urban services, and more on local goods purchased in person.&lt;/strong&gt; The gravity patterns (stronger distance gradient) and direct domestic spending shares document this directly: rural consumers allocate ~92–100% of spending to domestic producers versus ~75–80% for urban young college-educated consumers. When all cells are slack, a transfer to a high-domestic-intensity cell circulates more within the country, generating more rounds of domestic income and employment before leaking to imports. The mild assortative spending pattern further reinforces the first-order effect: spending by rural older consumers flows toward producer cells employing workers with similar characteristics, who also spend domestically, so higher-order connections amplify rather than dilute the domestic spending effect.&lt;/p&gt;
&lt;h3 id="q3-why-does-targeting-directly-exposed-or-slack-cells-not-guarantee-a-high-transfer-multiplier-after-the-us-tariff-shock"&gt;Q3. Why does targeting directly exposed or slack cells not guarantee a high transfer multiplier after the U.S. tariff shock?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;A transfer raises GDP by increasing spending, which creates labor demand for other consumer cells; a transfer to a slack cell only generates a high multiplier if that cell&amp;rsquo;s spending flows toward other slack cells (directly or through indirect chains) — not if it flows toward non-slack cells or abroad.&lt;/strong&gt; The tariff shock creates isolated pockets of slackness in rural manufacturing regions (e.g., Billund for Lego) that are geographically far from other slack regions; Billund consumers spend locally (gravity) and their locality is not itself a center of other slack cells. In contrast, regions near Copenhagen with moderate direct exposure may have high multipliers if they are close to many other slack manufacturing cells — their spending generates employment across the slack network. The R² decomposition confirms this: knowing a cell is slack explains only 1.5% of multiplier variation (R² = 0.015), while knowing its spending intensity on slack cells explains 76.9% (R² = 0.769).&lt;/p&gt;
&lt;h3 id="q4-how-does-the-paper-ensure-that-the-disaggregated-flows-satisfy-national-accounting-identities"&gt;Q4. How does the paper ensure that the disaggregated flows satisfy national accounting identities?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The system is designed so that every cell&amp;rsquo;s total inflows equal total outflows (a cell-level balance sheet constraint), and the sum of all cell-level flows equals the corresponding national aggregate from the SNA — both conditions are imposed by construction, not just approximated.&lt;/strong&gt; For most positions, a bottom-up approach uses observed bilateral microdata (e.g., card payments from Danske Bank directly measure consumer spending by consumer cell i at producer cell j); for positions without direct microdata, a top-down algorithm distributes an aggregate total across cells using assignment rules grounded in the microdata. This dual approach ensures national comprehensiveness (the sum of disaggregated flows equals aggregate national accounts) and individual consistency (cell-level identities hold), unlike existing regional accounts or social accounting matrices that satisfy only one of these constraints.&lt;/p&gt;
&lt;h3 id="q5-what-is-the-relationship-between-spending-intensity-and-the-standard-fiscal-multiplier-formula"&gt;Q5. What is the relationship between spending intensity and the standard fiscal multiplier formula?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The cell-level multiplier (dGDP/dTi) in Proposition 1 equals approximately the cell&amp;rsquo;s spending intensity on domestic cells, corrected for fiscal externalities and price effects of the fixed factor.&lt;/strong&gt; The formal difference is that the model multiplier involves the matrix (I − M)⁻¹M where M incorporates both spending and production shares (through which price changes for the fixed factor enter), while the reduced-form spending intensity uses only the spending matrix. Despite this difference, the two measures are highly correlated empirically: the regression of cell-level multipliers on domestic spending intensity has a slope of approximately 1.66 for static multipliers. The spending intensity can thus be calculated directly from the disaggregated accounts without solving the full general equilibrium model, making it a practical statistic for policy guidance.&lt;/p&gt;
&lt;h3 id="q6-how-does-the-dynamic-model-reconcile-the-fact-that-rural-older-and-less-college-educated-cells-have-high-spending-intensities-but-typically-lower-mpcs"&gt;Q6. How does the dynamic model reconcile the fact that rural, older, and less college-educated cells have high spending intensities but typically lower MPCs?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;MPCs and spending intensities are complementary but distinct determinants of dynamic multipliers at short horizons: high-MPC cells spend the transfer quickly (year 0–1), generating a large immediate impact, while high-spending-intensity cells ensure that spending, whenever it occurs, circulates domestically and reaches slack labor markets.&lt;/strong&gt; At long horizons (year 4+) the two effects converge because all cells eventually spend their full transfer (long-run MPC = 1) and the multiplier converges to the static model&amp;rsquo;s value, which depends only on spending intensity. The practical implication is that policies targeting rural/older/less-educated cells (high intensity, lower MPC) may have lower immediate multipliers than policies targeting high-MPC urban consumers, but converge to higher long-run multipliers. The year-4 cumulative multipliers from the dynamic model closely resemble the static model, suggesting a 3–5 year business cycle horizon is well captured by the static analysis.&lt;/p&gt;
&lt;h3 id="q7-what-does-the-triangular-flow-pattern-imply-for-understanding-regional-inequality-and-fiscal-redistribution"&gt;Q7. What does the triangular flow pattern imply for understanding regional inequality and fiscal redistribution?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The triangular flow — rural regions receive net income from foreign exports; rural consumers spend net inflows toward urban regions; urban consumers spend net toward abroad — means that rural regions&amp;rsquo; incomes depend on export competitiveness while urban regions&amp;rsquo; incomes depend on domestic consumption demand; fiscal transfers to rural consumers thus have high domestic multipliers because their spending boosts urban income (via the rural-to-urban spending flow), which then circulates domestically before leaking abroad.&lt;/strong&gt; This pattern is also consistent with the political economy finding that high-multiplier cells (rural, older, less educated) are more likely to vote for right-wing populists and feel politically disenfranchised — they are the &amp;ldquo;left behind&amp;rdquo; groups that economic research associates with exposure to globalization and automation, but whose spending patterns happen to generate large domestic multipliers during recessions.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;disaggregated economic accounts&lt;/strong&gt; : a system that breaks down all national accounting positions — consumer spending, labor and profit income, intermediates trade, government transactions, foreign trade — into bilateral flows between consistently defined region-by-industry consumer cells and producer cells, satisfying national accounting identities both at the cell level and in aggregate; the paper&amp;rsquo;s proof of concept is implemented for Denmark using 2,744 consumer cells and 2,646 producer cells in 2018.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;spending intensity&lt;/strong&gt; : a cell-level, reduced-form statistic capturing how much a consumer cell&amp;rsquo;s spending contributes to the income of a target group of cells (e.g., all domestic cells or all slack cells), accounting for all indirect higher-order connections through the circular flow; formally defined as a recursive sum that incorporates the full disaggregated network structure; ranges from 0.4 to 0.9 for domestic spending intensity and is systematically higher for rural, older, and less college-educated cells.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;slack cell&lt;/strong&gt; : in the paper&amp;rsquo;s NK model, a consumer cell for which demand-driven unemployment occurs because the nominal wage rigidity binds — labor supply exceeds demand when the cell&amp;rsquo;s income declines by more than a threshold δ due to a negative demand shock; fiscal transfers with high multipliers are those whose spending reaches slack cells (directly or through higher-order network connections).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;triangular flows&lt;/strong&gt; : the cross-regional spending pattern documented for Denmark in which net consumption spending flows from rural regions to urban regions (urban bias), net foreign export revenue flows to rural regions (rural manufacturing), and net foreign import spending flows from urban regions; implies that rural-to-urban spending flows act as an important transmission channel for fiscal stimulus targeted at rural consumers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;bottom-up vs top-down disaggregation&lt;/strong&gt; : the two methodological approaches for constructing bilateral cell-to-cell flows; the bottom-up approach uses individual-level microdata (e.g., bank transaction records) to directly observe cell-to-cell payment flows; the top-down approach allocates an aggregate national accounting position across cells using assignment algorithms informed by microdata; both approaches are designed so that the resulting disaggregated flows sum to the corresponding SNA aggregate.&lt;/p&gt;</description></item><item><title>Disincentive effects of unemployment insurance benefits</title><link>https://macropaperwarehouse.com/papers/disincentive-effects-of-unemployment-insurance-benefits/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/disincentive-effects-of-unemployment-insurance-benefits/</guid><description>&lt;p&gt;This paper isolates the disincentive effects of pandemic unemployment insurance (UI) benefits on employment recovery, separating them from the simultaneously operating stimulative (demand) effects that previous studies conflate. The authors study the largest UI expansion in U.S. history — the CARES Act of March 2020 — which introduced three simultaneous provisions: a $600 weekly income supplement (FPUC) through end of July 2020, a 13-week extension of maximum benefit duration (PEUC), and expanded eligibility to workers previously ineligible for UI (PUA), together raising the median replacement rate to 145% and more than doubling the number of UI recipients.&lt;/p&gt;
&lt;p&gt;The empirical strategy uses high-frequency establishment-level data from Homebase (HB), a scheduling and payroll provider covering approximately 140,000 small U.S. businesses — predominantly restaurants and retailers — matched to Yelp price-tier data and Safegraph foot-traffic and spending data. The final estimation sample is 4,595 businesses within 1,195 local-industry cells, observed at weekly frequency from January 2019 to December 2020.&lt;/p&gt;
&lt;p&gt;The identification rests on comparing employment recovery of low-wage versus high-wage businesses within the same narrow local labor market (four-digit zip code), industry (two-digit NAICS), and price tier. Because neighboring businesses largely share the local demand stimulus from UI, differencing within local-industry cells removes common demand effects. The key variation is the expiration of the $600 supplement, which differentially compresses the replacement-rate gap between low- and high-wage businesses depending on local average wages — labor markets where the gap falls more sharply are the treated group.&lt;/p&gt;
&lt;p&gt;The main empirical finding is that a 100 percentage point decline in the replacement rate gap is associated with a 5.7 percentage point rise in low-wage business employment recovery relative to high-wage business employment recovery at 12 weeks after the $600 expiration. For the average labor market, the expiration of the $600 supplement decreased the replacement rate gap by 46 percentage points, implying a 2.6 percentage point closing of the low-versus-high-wage employment gap within 12 weeks. Importantly, hours per employee and hourly wages grew faster in low-wage businesses over the same period, consistent with a labor supply rather than a demand mechanism. When the comparison is conducted at the U.S. state level rather than within local-industry cells — as in Finamor and Scott (2021) — the effect disappears and reverses sign, illustrating how local demand effects obscure disincentive effects at broader geographic aggregations.&lt;/p&gt;
&lt;p&gt;To quantify the aggregate employment impact, the authors build and calibrate a McCall-style labor search model with heterogeneous firm wages, a UI-eligible and non-UI unemployed pool, and equilibrium reservation wages. The model is extended to include a probability (calibrated at 16.5%) that workers lose UI eligibility upon refusing a job offer, which reconciles the model with the empirical estimates; without this feature the baseline model substantially overstates the differential employment effect of the $600 expiration.&lt;/p&gt;
&lt;p&gt;The full model-implied aggregate employment loss from all CARES Act UI provisions combined is 3.4 percentage points on average between April and December 2020, representing approximately 20% of the average employment shortfall in the Leisure and Hospitality sector over that period. When each provision is implemented in isolation, the effects are modest ($600 supplement: 0.2 pp; extended duration: 0.2 pp; expanded eligibility: 1.0 pp), but their interaction generates the large combined effect. Expanded eligibility is identified as the most disruptive provision, particularly for low-wage businesses, because it depletes the pool of non-UI unemployed who are the primary source of hires for these firms. The unemployment duration elasticities implied by the model are modest and in line with the low-to-middle range of pre-pandemic estimates.&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s scope is restricted to the disincentive channel and deliberately excludes the stimulative effects of UI; it studies small, in-person service sector businesses and the April–December 2020 recovery period only.&lt;/p&gt;
&lt;p&gt;Q: What is the core identification challenge this paper addresses?
A: Prior empirical studies find only modest net effects of pandemic UI on employment, but it is unclear whether this reflects small disincentive effects or the near-cancellation of two opposing forces — UI suppressing labor supply while simultaneously stimulating local consumer demand. Identifying the disincentive effect alone requires a design that neutralizes the demand channel. The authors accomplish this by comparing low-wage and high-wage businesses within the same narrow local market, industry, and price tier, so that common local demand shifts from UI are differenced out.&lt;/p&gt;
&lt;p&gt;Q: What data does the empirical analysis use, and how is the sample constructed?
A: The primary data source is Homebase, covering approximately 140,000 small U.S. businesses with daily employment, hourly wages, and hours worked. The estimation sample is restricted to 4,595 businesses present throughout 2019, matched to Yelp price-tier classification and Safegraph weekly foot traffic and credit-card spending. Businesses are grouped into 1,195 local-industry cells defined by four-digit zip code, two-digit NAICS industry, and Yelp price tier (inexpensive vs. expensive). Within each cell, businesses are classified as low-wage or high-wage, with high-wage businesses paying on average $1.80 per hour more — about 8% above the average hourly wage of $10.90.&lt;/p&gt;
&lt;p&gt;Q: How is the replacement rate defined in the empirical framework?
A: The business-specific replacement rate is the ratio of average UI receipts (state benefit plus the pandemic supplement, converted to hourly units) to the pre-pandemic average hourly wage of that business. Because the supplement is uniform across workers, businesses with lower pre-pandemic wages face higher replacement rates; the replacement rate gap between low- and high-wage businesses within a local market is therefore a function of both state benefit levels and the local wage dispersion.&lt;/p&gt;
&lt;p&gt;Q: What does the event-study analysis around the $600 expiration show?
A: The event study exploits cross-labor-market variation in how much the replacement rate gap between low- and high-wage businesses declined when the $600 FPUC supplement expired at end of July 2020. Labor markets with a larger decline in the gap see faster relative recovery in low-wage business employment after expiration. A 100 percentage point decline in the replacement rate gap is associated with a 5.7 percentage point rise in the low-versus-high-wage employment recovery gap at 12 weeks post-expiration. For the average labor market, the $600 expiration reduced the replacement rate gap by 46 percentage points, implying a 2.6 percentage point narrowing of the employment recovery gap.&lt;/p&gt;
&lt;p&gt;Q: Why does the estimated effect disappear when broader geographic aggregations are used?
A: When businesses are compared within U.S. state borders rather than within local-industry cells, the estimated coefficient on the replacement rate gap turns positive and statistically insignificant. This occurs because at the state level, low-wage areas benefit disproportionately from the purchasing power increase that generous UI provides to local unemployed workers, so demand effects swamp and reverse the supply-side disincentive. This finding explains why Finamor and Scott (2021), using Homebase data with state fixed effects, find no negative association between replacement rates and labor market re-entry.&lt;/p&gt;
&lt;p&gt;Q: What evidence supports a labor supply rather than demand interpretation of the differential recovery?
A: During the period of the $600 supplement, hours per employee and hourly wages grew faster in low-wage businesses than in high-wage businesses, even as low-wage businesses lagged in employment levels. If the differential recovery reflected demand deficiencies at low-wage businesses, hours per employee and wages should have grown faster at high-wage businesses instead. The observed pattern is consistent with labor supply shortfalls at low-wage firms.&lt;/p&gt;
&lt;p&gt;Q: What is the structure of the quantitative labor search model?
A: The model features a unit measure of workers and a fixed measure of firms, each posting a constant idiosyncratic wage drawn from an exogenous distribution. Unemployed workers receive job offers at a rate determined by labor market tightness and accept offers above their reservation wage. Reservation wages are equilibrium objects because UI benefits depend on the worker&amp;rsquo;s previous wage. The unemployed are split into UI-eligible and non-UI pools; the non-UI pool accepts jobs from lower in the wage distribution and is the primary supply source for low-wage firms. The model is calibrated to pre-pandemic U.S. service sector averages, with a pre-pandemic UI replacement rate of 0.51, a UI recipiency probability of 14%, and a non-UI replacement rate of 0.15.&lt;/p&gt;
&lt;p&gt;Q: Why does the baseline model overstate the empirical effect, and how is this reconciled?
A: The baseline model dramatically overstates the differential employment impact of the $600 expiration because the CARES Act&amp;rsquo;s expanded eligibility (modeled as a rise in the recipiency probability from 14% to 70%) nearly empties the non-UI unemployed pool, which is the dominant labor supply source for low-wage firms. In the data, the share of unemployed receiving UI nearly tripled for in-person leisure and hospitality workers, but not to the degree that the model&amp;rsquo;s implied employment collapse would require. The model is reconciled by introducing a 16.5% probability that a worker loses UI eligibility upon refusing a suitable job offer — consistent with UI law — which reduces the effective outside option and raises acceptance rates for low-wage firms.&lt;/p&gt;
&lt;p&gt;Q: What are the aggregate employment losses implied by the model?
A: When all three CARES Act provisions are implemented jointly, the model estimates that the disincentive effects held back aggregate employment recovery by 3.4 percentage points on average between April and December 2020 — approximately 20% of the average employment shortfall in the Leisure and Hospitality sector. Implemented in isolation, each provision generates only modest losses: the $600 supplement alone accounts for 0.2 percentage points, extended duration for 0.2 percentage points, and expanded eligibility for 1.0 percentage points. The large combined effect arises from the interaction of all three provisions, not from any single one.&lt;/p&gt;
&lt;p&gt;Q: What are the conditional (interaction) effects of each provision when the other two are in place?
A: Conditional on the other two provisions being active, the income supplement holds back employment recovery by 1.6 percentage points, the extended duration by 1.5 percentage points, and expanded eligibility by 2.9 percentage points. This interaction effect is the central quantitative finding: individually modest provisions combine to produce effects far exceeding their sum when implemented simultaneously.&lt;/p&gt;
&lt;p&gt;Q: What are the implied unemployment duration elasticities, and how do they compare to the literature?
A: The $600 supplement alone raises average unemployment duration by 8% against a 343% rise in the replacement rate, implying an elasticity of 0.02. Extended duration alone raises unemployment duration by 6% against a 150% increase in potential benefit duration, implying an elasticity of 0.03. Expanded eligibility alone raises unemployment duration by 19%, implying an elasticity of 0.04. When each provision is activated on top of the other two, the implied elasticities rise substantially: 0.24 for the $600 supplement, 0.43 for extended duration, and 0.28 for expanded eligibility. These are in the low-to-middle range of pre-pandemic estimates (Katz and Meyer, 1990: 0.3–0.5; Johnston and Mas, 2018: 0.4–0.8; Rothstein, 2011: 0.06; Farber and Valletta, 2015: 0.15).&lt;/p&gt;
&lt;p&gt;Q: What is the role of expanded eligibility specifically?
A: Expanded eligibility is identified as the most disruptive CARES Act provision, accounting for 1.0 percentage points of employment loss alone and 2.9 percentage points conditional on the other provisions. Mechanically, expanded eligibility converts non-UI unemployed workers into UI-eligible workers, draining the pool of workers willing to accept low-wage job offers. Because low-wage firms depend disproportionately on the non-UI pool for hiring, this provision disproportionately depresses their employment. Using CPS data, the authors document that the share of unemployed workers receiving UI in the in-person leisure and hospitality sector nearly tripled in 2020 relative to the pre-pandemic period.&lt;/p&gt;
&lt;p&gt;Q: What are the scope conditions and limitations of the analysis?
A: The empirical analysis is restricted to small, in-person service sector businesses (restaurants and retailers) in the Homebase sample, which may not be representative of the broader labor market. The quantitative model is explicitly focused on disincentive effects only and does not capture the stimulative or demand effects of UI. The model also abstracts from re-opening restrictions and other pandemic-specific confounders. The analysis covers April to December 2020; the 2021 pandemic UI extensions are not studied. The job-refusal probability (chi = 16.5%) is a reduced-form calibration target rather than a structurally identified parameter.&lt;/p&gt;
&lt;p&gt;Replacement rate gap: The difference in business-specific UI replacement rates between low-wage and high-wage businesses within the same local labor market; defined as UI benefits (state benefit plus supplement) divided by the business&amp;rsquo;s pre-pandemic average hourly wage. Larger gaps indicate greater relative disincentive for workers to accept jobs at low-wage firms.&lt;/p&gt;
&lt;p&gt;Disincentive effect: The negative impact of higher UI replacement rates on workers&amp;rsquo; willingness to accept job offers and thus on business employment recovery, isolated from the simultaneous stimulative demand effect of UI spending.&lt;/p&gt;
&lt;p&gt;Non-UI unemployed pool: Workers who are ineligible for or have exhausted UI benefits and therefore receive only social benefits at a lower replacement rate (calibrated at 0.15 in the model). This group has a lower reservation wage and constitutes the primary labor supply source for low-wage firms.&lt;/p&gt;
&lt;p&gt;Local-industry cell: The paper&amp;rsquo;s unit of comparison — businesses sharing the same four-digit zip code (covering on average four neighboring zip codes), two-digit NAICS industry, and Yelp price tier. Within-cell differencing is the mechanism that removes common local demand effects.&lt;/p&gt;
&lt;p&gt;Benefit recipiency probability: The probability that a newly separated worker enters the UI-eligible unemployed pool, combining UI eligibility and takeup. Pre-pandemic this is calibrated at 14%; under the CARES Act it rises to 70%, targeting the observed near-tripling of UI recipients in the CPS data.&lt;/p&gt;
&lt;p&gt;Job-refusal eligibility loss: A probability (calibrated at 16.5%) that a UI-eligible worker who rejects a job offer loses UI status and transitions to the non-UI pool. Motivated by UI law prohibiting refusal of suitable work; reduces the effective outside option and reconciles the model&amp;rsquo;s predicted employment gap with the empirical estimate.&lt;/p&gt;
&lt;p&gt;Equilibrium residual wage dispersion: The wage dispersion observed in equilibrium conditional on worker observables. The model generates realistic dispersion by calibrating the non-UI replacement rate to match the lower half of the wage distribution and the firm wage offer variance to match the upper half; the presence of the non-UI state substantially increases residual dispersion relative to standard search models.&lt;/p&gt;</description></item><item><title>Distorted prices and targeted taxes in the New Keynesian Network model</title><link>https://macropaperwarehouse.com/papers/distorted-prices-and-targeted-taxes-in-the-new-keynesian-network-model/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/distorted-prices-and-targeted-taxes-in-the-new-keynesian-network-model/</guid><description>&lt;p&gt;This paper asks how governments should optimally adjust sector-specific taxes in response to sectoral shocks when monetary policy cannot be tailored to individual sectors. The authors work within a variant of Rubbo&amp;rsquo;s (2023) New Keynesian Network (NKN) model, augmented to include time-varying sectoral sales taxes and production subsidies. The model features N sectors connected through input-output linkages, with Calvo-type price rigidity that is heterogeneous across sectors, and encompasses both sectoral productivity (supply) shocks and demand shocks.&lt;/p&gt;
&lt;p&gt;The central finding, stated as Proposition 1, is that the first-best tax policy requires exactly 2N instruments—one sales tax and one production subsidy per sector—not just instruments in the shocked sector. The mechanism turns on a twofold distortion created by sticky prices. Because only a fraction of firms adjust prices at any time, relative prices are distorted both within sectors (price dispersion among firms) and across sectors (misalignment of relative prices). The production subsidy offsets the effect of shocks on marginal costs, incentivizing price-adjusting firms to leave seller prices unchanged and thereby eliminating within-sector dispersion. The sales tax—which applies to both household purchases and intermediate goods trade—steers demand across sectors so that market prices move as if fully flexible, closing sectoral output gaps even as seller prices remain constant. The optimal sales tax moves exactly one-for-one with the vector of natural prices. Crucially, budget neutrality holds to first order: the sales tax revenues fund the production subsidies.&lt;/p&gt;
&lt;p&gt;The strength of each instrument&amp;rsquo;s response depends on network proximity rather than price rigidity. For supply shocks, adjustment propagates downstream (governed by the Leontief inverse), so sectors that intensively use inputs from the shocked sector require larger responses. For demand shocks, adjustment propagates upstream first and then back downstream, so upstream suppliers to the shocked sector face the largest responses.&lt;/p&gt;
&lt;p&gt;Because the first-best policy requires observing sectoral shocks directly, the authors propose a simple 2N rule (Proposition 2) that responds only to observable sectoral seller-price inflation, with rule strength parameter ϕ_i per sector. As ϕ_i → ∞ the simple rule converges to the first-best. Crucially, the rule can be implemented by observing inflation only in the shocked sector and adjusting taxes and subsidies in other sectors proportionally to their input-output distance from that sector.&lt;/p&gt;
&lt;p&gt;The quantitative assessment calibrates the model to the U.S. economy using BEA 2017 input-output accounts with N = 373 sectors at the 6-digit classification. Sectoral price flexibility is drawn from Antonova (2025), ranging from 0.052 to 0.989 with a median of 0.277 (implying a median price duration of roughly 4.3 months). Shocks follow AR(1) processes with persistence ρ = 0.97. Supply shocks hit 10 energy-related sectors (roughly 10% of total sales); demand shocks hit 22 service-related sectors (roughly 7% of total sales). The key quantitative finding is that the simple 2N policy—both subsidy and tax together—delivers substantially greater welfare improvement than a subsidy-only policy (N instruments), particularly for supply shocks. When the subsidy is not accompanied by the corresponding sales tax, welfare gains are much smaller.&lt;/p&gt;
&lt;p&gt;The paper extends to an open economy with import-price shocks that act simultaneously as supply and demand shocks. Applied to the 2022 Ukraine war energy crisis: a 24% world-energy-price increase (IMF Global Energy Price index, 2022M1–2022M4) is used, with high-dependence Europe (energy import share γ_EU = 0.63, substitution elasticity η_EU = 1) contrasted against low-dependence U.S. (γ_US = 0.17, η_US = 4). In Europe, adverse supply effects dominate so the domestic energy sector contracts; in the U.S., demand substitution effects dominate so domestic energy expands. Simple 2N rules correlate 0.89 with the optimal policy across sectors for Europe and 0.94 for the U.S. A notable normative implication: the optimal policy raises sales taxes on energy to discourage consumption, in contrast to the actual European policy of subsidizing energy consumption during the 2022 crisis.&lt;/p&gt;
&lt;p&gt;Q: Why can monetary policy not achieve the first-best allocation in the NKN model?&lt;/p&gt;
&lt;p&gt;A: Monetary policy sets a single nominal interest rate that applies uniformly across all sectors, but sectoral shocks generate heterogeneous natural rates. Even if monetary policy stabilizes aggregate output, it cannot simultaneously close all sectoral output gaps and eliminate within-sector price dispersion. Rubbo (2023) shows that optimal monetary policy improves welfare but leaves a significant welfare loss remaining.&lt;/p&gt;
&lt;p&gt;Q: What is the core tradeoff in each sector that motivates the 2N result?&lt;/p&gt;
&lt;p&gt;A: With Calvo-type staggered pricing, adjusting a sector&amp;rsquo;s relative price to close its output gap creates price dispersion within the sector because not all firms adjust simultaneously; but holding seller prices constant to avoid dispersion leaves output gaps open due to the absence of relative price adjustment. Two instruments—production subsidy and sales tax—are required to address both sides of this distortion simultaneously, in keeping with the Tinbergen principle.&lt;/p&gt;
&lt;p&gt;Q: How exactly do the production subsidy and sales tax each work under the optimal policy?&lt;/p&gt;
&lt;p&gt;A: The production subsidy is paid to producers and affects the optimal seller price for a given marginal cost, incentivizing firms that can adjust prices to leave them unchanged. The sales tax is levied on buyers (households and downstream firms) and, because it is applied to both household consumption and intermediate goods trade, it steers demand across sectors to replicate the efficient allocation of expenditure. Under the optimal policy, seller prices are fully stabilized (ps_t = 0) while buyer (market) prices move as pt = τs_t = pn_t, mimicking flexible-price outcomes.&lt;/p&gt;
&lt;p&gt;Q: What determines which sectors receive larger optimal tax and subsidy responses?&lt;/p&gt;
&lt;p&gt;A: For supply (productivity) shocks, responses are governed by the matrix L̄ = XL, where L is the Leontief inverse measuring downstream proximity; sectors that are more intensive downstream users of the shocked sector require larger responses. For demand shocks, the relevant matrix measures upstream proximity, so sectors that supply inputs to the shocked sector face stronger responses. Critically, the level of the policy response is independent of sector-specific price rigidity; only the network structure matters.&lt;/p&gt;
&lt;p&gt;Q: Is the optimal 2N policy budget-neutral, and why only approximately?&lt;/p&gt;
&lt;p&gt;A: Budget neutrality holds to first order around the zero-profit steady state. The production subsidy applies to costs while the sales tax applies to sales; at the steady state these coincide, so the subsidy is exactly funded by the tax revenue. The approximation breaks down away from the zero-profit steady state because costs and sales diverge.&lt;/p&gt;
&lt;p&gt;Q: What is the simple 2N rule and how does it relate to the first-best?&lt;/p&gt;
&lt;p&gt;A: The simple rule sets sp_t = Iϕ · πs_t and τs_t = sp_t, where Iϕ = diag{ϕ_i} is a diagonal matrix of response coefficients for each sector&amp;rsquo;s seller-price inflation. As ϕ_i → ∞ for all i, the allocation converges to first-best; larger ϕ_i produces a stronger commitment to stabilize sectoral inflation, resulting in muted inflation rather than large tax and subsidy levels. In practice, the rule can be implemented by observing inflation only in the shocked sector and scaling responses in other sectors by their input-output distance from that sector.&lt;/p&gt;
&lt;p&gt;Q: What does the three-sector example (Energy, Manufacturing, Services) illustrate about supply vs. demand shocks?&lt;/p&gt;
&lt;p&gt;A: Under an adverse energy productivity shock, the optimal policy subsidizes Energy and Manufacturing (proportional to energy use in manufacturing) but not Services, since Services are not energy-intensive and thus not closely connected downstream. Under a positive manufacturing demand shock, the optimal policy subsidizes both Manufacturing and upstream Energy equally, reflecting that demand shocks propagate upstream first.&lt;/p&gt;
&lt;p&gt;Q: What does the calibrated quantitative exercise show about the welfare gains from using both instruments versus one?&lt;/p&gt;
&lt;p&gt;A: For both supply and demand shock scenarios, the simple 2N policy (subsidy plus tax) delivers substantially greater welfare improvement than using only monetary policy. When the subsidy is not accompanied by the corresponding sales tax, welfare gains are much smaller, confirming that both instruments together—not subsidies alone—are essential. This is identified as a key quantitative finding of the paper.&lt;/p&gt;
&lt;p&gt;Q: How robust are results to decreasing returns to scale in production?&lt;/p&gt;
&lt;p&gt;A: Under decreasing returns to scale, the optimal policy response is highly similar to the baseline: correlations between the two are 0.98 for supply shocks and 0.99 for demand shocks across sectors. The simple 2N rule continues to deliver significant welfare improvements. One difference is that demand shocks generate relatively higher welfare losses under decreasing returns, while productivity shocks lead to lower losses.&lt;/p&gt;
&lt;p&gt;Q: How does the open-economy extension change the analysis for import-price shocks?&lt;/p&gt;
&lt;p&gt;A: Import-price shocks enter the model as both supply shocks (raising input costs) and demand shocks (shifting expenditures toward domestic substitutes), so they require a policy response that accounts for both propagation channels simultaneously. The optimal open-economy policy is formally isomorphic to the closed-economy counterpart but with redefined upstream and downstream matrices and shock vectors. The relative importance of the supply versus demand channel depends on the economy&amp;rsquo;s import dependence and substitution elasticity.&lt;/p&gt;
&lt;p&gt;Q: How does the 2022 energy crisis illustrate the difference between the optimal policy and actual European policy?&lt;/p&gt;
&lt;p&gt;A: Using a 24% world-energy-price increase (IMF Global Energy Price index, 2022M1–2022M4), the model implies that with high European energy dependence (γ_EU = 0.63, η_EU = 1), adverse supply effects dominate and the optimal policy raises sales taxes on energy to discourage consumption and subsidizes domestic energy users proportional to downstream proximity. Actual European policy partly subsidized energy consumption, which the model identifies as welfare-reducing relative to the optimal response. For the low-dependence U.S. (γ_US = 0.17, η_US = 4), demand substitution toward domestic energy dominates, requiring additional subsidies to domestic energy producers.&lt;/p&gt;
&lt;p&gt;Q: How does this paper relate to the Diamond-Mirrlees result on intermediate good taxation?&lt;/p&gt;
&lt;p&gt;A: Diamond-Mirrlees (1971) recommends against taxing intermediate goods in an otherwise efficient economy to avoid introducing additional distortions. This paper considers an economy already subject to pricing frictions (Calvo staggered pricing), and shows that taxing intermediate goods through the sales tax—which applies to intermediate goods trade—is part of the optimal policy precisely because it corrects the pre-existing distortions. The paper thus does not contradict Diamond-Mirrlees but operates in a different setting where frictions are already present.&lt;/p&gt;
&lt;p&gt;New Keynesian Network (NKN) model: A multi-sector general equilibrium framework with N sectors connected through input-output linkages, Calvo-type staggered price setting that is heterogeneous across sectors, and monopolistically competitive firms; provides the canonical system of sectoral IS curves and Phillips curves used in this paper.&lt;/p&gt;
&lt;p&gt;2N policy: The paper&amp;rsquo;s central result that the first-best tax policy requires exactly two instruments per sector—one production subsidy and one sales tax—for a total of 2N instruments; characterized in Proposition 1 and named for this instrument count.&lt;/p&gt;
&lt;p&gt;Production subsidy (sp_t,i): A sector-specific transfer paid to producers that affects the optimal seller price for a given marginal cost; under the optimal policy it offsets the effect of shocks on marginal costs, incentivizing price-adjusting firms to leave seller prices unchanged and thereby eliminating within-sector price dispersion.&lt;/p&gt;
&lt;p&gt;Sales tax (τs_t,i): A sector-specific tax levied on buyers—both households and downstream firms purchasing intermediate goods—such that the buyer (market) price equals (1 + τs_t,i) times the seller price; under the optimal policy it replicates the efficient allocation of expenditure across sectors even when seller prices are fully stabilized.&lt;/p&gt;
&lt;p&gt;Downstream proximity (Leontief inverse L̄ = XL): A measure of the total direct and indirect use of a sector&amp;rsquo;s output by other sectors, governing the propagation and optimal policy response to supply (productivity) shocks; the ij-th element of L̄ captures how strongly a shock in sector j affects policy in sector i through downstream input-output linkages.&lt;/p&gt;
&lt;p&gt;Upstream proximity: A measure of how closely a sector supplies inputs to another sector, governing the propagation of demand shocks; demand shocks propagate first upstream (to input suppliers) before feeding back downstream.&lt;/p&gt;
&lt;p&gt;Budget neutrality: The property that the optimal 2N policy is self-financing to first order—sales tax revenues exactly fund the production subsidies around the zero-profit steady state—so the fiscal intervention does not require net government expenditure.&lt;/p&gt;
&lt;p&gt;Simple 2N rule: A practically implementable approximation to the first-best policy that sets subsidies and taxes proportional to observed sectoral seller-price inflation with response coefficients ϕ_i; converges to the first-best as ϕ_i → ∞ and can be implemented using only the inflation rate of the shocked sector plus network-distance weights from the input-output table.&lt;/p&gt;</description></item><item><title>Distributional Growth Accounting: Education and the Reduction of Global Poverty, 1980–2019</title><link>https://macropaperwarehouse.com/papers/distributional-growth-accounting-education-and-the-reduction-of-global-poverty-19802019/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/distributional-growth-accounting-education-and-the-reduction-of-global-poverty-19802019/</guid><description>&lt;h2 id="layer-1--core-argument"&gt;Layer 1 — Core Argument&lt;/h2&gt;
&lt;p&gt;This paper constructs the first estimates of the aggregate and distributional effects of worldwide educational expansion since 1980 by developing a &amp;ldquo;distributional growth accounting&amp;rdquo; framework that isolates the contribution of schooling to economic growth by income group. The framework integrates the canonical labor supply-and-demand model of education and the wage structure (à la Goldin and Katz 2007) with standard growth accounting tools, applied to a new microdatabase covering household surveys in 150 countries and representative of approximately 95% of the world&amp;rsquo;s population, alongside new country-specific estimates of private returns to primary, secondary, and tertiary schooling. Under conservative assumptions — relying on standard Mincerian returns, assuming capital income is unaffected by schooling, and abstracting from human capital externalities — education can account for approximately 50% of global economic growth, 70% of income gains among the world&amp;rsquo;s poorest 20% of individuals, and 40% of extreme poverty reduction since 1980; it also explains over 50% of improvements in the share of labor income accruing to women. A key mechanism is imperfect substitutability between skill groups: as educational expansion raises the supply of skilled workers, their relative wage falls, redistributing income toward low-skilled workers and amplifying education&amp;rsquo;s equalizing effect at the bottom of the distribution — a channel that canonical cross-country growth accounting misses, causing it to underestimate education&amp;rsquo;s contribution to poverty reduction by a factor of approximately three. Combining these indirect investment benefits from education with direct government redistribution (from a companion paper) brings the total contribution of public policies to extreme poverty reduction to at least 50%.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-q-what-is-distributional-growth-accounting-and-how-does-it-differ-from-standard-growth-accounting"&gt;Q1. Q: What is distributional growth accounting and how does it differ from standard growth accounting?&lt;/h3&gt;
&lt;p&gt;A: Standard growth accounting (as in Barro and Lee 2015) combines cross-country data on average years of schooling with a uniform return to derive a counterfactual average income absent educational progress. Distributional growth accounting instead starts from microdata on the joint distribution of income and education within 150 countries, constructs income-group-specific counterfactuals, and accounts for both direct wage effects on individuals whose education changed and general equilibrium supply effects that alter relative wages across all workers. The standard approach is found to underestimate education&amp;rsquo;s contribution to the poorest 20%&amp;rsquo;s income growth by a factor of roughly three (23% vs. 71% in the benchmark specification), because cross-country averages cannot accurately locate the world&amp;rsquo;s poorest individuals and because two key channels — labor income shares being greater at the bottom, and supply-side wage redistribution — are omitted.&lt;/p&gt;
&lt;h3 id="q2-q-how-is-the-counterfactual-world-income-distribution-constructed"&gt;Q2. Q: How is the counterfactual world income distribution constructed?&lt;/h3&gt;
&lt;p&gt;A: In five steps applied to the 150-country microdata. First, education levels are downgraded within each survey until matching the 1980 distribution of educational attainment (using the Barro–Lee database), prioritizing individuals closest to the target level. Second, the earnings of downgraded workers are reduced using the &amp;ldquo;true&amp;rdquo; return to schooling, which lies between the initial return (prevailing before expansion, computed from the CES production function using the 2019 elasticity) and the final return observed in 2019 — for plausible parameterizations, the true return weights initial returns at 50–70%. Third, relative wages are adjusted to reflect supply effects: the increase in skilled-worker supply lowers their relative wage by 1/σ log points per log-point increase in relative supply. Fourth, counterfactual labor income is combined with unchanged capital income to yield counterfactual total income. Fifth, the share of actual income growth attributable to education is computed as the gap between the actual and counterfactual growth rates, expressed as a fraction of actual growth.&lt;/p&gt;
&lt;h3 id="q3-q-what-role-does-imperfect-skill-substitution-play-and-how-is-σ-calibrated"&gt;Q3. Q: What role does imperfect skill substitution play, and how is σ calibrated?&lt;/h3&gt;
&lt;p&gt;A: Imperfect substitution between skill groups (elasticity σ in a CES production function) is the mechanism through which educational expansion redistributes income. When skilled-worker supply rises, their relative wage falls and low-skilled workers&amp;rsquo; relative wage rises, so the income gains from education are shared more broadly than individual returns alone would suggest. With perfect substitutes (σ → ∞), supply effects vanish and education&amp;rsquo;s distributional impact is determined entirely by who directly received schooling. The elasticity is calibrated from the recent macroeconomics literature; in sensitivity analysis, the paper bounds the contribution of education to the poorest 20%&amp;rsquo;s income growth between 60% and 90% across plausible values of σ and private returns.&lt;/p&gt;
&lt;h3 id="q4-q-why-are-the-estimates-described-as-conservative"&gt;Q4. Q: Why are the estimates described as conservative?&lt;/h3&gt;
&lt;p&gt;A: Three reasons, each biasing the estimates downward. First, standard Mincerian returns are used, which are systematically lower than causal estimates from natural experiments — a meta-analysis of 15 papers and the paper&amp;rsquo;s own quasi-experimental validation (India, Indonesia, United States) confirm this; if anything, the framework underestimates schooling&amp;rsquo;s benefits in those settings. Second, capital income is assumed unaffected by schooling, abstracting from potential effects on capital accumulation and returns. Third, human capital externalities — for which there is now substantial empirical evidence — are ignored entirely. These conservative choices are deliberate; relaxing them would increase all headline estimates.&lt;/p&gt;
&lt;h3 id="q5-q-how-does-skill-biased-technical-change-interact-with-the-education-contribution"&gt;Q5. Q: How does skill-biased technical change interact with the education contribution?&lt;/h3&gt;
&lt;p&gt;A: In the CES model, the return to schooling is increasing in the skill bias of technology (AH/AL): a higher skill bias raises the marginal product of skilled workers relative to unskilled, making schooling more profitable. The benchmark counterfactual holds technology fixed at its 2019 value and reduces education to its 1980 level. An alternative counterfactual would hold technology at its 1980 value and increase education to its 2019 level; the difference between these two exercises identifies the contribution of skill-biased technical change in amplifying the benefits of schooling. Because 1980 microdata on the world income distribution are unavailable, this decomposition can only be performed for the subsample of 33 countries with surveys around 2000; for that sample, skill-biased technical change accounts for 20–30% of the income benefits of schooling, meaning education would still have yielded large gains even absent technological progress.&lt;/p&gt;
&lt;h3 id="q6-q-what-do-the-quasi-experimental-validations-in-india-indonesia-and-the-united-states-show"&gt;Q6. Q: What do the quasi-experimental validations in India, Indonesia, and the United States show?&lt;/h3&gt;
&lt;p&gt;A: Three large-scale schooling policy interventions — a school construction program in India (studied in Khanna 2023), Indonesia&amp;rsquo;s INPRES program (Duflo 2001 and 2004), and US compulsory schooling laws (Acemoglu and Angrist 2000) — are used to externally validate the framework. Using regional variation in exposure to each program and rich microdata on the income distribution, the paper documents two findings: (1) educational expansion had large causal effects on aggregate regional incomes comparable in magnitude to individual returns estimated in the same contexts; and (2) all three policies disproportionately benefited low-income earners, substantially reducing inequality. The distributional growth accounting framework reproduces both findings with &amp;ldquo;a remarkable degree of accuracy,&amp;rdquo; and if anything underestimates the benefits of schooling, providing validation of the methodological foundation.&lt;/p&gt;
&lt;h3 id="q7-q-how-does-the-paper-quantify-educations-role-in-gender-inequality-reduction"&gt;Q7. Q: How does the paper quantify education&amp;rsquo;s role in gender inequality reduction?&lt;/h3&gt;
&lt;p&gt;A: The framework is extended to gender by constructing a counterfactual for how large gender labor income gaps would be absent educational improvement since the early 1990s (the period for which female labor income share data are available). The counterfactual accounts for three gender-specific channels: differential educational expansion between men and women, heterogeneous returns to schooling by gender, and differential effects of schooling on female labor force participation. Comparing the counterfactual to actual trends in female labor income shares, education can explain 50–80% of the observed reductions in gender inequality, depending on specification and world region.&lt;/p&gt;
&lt;h3 id="q8-q-how-do-public-policies-as-a-whole-contribute-to-extreme-poverty-reduction"&gt;Q8. Q: How do public policies as a whole contribute to extreme poverty reduction?&lt;/h3&gt;
&lt;p&gt;A: The paper&amp;rsquo;s estimate of education&amp;rsquo;s indirect investment benefits (40% of extreme poverty reduction) is combined with a companion paper&amp;rsquo;s (Gethin 2023) estimates of direct government redistribution — cash and in-kind transfers together accounting for approximately 30% of global poverty reduction since 1980, with in-kind transfers alone accounting for approximately 20%. Because the two contributions overlap (e.g., public education spending is both an indirect investment benefit and an in-kind transfer), the combined lower bound is reported as &amp;ldquo;at least 50%&amp;rdquo; of extreme poverty reduction attributable to public policies.&lt;/p&gt;
&lt;h3 id="q9-q-why-does-the-distributional-approach-yield-such-different-results-from-the-standard-approach-for-the-poorest-20"&gt;Q9. Q: Why does the distributional approach yield such different results from the standard approach for the poorest 20%?&lt;/h3&gt;
&lt;p&gt;A: Two main reasons. First, cross-country data cannot accurately measure the incomes of the world&amp;rsquo;s poorest, because the poorest individuals are not all concentrated in the poorest countries — distributional accounting within countries is necessary to locate them precisely. Second, the standard approach misses two progressive channels: (a) labor income shares are higher at the bottom of the income distribution than average, so gains from schooling translate into larger income increases for the poor; and (b) supply effects redistribute schooling gains from high-skilled to low-skilled workers, a mechanism that is entirely absent from cross-country averages but directly captured in the microdata-based counterfactual.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Distributional growth accounting:&lt;/strong&gt; A framework, introduced in this paper, that combines a model of education and the wage structure with household microdata to construct income-group-specific counterfactuals, isolating the contribution of human capital accumulation to growth at each point of the income distribution rather than at the national-average level.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;&lt;em&gt;True return to schooling (r&lt;/em&gt;):&lt;/em&gt;* In the CES framework with imperfect skill substitution, the &amp;ldquo;true&amp;rdquo; aggregate return to schooling used in the counterfactual lies strictly between the initial return (prevailing before educational expansion, counterfactually higher because skilled-worker supply was lower) and the final return (observed after expansion, lower due to skill-supply pressure). The true return is the return that equates the model&amp;rsquo;s predicted output loss to the actual output loss from reducing education; for plausible parameters it weights initial returns at 50–70%.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Supply effects (general equilibrium effects of schooling):&lt;/strong&gt; When the supply of skilled workers rises, their relative wage falls and the relative wage of unskilled workers rises. These wage adjustments are not captured by individual-level Mincerian returns but are modeled via the CES elasticity of substitution σ. Supply effects are central to education&amp;rsquo;s progressive distributional impact: they compress the skill premium and raise earnings at the bottom of the distribution.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Imperfect substitution between skill groups:&lt;/strong&gt; The CES production specification in which skilled (H) and unskilled (L) labor are combined with elasticity σ &amp;lt; ∞. This governs the magnitude of general equilibrium wage effects: a lower σ means a larger wage compression per unit of skilled-supply increase, amplifying the redistributive role of education. The paper calibrates σ from the macroeconomics literature and bounds results over plausible ranges.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Skill-biased technical change (SBTC):&lt;/strong&gt; Technology that raises the marginal product of skilled workers relative to unskilled (captured by the ratio AH/AL in the CES production function). SBTC amplifies returns to schooling; in the subsample of 33 countries with around-2000 surveys, SBTC accounts for 20–30% of schooling&amp;rsquo;s income benefits, but education would still have generated substantial income gains absent SBTC.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Conservative assumptions (scope condition):&lt;/strong&gt; All headline quantitative results (50% of aggregate growth, 70% of poorest-20% income gains, 40% of extreme poverty reduction, &amp;gt;50% of gender inequality reduction) are explicitly conditioned on conservative assumptions: Mincerian rather than causal returns, no effect on capital income, and no human capital externalities. The paper argues these assumptions bias all estimates downward.&lt;/p&gt;
&lt;hr&gt;
&lt;p&gt;&lt;em&gt;Summary based on HAL working paper (halshs-04423765v1, Working Paper 2023/25, November 2023). Period covered in working paper text: 1980–2022. AI-assisted, human review pending.&lt;/em&gt;&lt;/p&gt;</description></item><item><title>Diversifying Society's Leaders? Determinants and Causal Effects of Admission</title><link>https://macropaperwarehouse.com/papers/diversifying-societys-leaders-determinants-and-causal-effects-of-admission/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/diversifying-societys-leaders-determinants-and-causal-effects-of-admission/</guid><description>&lt;p&gt;This paper studies why children from high-income families are more likely to attend Ivy-Plus colleges (Ivy League, Stanford, MIT, Duke, Chicago — 12 colleges total) and whether attending these colleges causally improves post-college outcomes. The authors construct a de-identified panel dataset linking federal income tax records, Department of Education college attendance data, College Board and ACT test scores, and application and admissions records from several Ivy-Plus and flagship public colleges covering approximately 2.4 million students across entering classes from 1998–2015.&lt;/p&gt;
&lt;p&gt;The central finding on the input side is that students from families in the top 1% of the income distribution (income above $611,000) are 2.3 times more likely to attend an Ivy-Plus college than middle-class students (defined as the 70th–80th percentiles of the national parental income distribution, approximately $91,000–$114,000) with comparable SAT/ACT scores. Two-thirds of this gap is attributable to higher admissions rates at Ivy-Plus colleges for high-income applicants; conditional on SAT/ACT scores, top-1% applicants are 58% more likely to be admitted than middle-class applicants. The remaining third splits between differences in application rates (roughly 20% of the total attendance gap) and matriculation rates (roughly 12%). In contrast, admissions rates at flagship public colleges are essentially uncorrelated with parental income conditional on test scores.&lt;/p&gt;
&lt;p&gt;Three admissions practices drive the high-income admissions advantage at Ivy-Plus colleges. First, legacy preferences: legacy applicants from the top 1% are admitted at more than five times the rate of non-legacy applicants with comparable test scores, demographics, and admissions ratings; children of alumni of a given Ivy-Plus college are not more likely to be admitted to other Ivy-Plus colleges, confirming that legacy status is not merely a proxy for unobservable credentials. Legacy preferences account for 52 of the estimated 168 &amp;ldquo;extra&amp;rdquo; top-1% students per average Ivy-Plus class (enrollment ~1,650). Second, non-academic ratings: students from the top 1% have markedly stronger non-academic credentials (extracurricular activities, leadership ratings) partly because they disproportionately attend private high schools whose students receive higher non-academic ratings despite no higher academic ratings; this accounts for 35 additional extra top-1% students. Third, athletic recruitment: the share of recruited athletes rises from 5% among admitted students from the bottom 60% to 13% among those from the top 1%, accounting for 27 additional extra top-1% students.&lt;/p&gt;
&lt;p&gt;On the output side, the authors estimate causal effects of attending an Ivy-Plus college using a new research design based on waitlisted applicants. The key identification assumption is that idiosyncratic variation in admissions decisions across waitlisted applicants at one Ivy-Plus college is uncorrelated with admissions decisions at other Ivy-Plus colleges — which the authors verify empirically. Under this assumption, comparisons of admitted vs. rejected waitlisted applicants identify causal effects for marginal students. The marginal student who attends an Ivy-Plus college instead of the average flagship public is approximately 50% more likely to reach the top 1% of the earnings distribution at age 33, nearly twice as likely to attend a highly-ranked graduate school, and 2.5 times as likely to work at a prestigious firm. Attending an Ivy-Plus college increases mean earnings by $101,000 at age 33 relative to a counterfactual mean of $143,000 at state flagships. Effects are concentrated in the upper tail of earnings — the impact on reaching the top quartile is small and statistically insignificant, while impacts on reaching the top 1% far exceed what a constant percentage treatment effect would predict. Effects are larger for students with weaker fallback options (i.e., whose home-state colleges channel fewer students to the top 1%).&lt;/p&gt;
&lt;p&gt;Critically, the three credentials driving the high-income admissions advantage — legacy status, athletic recruitment, and high non-academic ratings — are uncorrelated with or negatively correlated with post-college success once the college attended is held constant. Academic credentials (SAT/ACT scores, academic ratings) remain highly predictive of outcomes.&lt;/p&gt;
&lt;p&gt;Counterfactual simulations show that eliminating all three high-income admissions preferences and replacing those slots with students having the same test score distribution would increase enrollment from the bottom 95% of the parental income distribution by 8.8 percentage points — comparable in magnitude to the effect of race-based affirmative action on Black and Hispanic enrollment shares. Such a policy would have small effects on monetary leadership outcomes (e.g., Fortune 500 CEO share from bottom-95% families rises by only 0.4 pp, because Ivy-Plus graduates are a small fraction of all top earners) but larger effects on non-monetary leadership positions: the share of senators from the bottom 95% would rise by 1.7 pp and the share of Supreme Court justices by 5.4 pp. With need-affirmative policies (giving low-income students preferences comparable to those currently given to legacy applicants), the share of Supreme Court justices from families in the bottom 60% would rise by 17.5 pp. These predictions assume that the causal share of Ivy-Plus attendance in explaining observational differences in leadership outcomes is the same as that estimated for early-career outcomes, and they ignore general equilibrium effects.&lt;/p&gt;
&lt;p&gt;Q: How much more likely are top-1% students to attend an Ivy-Plus college than middle-class students with the same test scores?
A: Students from families in the top 1% (income above $611,000) are 2.3 times more likely to attend an Ivy-Plus college than students from the 70th–80th percentile of the parental income distribution (approximately $91,000–$114,000) with comparable SAT/ACT scores. This &amp;ldquo;missing middle&amp;rdquo; pattern is stable across entering classes from 1998 to 2018 and persists after controlling for race and ethnicity.&lt;/p&gt;
&lt;p&gt;Q: How is the overall attendance gap decomposed into application, admissions, and matriculation?
A: Differences in admissions rates explain two-thirds of the gap in Ivy-Plus attendance between top-1% and middle-class students conditional on test scores. Of the estimated 168 &amp;ldquo;extra&amp;rdquo; top-1% students per average Ivy-Plus class, 87 come from higher admissions rates for non-recruited athletes, 27 from athletic recruitment, and the remaining slack from application rate differences (accounting for roughly 20% of the overall attendance gap) and matriculation differences (roughly 12%).&lt;/p&gt;
&lt;p&gt;Q: How large is the admissions advantage for top-1% applicants at Ivy-Plus colleges?
A: Conditional on SAT/ACT scores, applicants from the top 1% are 58% more likely to be admitted to Ivy-Plus colleges than middle-class applicants. Students from the top 0.1% are 2.5 times more likely to be admitted than middle-class applicants with comparable test scores. At flagship public colleges, admissions rates are essentially constant across the income distribution conditional on test scores.&lt;/p&gt;
&lt;p&gt;Q: What is the magnitude of legacy preferences and how is it established that legacy is not just a proxy for other credentials?
A: Legacy applicants from the top 1% are admitted at more than five times the rate of otherwise comparable non-legacy applicants at the college their parents attended. The paper isolates the legacy effect by showing that children of alumni at a given Ivy-Plus college are only slightly more likely to be admitted at other Ivy-Plus colleges — and the predicted counterfactual admissions rate for legacy students at other colleges closely matches their actual admissions rate — confirming that legacy status is not merely a proxy for other unobservable credentials. Legacy applicants constitute 2.5% of the overall applicant pool but over 9% of top-1% applicants.&lt;/p&gt;
&lt;p&gt;Q: How do non-academic credentials differ by parental income, and what drives the difference?
A: Top-1% applicants have markedly stronger non-academic ratings (measuring extracurricular participation and leadership traits) compared with other applicants, while the share achieving high academic ratings is essentially constant across the income distribution. Students from the top 1% are much more likely to have attended private high schools, whose applicants receive substantially higher non-academic ratings than students from public high schools with the same SAT/ACT scores. Non-academic ratings account for 35 of the estimated 168 extra top-1% students per Ivy-Plus class.&lt;/p&gt;
&lt;p&gt;Q: What is the research design for estimating causal effects, and what is the key identification assumption?
A: The authors focus on applicants who are waitlisted at a given Ivy-Plus college and compare those ultimately admitted versus rejected from the waitlist. The key identification assumption is that if different colleges&amp;rsquo; admissions committees make correlated assessments of underlying student merit but uncorrelated idiosyncratic admissions errors, then residual variation in admissions outcomes for waitlisted applicants at one college is orthogonal to students&amp;rsquo; long-run potential. The authors validate this empirically by showing that waitlist admission at one Ivy-Plus college is uncorrelated with admissions decisions and internal ratings at other Ivy-Plus colleges.&lt;/p&gt;
&lt;p&gt;Q: What are the causal effects of attending an Ivy-Plus college on post-college outcomes?
A: For the marginal student (one who attends an Ivy-Plus college instead of the average flagship public), attending an Ivy-Plus college increases the probability of reaching the top 1% of the earnings distribution at age 33 by approximately 50%, nearly doubles the probability of attending an elite graduate school, and increases the probability of working at a prestigious firm by approximately 2.5 times. Mean earnings at age 33 increase by $101,000 (relative to a counterfactual mean of $143,000 at state flagships). Effects on reaching the top quartile of earnings are small and statistically insignificant, while effects at the very top tail are disproportionately large.&lt;/p&gt;
&lt;p&gt;Q: Why do the findings differ from Dale and Krueger (2002) and related studies finding little effect of selective college attendance on earnings?
A: The authors replicate the matriculation design of Dale and Krueger (comparing outcomes conditional on the set of colleges to which students were admitted) and obtain estimates statistically indistinguishable from their waitlist design — the research designs are not the source of disagreement. Instead, the differences arise because (1) the authors have direct college fixed effects rather than relying on average test scores as a proxy for college quality, and (2) the authors focus on upper-tail outcomes (top 1% earnings, elite graduate schools, prestigious firms) rather than log mean earnings, where Ivy-Plus colleges have their largest effects.&lt;/p&gt;
&lt;p&gt;Q: Are the credentials that drive the high-income admissions advantage — legacy, athlete status, high non-academic ratings — predictive of better post-college outcomes?
A: No. Recruited athletes, students with higher non-academic ratings, and legacy students have equivalent or lower chances of reaching the upper tail of the income distribution, attending an elite graduate school, or working at a prestigious firm than comparable Ivy-Plus applicants once the college attended is held constant. By contrast, SAT/ACT scores and academic ratings are highly positively predictive of all three post-college outcome measures.&lt;/p&gt;
&lt;p&gt;Q: How much could changing admissions practices diversify Ivy-Plus enrollment and subsequently society&amp;rsquo;s leadership?
A: Eliminating legacy preferences, non-academic rating weights, and the differential recruitment of high-income athletes — and filling those slots with students having the same test score distribution as the current class — would increase enrollment from families in the bottom 95% of the parental income distribution by 8.8 percentage points, a magnitude comparable to race-based affirmative action&amp;rsquo;s effect on Black and Hispanic enrollment shares. For leadership positions, predicted effects are small for monetary outcomes (Fortune 500 CEOs from the bottom 95% would increase by only 0.4 pp) but larger for positions where Ivy-Plus graduates are a larger share: senators from the bottom 95% would increase by 1.7 pp and Supreme Court justices by 5.4 pp. A stronger need-affirmative policy (giving low-income students preferences equivalent to current legacy preferences) would increase the share of Supreme Court justices from the bottom 60% by 17.5 pp.&lt;/p&gt;
&lt;p&gt;Q: How are &amp;ldquo;elite&amp;rdquo; and &amp;ldquo;prestigious&amp;rdquo; employers defined in this study?
A: Elite firms are defined as those that disproportionately employ Ivy-Plus graduates relative to flagship public graduates, pulling firms from the top of that ratio ranking until 25% of Ivy-Plus attendee employment is accounted for. Prestigious employers are defined by the residual of that ratio after controlling for the firm&amp;rsquo;s predicted top-1% income probability — they are firms that disproportionately employ Ivy-Plus graduates conditional on their salaries, capturing high-status jobs that do not necessarily lead to the highest earnings. The paper validates this algorithmic approach against external rankings (Vault.com for law and consulting firms; Scimagoir for hospitals), finding substantial overlap.&lt;/p&gt;
&lt;p&gt;Q: How are treatment effect estimates adjusted for heterogeneity in students&amp;rsquo; fallback options?
A: Causal effects of Ivy-Plus attendance are much larger for students with weaker fallback options — specifically, students whose home-state flagship colleges channel fewer students to the top 1% of earnings. The authors exploit this heterogeneity to estimate the treatment effect for the marginal student who actually switches from a flagship public to an Ivy-Plus college. This heterogeneity also implies that the average causal effect across all admitted students may differ from the effect for the marginal admitted student.&lt;/p&gt;
&lt;p&gt;Q: What share of the overrepresentation of top-1% families at Ivy-Plus colleges is attributable to pre-application factors versus admissions practices?
A: Of the 245 &amp;ldquo;extra&amp;rdquo; top-1% students in an average Ivy-Plus class relative to an unconditionally income-neutral benchmark, 77 (31%) are attributable to the higher test scores of top-1% students (a pre-application factor). The remaining 168 (69%) reflect higher attendance rates conditional on test scores, of which the large majority is attributable to admissions practices (legacy, non-academic ratings, athletic recruitment) rather than application or matriculation rate differences.&lt;/p&gt;
&lt;p&gt;Ivy-Plus colleges: The twelve highly selective private colleges comprising the eight Ivy League institutions plus Stanford, MIT, Duke, and the University of Chicago — the focus group of the study, which together account for more than 10% of Fortune 500 CEOs, a quarter of U.S. senators, and three-fourths of Supreme Court justices appointed in the last half century despite enrolling less than 0.5% of Americans.&lt;/p&gt;
&lt;p&gt;Missing middle: The pattern by which attendance rates at Ivy-Plus colleges conditional on SAT/ACT scores are lowest for students from the middle class (70th–80th percentile of the parental income distribution, approximately $91,000–$114,000) — lower than both the top 1% and, slightly, the bottom 40% — producing a non-monotone income gradient in attendance.&lt;/p&gt;
&lt;p&gt;Legacy preference: An admissions advantage given to applicants whose parent(s) obtained an undergraduate degree from the college to which the student is applying. In the paper&amp;rsquo;s data, legacy applicants from the top 1% are admitted at more than five times the rate of non-legacy applicants with comparable test scores, demographics, and admissions ratings; the preference is college-specific (children of alumni are only slightly more likely to be admitted at other Ivy-Plus colleges).&lt;/p&gt;
&lt;p&gt;Waitlist research design: The paper&amp;rsquo;s primary identification strategy for causal effects, which exploits idiosyncratic variation in admissions decisions among waitlisted applicants. The design&amp;rsquo;s validity rests on the empirical finding that waitlist admissions at one Ivy-Plus college are uncorrelated with admissions decisions and internal ratings at other Ivy-Plus colleges, implying that residual variation conditional on being on the waitlist is orthogonal to students&amp;rsquo; long-run potential outcomes.&lt;/p&gt;
&lt;p&gt;Prestigious employers: Firms defined by the paper&amp;rsquo;s algorithm as disproportionately employing Ivy-Plus graduates conditional on those firms&amp;rsquo; predicted top-1% income probability — capturing high-status employment that does not necessarily lead to the highest earnings (e.g., prominent law firms, consulting firms, elite hospitals). Validated against external rankings (Vault.com, Scimagoir).&lt;/p&gt;
&lt;p&gt;Non-academic ratings: Numerical scores assigned by admissions officers measuring aspects of an application outside academic achievement, such as extracurricular activities and leadership traits. In the paper&amp;rsquo;s data, non-academic ratings differ substantially by parental income — particularly because top-1% applicants disproportionately attend private high schools whose students receive higher non-academic ratings — while academic ratings do not differ across the income distribution.&lt;/p&gt;
&lt;p&gt;Surrogate index: A prediction of later earnings outcomes (specifically, probability of reaching the top 1% at age 33 and mean income rank) constructed from individuals&amp;rsquo; graduate school attendance and employer fixed effects at ages 22–25, used to extend the outcome window for cohorts observed only early in their careers. The approach follows the terminology and methodology of Athey et al. (2019).&lt;/p&gt;</description></item><item><title>Do Credit Conditions Move House Prices?</title><link>https://macropaperwarehouse.com/papers/do-credit-conditions-move-house-prices/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/do-credit-conditions-move-house-prices/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; To what extent did an expansion and contraction of credit drive the 2000s housing boom and bust? The existing literature offers sharply divergent answers — ranging from credit explaining virtually none of the boom (Kaplan, Mitman, and Violante 2020) to credit explaining the majority of it (Favilukis, Ludvigson, and Van Nieuwerburgh 2017, who find credit alone explains 60% of the rise in price-to-rent ratios). Greenwald and Guren argue that the source of these divergent findings is a single structural assumption: the degree to which credit-insensitive agents (landlords and unconstrained savers) can absorb credit-driven demand for housing, which in turn depends on the degree of segmentation between the owner-occupied and rental housing markets.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Key Mechanism.&lt;/strong&gt; The paper organizes the literature around a &amp;ldquo;tenure supply&amp;rdquo; curve, defined in price-rent ratio versus homeownership rate space. A perfectly inelastic (vertical) supply curve — corresponding to perfect segmentation, in which housing cannot move between the owner-occupied and rental sectors — implies that credit expansion bids up house prices with no change in the homeownership rate. A perfectly elastic (horizontal) supply curve — corresponding to a frictionless rental market with deep-pocketed landlords who price at the present value of rents — implies that credit expansion raises the homeownership rate but not the price-rent ratio, because landlord reservation prices are unaffected by credit. Intermediate degrees of segmentation produce intermediate outcomes: credit raises both the price-rent ratio and the homeownership rate, with the relative magnitudes determined by the slope of the tenure supply curve.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Empirical Strategy.&lt;/strong&gt; To measure where reality falls on this spectrum, the authors estimate the relative elasticity of the price-rent ratio to an identified credit supply shock, compared to the elasticity of the homeownership rate to the same shock. This ratio is a sufficient statistic for the slope of the tenure supply curve. They use three distinct identification strategies from prior literature — (1) Loutskina and Strahan (2015), instrumenting for local credit supply using differential city-level exposure to changes in the conforming loan limit (CLL); (2) Di Maggio and Kermani (2017), exploiting the 2004 OCC preemption of state anti-predatory-lending laws for national banks; and (3) Mian and Sufi (2019), using differential city-level exposure to the 2003 private label securitization (PLS) expansion through bank funding composition. Regressions are estimated on annual CBSA-level panels using local projection IV (LP-IV) or event-study reduced-form methods. Key data include the CoreLogic repeat-sales house price index, the CBRE Torto-Wheaton same-store rent index (a repeat-rent index for multi-unit apartment buildings, constructed from newly-leased units), and Census Housing Vacancy Survey homeownership rates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Empirical Findings.&lt;/strong&gt; All three instruments consistently find that credit supply shocks generate a significant increase in house prices and the price-rent ratio but a much smaller, rarely statistically significant, effect on the homeownership rate. Under the LS LP-IV, the price-rent ratio peaks at an increase of 0.471, while the homeownership rate response reaches only 0.037 at the 2-year horizon and peaks at 0.101 after 5 years. The ratio of price-rent to homeownership responses ranges from 3 to infinity across the three instruments and horizons. These estimates imply a substantial degree of segmentation — the no-segmentation model falls far outside the 95% confidence intervals at all horizons.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Structural Model and Calibration.&lt;/strong&gt; The authors construct a general equilibrium model featuring a representative borrower, landlord, and saver, with long-term fixed-rate mortgages subject to loan-to-value (LTV) and payment-to-income (PTI) limits following Greenwald (2018). The key modeling innovation is within-type heterogeneity in the benefit of owning versus renting, captured by logistic distributions for both borrowers and landlords. The dispersion parameter of the landlord distribution (σω,L) governs the slope of the tenure supply curve and is calibrated to minimize weighted distance to the LS empirical impulse responses. The resulting benchmark calibration yields σω,L = 2.877, with the benchmark model&amp;rsquo;s price-rent-to-homeownership ratio between 6.98 and 9.31 depending on the horizon — consistent with the empirical estimates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Quantitative Results on the 2000s Boom.&lt;/strong&gt; The paper then uses the calibrated model to simulate a credit standard relaxation (LTV limits relaxed from 85% to 99%, PTI limits from 36% to 65%) from 1998 Q1 through 2007 Q1, with a reversion at the start of the bust. This credit relaxation alone explains 34% of the peak rise in price-rent ratios observed in the boom, with a lower bound of 26% accounting for parameter uncertainty. In contrast, the no-segmentation model explains -1%, while the full segmentation model explains 38%. Adding a 2 percentage point permanent decline in mortgage spreads alongside the credit standard relaxation allows the benchmark model to explain 72% of the observed rise in price-rent ratios and 80% of the rise in loan-to-income ratios, compared to only 4% in the no-segmentation model. In a &amp;ldquo;full boom&amp;rdquo; scenario where additional demand and supply shocks are added to match the entire boom in price-rent ratios and homeownership, removing the credit relaxation reduces the rise in price-rent ratios by 55% in the benchmark economy — larger than the 34% explained in isolation due to nonlinear interactions — compared to only 5% in the no-segmentation economy.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions and Extensions.&lt;/strong&gt; These results apply to the benchmark calibration in which landlords do not use credit and saver housing demand is fixed. When landlords are allowed to use credit (LTV limit of 65% relaxed to 85% during the boom), the role of credit is strengthened: the recalibrated model explains 80% of the rise in price-rent ratios from combined credit and rate changes, suggesting the benchmark is a lower bound. When savers are allowed to frictionlessly trade housing with borrowers, credit explains 54% of the rise in price-rent ratios even after recalibration — a roughly 25% reduction relative to the benchmark 72%, representing what the authors characterize as an extreme lower bound given that saver housing markets are in practice substantially segmented due to indivisibility, quality, and location differences.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Policy Implications.&lt;/strong&gt; The findings imply that macroprudential policies tightening LTV and PTI ratios can be effective at restraining house price growth, but only in the presence of the significant rental market segmentation found in the benchmark economy. In the no-segmentation economy, removing the credit relaxation from the full boom reduces price-rent ratio growth by only 5%.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-core-theoretical-insight-that-reconciles-the-divergent-findings-in-the-prior-literature-on-credit-and-house-prices"&gt;Q1. What is the core theoretical insight that reconciles the divergent findings in the prior literature on credit and house prices?&lt;/h3&gt;
&lt;p&gt;The key difference is the degree to which credit-insensitive agents — specifically landlords and unconstrained savers — can absorb credit-driven demand for housing. Models with perfectly segmented rental markets (no rental sector or fixed homeownership rate) feature borrowers competing only with each other for a fixed stock, so credit expansion bids up prices. Models with frictionless rental markets feature deep-pocketed landlords who supply housing at a price equal to the present value of rents, which is unaffected by credit; credit expansion then raises the homeownership rate rather than prices. Intermediate degrees of frictions produce intermediate outcomes. This mechanism had not been recognized as the source of the literature&amp;rsquo;s divergence before this paper.&lt;/p&gt;
&lt;h3 id="q2-what-is-the-tenure-supply-curve-and-why-is-its-slope-the-key-empirical-object"&gt;Q2. What is the &amp;ldquo;tenure supply curve&amp;rdquo; and why is its slope the key empirical object?&lt;/h3&gt;
&lt;p&gt;The tenure supply curve describes the menu of price-rent ratios at which landlords are willing to supply varying amounts of owner-occupied housing (given total housing stock), traced out in price-rent ratio versus homeownership rate space. Its slope determines how the equilibrium responds to a credit-induced demand shift: a steep (inelastic) supply curve translates credit expansion primarily into price-rent ratio increases; a flat (elastic) supply curve translates it primarily into homeownership rate increases. Identifying this slope empirically is therefore sufficient to discipline any macro-housing model&amp;rsquo;s predictions about the role of credit in price dynamics, for arbitrary underlying shocks.&lt;/p&gt;
&lt;h3 id="q3-how-do-the-authors-identify-the-slope-of-the-tenure-supply-curve-empirically"&gt;Q3. How do the authors identify the slope of the tenure supply curve empirically?&lt;/h3&gt;
&lt;p&gt;They estimate the slope as the ratio of the causal elasticity of the price-rent ratio to that of the homeownership rate, with respect to an identified credit supply shock. Three instruments are used: (1) the Loutskina-Strahan shift-share instrument based on differential exposure to changes in the conforming loan limit, estimated by LP-IV on an unbalanced panel of 62 CBSAs from 1992 to 2016; (2) the Di Maggio-Kermani event study based on the 2004 OCC preemption of state anti-predatory-lending laws, covering 262 CBSAs for house prices and 82 CBSAs for homeownership from 2001 to 2010; and (3) the Mian-Sufi event study based on differential exposure to the 2003 PLS expansion via non-core deposit share, covering 245 CBSAs using ACS and FHFA data. In practice, they estimate the inverse slope (ratio of homeownership to price-rent response) because the first stage is far stronger using price-rent ratios as the endogenous variable.&lt;/p&gt;
&lt;h3 id="q4-what-are-the-empirical-results-on-the-relative-price-rent-and-homeownership-responses"&gt;Q4. What are the empirical results on the relative price-rent and homeownership responses?&lt;/h3&gt;
&lt;p&gt;Across all three instruments, credit supply shocks significantly raise the price-rent ratio but have a much smaller, rarely statistically significant effect on the homeownership rate. Under the LS LP-IV, the price-rent ratio peaks at 0.471 after 2 years, while the homeownership rate reaches only 0.037 at 2 years and peaks at 0.101 at 5 years. The naive point-estimate ratios range from 2.93 to 12.83 at horizons 2 through 5, with the 4-year estimate negative (implying an infinite slope). The directly estimated inverse slope coefficients are small (0.05 to 0.24) and never statistically different from zero. The DK instrument yields slopes of 6.72 in 2005, 3.67 in 2006, and 3.40 in 2007. The MS instrument yields a slope of approximately 4.49 in both 2006 and 2007. The lower bound of the 95% confidence intervals corresponds to slopes of at least 1.8 to 8.4.&lt;/p&gt;
&lt;h3 id="q5-what-is-the-key-modeling-contribution-on-the-structural-side"&gt;Q5. What is the key modeling contribution on the structural side?&lt;/h3&gt;
&lt;p&gt;The key innovation is the introduction of within-type heterogeneity in ownership preferences for both borrowers and landlords, modeled as logistic distributions. This heterogeneity allows the model to generate a fractional and time-varying homeownership rate — a feature absent from most prior macro-housing models — and maps directly into the slopes of the demand and tenure supply curves. The dispersion in landlord ownership costs (σω,L) governs the supply curve slope and is calibrated to match the empirical impulse responses. Without this heterogeneity, the model would produce corner solutions with all housing owned by one type.&lt;/p&gt;
&lt;h3 id="q6-how-is-the-landlord-dispersion-parameter-σωl-calibrated-and-what-is-the-estimated-value"&gt;Q6. How is the landlord dispersion parameter σω,L calibrated, and what is the estimated value?&lt;/h3&gt;
&lt;p&gt;The calibration minimizes a weighted sum of squared deviations between model and data impulse responses for the price-rent ratio and homeownership rate, using the LS LP-IV estimates. Deviations are weighted by the inverse of empirical standard errors. Because model impulse responses jump on impact while empirical responses are hump-shaped (due to search frictions), the calibration uses only horizons 2 through 5 years. The minimum-distance estimate yields σω,L = 2.877, alongside a mortgage spread shock persistence of 0.965 and a shock size of -0.041 (corresponding to an annualized CLL subsidy of approximately 17 basis points, within the 10-24bp range found in prior literature). The benchmark model&amp;rsquo;s implied price-rent-to-homeownership response ratio ranges from 6.98 to 9.31, consistent with the empirical estimates.&lt;/p&gt;
&lt;h3 id="q7-what-lower-bound-does-the-paper-derive-for-σωl-and-how-does-the-no-segmentation-model-compare"&gt;Q7. What lower bound does the paper derive for σω,L, and how does the no-segmentation model compare?&lt;/h3&gt;
&lt;p&gt;A credible set for σω,L is derived by targeting the upper and lower bounds of the 95% confidence interval for the estimated inverse slope. The lower bound for σω,L (targeting the top of the confidence interval) is 0.810; the lower bound targets the bottom of the confidence interval but is best matched by the full segmentation case (σω,L → ∞). The no-segmentation economy (σω,L = 0) produces inverse ratios between 4 and 32 times the empirical upper bound, placing it far outside the credible set.&lt;/p&gt;
&lt;h3 id="q8-what-is-the-models-quantitative-finding-on-the-role-of-credit-standard-relaxation-in-isolation"&gt;Q8. What is the model&amp;rsquo;s quantitative finding on the role of credit standard relaxation in isolation?&lt;/h3&gt;
&lt;p&gt;A credit standard relaxation (LTV from 85% to 99%, PTI from 36% to 65%) implemented from 1998 Q1 to 2007 Q1 and then reverted explains 34% of the peak rise in price-rent ratios in the benchmark model, with a lower bound of 26% conditional on parameter uncertainty. In the full segmentation model, the same relaxation explains 38%, while in the no-segmentation model it explains -1%. Credit standard relaxation also explains 51% of the rise in loan-to-income ratios in the benchmark, compared to 31% in the no-segmentation model.&lt;/p&gt;
&lt;h3 id="q9-what-does-adding-a-decline-in-mortgage-rates-contribute"&gt;Q9. What does adding a decline in mortgage rates contribute?&lt;/h3&gt;
&lt;p&gt;Adding a permanent 2 percentage point decline in mortgage spreads alongside the credit standard relaxation increases the benchmark model&amp;rsquo;s explained share of the price-rent ratio boom from 34% to 72%, and the loan-to-income ratio share from 51% to 80%. The no-segmentation model explains only 4% of the price-rent ratio boom and 38% of the loan-to-income ratio boom under the same combined experiment.&lt;/p&gt;
&lt;h3 id="q10-how-does-the-full-boom-counterfactual-estimate-the-marginal-contribution-of-credit"&gt;Q10. How does the &amp;ldquo;full boom&amp;rdquo; counterfactual estimate the marginal contribution of credit?&lt;/h3&gt;
&lt;p&gt;The full boom experiment adds exogenous demand shocks (shifts to µω,B) and supply shocks (shifts to µω,L) on top of the credit relaxation and rate decline, calibrated to exactly reproduce the observed peak increase in both the price-rent ratio and the homeownership rate during the boom. Removing the credit relaxation from this full boom scenario reduces the rise in price-rent ratios by 55% and the rise in loan-to-income ratios by 74% in the benchmark economy. This exceeds the 34% figure from the credit-alone experiment due to strong nonlinear interactions: without the credit relaxation, binding PTI limits constrain households&amp;rsquo; ability to finance properties even when ownership preferences rise, dampening both price and credit growth. In the no-segmentation economy, removing the credit relaxation reduces price-rent ratio growth by only 5%.&lt;/p&gt;
&lt;h3 id="q11-what-are-the-implications-of-allowing-landlords-to-use-credit"&gt;Q11. What are the implications of allowing landlords to use credit?&lt;/h3&gt;
&lt;p&gt;When landlords face an LTV limit of 65% relaxed to 85% during the boom, the credit expansion also shifts the tenure supply curve upward (as in Panel (d) of the supply-demand framework), leading to a larger price-rent ratio response and a smaller homeownership rate response than in the baseline. Without recalibration, this model explains 81% of the price-rent ratio rise. After recalibration of σω,L (which is required because landlord credit changes the mapping from empirical moments to structural parameters), the model explains 80% of the price-rent ratio rise. This implies the benchmark results are a lower bound on the role of credit in driving house prices.&lt;/p&gt;
&lt;h3 id="q12-what-are-the-implications-of-allowing-savers-to-frictionlessly-trade-housing-with-borrowers"&gt;Q12. What are the implications of allowing savers to frictionlessly trade housing with borrowers?&lt;/h3&gt;
&lt;p&gt;When savers are allowed to frictionlessly adjust their housing demand (purchasing housing from or selling to borrowers as credit conditions change), the price-rent ratio response is dampened because savers absorb excess borrower demand. After recalibrating σω,L, the combined credit-and-rate experiment explains 54% of the price-rent ratio boom — roughly 25% less than the benchmark 72%. The authors regard this as an extreme lower bound because in practice saver and borrower housing markets are substantially segmented due to indivisibility, location, and quality differences.&lt;/p&gt;
&lt;h3 id="q13-what-are-the-implications-for-macroprudential-policy"&gt;Q13. What are the implications for macroprudential policy?&lt;/h3&gt;
&lt;p&gt;Macroprudential policies that tighten LTV and PTI limits are effective at slowing house price growth in the benchmark economy, where rental market frictions are substantial. In the full boom counterfactual, tightening credit standards reduces the rise in price-rent ratios by 55%. However, in the no-segmentation economy, the same tightening reduces price-rent ratio growth by only 5%, because landlords readily absorb credit-driven demand and pin prices to the present value of rents. The effectiveness of macroprudential policies is therefore deeply dependent on the degree of rental market segmentation.&lt;/p&gt;
&lt;h3 id="q14-why-do-the-authors-prefer-the-cbre-torto-wheaton-rent-index-over-typical-rent-measures"&gt;Q14. Why do the authors prefer the CBRE Torto-Wheaton rent index over typical rent measures?&lt;/h3&gt;
&lt;p&gt;The TW index uses a repeat-rent methodology on newly-leased multi-unit apartments, which better captures current market conditions than median rent measures, which are biased by composition changes and are sticky due to long-term lease contracts. Since the price-rent ratio is meant to capture the rent a unit could command if leased instead of sold, newly-leased apartment rents are more appropriate for constructing this ratio. The TW index is available for 53 CBSAs from 1989 and 62 CBSAs from 1994.&lt;/p&gt;
&lt;h3 id="q15-why-do-the-authors-estimate-the-inverse-slope-rather-than-the-slope-directly"&gt;Q15. Why do the authors estimate the inverse slope rather than the slope directly?&lt;/h3&gt;
&lt;p&gt;The first stage for the homeownership rate response is very weak — the estimated coefficients are small and imprecise, so using the homeownership rate as an endogenous variable would suffer severe weak instrument problems. Instead, the authors use the price-rent ratio as the endogenous variable (with a much stronger first stage) and the homeownership rate as the outcome, obtaining the inverse slope (homeownership response per unit price-rent ratio response). The upper bounds of the 95% confidence intervals for the inverse slope range from 0.12 to 0.56 across horizons, corresponding to lower bounds on the slope of 1.8 to 8.4.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Tenure Supply Curve.&lt;/strong&gt; The menu of price-rent ratios at which landlords are willing to supply varying quantities of owner-occupied housing (i.e., sell rental units to potential homeowners) at a given total housing stock. Defined in price-rent ratio versus homeownership rate space. Distinct from the absolute supply of housing via the construction sector; shifts in the construction margin affect absolute quantities and prices but not necessarily the price-rent ratio or the ownership share. The slope of this curve — not the level — is the central empirical and structural object of the paper.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Market Segmentation (in the paper&amp;rsquo;s sense).&lt;/strong&gt; The degree to which credit-insensitive agents (landlords, unconstrained savers) cannot absorb credit-driven demand from constrained borrowers. Perfect segmentation means owner-occupied and rental housing are entirely non-fungible, so all credit-driven demand falls on a fixed supply of owned units. Zero segmentation means landlords (or savers) can frictionlessly convert between owned and rented housing at a price tied to present discounted rents. In this paper, segmentation is measured continuously by the slope of the tenure supply curve.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sufficient Statistic (for segmentation).&lt;/strong&gt; The ratio of the causal elasticity of the price-rent ratio to the causal elasticity of the homeownership rate, both with respect to the same identified credit supply shock. This ratio identifies the slope of the tenure supply curve and is sufficient to calibrate a structural model to recover the role of credit in driving house prices for arbitrary combinations of shocks, even when those shocks differ from the identifying variation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Ownership Benefit Heterogeneity.&lt;/strong&gt; An additional idiosyncratic utility flow (positive or negative) that borrowers or landlords receive from owning versus renting a given unit, modeled as a logistic distribution. This within-type heterogeneity generates a fractional and time-varying homeownership rate in the model and maps directly into the slope of the demand and tenure supply curves. The dispersion parameter σω,L for landlords governs the slope of the tenure supply curve; higher dispersion implies a steeper (more segmented) supply curve and larger price-rent ratio responses to credit shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Marginal Collateral Value (CB,t).&lt;/strong&gt; The shadow value to borrowers of the additional credit that can be collateralized by an additional dollar of housing value, equal to µB,t × FLTV × θLTV in the model. A relaxation of credit standards (raising θLTV or θPTI) or a decline in credit costs raises CB,t, increasing borrower reservation prices and shifting the housing demand curve outward. This is the channel through which credit conditions enter house price dynamics.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Local Projection IV (LP-IV).&lt;/strong&gt; A generalization of Jordà (2005) local projections to instrumental variables settings, as in Ramey (2016) and Ramey and Zubairy (2018), extended to a panel context with CBSA and time fixed effects. Used to estimate impulse responses of price-rent ratios, house prices, and homeownership rates to credit supply shocks at horizons 0 through 5 years, instrumenting for endogenous credit growth using the conforming loan limit shift-share instrument.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Conforming Loan Limit (CLL) Instrument.&lt;/strong&gt; A shift-share instrument for local credit supply constructed by interacting the share of mortgage originations in the prior year falling within 5% of the current year&amp;rsquo;s CLL with the percentage change in the national CLL. Cities where a larger fraction of loans cluster near the CLL threshold experience a larger credit supply shock when the CLL increases, because more loans shift from unsubsidized to GSE-subsidized rates. The instrument is constructed using the change in the national CLL only to avoid endogeneity from high-cost area adjustments.&lt;/p&gt;</description></item><item><title>Do Financial Concerns Make Workers Less Productive?</title><link>https://macropaperwarehouse.com/papers/do-financial-concerns-make-workers-less-productive/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/do-financial-concerns-make-workers-less-productive/</guid><description>&lt;h2 id="do-financial-concerns-make-workers-less-productive"&gt;Do Financial Concerns Make Workers Less Productive?&lt;/h2&gt;
&lt;h3 id="research-question"&gt;Research Question&lt;/h3&gt;
&lt;p&gt;The paper tests whether financial concerns distract workers sufficiently to meaningfully reduce their productivity, and whether receiving cash — by alleviating those concerns — can raise output even when total compensation is held fixed.&lt;/p&gt;
&lt;h3 id="setting-and-sample"&gt;Setting and Sample&lt;/h3&gt;
&lt;p&gt;The experiment involves 408 low-income male agricultural casual laborers in rural Odisha, India, recruited from 47 villages across five worksites in four districts. The study takes place during the lean agricultural season (March–June 2017 and 2018), when formal employment is scarce (workers found paid wage work on only 1.9 days per week on average). During this period, 86% of workers reported being &amp;ldquo;worried&amp;rdquo; or &amp;ldquo;very worried&amp;rdquo; about their finances, 68–71% carried outstanding loans, and 64–66% said they would have difficulty coming up with Rs. 1,000 (roughly four days of wages) in an emergency. Workers bring these burdens to the job: on a given day, approximately one in two workers reported thinking about financial worries while working.&lt;/p&gt;
&lt;h3 id="experimental-design"&gt;Experimental Design&lt;/h3&gt;
&lt;p&gt;Workers were employed for twelve days in a piece-rate manufacturing task — stitching sal tree leaves into disposable plates for restaurants. The payment-timing manipulation is the core of the identification strategy. Control workers received all accrued earnings as a lump sum on the final day (day 12). Treatment workers received their earnings in two installments: an interim payment of earnings to date on day 8 or 9 (randomly staggered across waves), with the balance paid on day 12. Total compensation was held constant across groups; only the timing of receipt differed. On day 5 (the &amp;ldquo;announcement day&amp;rdquo;), each worker learned his payment schedule individually. The design thus separates the announcement period (days 5 through the interim payment day, when workers know their schedule but have not yet received cash) from the post-pay period (days after the interim payment until the contract end). This enables the authors to test whether productivity effects arise from information about impending cash, or only once cash is physically in hand.&lt;/p&gt;
&lt;h3 id="first-stage-effects-on-financial-strain"&gt;First Stage: Effects on Financial Strain&lt;/h3&gt;
&lt;p&gt;Within three days of receiving the interim payment, treated workers increased loan repayments by Rs. 271, a 287% increase relative to the control group mean (p &amp;lt; 0.001), and were 40 percentage points (222%) more likely to repay any loan (p &amp;lt; 0.001). The majority of repayments occurred on the same evening as the cash disbursement — a 746% single-day increase in loan payments. Household expenditures on food, clothing, and essentials rose by 40% (Rs. 150) over three days (p &amp;lt; 0.001). Treatment workers also reported feeling more focused on the work task (11.5 percentage points more likely, p = 0.032) and were less likely to report thinking about financial worries while making plates (13.7 percentage points, p = 0.044).&lt;/p&gt;
&lt;h3 id="main-productivity-results"&gt;Main Productivity Results&lt;/h3&gt;
&lt;p&gt;In the post-pay period, treated workers increased output by 0.109 SD (6.9%) relative to the control group (p = 0.020). No treatment effect emerged during the announcement period (0.014 SD, p = 0.685); the post-pay and announcement-period effects are statistically distinguishable (p = 0.008). Because work hours are fixed and daily attendance is 98.3% with no treatment effect on attendance, these gains reflect improvements in how quickly workers produce plates per hour of work.&lt;/p&gt;
&lt;p&gt;Effects are concentrated among workers with below-median baseline wealth (fewer assets, less liquidity): for this subgroup, the interim payment increases output by 0.204 SD (13.0%, p = 0.003). For workers with above-median wealth, the effect is close to zero and statistically insignificant (p = 0.819).&lt;/p&gt;
&lt;h3 id="attentiveness-results"&gt;Attentiveness Results&lt;/h3&gt;
&lt;p&gt;Beyond total output, the authors measure attentiveness through three markers embedded in the finished plates: the number of &amp;ldquo;double holes&amp;rdquo; (paired stitching holes indicating a removed mistaken stitch), the number of leaves used, and the number of stitches used. These measures are collected unbeknownst to workers and combined into an &amp;ldquo;attentiveness index.&amp;rdquo; After receiving the interim payment, treated workers&amp;rsquo; attentiveness index increased by 0.077 SD across all workers (p = 0.092); among poorer workers, attentiveness increased by 0.17 SD (p = 0.041). This improvement occurred simultaneously with higher output speed — workers were producing plates faster while also making fewer mistakes, suggesting improved cognitive engagement rather than mere effort intensification.&lt;/p&gt;
&lt;h3 id="piece-rate-comparison"&gt;Piece-Rate Comparison&lt;/h3&gt;
&lt;p&gt;In separate supplementary rounds with 150 experienced workers, the authors varied piece rates (Rs. 2, 3, or 4) while holding overall earnings constant. Each one-rupee increase in the piece rate raised output by 0.020 SD (p = 0.042). Critically, piece-rate increases produced no detectable change in the attentiveness index (point estimate negative, statistically insignificant), and the piece-rate effect on output differs significantly from the attentiveness effect (p = 0.001). This indicates that consciou effort and automatic attentiveness can move independently: higher incentives increase pace but do not reduce attentional lapses, whereas financial relief increases both pace and attentiveness.&lt;/p&gt;
&lt;h3 id="alternative-explanations-ruled-out"&gt;Alternative Explanations Ruled Out&lt;/h3&gt;
&lt;p&gt;The authors systematically address gift exchange/fairness, trust, nutrition, and sleep. Fairness and gift-exchange stories are inconsistent with: (i) no detectable announcement-period effect; (ii) no decline in control-worker effort when treatment workers are paid before them; (iii) the pattern of effects being concentrated among poorer workers; and (iv) attentiveness being affected when it is not a sanctioned quality dimension for payment. Nutritional channels are inconsistent with overnight effect onset (nutritional stock changes are too slow biologically), no treatment effect on breakfast consumption patterns, and productivity effects persisting through the end of each workday. Sleep channels are inconsistent with no treatment effect on hours or quality of sleep.&lt;/p&gt;
&lt;h3 id="scope-conditions-and-implications"&gt;Scope Conditions and Implications&lt;/h3&gt;
&lt;p&gt;The effect operates through the actual arrival of cash, not its anticipation, consistent with a model in which automatic cognitive inputs — unlike consciously chosen effort — respond to current financial strain rather than expected future income. Effects are concentrated among more financially constrained workers within an already-poor sample. The authors do not identify the specific psychological mechanism (worry, anxiety, affect, or rumination) but interpret results as evidence that financial strain, at least partly through psychological channels, reduces earnings exactly when money is most needed.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-why-does-the-experiment-focus-on-payment-timing-rather-than-an-outright-transfer-of-additional-money"&gt;Q1. Why does the experiment focus on payment timing rather than an outright transfer of additional money?&lt;/h3&gt;
&lt;p&gt;Varying only payment timing — not total pay — holds constant both the piece-rate incentive and total wealth across treatment and control. An outright cash transfer would raise total lifetime income, potentially reducing effort through a neoclassical income effect (more lifetime wealth lowers the marginal utility of current consumption). By holding total compensation fixed and only shifting when it arrives, the design isolates the effect of financial strain per se, separable from any wealth or incentive effect.&lt;/p&gt;
&lt;h3 id="q2-why-is-there-no-treatment-effect-during-the-announcement-period-and-why-does-this-matter"&gt;Q2. Why is there no treatment effect during the announcement period, and why does this matter?&lt;/h3&gt;
&lt;p&gt;Between day 5 (when workers learn their payment schedule) and the interim payment date, treated workers know cash is coming but have not yet received it. Output in this window shows no treatment effect (0.014 SD, p = 0.685), and the announcement effect is significantly smaller than the post-pay effect (p = 0.008). This matters because it rules out mechanisms that should operate on information alone — including gift exchange, trust updating, or effort responses to higher discounted expected income — and is consistent with a model in which financial strain falls only when cash is physically received (e.g., moneylenders do not relent until the loan is actually repaid).&lt;/p&gt;
&lt;h3 id="q3-what-is-the-attentiveness-index-and-how-was-it-constructed"&gt;Q3. What is the attentiveness index and how was it constructed?&lt;/h3&gt;
&lt;p&gt;The attentiveness index averages three plate-level markers: (i) number of &amp;ldquo;double holes&amp;rdquo; — pairs of stitching holes indicating a mistaken stitch was removed; (ii) number of leaves used; and (iii) number of stitches used. Each component was normalized using the control group&amp;rsquo;s post-pay mean and standard deviation, then averaged and reverse-coded so that higher values denote better attentiveness (fewer mistakes, fewer leaves, fewer stitches). Workers were unaware these dimensions were being measured. The index thus captures the number of unforced steps a worker took to complete a plate — a behavioral trace of cognitive lapses.&lt;/p&gt;
&lt;h3 id="q4-how-do-the-piece-rate-rounds-demonstrate-that-effort-and-attentiveness-are-separable"&gt;Q4. How do the piece-rate rounds demonstrate that effort and attentiveness are separable?&lt;/h3&gt;
&lt;p&gt;In supplementary rounds (150 workers, 2019), piece rates were experimentally varied among Rs. 2, 3, and 4 per plate with the base wage adjusted to hold total earnings constant, so financial strain was unchanged. A one-rupee increase in the piece rate raises output by 0.020 SD (p = 0.042), consistent with a standard effort response. The same increase produces no discernible change in the attentiveness index (point estimate: negative but not significant), and the output and attentiveness effects are significantly different from each other (p = 0.001). This shows that workers can speed up via conscious effort without reducing attentional lapses, whereas the cash infusion raises both pace and attentiveness simultaneously — a pattern inconsistent with pure motivation as the mechanism.&lt;/p&gt;
&lt;h3 id="q5-what-does-the-staggered-timing-within-the-treatment-group-wave-a-vs-wave-b-contribute-to-identification"&gt;Q5. What does the staggered timing within the treatment group (Wave A vs. Wave B) contribute to identification?&lt;/h3&gt;
&lt;p&gt;Treatment workers were randomized to receive their interim payment on day 8 (Wave A) or day 9 (Wave B). On day 9, Wave B workers have not yet been paid while Wave A workers have. If fairness concerns drove control workers to reduce effort upon seeing colleagues paid first, control workers on day 9 — having observed Wave A payments the evening before — should work less hard relative to Wave B treatment workers (who have also not yet been paid). The authors find no such pattern: the triple interaction (Cash × Payment Day × Wave B) is close to zero and insignificant, ruling out effort reductions from seeing peers paid earlier.&lt;/p&gt;
&lt;h3 id="q6-what-are-the-magnitudes-and-timing-of-the-spending-response-to-the-cash-infusion"&gt;Q6. What are the magnitudes and timing of the spending response to the cash infusion?&lt;/h3&gt;
&lt;p&gt;Within three days of the interim payment, treatment workers spent Rs. 900 in total — roughly two-thirds of the average interim payment of over Rs. 1,400. On the day of the payment itself, loan repayments rose by Rs. 169 (746% increase), and household expenditures rose by Rs. 70 (68% increase). Over three days, loan repayments increased by Rs. 271 (287%), the probability of repaying any loan rose by 40 percentage points (222%), and total household spending rose by 65% (Rs. 371). These patterns indicate that the two main sources of financial stress cited by workers — outstanding debt and inability to meet household essentials — were directly addressed, suggesting a meaningful reduction in financial strain.&lt;/p&gt;
&lt;h3 id="q7-why-are-the-productivity-effects-concentrated-among-poorer-workers-and-what-are-the-two-interpretations"&gt;Q7. Why are the productivity effects concentrated among poorer workers, and what are the two interpretations?&lt;/h3&gt;
&lt;p&gt;Workers with below-median baseline wealth (fewer assets, lower liquidity) show a 0.204 SD (13.0%) productivity gain, while workers above the median wealth threshold show essentially no effect. The authors offer two interpretations. First, poorer workers may start from a higher level of financial strain, giving the intervention more scope to reduce it. Second, since all workers in the sample are objectively poor and report similar baseline financial worries and loan levels, the more likely explanation is that the interim payment is larger relative to the wealth and income buffer of poorer workers, making the same nominal cash infusion more meaningful for them. Both richer and poorer workers in the sample use the interim payment to repay loans and cover household needs.&lt;/p&gt;
&lt;h3 id="q8-how-do-the-authors-rule-out-nutritional-channels"&gt;Q8. How do the authors rule out nutritional channels?&lt;/h3&gt;
&lt;p&gt;Two tests address nutrition. First, workers were not at subsistence — 94% reported missing no meals the prior week — and increased food spending cannot change the nutritional stock overnight (the medical literature indicates nutritional-stock effects on cognition operate over longer time horizons). Second, and more precisely, all food consumed at the worksite during the workday was provided by the researchers, so differential pre-worksite breakfast consumption is the only plausible same-day biological channel. The authors find no treatment effect on breakfast consumption (whether workers had breakfast, how much, or what they ate). Further, if blood sugar or satiety drove effects, they should attenuate over the workday as all workers are given the same afternoon meal; instead, treatment effects persist and if anything increase through the final hours of the workday.&lt;/p&gt;
&lt;h3 id="q9-what-does-the-self-report-evidence-on-focus-and-worry-show-and-why-is-it-treated-as-suggestive-rather-than-primary"&gt;Q9. What does the self-report evidence on focus and worry show, and why is it treated as suggestive rather than primary?&lt;/h3&gt;
&lt;p&gt;Two days after the interim payment, workers were asked an open-ended question about what they were thinking about while working. Treatment workers were 11.5 percentage points (15.5%) more likely to report feeling focused on the task (p = 0.032) and 13.7 percentage points (32.7%) less likely to report thinking about financial worries (p = 0.044). A supplementary test showed treated workers were 10 percentage points (31%) more likely to generate explanations for a low-income person&amp;rsquo;s negative affect that were unrelated to financial concerns (p &amp;lt; 0.05), suggesting a broadening of cognitive scope. These measures are treated as suggestive because they were collected only at a single point and are self-reported; the primary evidence rests on objective production data because it is more objective and collected at fine hourly resolution throughout the post-pay period.&lt;/p&gt;
&lt;h3 id="q10-what-does-the-paper-say-about-optimal-payment-frequency-as-a-policy-implication"&gt;Q10. What does the paper say about optimal payment frequency as a policy implication?&lt;/h3&gt;
&lt;p&gt;The authors are cautious in drawing a direct policy inference about paying workers more frequently. While the positive productivity effect of early payment points toward more frequent paydays reducing financial strain, this must be weighed against workers&amp;rsquo; self-control problems in consumption. In settings where workers face lumpy expenditure needs (e.g., monthly rent), more frequent payments could cause under-saving and worsen strain at the time of lumpy bills. The authors suggest payment frequency or size that matches expenditure needs, or more generally financial products that allow workers to time income receipts to coincide with expenses, as potentially more robust solutions — noting that such products appear largely absent in these markets.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Financial strain (as used in the paper):&lt;/strong&gt; A psychological burden arising from pressing present needs for resources — defined in the authors&amp;rsquo; model as increasing in both the current marginal utility of consumption (i.e., how valuable an additional rupee would be today) and the level of outstanding debt (including lender harassment pressure). Strain is present-oriented: it responds to current cash-on-hand and debt levels, not to expected future income, which is why anticipating a payment does not fully relieve it.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Automatic input (a):&lt;/strong&gt; In the authors&amp;rsquo; behavioral model, one of two inputs into production. Unlike &amp;ldquo;effortful&amp;rdquo; input (e), which the worker consciously controls (speed of hands, consciously directed attention), the automatic input captures cognitive functions that are beyond the worker&amp;rsquo;s full control — background attentional processes that can be degraded by financial strain even when a worker is motivated and exerting high effort. The key behavioral assumption is that a falls when financial strain is high, independently of chosen effort.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Attentiveness index:&lt;/strong&gt; A composite measure constructed from three unincentivized physical markers embedded in completed leaf plates: (i) number of double holes (pairs indicating a stitch was removed to correct a mistake); (ii) number of leaves used; (iii) number of stitches used. The index is normalized to the control group&amp;rsquo;s post-pay distribution and reverse-coded so higher values denote better attentiveness. Workers were unaware these dimensions were measured. The index captures attentional lapses — unforced errors that increase the number of steps and time needed to complete each plate.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Announcement period:&lt;/strong&gt; The days between when workers are individually informed of their payment schedule (day 5) and when the interim payment is actually disbursed (day 8 or 9). This window serves as a within-experiment control: if effects arose from information about impending cash (e.g., through discounting, gift exchange, or trust), they should appear here. The consistent absence of treatment effects during this period is a key identification result.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Post-pay period:&lt;/strong&gt; The days from the interim payment until the contract end (day 12). The main productivity and attentiveness treatment effects are estimated in this window, comparing treatment workers (who have received cash) to control workers (who have not yet been paid).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Lean season:&lt;/strong&gt; The months outside the peak agricultural planting and harvesting periods (roughly six to eight months per year in the study area) during which agricultural workers seek intermittent casual employment in manufacturing, construction, and other sectors. Employment rates are low (1.9 paid days per week on average), income is low and variable, and financial strain is correspondingly high. The experiment is intentionally conducted during this period to maximize baseline levels of financial concern.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Piece-rate elasticity of effort:&lt;/strong&gt; The responsiveness of output to changes in the marginal return per unit produced (the piece rate), holding financial strain constant. In the supplementary rounds, a one-rupee increase in the piece rate raises output by 0.020 SD. The authors interpret this as the upper bound on how much pure motivational effort can move output in this task, and use it to benchmark the cash infusion effects, which are roughly five times larger per unit of treatment variation and additionally move attentiveness (which piece-rate changes do not).&lt;/p&gt;</description></item><item><title>Dollar Dominance and the Transmission of Monetary Policy</title><link>https://macropaperwarehouse.com/papers/dollar-dominance-and-the-transmission-of-monetary-policy/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/dollar-dominance-and-the-transmission-of-monetary-policy/</guid><description>&lt;h2 id="layer-1--summary"&gt;Layer 1 — Summary&lt;/h2&gt;
&lt;p&gt;An emerging view in international macroeconomics contends that dollar invoicing of exports renders monetary policy ineffective for non-U.S. countries: because export prices are allegedly sticky in dollars, exchange rate depreciations cannot shift expenditure toward domestic goods, muting the classical Mundell-Fleming channel. McLeay and Tenreyro argue that this view rests on empirical assumptions that are not borne out by the data: goods priced in dollars tend to have more flexible prices and higher elasticities of substitution, not the monopoly power and sticky dollar prices assumed in dominant currency pricing (DCP) models. They propose a mixed currency pricing (MCP) framework that incorporates heterogeneous price flexibility and intra-sector international competition, and show that even with dollar pricing, depreciating the currency by loosening monetary policy can still boost exports and activity materially. The limit to any expansion is not demand, but supply capacity: after a depreciation, domestic dollar costs fall, flexible-price exporters lower prices slightly and gain large market share due to high demand elasticities, and the expansion runs until rising marginal costs offset the initial depreciation — producing limited reduced-form dollar pass-through as an equilibrium result rather than evidence of nominal stickiness. Empirical tests using monetary policy shocks in a sample of emerging and developing economies, case studies of Canada and Chile as commodity exporters, and three large devaluation episodes all find significant, material increases in exports and aggregate activity following exchange-rate depreciations, consistent with the MCP model&amp;rsquo;s predictions.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-specific-empirical-claim-that-dcp-models-rest-on-and-how-do-mcleay-and-tenreyro-challenge-it"&gt;Q1. What is the specific empirical claim that DCP models rest on, and how do McLeay and Tenreyro challenge it?&lt;/h3&gt;
&lt;p&gt;DCP models (e.g., Gopinath et al. 2020) posit that exporters invoicing in dollars have monopoly power and face nominal rigidities that keep their dollar export prices sticky. The observable implication used to motivate this assumption was limited exchange rate pass-through to dollar export prices. McLeay and Tenreyro show that low pass-through is equally consistent with a flexible-price, high-elasticity equilibrium. When demand elasticities are high, firms optimally absorb exchange rate changes through quantities rather than prices; the reduced-form pass-through coefficient is small even without any nominal friction. Low pass-through is therefore not informative about the degree of nominal rigidities, and using it to calibrate sticky-price DCP models and draw normative conclusions about exchange rate policy is unwarranted.&lt;/p&gt;
&lt;h3 id="q2-what-are-the-three-empirical-facts-that-motivate-the-mcp-frameworks-assumptions"&gt;Q2. What are the three empirical facts that motivate the MCP framework&amp;rsquo;s assumptions?&lt;/h3&gt;
&lt;p&gt;Fact 1: Homogeneous products (commodities and commodity-like goods traded on organized exchanges or reference-priced, following Rauch 1999) represent a large share of goods exports, exceeding 70% for developing economies, around 60% for emerging economies, and around 35% for advanced economies; Sub-Saharan Africa, Latin America, and the Middle East all have shares above 50%. Fact 2: Homogeneous and more competitively produced goods have more flexible prices, documented across multiple countries — for instance, Nakamura and Steinsson (2008) find a median monthly price-change frequency of 10.8% for finished-good producer prices but 98.9% for crude materials. Fact 3: Dollar (vehicle currency) invoicing is most prevalent precisely in these homogeneous, competitive-good sectors; classical work by McKinnon (1979) and Magee and Rao (1980) emphasized that vehicle-currency invoicing facilitates continuous price comparability in competitive markets, and panel regressions corroborate a positive relationship between the share of exports invoiced in dollars and the homogeneous-goods share of exports.&lt;/p&gt;
&lt;h3 id="q3-what-is-the-mechanism-through-which-depreciation-boosts-exports-in-the-mcp-model-and-why-does-this-generate-low-observed-pass-through"&gt;Q3. What is the mechanism through which depreciation boosts exports in the MCP model, and why does this generate low observed pass-through?&lt;/h3&gt;
&lt;p&gt;With sticky wages (representing non-tradable input price stickiness more broadly), a monetary policy-induced depreciation lowers the domestic cost of production when expressed in dollars. For competitive exporters facing highly elastic demand, even a small reduction in the dollar price translates into a substantial gain in export quantities. Firms therefore lower their dollar prices slightly, trading some profit margin for a large increase in market share. As exports expand, domestic marginal costs rise (firms move up the upward-sloping marginal cost curve), partially offsetting the depreciation&amp;rsquo;s effect on dollar costs. In equilibrium, the net dollar price movement is small — producing the observed limited pass-through — but the quantity response is large. In the perfectly competitive limit (relevant for commodity exporters), the dollar price is unchanged by the world market, and the entire adjustment is through an expansion of export volumes until rising domestic marginal costs absorb the depreciation. The implied observation is identical to a sticky-price model for prices, but &amp;ldquo;the implications for export quantities are diametrically opposed.&amp;rdquo;&lt;/p&gt;
&lt;h3 id="q4-how-does-the-mcp-model-nest-existing-frameworks-and-what-does-it-add-relative-to-the-dcp-and-pcp-benchmarks"&gt;Q4. How does the MCP model nest existing frameworks, and what does it add relative to the DCP and PCP benchmarks?&lt;/h3&gt;
&lt;p&gt;The MCP (mixed currency pricing) framework nests sticky-price DCP as a special case (by setting demand elasticities low and allowing full price stickiness) and produces behavior close to PCP (producer currency pricing) in the flexible-price, high-elasticity limit — restoring the allocative properties of the exchange rate from Obstfeld and Rogoff (1995). The distinctive addition is intra-sector international competition: domestic exporters face competition from international competitors producing highly substitutable varieties of the same good, so substitution elasticities can be high at the variety level even when macro-level elasticities between goods remain low. This follows a bottom-up approach to elasticities as in Feenstra et al. (2018). The model also allows heterogeneous nominal rigidities across producers, with exporters of dollar-invoiced homogeneous goods having flexible prices while non-tradable input prices (wages) remain sticky — the source of monetary non-neutrality and the mechanism for real exchange rate effects.&lt;/p&gt;
&lt;h3 id="q5-what-is-the-role-of-supply-capacity-and-why-is-it-the-limit-rather-than-demand"&gt;Q5. What is the role of supply capacity, and why is it &amp;ldquo;the limit&amp;rdquo; rather than demand?&lt;/h3&gt;
&lt;p&gt;In the sticky-price DCP model, the constraint on the export response is on the demand side: dollar prices do not move, so demand is unchanged, and there is no export response at all. In the MCP model, demand responds immediately to the cost reduction — the constraint that eventually stops the expansion is supply capacity, captured by the slope of the marginal cost curve and macroeconomic constraints on non-tradable inputs. With a flat marginal cost curve (plentiful supply capacity), exports expand materially; with a steep curve or hard capacity constraints, the increase in marginal cost fully offsets the depreciation before much quantity adjustment occurs. This supply-side framing reorients the policy question: the limiting factor for monetary policy&amp;rsquo;s external effectiveness is not whether dollar prices can move, but whether the domestic economy has the productive capacity to expand tradable output. This also connects the paper to the Salter-Swan two-good framework and to Schmitt-Grohé and Uribe (2021).&lt;/p&gt;
&lt;h3 id="q6-what-do-the-macroeconomic-empirical-tests-find-and-how-do-they-distinguish-the-mcp-from-sticky-price-dcp"&gt;Q6. What do the macroeconomic empirical tests find, and how do they distinguish the MCP from sticky-price DCP?&lt;/h3&gt;
&lt;p&gt;The paper uses three empirical exercises. First, using a sample of developing and emerging economies, monetary policy expansions that generate exchange rate depreciations cause significant increases in both exports and aggregate economic activity — consistent with the MCP model&amp;rsquo;s material export response and inconsistent with the DCP prediction of no export channel. Second, focusing on Canada and Chile as commodity exporters where the MCP assumptions (competitive markets, flexible export prices) are especially applicable, the aggregate results are corroborated and sectoral evidence provides additional support. Third, three case studies of large devaluations in the sample document that they are followed by material increases in exports relative to trend. In all exercises, the direction and magnitude of export and output responses are consistent with a functioning expenditure-switching channel, even where exports are priced in dollars.&lt;/p&gt;
&lt;h3 id="q7-how-does-the-paper-reinterpret-the-pass-through-evidence-that-motivated-sticky-price-dcp-models-and-what-does-this-imply-for-normative-conclusions"&gt;Q7. How does the paper reinterpret the pass-through evidence that motivated sticky-price DCP models, and what does this imply for normative conclusions?&lt;/h3&gt;
&lt;p&gt;Standard reduced-form pass-through regressions relate the change in dollar export prices to changes in the exchange rate. These regressions typically omit or fail to fully capture movements in marginal cost. In the MCP model, flexible-price firms fully pass through changes in marginal cost; the observed limited pass-through to export prices is an equilibrium result of the offsetting rise in marginal costs as export volumes expand, not evidence of a nominal friction. Because the standard regressions omit marginal cost dynamics, they risk attributing the equilibrium quantity-driven equilibrium to a pricing friction. This has direct normative implications: the case made by the IMF (2019, 2020) that dollar invoicing worsens the cost-benefit calculation for flexible exchange rates — and may bolster the case for capital controls — rests on interpreting low pass-through as evidence of stickiness. If low pass-through instead reflects high demand elasticities and supply-side adjustment, the normative argument for constraining exchange rate flexibility is weakened.&lt;/p&gt;
&lt;h3 id="q8-how-does-the-paper-relate-to-the-purchasing-power-parity-puzzle-and-the-mussa-puzzle"&gt;Q8. How does the paper relate to the purchasing power parity puzzle and the Mussa puzzle?&lt;/h3&gt;
&lt;p&gt;The MCP framework offers explanations for two classic international macro puzzles without assuming nominal rigidities in export prices. On the PPP puzzle (the volatility and persistence of the real exchange rate, Rogoff 1996): in the MCP model, exporters&amp;rsquo; optimal reset prices move very little after exchange rate changes — not because of stickiness, but because demand is elastic and marginal costs rise quickly. This predicts limited movement in relative export prices, consistent with empirical evidence in Blanco and Cravino (2020) and Itskhoki and Mukhin (2025). On the Mussa puzzle (the large jump in nominal and real exchange rate volatility after the Bretton Woods collapse): the model&amp;rsquo;s mechanism via sticky wages is consistent with evidence that depreciations produce slow adjustment of non-tradable prices (Burstein, Eichenbaum, and Rebelo 2005), generating real exchange rate movements despite limited response in traded-good dollar prices.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Dominant currency pricing (DCP):&lt;/strong&gt; A framework in which non-U.S. exporters set and maintain prices in U.S. dollars, with sticky dollar prices. As formulated by Gopinath et al. (2020), DCP predicts that exchange rate depreciations by non-U.S. countries do not reduce dollar export prices and therefore do not stimulate export demand — muting the expenditure-switching channel of monetary policy.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mixed currency pricing (MCP):&lt;/strong&gt; The framework introduced in this paper. It allows heterogeneous price flexibility and market structure across export sectors, nesting both sticky-price DCP and flexible-price PCP as special cases. Dollar-priced exports face elastic demand from international competition, have flexible prices, and respond to depreciations through quantities rather than prices. Non-traded inputs (wages) remain sticky, providing the source of monetary non-neutrality.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Expenditure-switching channel:&lt;/strong&gt; The mechanism by which exchange rate depreciations redirect spending toward domestically produced goods, boosting exports and aggregate demand. In PCP models, this works through a fall in relative export prices. In the MCP model, it works through an expansion in export quantities even when dollar prices change little.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Exchange rate pass-through (to export prices):&lt;/strong&gt; The elasticity of dollar export prices with respect to the nominal exchange rate. In sticky-price DCP models, low pass-through reflects a nominal friction (prices cannot adjust). In the MCP model, low pass-through reflects high demand elasticities and offsetting marginal cost increases: it is an equilibrium outcome, not a friction, and therefore does not imply that export volumes are unresponsive.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Intra-sector international competition:&lt;/strong&gt; The market structure feature central to the MCP framework. Domestic exporters of a given good compete with foreign suppliers of highly substitutable varieties, making their demand elastic at the variety level even if aggregate elasticities across different goods categories are low. This follows Armington (1969) as implemented by Feenstra et al. (2018).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Supply capacity constraint:&lt;/strong&gt; In the MCP model, the binding constraint on how much a depreciation can boost exports. With high demand elasticities, demand for domestic exports expands freely; the limit is set by how quickly rising domestic marginal costs absorb the improvement in export profitability. The supply constraint replaces the demand constraint that operates (mechanically, via zero price response) in sticky-price DCP models.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Homogeneous goods (Rauch 1999 classification):&lt;/strong&gt; Goods traded on organized commodity exchanges or reference-priced in trade publications, as opposed to differentiated goods. McLeay and Tenreyro use this classification to establish that dollar-invoiced exports are disproportionately homogeneous, competitive, and flexible-priced — contrary to the DCP assumption of monopoly power and price stickiness.&lt;/p&gt;
&lt;hr&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary based on published open-access version. AI-assisted, human review pending.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;</description></item><item><title>Downward Rigidity in the Wage for New Hires</title><link>https://macropaperwarehouse.com/papers/downward-rigidity-in-the-wage-for-new-hires/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/downward-rigidity-in-the-wage-for-new-hires/</guid><description>&lt;h2 id="layer-1--summary"&gt;Layer 1 — Summary&lt;/h2&gt;
&lt;p&gt;Hazell and Taska use wages posted on online job vacancies — matched to job titles and establishment identifiers from Burning Glass Technologies — to measure the wage for new hires at the job level (same job title and establishment) over 2010Q1–2020Q2. They find that this measure of the wage for new hires is rigid downward and flexible upward. At the job level, the nominal posted wage changes infrequently — on average once every 5–6 quarters — and conditional on changing, is four times more likely to rise than to fall. In the cyclical dimension, job-level posted wages rise strongly when state unemployment falls but do not fall when state unemployment rises; real wages exhibit the same asymmetric pattern. These results do not appear in the average wage for new hires (which aggregates across all job types), because time-varying job composition inflates the variance of average wages and raises standard errors roughly twentyfold relative to job-level regressions — explaining why prior work using worker-level survey data found no evidence of downward rigidity. A Heckman (1979) selection correction for firms&amp;rsquo; selection into vacancy posting suggests that selection bias in the job-level regression is moderate. The findings provide direct empirical support for models in which downward wage rigidity for new hires — specifically at the job level — amplifies unemployment fluctuations and generates asymmetric unemployment dynamics.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-q-what-is-the-central-empirical-claim-of-the-paper"&gt;Q1. Q: What is the central empirical claim of the paper?&lt;/h3&gt;
&lt;p&gt;A: At the job level — defined as the same job title within the same establishment — the wage posted for new hires is rigid downward and flexible upward. It changes infrequently and, conditional on changing, rises far more often than it falls; and it responds to falls in unemployment but not to rises in unemployment.&lt;/p&gt;
&lt;h3 id="q2-q-what-data-does-the-paper-use-and-what-defines-a-job"&gt;Q2. Q: What data does the paper use, and what defines a &amp;ldquo;job&amp;rdquo;?&lt;/h3&gt;
&lt;p&gt;A: The paper uses the Burning Glass Technologies dataset of wages posted on online vacancies, covering January 2010 to June 2020. A &amp;ldquo;job&amp;rdquo; is a job title within an establishment whose wages are paid at a given frequency (e.g., hourly or annual). The data come from the near-universe of online job postings — roughly 40,000 sources — and the main regression sample consists of jobs that post wages, have job title and establishment information, and post vacancies in multiple quarters, yielding approximately 3.05 million vacancies, representing about 0.8% of total US vacancies.&lt;/p&gt;
&lt;h3 id="q3-q-how-do-the-authors-validate-that-posted-wages-measure-the-wage-for-new-hires"&gt;Q3. Q: How do the authors validate that posted wages measure the wage for new hires?&lt;/h3&gt;
&lt;p&gt;A: They construct a measure of the wage for new hires from the Current Population Survey (CPS) — workers switching jobs or entering from unemployment — at the state, industry, and occupation level. Regressing log CPS wages on log Burning Glass wages (using an IV split-sample procedure to correct for attenuation bias) yields a coefficient close to 1 across specifications and levels of aggregation, indicating that average posted wages move roughly one-for-one with average wages for new hires in representative survey data.&lt;/p&gt;
&lt;h3 id="q4-q-how-is-the-frequency-of-wage-change-estimated"&gt;Q4. Q: How is the frequency of wage change estimated?&lt;/h3&gt;
&lt;p&gt;A: Because wages are not observed in quarters without a vacancy posting, the authors adapt a constant-hazard model from the price-setting literature (following Nakamura–Steinsson and Klenow–Kryvtsov). The latent wage evolves stochastically between postings; the observed wage is treated as a draw from this process. The quarterly probability of wage change is estimated at 0.17–0.19 across specifications, implying implied durations of unchanged wages of 4–5 quarters.&lt;/p&gt;
&lt;h3 id="q5-q-what-is-the-asymmetry-in-the-direction-of-wage-changes"&gt;Q5. Q: What is the asymmetry in the direction of wage changes?&lt;/h3&gt;
&lt;p&gt;A: In the unweighted baseline, the quarterly probability of a wage decrease is 0.04, whereas the probability of a wage increase is 0.12 — roughly a three-to-one ratio in probabilities, summarized in the paper&amp;rsquo;s abstract as wages being &amp;ldquo;four times more likely to rise than to fall.&amp;rdquo; The distribution of non-zero wage changes also shows a pronounced pile-up of small positive changes relative to small negative changes, consistent with a downward constraint on wage setting.&lt;/p&gt;
&lt;h3 id="q6-q-what-is-the-first-piece-of-cyclical-evidence-for-downward-rigidity"&gt;Q6. Q: What is the first piece of cyclical evidence for downward rigidity?&lt;/h3&gt;
&lt;p&gt;A: A binned scatterplot (Figure 1) of job-level wage growth against state-level quarterly changes in unemployment shows a strong, roughly linear relationship when unemployment is falling — wages rise with falls in unemployment, both for small and large declines. When unemployment rises, however, wages do not fall — neither for small nor for large increases in unemployment. This asymmetry is robust to regression-based analysis and to identified labor demand shocks.&lt;/p&gt;
&lt;h3 id="q7-q-are-real-wages-also-rigid-downward"&gt;Q7. Q: Are real wages also rigid downward?&lt;/h3&gt;
&lt;p&gt;A: Yes. The paper reports that real wages (nominal posted wages deflated) are also rigid downward and flexible upward, mirroring the pattern for nominal wages.&lt;/p&gt;
&lt;h3 id="q8-q-what-is-the-job-composition-problem-and-why-does-it-matter"&gt;Q8. Q: What is the job-composition problem, and why does it matter?&lt;/h3&gt;
&lt;p&gt;A: The average wage for new hires — the object measured in most prior work — aggregates across all job types that are actively hiring. If the composition of jobs hiring shifts over the business cycle (e.g., the share of lower-wage jobs rises in recessions), then average wages can fall even if no individual job cuts its wage, and can stay flat or rise even if every job cuts its wage. Job composition therefore confounds cyclicality estimates based on average wages. By tracking the same job title at the same establishment across successive vacancies, the authors purge wage changes driven by shifting composition.&lt;/p&gt;
&lt;h3 id="q9-q-why-did-prior-work-find-no-evidence-of-downward-rigidity-for-new-hires"&gt;Q9. Q: Why did prior work find no evidence of downward rigidity for new hires?&lt;/h3&gt;
&lt;p&gt;A: Prior work used worker-level survey data (e.g., Bils 1985; Pissarides 2009 survey) that controls for worker characteristics but averages across jobs — the average wage for new hires. The volatility of job composition inflates the variance of this average measure. In the Burning Glass data, standard errors from regressions using average wages are roughly twenty times larger than those from job-level regressions, making it impossible to detect downward rigidity even if it exists. Point estimates in prior work suggested procyclicality but were too imprecise to exclude downward rigidity.&lt;/p&gt;
&lt;h3 id="q10-q-how-does-this-paper-relate-to-gertler-huckfeldt-and-trigari-2020-and-grigsby-hurst-and-yildirmaz-2021"&gt;Q10. Q: How does this paper relate to Gertler, Huckfeldt, and Trigari (2020) and Grigsby, Hurst, and Yildirmaz (2021)?&lt;/h3&gt;
&lt;p&gt;A: Both papers attempt to control for job composition at the worker level. Gertler et al. focus on wages of workers hired from unemployment (less affected by composition than all new hires) and find weakly procyclical wages. Grigsby et al. use rich payroll data and worker-level matching to control for composition and also find weakly procyclical wages. The present paper complements these by using job-level data that directly purges composition without relying on worker characteristics, and adds evidence on the asymmetry of rigidity (not just average procyclicality).&lt;/p&gt;
&lt;h3 id="q11-q-what-is-the-role-of-the-heckman-selection-correction"&gt;Q11. Q: What is the role of the Heckman selection correction?&lt;/h3&gt;
&lt;p&gt;A: If firms select into vacancy posting depending on business-cycle conditions, the sample of observed posted wages may be non-random, biasing job-level wage-cyclicality estimates. The authors implement a standard Heckman (1979) two-step selection correction. The correction suggests that selection bias in the job-level regression is moderate — it does not overturn the finding of downward rigidity.&lt;/p&gt;
&lt;h3 id="q12-q-what-are-the-four-main-caveats-the-authors-acknowledge"&gt;Q12. Q: What are the four main caveats the authors acknowledge?&lt;/h3&gt;
&lt;p&gt;A: (1) The main sample is small — 0.8% of US vacancies — though the authors show it is broadly representative on observables and that wages track representative survey data. (2) The paper measures rigidity only for jobs that post wages; jobs that do not post wages might be more flexible, though the share of vacancies posting wages does not decline during contractions. (3) Posted wages may differ from realized (bargained) wages; however, wages are rigid even in occupations where bargaining is uncommon. (4) The Pandemic Recession is the main contractionary episode in the sample, and it involved labor supply shocks as well as demand shocks; the authors address this through identified labor demand shock regressions and by ending the sample in June 2020.&lt;/p&gt;
&lt;h3 id="q13-q-what-are-the-implications-for-models-of-unemployment-fluctuations"&gt;Q13. Q: What are the implications for models of unemployment fluctuations?&lt;/h3&gt;
&lt;p&gt;A: In the Diamond–Mortensen–Pissarides search model, Pissarides (2009) emphasizes that the wage for newly hired workers — not continuing workers — is the relevant margin for unemployment fluctuations. Shimer (2005) showed the standard calibration produces too-small unemployment fluctuations; wage rigidity for new hires can resolve this. The paper&amp;rsquo;s finding of downward-but-not-upward rigidity additionally supports models (e.g., Dupraz, Nakamura, and Steinsson, 2020) in which this asymmetry generates asymmetric unemployment dynamics — unemployment rises sharply in contractions but falls more slowly in expansions.&lt;/p&gt;
&lt;h3 id="q14-q-how-do-wages-for-new-hires-compare-with-wages-for-continuing-workers-in-terms-of-rigidity"&gt;Q14. Q: How do wages for new hires compare with wages for continuing workers in terms of rigidity?&lt;/h3&gt;
&lt;p&gt;A: The paper finds approximate parity. The implied duration of unchanged wages from the job-level posted wage data (4–5 quarters) is similar to estimates for continuing workers in the prior literature. This is perhaps surprising because wages could in principle be more flexible for new hires than continuing workers — firms might cut wages for new hires even while insuring continuing workers (Beaudry and DiNardo, 1991). The results instead suggest that internal equity concerns (Bewley, 2002) or other forces produce similar rigidity for both groups.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Job level wage&lt;/strong&gt;: The wage across successive vacancies posted by the same job title at the same establishment. This is the unit of observation in the paper&amp;rsquo;s main analysis and the object for which downward rigidity is documented. Distinct from the average wage for new hires (which aggregates across all job types).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Downward rigidity (as used in this paper)&lt;/strong&gt;: An empirical pattern in which wages at the job level do not fall during contractions — they do not respond to rising unemployment — while rising during expansions in response to falling unemployment. The claim is descriptive: the data show wages do not fall; the paper does not structurally identify the mechanism enforcing this floor.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Job composition problem&lt;/strong&gt;: The bias introduced when measuring cyclicality of the average wage for new hires using data that aggregates across different types of jobs. If the mix of job types hiring shifts with the business cycle, average wages can change even when no individual job changes its wage, and can mask individual-job wage changes. Job-level data resolve this by holding the job fixed.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Burning Glass Technologies dataset&lt;/strong&gt;: A database of wages posted on online job vacancies, drawn from approximately 40,000 online sources (job boards and company websites), covering the near-universe of US online vacancies. The paper&amp;rsquo;s main regression sample uses the subset with posted wages, job title, establishment identifiers, and multiple quarters of postings, spanning January 2010 to June 2020.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Constant hazard model (wage change frequency)&lt;/strong&gt;: An estimation procedure adapted from the price-setting literature to recover the quarterly probability of wage change from a dataset in which wages are only observed when a vacancy is posted. The latent wage evolves with a constant hazard of change between observations; observed wage changes identify the hazard rates for increases and decreases separately.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Average wage for new hires&lt;/strong&gt;: The mean wage across all workers newly entering employment (or across all new-hire jobs), used in prior work (Bils 1985 and related). Does not control for job composition. Shown in this paper to exhibit no detectable downward rigidity, with standard errors roughly twenty times larger than in job-level specifications — because job composition variance inflates the residual variance.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Heckman selection correction&lt;/strong&gt;: A two-step procedure (Heckman 1979) to correct for the possibility that firms that post vacancies — and post wages — are a selected sample that differs systematically across the business cycle. The paper applies this to assess whether selection into vacancy posting biases the job-level wage-cyclicality estimates; the correction suggests bias is moderate.&lt;/p&gt;
&lt;hr&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary based on LSE Research Online accepted version (accepted manuscript, covers full paper including introduction, data, and Section 3; extraction terminated at line 595 before Sections 4–5). AI-assisted, human review pending.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;</description></item><item><title>Dynamic Concern for Misspecification</title><link>https://macropaperwarehouse.com/papers/dynamic-concern-for-misspecification/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/dynamic-concern-for-misspecification/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This paper asks how an agent who fears that none of their probabilistic models is the correct description of the data-generating process (DGP) should update that fear as evidence accumulates, and what long-run behavior such an agent exhibits. The central contribution is making the concern for misspecification &lt;em&gt;endogenous&lt;/em&gt;: the better the agent&amp;rsquo;s structured models explain past observations, the less concerned the agent becomes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Decision Criterion&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The agent posits a finite-dimensional parametric set of structured models Θ, holds a prior µ over Θ, and evaluates each action according to an &lt;em&gt;average robust control criterion&lt;/em&gt;. This criterion takes a weighted average (over models) of robust control assessments, where each assessment penalizes expected utility for probability distributions that deviate from the structured model in terms of relative entropy, scaled by a misspecification concern parameter λ &amp;gt; 0. A standard subjective expected utility maximizer is the limiting case as λ → 0 (no concern), and a maxmin agent is approached as λ → ∞.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Endogenous Misspecification Concern&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The concern parameter λ is updated each period as a function of the likelihood ratio test (LRT) statistic of the structured models against unstructured alternatives, scaled by a time-normalizing sequence βₜ: λ(hₜ) = LRT(hₜ, Θ) / (2βₜ). The sequence βₜ determines how demanding the agent is in evaluating model fit.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Taxonomy of Agent Types&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Three types emerge based on the speed of βₜ:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Statistician type&lt;/strong&gt; (βₜ = ct, linear): applies a time scaling that keeps the LRT asymptotically informative about the degree of misspecification. This is the unique type satisfying both &lt;em&gt;safety&lt;/em&gt; (long-run average payoff at least ε-close to the maxmin guarantee, almost surely) and &lt;em&gt;consistency under almost correct specification&lt;/em&gt; (no ε-regret when misspecification is small).&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Lenient type&lt;/strong&gt; (t = o(βₜ)): attributes unexplained evidence to sampling variability; corresponds to the Law of Large Numbers intuition.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Demanding type&lt;/strong&gt; (βₜ = o(t)): overly penalizes small discrepancies, analogous to the Law of Small Numbers fallacy (Tversky and Kahneman, 1971).&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Standard SEU maximization fails safety; robust control with an invariant λ (Hansen and Sargent, 2001; 2022) fails consistency under almost correct specification.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Long-Run Convergence Results (Theorem 1)&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;For a misspecified agent (no θ ∈ Θ with qθ_{a*} = p*_{a*}), the nature of the limit action a* depends on the agent type:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;&lt;em&gt;Lenient type&lt;/em&gt;: a* is a &lt;strong&gt;Berk-Nash equilibrium&lt;/strong&gt; — an SEU best reply to beliefs supported on the models with minimum relative entropy from the true DGP.&lt;/li&gt;
&lt;li&gt;&lt;em&gt;Demanding type&lt;/em&gt;: a* is a &lt;strong&gt;maxmin equilibrium&lt;/strong&gt; — a worst-case best reply to all models absolutely continuous with respect to the true DGP.&lt;/li&gt;
&lt;li&gt;&lt;em&gt;Statistician type&lt;/em&gt;: if behavior converges, a* is a &lt;strong&gt;c-robust equilibrium&lt;/strong&gt; — a robust control best reply to beliefs on the relative entropy minimizers, with the concern for misspecification endogenously set at minθ R(p*&lt;em&gt;{a*} || qθ&lt;/em&gt;{a*}) / c.&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;For a correctly specified agent (Proposition 2), every limit action is a &lt;strong&gt;self-confirming equilibrium&lt;/strong&gt;, regardless of the agent type.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Cycles and Limit Frequency (Section 4, Theorem 2)&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The statistician type&amp;rsquo;s behavior need not converge. In natural settings, the agent cycles between actions: playing a &amp;ldquo;safe&amp;rdquo; action whose consequences are well-explained by Θ reduces concern for misspecification, eventually leading to a riskier action whose poorly-explained consequences raise concern again, inducing a return to the safe action. The paper proves that every limit &lt;em&gt;frequency&lt;/em&gt; (empirical distribution over actions) is a &lt;strong&gt;mixed c-robust equilibrium&lt;/strong&gt; — a generalization that allows mixing while tying the concern for misspecification to the frequency-weighted average relative entropy of each action.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Empirical Applications&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Monetary policy cycles&lt;/em&gt; (Sargent 1999, 2008): In a central bank model where the true DGP includes increased inflation variability under aggressive policy (a feature absent from the bank&amp;rsquo;s structured models), no pure c-robust equilibrium exists for small c. The model predicts persistent cycles between conservative and aggressive policy. The frequency of the conservative policy is increasing in the strength of the exploitable inflation-unemployment trade-off (θ&lt;em&gt;₁π + θ&lt;/em&gt;₁a).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Labor supply under complex tax schedules&lt;/em&gt; (Rees-Jones and Taubinsky, 2020): Agents with a &amp;ldquo;schmeduling&amp;rdquo; heuristic (linearizing the tax schedule) are misspecified. Berk-Nash equilibrium predicts these agents exert excess effort, with the bias increasing in the complexity (convexity) of the tax code. The c-robust equilibrium attenuates this bias: conditional on the equilibrium, minθ R(p*_a || qθ_a) &amp;gt; 0, so agents maintain positive concern for misspecification and pull back from the biased recommendation. The paper rationalizes the empirical finding that approximately 40% of agents hold the schmeduling belief but only about 20% fewer agents act on it — consistent with endogenous concern reducing the behavioral impact of the biased model.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;Axiomatization (Section 5)&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper axiomatizes the static average robust control criterion (Theorem 3) using: a Variational Axiom (from Maccheroni, Marinacci, and Rustichini, 2006a), a Structured Savage axiom (Sure-Thing Principle for bets on the model identity), an Intramodel Sure-Thing Principle (STP for bets conditional on the model), and Uniform Misspecification Concern (the agent is equally concerned about misspecification regardless of which model is identified as best-fitting). Three additional dynamic axioms characterize preference evolution: Constant Preference Invariance (utility index stable over time), Dynamic Consistency over Models (Bayesian updating over structured models), and Q-Likelihood (misspecification concern increases in the LRT). A novel Asymptotic Frequentism axiom characterizes the statistician type: preferences must become arbitrarily similar (in a precise quantitative sense) after sufficiently long histories with the same outcome frequency.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-average-robust-control-criterion-and-how-does-it-generalize-prior-decision-criteria"&gt;Q1. What is the average robust control criterion and how does it generalize prior decision criteria?&lt;/h3&gt;
&lt;p&gt;A: An agent evaluates action a by averaging over structured models θ a robust control assessment: for each θ, minimize expected utility over probability distributions within relative entropy distance (penalized by 1/λ) of qθ_a, then integrate over θ with prior µ. This nests SEU (λ → 0, perfect trust in models), standard robust control of Hansen and Sargent (2001) (µ is Dirac, single benchmark model), and maxmin expected utility of Gilboa and Schmeidler (λ → ∞). The key extension is allowing µ to be nondegenerate, so the agent is simultaneously uncertain about the best-fitting model and about whether any model is exact.&lt;/p&gt;
&lt;h3 id="q2-what-is-the-role-of-the-likelihood-ratio-test-statistic-in-driving-misspecification-concern"&gt;Q2. What is the role of the likelihood ratio test statistic in driving misspecification concern?&lt;/h3&gt;
&lt;p&gt;A: The LRT statistic compares the maximum likelihood of the structured models against the best unstructured alternative. It diverges almost surely when the agent is misspecified, regardless of how close the structured models are to the true DGP. The concern parameter λ(hₜ) = LRT(hₜ, Θ) / (2βₜ) uses a time-scaling sequence βₜ to keep this statistic interpretable. Without scaling, a misspecified agent&amp;rsquo;s concern would always explode to infinity.&lt;/p&gt;
&lt;h3 id="q3-why-does-linear-time-scaling-βₜ--ct-uniquely-characterize-the-statistician-type-as-rational"&gt;Q3. Why does linear time scaling (βₜ = ct) uniquely characterize the statistician type as rational?&lt;/h3&gt;
&lt;p&gt;A: Proposition 1 establishes two properties: (1) ε-safety — every βₜ = ct-optimal policy achieves average payoff at least ε below the maxmin guarantee, almost surely; (2) ε-consistency under almost correct specification — for DGPs sufficiently close to Θ, the agent avoids long-run regret. Part 2 of Proposition 1 shows that no βₜ with βₜ = o(t) or t = o(βₜ) satisfies both properties simultaneously. SEU fails safety; invariant-λ robust control fails consistency.&lt;/p&gt;
&lt;h3 id="q4-what-is-a-c-robust-equilibrium-and-how-does-it-differ-from-a-berk-nash-equilibrium"&gt;Q4. What is a c-robust equilibrium and how does it differ from a Berk-Nash equilibrium?&lt;/h3&gt;
&lt;p&gt;A: A Berk-Nash equilibrium (Esponda and Pouzo, 2016) requires the action to be an SEU best reply to beliefs supported on the relative entropy minimizers of the true DGP. A c-robust equilibrium requires the same support condition but with the best reply taken under the average robust control criterion, where the concern for misspecification λ equals minθ R(p*&lt;em&gt;{a*} || qθ&lt;/em&gt;{a*}) / c — that is, the minimum relative entropy scaled by 1/c. The endogenous λ is positive whenever the agent is misspecified, so the agent does not fully trust even the best-fitting model.&lt;/p&gt;
&lt;h3 id="q5-how-does-the-paper-explain-that-misspecified-lenient-types-converge-to-berk-nash-while-demanding-types-converge-to-maxmin"&gt;Q5. How does the paper explain that misspecified lenient types converge to Berk-Nash while demanding types converge to maxmin?&lt;/h3&gt;
&lt;p&gt;A: For the lenient type (t = o(βₜ)), the time scaling makes the concern for misspecification converge to 0 (the LRT grows slower than βₜ relative to t), so the agent effectively behaves as an SEU maximizer with beliefs on the KL-minimizing models — the Berk-Nash condition. For the demanding type (βₜ = o(t)), the LRT diverges relative to βₜ, so λ → ∞ and the agent&amp;rsquo;s preferences converge to worst-case evaluation over all models absolutely continuous with the true DGP — the maxmin condition. These are Theorem 1, parts 1 and 2.&lt;/p&gt;
&lt;h3 id="q6-why-does-the-statistician-type-exhibit-cycles-rather-than-convergence"&gt;Q6. Why does the statistician type exhibit cycles rather than convergence?&lt;/h3&gt;
&lt;p&gt;A: Section 4 and Corollary 1 show in the monetary policy application that no pure c-robust equilibrium exists for small c. Intuitively, the conservative policy (a=0) is a best reply to a high misspecification concern, but it produces outcomes well-explained by Θ, which drives concern down. The aggressive policy (a=1) is a best reply to a low concern, but it generates increased inflation variability not captured in Θ, which drives concern up sharply. There is no fixed point that is self-sustaining, so the agent cycles. Theorem 2 shows that the empirical frequency of actions still converges to a mixed c-robust equilibrium.&lt;/p&gt;
&lt;h3 id="q7-what-are-the-quantitative-comparative-statics-for-the-monetary-policy-cycles"&gt;Q7. What are the quantitative comparative statics for the monetary policy cycles?&lt;/h3&gt;
&lt;p&gt;A: Corollary 1 establishes that there exists a threshold c̄ &amp;gt; 0 such that for all c ≤ c̄: (1) no pure c-robust equilibrium exists; (2) a mixed c-robust equilibrium exists; and (3) in the maximal and minimal equilibria, the frequency of the conservative policy α*(0) is increasing in θ&lt;em&gt;₁π + θ&lt;/em&gt;₁a — a larger exploitable trade-off between inflation and unemployment implies more time spent on the aggressive policy.&lt;/p&gt;
&lt;h3 id="q8-how-does-the-model-rationalize-the-rees-jones-and-taubinsky-2020-labor-supply-finding"&gt;Q8. How does the model rationalize the Rees-Jones and Taubinsky (2020) labor supply finding?&lt;/h3&gt;
&lt;p&gt;A: Rees-Jones and Taubinsky (2020) find that approximately 40% of agents have incentive-compatible beliefs consistent with the schmeduling heuristic (linearizing a convex tax schedule), but approximately 20% fewer agents act according to that heuristic. In a Berk-Nash equilibrium, the schmeduling agent exerts excess effort relative to the optimum; the more convex the tax code, the larger the excess. In a c-robust equilibrium, the agent retains a positive misspecification concern proportional to the deviation between the convex tax schedule and the linear approximation. Higher effort levels are more exposed to uncertainty in the marginal rate (the misspecified term θ+ε multiplies a higher average income z), so the concern for misspecification provides a natural force that reduces effort below the Berk-Nash prediction. The paper notes this finding is also consistent with an alternative interpretation in Rees-Jones and Taubinsky where all agents hold schmeduling beliefs but under-respond behaviorally.&lt;/p&gt;
&lt;h3 id="q9-what-is-the-mixed-c-robust-equilibrium-and-why-does-it-always-exist"&gt;Q9. What is the mixed c-robust equilibrium and why does it always exist?&lt;/h3&gt;
&lt;p&gt;A: A mixed c-robust equilibrium is a mixed action α* ∈ Δ(A) such that beliefs ν are supported on the relative entropy minimizers Θ(α*) — computed as the parameter minimizing the α*-weighted average relative entropy across actions — and every action in the support of α* is a best reply under the average robust control criterion with λ = minθ Σ_a α*(a) R(p*_a || qθ_a) / c. Proposition 3 proves existence by mapping this fixed-point condition to a Nash equilibrium in an auxiliary game between the agent and two adversarial Nature players, then invoking Reny (1999) on that game. A pure c-robust equilibrium need not exist, but mixing over actions allows the concern for misspecification to be calibrated to the frequency of poorly-explained actions.&lt;/p&gt;
&lt;h3 id="q10-how-does-theorem-2-formally-connect-cycles-to-mixed-c-robust-equilibria"&gt;Q10. How does Theorem 2 formally connect cycles to mixed c-robust equilibria?&lt;/h3&gt;
&lt;p&gt;A: Theorem 2 states that if βₜ = ct for all t and α* is a βₜ-limit frequency (i.e., the empirical action distribution converges to α* with positive probability under some optimal policy), then α* is a mixed c-robust equilibrium. The intuition is that when α* places weight on both a well-explained action and a poorly-explained action, the time-averaged relative entropy stabilizes at a fixed level, producing a stable endogenous concern for misspecification that makes the agent asymptotically indifferent between the actions in the support — sharply reducing the incentive to break the cycle.&lt;/p&gt;
&lt;h3 id="q11-what-does-the-axiomatization-contribute-beyond-the-learning-results"&gt;Q11. What does the axiomatization contribute beyond the learning results?&lt;/h3&gt;
&lt;p&gt;A: The axiomatization (Section 5, Theorem 3) provides behavioral foundations observable from choices, without assuming the internal LRT mechanism. Two primary axioms pin down the average robust control criterion within the variational class: Structured Savage (Sure-Thing Principle for bets over model identity) and Uniform Misspecification Concern (equal concern for misspecification regardless of which model is revealed as best-fitting). Dynamic Consistency over Models pins down Bayesian updating. Q-Likelihood axiomatizes that the concern for misspecification is ordinally increasing in the LRT. The novel Asymptotic Frequentism axiom (Axiom 9) pins down the &lt;em&gt;quantitative speed&lt;/em&gt; of adjustment: long histories with the same empirical frequency must induce asymptotically similar preferences, and Proposition 5 shows this implies λ_{hₜ} / (LRT(hₜ, Q) / (2tₙ)) converges to a finite limit — exactly the statistician type&amp;rsquo;s linear scaling.&lt;/p&gt;
&lt;h3 id="q12-what-is-the-correlation-between-behavioral-biases-that-the-model-predicts"&gt;Q12. What is the correlation between behavioral biases that the model predicts?&lt;/h3&gt;
&lt;p&gt;A: The paper derives three novel empirical predictions about the cross-sectional and time-series correlation of uncertainty attitudes: (1) long-run uncertainty aversion positively correlates with initial misspecification and with belief in the Law of Small Numbers; (2) these correlations are causal — repeated model failures and overly demanding evaluation induce a shift toward cautious behavior; (3) even holding misspecification and probability reasoning fixed, limit uncertainty attitudes are stochastic, depending on whether the limit action&amp;rsquo;s outcomes are well-explained by the structured models.&lt;/p&gt;
&lt;h3 id="q13-how-does-example-2-correlation-neglect-show-that-endogenous-concern-can-amplify-rather-than-attenuate-biases"&gt;Q13. How does Example 2 (Correlation Neglect) show that endogenous concern can amplify rather than attenuate biases?&lt;/h3&gt;
&lt;p&gt;A: In a double auction, a buyer who mistakenly treats their own valuation and the ask price as independent (Correlation Neglect, Esponda, 2008) bids below the optimum in Berk-Nash equilibrium. In a c-robust equilibrium, the positive correlation between valuations and prices produces a strictly positive minθ R(p*&lt;em&gt;{a*} || qθ&lt;/em&gt;{a*}), so the agent maintains misspecification concern. Since lower bids are accepted with lower probability (and thus are less sensitive to model misspecification), the endogenous concern drives the agent to bid even lower — amplifying the bias rather than attenuating it. This example illustrates that the direction of the correction depends on the geometry of how the misspecification interacts with the payoff structure.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Average Robust Control Criterion&lt;/strong&gt;: The decision criterion proposed in the paper. An agent evaluates action a by taking the expectation over structured models θ (with prior µ) of min_{p_a ∈ Δ(Y)} [E_{p_a}[u(a,y)] + (1/λ) R(p_a || qθ_a)]. This is a weighted average of robust control assessments, each penalizing distributions that deviate from a structured model in relative entropy. The parameter λ &amp;gt; 0 governs the intensity of misspecification concern, with SEU as the limit at λ → 0 and maxmin at λ → ∞.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Endogenous Misspecification Concern&lt;/strong&gt;: Unlike prior robust control models where λ is fixed or set externally, here λ(hₜ) = LRT(hₜ, Θ) / (2βₜ) is a function of how well the structured models explain the observed history hₜ via the likelihood ratio test statistic. The better the models explain past data, the smaller λ becomes and the less the agent hedges.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Statistician Type&lt;/strong&gt;: An agent who scales the likelihood ratio test statistic with a linear time sequence βₜ = ct for some c &amp;gt; 0. This is the unique agent type satisfying both ε-safety (guaranteed long-run average payoff above the maxmin guarantee minus ε) and ε-consistency under almost correct specification (no long-run regret when misspecification is small). The statistician type&amp;rsquo;s linear scaling is the only one for which the LRT statistic retains asymptotic informativeness about the degree of misspecification.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;c-Robust Equilibrium&lt;/strong&gt;: A fixed-point concept for the long-run behavior of the statistician type. Action a* is a c-robust equilibrium if it is an average robust control best reply to beliefs supported on Θ(a*) = argmin_θ R(p*&lt;em&gt;{a*} || qθ&lt;/em&gt;{a*}), with misspecification concern λ = minθ R(p*&lt;em&gt;{a*} || qθ&lt;/em&gt;{a*}) / c. This generalizes Berk-Nash equilibrium by incorporating an endogenous hedging motive proportional to the minimum relative entropy between the true DGP and the best structured model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mixed c-Robust Equilibrium&lt;/strong&gt;: A generalization of c-robust equilibrium to mixed actions α* ∈ Δ(A) for environments where no pure equilibrium exists. The beliefs are supported on the models minimizing the α*-weighted average relative entropy, and the misspecification concern is tied to that average entropy. Every βₜ-limit frequency is a mixed c-robust equilibrium (Theorem 2). This concept characterizes the long-run time-average behavior when the statistician type cycles.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Law of Small Numbers (LSN) Type / Demanding Type&lt;/strong&gt;: An agent for whom βₜ = o(t), meaning the time scaling grows sub-linearly. This agent is excessively sensitive to early model failures (analogously to the Law of Small Numbers fallacy of Tversky and Kahneman, 1971, where short-run frequencies are treated as the long-run norm). The long-run behavior of such a type converges to maxmin behavior rather than robust control.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Asymptotic Frequentism (Axiom 9)&lt;/strong&gt;: A novel axiom requiring that conditional preferences after sufficiently long histories with the same empirical outcome frequency must be arbitrarily similar (in a quantitative sense defined by measuring rods x, y, E) to a limiting preference. This axiom axiomatically pins down the statistician type&amp;rsquo;s linear time scaling: it implies that the ratio λ_{hₜ} / (LRT(hₜ, Q) / (2t)) converges to a finite limit c, exactly characterizing βₜ = ct.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Berk-Nash Equilibrium&lt;/strong&gt;: The equilibrium concept (Esponda and Pouzo, 2016) that describes the long-run behavior of lenient (SEU) agents learning under misspecification. An action a* is a Berk-Nash equilibrium if it is an SEU best reply to beliefs supported on Θ(a*) — the KL-minimizing models — without any additional hedging against misspecification. The current paper shows that lenient types converge to Berk-Nash equilibria, while statistician types converge to c-robust equilibria that differ by incorporating a positive misspecification concern.&lt;/p&gt;</description></item><item><title>Dynamics of the Long-Term Housing Yield: Evidence from Natural Experiments</title><link>https://macropaperwarehouse.com/papers/dynamics-of-the-long-term-housing-yield-evidence-from-natural-experiments/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/dynamics-of-the-long-term-housing-yield-evidence-from-natural-experiments/</guid><description>&lt;p&gt;Each month a fraction of UK property leases are extended by 90 years or more, creating thousands of natural experiments in which the same property&amp;rsquo;s rent and capital value are revealed simultaneously. This paper uses these lease extensions — and Massachusetts and Cambridge rent-control removals as a second identification strategy — to estimate the expected long-term housing yield (annual rent-to-price ratio) and decompose its dynamics into rent-growth expectations and discount-rate components. The central finding is that housing yield movements are dominated by discount-rate shocks: variation in required returns on housing explains the overwhelming majority of yield variance, while expected rent growth contributes less than 10 percent. Housing booms are therefore primarily driven by falling required returns, not by rational expectations of higher future rents. The yield responds to real long-term interest rates with a slope significantly below one, consistent with a non-pecuniary convenience yield on housing that is not fully displaced by interest rate changes.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-do-the-natural-experiments-identify"&gt;Q1. What do the natural experiments identify?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Lease extensions reveal the market&amp;rsquo;s valuation of the same physical dwelling at two points — just before and just after the 90-year extension — with the extension itself creating a clean variation in the remaining lease term (and hence in the present value of ownership) without changing the property&amp;rsquo;s rent-generating characteristics.&lt;/strong&gt; This design separates the rent and price components of the yield at the property level, allowing identification of discount-rate and rent-growth contributions free of compositional differences across properties.&lt;/p&gt;
&lt;h3 id="q2-why-do-discount-rates-dominate-yield-variation"&gt;Q2. Why do discount rates dominate yield variation?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;A present-value decomposition of the housing yield into expected rent growth and the discount rate assigns more than 90 percent of variance to the discount rate component, implying that periods of low housing yields (high prices relative to rent) reflect primarily that investors demand a lower return on housing — not that they expect rents to rise faster.&lt;/strong&gt; This result mirrors Campbell-Shiller findings for equity markets but is especially striking for housing, where naive narratives often attribute booms to expected rent appreciation.&lt;/p&gt;
&lt;h3 id="q3-what-does-the-convenience-yield-interpretation-imply"&gt;Q3. What does the convenience yield interpretation imply?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Housing yields respond less than one-for-one to real interest rate movements — a slope well below one in the yield-rate regression — implying that housing carries a non-pecuniary convenience yield (liquidity, collateral value, direct utility of ownership) that buffers the required return on housing against interest rate changes.&lt;/strong&gt; When real rates rise, housing yields rise by less, so price-to-rent ratios decline by less than a frictionless model would predict.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;housing yield&lt;/strong&gt; : the annual rent-to-price ratio on residential property; the paper&amp;rsquo;s central object, decomposed into discount-rate and rent-growth components.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;discount-rate channel&lt;/strong&gt; : the dominant source of housing yield variation in this paper; movements in investors&amp;rsquo; required return on housing, not expected rent growth, drive the observed yield dynamics.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;convenience yield&lt;/strong&gt; : the non-pecuniary value of housing ownership (liquidity, collateral, direct utility) that drives a wedge between the housing yield and the risk-free real interest rate; explains the less-than-one slope in the yield-rate relationship.&lt;/p&gt;</description></item><item><title>Efficiency Criteria, Income Taxation, and Heterogeneous Elasticities</title><link>https://macropaperwarehouse.com/papers/efficiency-criteria-income-taxation-and-heterogeneous-elasticities/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/efficiency-criteria-income-taxation-and-heterogeneous-elasticities/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; Can income tax schedules be justified as utilitarian-optimal without adopting extreme normative assumptions about how household welfare should be measured? The paper proposes a welfare criterion strictly stronger than Pareto efficiency—called &lt;em&gt;rationalizability with bounded curvature&lt;/em&gt;—and asks whether observed US income taxes satisfy it.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Starting Point.&lt;/strong&gt; Any Pareto-efficient nonlinear income tax schedule can, in principle, be rationalized as utilitarian-optimal under &lt;em&gt;some&lt;/em&gt; cardinalization of household utilities (i.e., some choice of how to measure the cardinal scale of each household&amp;rsquo;s well-being). However, the paper shows that rationalizing Pareto-efficient taxes in this way often requires cardinalizations under which there is &lt;em&gt;no&lt;/em&gt; population upper bound on the curvature of utility with respect to consumption. Equivalently, a utilitarian planner&amp;rsquo;s marginal willingness to transfer resources to households must fall arbitrarily quickly with the size of those transfers—an extreme form of status quo bias violated by virtually all quantitative optimal-tax exercises.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;The Proposed Criterion.&lt;/strong&gt; The authors restrict attention to cardinalizations with &lt;em&gt;locally bounded curvature&lt;/em&gt;: there exists a finite (though potentially arbitrarily large) upper bound on the coefficient of relative risk aversion across the population. This admits two interpretations: (i) ex post, it requires that the social value of transfers not change arbitrarily quickly with transfer size; (ii) ex ante, it corresponds to a decision-maker behind a veil of ignorance with bounded risk aversion.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Theoretical Result.&lt;/strong&gt; Within a standard Mirrlees model of nonlinear income taxation with arbitrary preference heterogeneity and intensive-margin labor supply, the paper proves that a tax schedule can be rationalized with bounded curvature if and only if government revenues are both &lt;em&gt;decreasing and concave&lt;/em&gt; (not merely decreasing) with respect to a class of narrowly targeted &amp;ldquo;two-bracket&amp;rdquo; reforms—reforms that raise retention by $1 local to some income level $z$ and zero elsewhere. This contrasts with Pareto efficiency, which requires only that revenues be decreasing in these reforms (Bierbrauer, Boyer, and Hansen 2023). The additional requirement of revenue concavity is what distinguishes the bounded-curvature criterion from pure Pareto efficiency.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sufficient Statistics.&lt;/strong&gt; The paper derives explicit sufficient-statistics expressions for the first- and second-order derivatives of tax revenue with respect to these targeted reforms. The second derivative depends on higher moments of the elasticity distribution, specifically the &lt;em&gt;income-conditional variance&lt;/em&gt; of compensated elasticities of taxable income (ETIs). Revenue convexity—which causes the second-order condition to fail—arises when income-conditional ETI variance is sufficiently high, even holding the mean ETI fixed. The economic mechanism is a &amp;ldquo;sort-and-extort&amp;rdquo; dynamic: a small tax reform sorts higher-elasticity households into income brackets where marginal taxes fall and lower-elasticity households into brackets where marginal taxes rise; repeating the reform then exploits this sorting by differentially taxing households by elasticity, as if applying group-specific tax schedules within a uniform income tax.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Empirical Findings.&lt;/strong&gt; Using the NBER panel of US tax returns from 1979 to 1990, the paper estimates income-conditional mean ETIs of approximately 0.2–0.3 at most income levels. Crucially, it estimates a &lt;em&gt;lower bound&lt;/em&gt; on income-conditional ETI variance by comparing elasticities of light versus heavy itemizers (defined by whether a household claims above or below the mean value of deductions in its income bracket). The low-elasticity group has an ETI of approximately zero and the high-elasticity group has an ETI of approximately one, implying a lower bound on ETI variance of roughly 0.2 at most incomes and approximately 0.25 at the top of the distribution. This lower bound is close to—and under plausible assumptions above—the threshold required for the second-order condition to fail. The authors conclude that the US income tax schedule in 1990 was likely Pareto efficient but likely &lt;em&gt;not&lt;/em&gt; rationalizable with bounded curvature.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Quantitative Welfare Gains.&lt;/strong&gt; In a calibrated model with a 50% top marginal tax rate, Pareto-tail shape of 2.5, mean ETI of 0.3, and ETI standard deviation of 0.75 (50% above the estimated lower bound), the planner gains significant welfare from either raising or lowering top marginal taxes. The welfare-maximizing top rate below the baseline is 13.3%, generating social value equivalent to a transfer of $1,966 per top earner. The welfare-maximizing top rate above the baseline is 71.2%, generating social value equivalent to a transfer of $972 per top earner. The revenue-maximizing rate is 80.9% under the baseline calibration, ranging from 74.6% to 86.8% as ETI standard deviation varies by ±25% of the lower bound.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions.&lt;/strong&gt; The theoretical analysis is restricted to intensive-margin labor supply (abstracting from extensive-margin decisions); the empirical application focuses on top incomes where extensive-margin effects are likely small. The empirical period is 1979–1990, covering major federal and state tax reforms. Results concern local efficiency of the tax schedule, not global optimization.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-exactly-is-rationalizability-with-bounded-curvature-and-how-does-it-differ-from-pareto-efficiency"&gt;Q1. What exactly is &amp;ldquo;rationalizability with bounded curvature&amp;rdquo; and how does it differ from Pareto efficiency?&lt;/h3&gt;
&lt;p&gt;A: Pareto efficiency requires that no small reform makes someone better off without making anyone worse off. Rationalizability (with &lt;em&gt;any&lt;/em&gt; cardinalization) is equivalent to Pareto efficiency in this setting. Rationalizability with bounded curvature additionally restricts the cardinalization: there must exist a finite upper bound on the coefficient of relative risk aversion (or equivalently, on the curvature of utility with respect to consumption) across the population. This is a strictly stronger criterion than Pareto efficiency. A schedule can be Pareto efficient but not rationalizable with bounded curvature if the only cardinalizations that rationalize it require unbounded consumption utility curvature.&lt;/p&gt;
&lt;h3 id="q2-why-do-extreme-cardinalizations-with-unbounded-curvature-arise-when-rationalizing-pareto-efficient-taxes"&gt;Q2. Why do &amp;ldquo;extreme&amp;rdquo; cardinalizations with unbounded curvature arise when rationalizing Pareto-efficient taxes?&lt;/h3&gt;
&lt;p&gt;A: When a Pareto-efficient schedule is rationalized as utilitarian, the cardinalization must make the set of feasible, recardinalized utilities convex so it can be separated from the set of Pareto-improving allocations. The paper constructs such a cardinalization explicitly: it takes the form of a function whose second derivative approaches negative infinity as utility approaches its baseline value. This implies the planner&amp;rsquo;s marginal value of transfers to a household falls precipitously as the household is made even slightly better off—an extreme status quo bias. Theorem 2.b establishes that &lt;em&gt;all&lt;/em&gt; cardinalizations rationalizing a schedule with convex revenues must share this pathology.&lt;/p&gt;
&lt;h3 id="q3-what-is-the-sort-and-extort-mechanism-and-how-does-it-generate-revenue-convexity"&gt;Q3. What is the &amp;ldquo;sort-and-extort&amp;rdquo; mechanism and how does it generate revenue convexity?&lt;/h3&gt;
&lt;p&gt;A: When elasticities of taxable income (ETIs) are heterogeneous within an income level and the income density is declining steeply, a reform that lowers marginal taxes around income $z$ brings more households into the local bracket (because there are more households just below $z$ than above). Crucially, it disproportionately attracts households with &lt;em&gt;higher&lt;/em&gt; ETIs, since they respond more strongly to the marginal tax cut and relocate from further away, where the density differs more. Repeating the reform therefore faces a higher-elasticity composition at $z$, generating larger positive behavioral effects—making revenues convex in the size of the reform. The second step (&amp;ldquo;extort&amp;rdquo;) involves raising taxes on the now-concentrated low-elasticity households at adjacent brackets, achieving as-if group-specific taxation within a single income tax schedule.&lt;/p&gt;
&lt;h3 id="q4-what-is-the-precise-relationship-between-revenue-convexity-and-eti-variance"&gt;Q4. What is the precise relationship between revenue convexity and ETI variance?&lt;/h3&gt;
&lt;p&gt;A: The paper shows (Theorem 4) that the second-order revenue derivative with respect to a narrow two-bracket reform around income $z$ equals a positive function of the income density times the expression $-[1-R&amp;rsquo;_0(z)]\varepsilon(z) + [1-R&amp;rsquo;_0(z)]\alpha(z)[\varepsilon^2(z) + \text{var}_h[\varepsilon^h | z^h_0=z]]$. The first term is always negative (pushing toward revenue concavity). The second term, which includes the income-conditional variance of ETIs, can dominate and create revenue convexity when ETI variance is sufficiently large. In the benchmark case with a single household type at each income (no within-income heterogeneity), the variance term vanishes and revenues are always concave whenever decreasing.&lt;/p&gt;
&lt;h3 id="q5-what-is-the-sufficient-statistics-test-for-rationalizability-at-the-top-of-the-income-distribution"&gt;Q5. What is the sufficient statistics test for rationalizability at the top of the income distribution?&lt;/h3&gt;
&lt;p&gt;A: At top incomes (assuming no income effects, no super-elasticities, and CES preferences), taxes are Pareto efficient if and only if $\tau_\text{top} &amp;lt; \frac{1}{1+\alpha_\text{top}\varepsilon_\text{top}}$, and they are rationalizable with bounded curvature if and only if additionally $\tau_\text{top} &amp;lt; \frac{2}{1+\alpha_\text{top}(\varepsilon_\text{top} + \sigma^2_\text{top}/\varepsilon_\text{top})}$, where $\tau_\text{top}$ is the top marginal tax rate, $\alpha_\text{top}$ is the Pareto tail shape, $\varepsilon_\text{top}$ is the mean ETI at the top, and $\sigma^2_\text{top}$ is the income-conditional ETI variance at the top.&lt;/p&gt;
&lt;h3 id="q6-how-does-the-paper-estimate-a-lower-bound-on-income-conditional-eti-variance"&gt;Q6. How does the paper estimate a lower bound on income-conditional ETI variance?&lt;/h3&gt;
&lt;p&gt;A: The authors divide households at each income level into &amp;ldquo;heavy&amp;rdquo; and &amp;ldquo;light&amp;rdquo; itemizers based on whether their total deductions exceed the local income-bracket mean. They then estimate group-specific ETIs using local polynomial regressions of log income changes on log marginal retention changes, interacting tax changes with heavy-itemizer indicators. The within-year difference in elasticities between groups provides a lower bound on within-income ETI variance, since the two-group decomposition captures only a fraction of true variance. The interaction coefficient is allowed to vary by year to isolate within-year, within-income variation in elasticities rather than between-year compositional changes.&lt;/p&gt;
&lt;h3 id="q7-what-are-the-estimated-magnitudes-of-mean-and-variance-of-etis"&gt;Q7. What are the estimated magnitudes of mean and variance of ETIs?&lt;/h3&gt;
&lt;p&gt;A: Income-conditional average ETIs are estimated at between 0.2 and 0.3 at most income levels, consistent with but somewhat below prior literature estimates. The low-elasticity group (light itemizers) has an ETI of approximately zero, while the high-elasticity group (heavy itemizers) has an ETI of approximately one. Given roughly equal group sizes, this implies a lower bound on ETI variance of approximately 0.2 at most incomes and approximately 0.25 at the ninety-fifth percentile. Subdividing the high-elasticity group into two, three, and four subgroups yields a lower bound of approximately 0.25 for variance at the top.&lt;/p&gt;
&lt;h3 id="q8-how-does-the-back-of-the-envelope-calculation-work-to-assess-whether-the-second-order-test-fails"&gt;Q8. How does the back-of-the-envelope calculation work to assess whether the second-order test fails?&lt;/h3&gt;
&lt;p&gt;A: With $\tau_\text{top} \approx 0.5$, $\alpha_\text{top} \approx 2.5$, and $\varepsilon_\text{top} \approx 0.3$ (from prior literature), the second-order condition fails if and only if ETI variance exceeds approximately 0.27. The authors&amp;rsquo; lower bound estimate of ETI variance is already approximately 0.25 (standard deviation approximately 0.5), just below this threshold. The authors note that if the true standard deviation exceeds the lower bound by more than 4%, the second-order condition fails, making it empirically likely that the 1990 US tax schedule was not rationalizable with bounded curvature.&lt;/p&gt;
&lt;h3 id="q9-why-does-the-paper-focus-on-the-top-of-the-income-distribution-for-the-empirical-test"&gt;Q9. Why does the paper focus on the top of the income distribution for the empirical test?&lt;/h3&gt;
&lt;p&gt;A: The second-order condition is most likely to fail at high incomes for three reasons simultaneously: (i) the marginal tax rate is highest, (ii) ETI means are somewhat higher there, and (iii) the Pareto parameter $\alpha(z)$ is largest (income density falls steeply), which amplifies the sort-and-extort mechanism. The authors also note that extensive-margin labor supply responses—which are abstracted away in the theory—are likely small at high incomes.&lt;/p&gt;
&lt;h3 id="q10-what-does-the-calibrated-quantitative-application-reveal-about-optimal-top-tax-policy"&gt;Q10. What does the calibrated quantitative application reveal about optimal top tax policy?&lt;/h3&gt;
&lt;p&gt;A: Calibrated with a 50% initial top marginal tax rate, Pareto tail shape of 2.5, mean ETI of 0.3, and ETI standard deviation of 0.75 (50% above the estimated lower bound), the model finds welfare gains in both directions of reform. The welfare-maximizing rate &lt;em&gt;below&lt;/em&gt; the baseline is 13.3%, yielding equivalent welfare gains of $1,966 per top earner. The welfare-maximizing rate &lt;em&gt;above&lt;/em&gt; the baseline is 71.2%, yielding equivalent gains of $972 per top earner. The revenue-maximizing rate is 80.9%, ranging from 74.6% to 86.8% when ETI standard deviation varies by ±25% of the lower bound. This sensitivity highlights that the optimal direction and magnitude of reform depend substantially on the uncertain degree of ETI heterogeneity.&lt;/p&gt;
&lt;h3 id="q11-how-does-the-paper-relate-to-the-inverse-optimum-literature"&gt;Q11. How does the paper relate to the &amp;ldquo;inverse optimum&amp;rdquo; literature?&lt;/h3&gt;
&lt;p&gt;A: The inverse optimum approach (Bourguignon and Spadaro 2012; Hendren 2020) infers the first-order welfare trade-offs implicit in an observed tax schedule. This paper goes further by inferring from second-order empirical moments—specifically the income-conditional ETI variance—whether taxes are consistent with &lt;em&gt;minimal&lt;/em&gt; requirements on how sensitive the planner&amp;rsquo;s trade-offs are to household welfare levels. Rather than assuming a welfare function, it tests whether &lt;em&gt;any&lt;/em&gt; welfare function with bounded curvature can rationalize the observed schedule.&lt;/p&gt;
&lt;h3 id="q12-is-revenue-convexity-possible-without-within-income-heterogeneity-in-preferences"&gt;Q12. Is revenue convexity possible without within-income heterogeneity in preferences?&lt;/h3&gt;
&lt;p&gt;A: Yes, but only under more specific conditions. The paper provides two supplemental examples. In the first, all households have constant-elasticity labor disutility but differ in both productivity and elasticity across income levels; when lower-income households have higher elasticities, a reform reducing marginal taxes at $z$ attracts higher-elasticity households and raises the average elasticity, leading to convex revenues. In the second, all households have the same initial elasticity but individual elasticities change in response to reforms. However, with the standard additively separable CES preferences and no within-income heterogeneity, revenues are always concave when decreasing—consistent with Werning&amp;rsquo;s (2007) observation that the Pareto planner&amp;rsquo;s problem is convex in this case.&lt;/p&gt;
&lt;h3 id="q13-what-is-the-role-of-random-tax-reforms-in-the-papers-logic"&gt;Q13. What is the role of random tax reforms in the paper&amp;rsquo;s logic?&lt;/h3&gt;
&lt;p&gt;A: Random tax reforms serve as an expository bridge. The paper shows that if the second-order revenue effect of a two-bracket reform is positive at some income $z$, then a &amp;ldquo;randomized&amp;rdquo; reform that applies the reform with equal probability in positive and negative directions generates an expected Pareto improvement—because the convexity of revenues implies expected revenues rise, while for any household with bounded risk aversion the reform&amp;rsquo;s second-order utility effect is also positive when the reform is sufficiently narrow. This establishes that revenue convexity implies random Pareto inefficiency under bounded risk aversion, and then the paper shows the analogous deterministic result for rationalizability.&lt;/p&gt;
&lt;h3 id="q14-what-scope-conditions-attach-to-the-sufficient-conditions-for-rationalizability-theorem-3"&gt;Q14. What scope conditions attach to the sufficient conditions for rationalizability (Theorem 3)?&lt;/h3&gt;
&lt;p&gt;A: Theorem 3 requires Assumptions 1 and 3 plus two boundary conditions: the ratio $\delta\text{Rev}(z)/(zg(z))$ must remain bounded away from zero as income approaches 0 or infinity, and at all incomes there must exist households with low enough compensated elasticities. Assumption 1 requires that average and marginal taxes have upper bounds below one, that marginal taxes have a lower bound, and that $zg(z)$ converges to zero at the boundaries. Assumption 3 is a regularity condition on how conditional moments of the elasticity distribution vary with income. These conditions ensure that the narrow, self-financing reforms considered in the necessity proof cannot generate welfare improvements once revenues are both decreasing and concave.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Rationalizability with Bounded Curvature.&lt;/strong&gt; The property that a tax schedule is utilitarian-optimal under some cardinalization of household utilities in which there exists a finite (though potentially arbitrarily large) upper bound on the curvature of utility with respect to consumption across the population. Formally, there exists a continuous function $\bar{\rho}$ such that, for all households, the absolute value of $[w_h \circ u_h]_{cc} / [w_h \circ u_h]_c$ is bounded by $\bar{\rho}$ evaluated at the household&amp;rsquo;s income. This criterion is strictly stronger than Pareto efficiency and strictly weaker than utilitarian optimality under a fixed cardinalization.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Two-Bracket Reform.&lt;/strong&gt; A targeted tax reform that increases retention (post-tax income) by $1 at incomes local to some level $z$ over a small bracket of width $\ell$, and zero elsewhere (smoothed at the edges). As $\ell \to 0$, this becomes an infinitesimally narrow reform. The first- and second-order revenue effects of these reforms—denoted $\delta\text{Rev}(z)$ and $\delta^2\text{Rev}(z)$—are the paper&amp;rsquo;s key objects: Pareto efficiency requires $\delta\text{Rev}(z) &amp;lt; 0$ for all $z$, and rationalizability with bounded curvature additionally requires $\delta^2\text{Rev}(z) \leq 0$ for all $z$.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Income-Conditional ETI Variance.&lt;/strong&gt; The variance of compensated elasticities of taxable income (ETIs) among households with the same income level, $\text{var}_h[\varepsilon^h | z^h_0 = z]$. This is the paper&amp;rsquo;s primary empirical object of interest and the key determinant of whether revenues are convex or concave in the size of targeted reforms. Unlike the literature&amp;rsquo;s focus on mean ETIs by income bracket, this within-income variance captures heterogeneity among households sharing the same pre-reform income.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sort-and-Extort Mechanism.&lt;/strong&gt; The two-step economic mechanism underlying revenue convexity from ETI heterogeneity. In the first step (&amp;ldquo;sort&amp;rdquo;), a marginal tax cut around income $z$ disproportionately attracts higher-ETI households from lower incomes (because they respond more strongly and relocate from further away), shifting the elasticity composition at $z$ upward. In the second step (&amp;ldquo;extort&amp;rdquo;), repeating the reform finds higher-elasticity households concentrated where marginal taxes fall and lower-elasticity households where taxes rise, effectively applying differential tax treatment by elasticity within a single income tax schedule.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Local Pareto Parameter $\alpha(z)$.&lt;/strong&gt; Defined as $-d\log(zg(z))/d\log z$, where $g(z)$ is the income density. This captures the rate at which the income density is falling in income locally at $z$, and governs the strength of the sort-and-extort mechanism. High $\alpha(z)$ at top incomes (reflecting a steeply declining Pareto-type density) amplifies revenue convexity from ETI heterogeneity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Super-Elasticity.&lt;/strong&gt; A concept that captures how a household&amp;rsquo;s compensated ETI would change if its income were different, holding preferences fixed. Formally, it is the derivative of the household&amp;rsquo;s elasticity with respect to its log income, decomposing into effects from changes in preference curvature and changes in the local curvature of the tax schedule. Super-elasticities are zero in the benchmark case of additively CES preferences and locally CES retention schedules but contribute additional terms to the second-order revenue expression in the general case.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Cardinalizing Function.&lt;/strong&gt; A strictly increasing function $w_h$ that maps household $h$&amp;rsquo;s indirect utility $V_h$ to a cardinalized utility level $w_h(V_h)$. The social planner maximizes the expectation of cardinalized utilities. Different choices of ${w_h}_h$ correspond to different stances on interpersonal comparisons, including unbounded curvature (rationalizing any Pareto-efficient schedule) or bounded curvature (the paper&amp;rsquo;s proposed restriction). Rawlsian social welfare is a limit of utilitarian welfare with increasingly concave cardinalizing functions.&lt;/p&gt;</description></item><item><title>Endogenous Production Networks Under Supply Chain Uncertainty</title><link>https://macropaperwarehouse.com/papers/endogenous-production-networks-under-supply-chain-uncertainty/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/endogenous-production-networks-under-supply-chain-uncertainty/</guid><description>&lt;p&gt;This paper studies how firms&amp;rsquo; optimal technique choices under productivity uncertainty endogenously shape the structure of production networks and aggregate macroeconomic outcomes. Each sector chooses input shares (production techniques) before observing sector-specific TFP realizations. Techniques are selected to maximize a risk-adjusted expected log GDP measure — expected log GDP minus a risk-aversion-scaled variance term — with endogenous productivity shifters that favor balanced use of inputs. When uncertainty about sector TFP rises, firms shift toward suppliers with lower expected productivity but lower variance — a &amp;ldquo;flight to safety&amp;rdquo; in input sourcing. The key aggregation result is that the contribution of each sector to aggregate welfare depends on its endogenous Domar weight (expenditure share times adjustment factor), which itself responds to changes in beliefs. The paper establishes propositions characterizing how Domar weights respond to changes in mean (μ) and variance (Σ) of TFP beliefs: higher mean raises a sector&amp;rsquo;s Domar weight; higher variance lowers it when inputs are gross substitutes, but can lower it even with complementary inputs through belief adjustment. A basic calibration to 37 US BEA sectors (1948–2020) finds that the flexible-network economy has expected log GDP 2.1% higher than a fixed-network alternative. During the Great Recession, elevated uncertainty caused firms to shift toward safer, lower-productivity suppliers, reducing expected log GDP by 0.25% but reducing GDP variance by 2.4% and improving actual realized GDP outcomes by 2.7% relative to a &amp;ldquo;no uncertainty&amp;rdquo; benchmark.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-models-core-setup-and-how-does-technique-choice-generate-an-endogenous-network"&gt;Q1. What is the model&amp;rsquo;s core setup and how does technique choice generate an endogenous network?&lt;/h3&gt;
&lt;p&gt;&lt;em&gt;&lt;em&gt;Each of n sectors chooses input shares αi = (αi0, αi1, &amp;hellip;, αin) — where αi0 is the labor share — before observing sectoral TFP realizations εt, subject to a convex cost function Ai(αi) that penalizes deviation from fixed-proportion baseline techniques; the equilibrium network α&lt;/em&gt; is then the solution to a social planner&amp;rsquo;s problem that maximizes expected welfare W = E[y] − (ρ/2)V[y], where y is log GDP and ρ is the coefficient of relative risk aversion.&lt;/em&gt;* Because cost functions Ai are jointly determined by the input shares chosen and by Hessian matrices Hi that govern substitutability/complementarity of inputs, sectors can substitute or complement in the production of any given good, and the equilibrium network balances expected log GDP gains from choosing more productive suppliers against the variance reduction from choosing safer suppliers.&lt;/p&gt;
&lt;h3 id="q2-what-role-do-domar-weights-play-and-how-do-they-generalize-to-the-endogenous-network-case"&gt;Q2. What role do Domar weights play, and how do they generalize to the endogenous-network case?&lt;/h3&gt;
&lt;p&gt;&lt;em&gt;&lt;em&gt;In the standard fixed-network case (Hulten&amp;rsquo;s theorem), each sector i&amp;rsquo;s contribution to aggregate log GDP equals its Domar weight ωi = (expenditure on sector i&amp;rsquo;s output)/(total GDP) — a sufficient statistic for first-order productivity effects. The paper extends this: with an endogenous network, the social planner&amp;rsquo;s optimality conditions imply that equilibrium Domar weights equal the shadow value of relaxing each sector&amp;rsquo;s resource constraint, and these shadow values respond to changes in beliefs (μ, Σ) through the induced changes in α&lt;/em&gt;.&lt;/em&gt;* Lemmas 3–5 characterize these responses: a sector&amp;rsquo;s Domar weight increases in its expected log TFP (μi), and changes in variance Σij propagate through the network via the adjustment terms in the first-order conditions, so that a single sector&amp;rsquo;s volatility change affects the Domar weights of all connected sectors.&lt;/p&gt;
&lt;h3 id="q3-what-are-the-key-propositions-about-how-beliefs-affect-aggregate-welfare-and-gdp"&gt;Q3. What are the key propositions about how beliefs affect aggregate welfare and GDP?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Proposition 6 (monotone welfare response to mean beliefs): welfare W is increasing in each sector&amp;rsquo;s mean log TFP μi, and the marginal effect equals the sector&amp;rsquo;s Domar weight; this holds even though expected log GDP E[y] may non-monotonically respond to μi when inputs are gross substitutes, because the variance-reduction benefit of adjusting away from the now-more-productive but higher-variance sector can temporarily dominate.&lt;/strong&gt; Proposition 7 (variance increases hurt expected log GDP): for substitutable inputs, a rise in Σii decreases E[y] because firms shift away from the more volatile sector toward less productive alternatives; for complementary inputs, the same shift also reduces E[y] because complementary inputs move together. Corollary 4 shows that welfare W always falls when uncertainty rises, combining these effects.&lt;/p&gt;
&lt;h3 id="q4-how-does-the-flight-to-safety-mechanism-work-in-a-multi-sector-economy"&gt;Q4. How does the flight-to-safety mechanism work in a multi-sector economy?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;When uncertainty about sector i&amp;rsquo;s productivity rises, the optimal technique response is to reduce αji for all downstream sectors j that use sector i as an input substitute, and increase labor shares or shares in less volatile inputs; since sectors with lower μ but lower Σ become relatively more attractive on a risk-adjusted basis, the network reconfigures toward &amp;ldquo;safer&amp;rdquo; but typically less productive suppliers.&lt;/strong&gt; The cascading link-destruction example (Section 7) illustrates this: when an industry&amp;rsquo;s production becomes uncertain, the endogenous deletion of risky links propagates across the network as complementary and substitute linkages amplify or dampen the flight to safety, with the direction depending on whether inputs are gross complements or substitutes in the Hessian Hi.&lt;/p&gt;
&lt;h3 id="q5-what-does-the-calibration-to-us-data-find-about-the-quantitative-importance-of-the-endogenous-network"&gt;Q5. What does the calibration to US data find about the quantitative importance of the endogenous network?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The calibrated model with 37 BEA sectors (1948–2020) achieves a cross-sectional correlation between model and data Domar weights of 0.96 (though the model average Domar weight of 0.03 is below the data&amp;rsquo;s 0.05) and matches the data correlations Corr(ωjt, μjt) = 0.1 and Corr(ωjt, Σjjt) = −0.4 closely (model delivers 0.1 and −0.3 respectively).&lt;/strong&gt; Comparing the flexible-network baseline to a fixed-network alternative, expected log GDP is 2.1% lower in the fixed-network economy, and welfare differs by a similar 2.1%. This suggests the endogenous reallocation of input shares over the sample period — as some sectors became persistently more productive — delivered substantial gains relative to a static network.&lt;/p&gt;
&lt;h3 id="q6-what-happens-during-high-uncertainty-episodes-such-as-the-great-recession"&gt;Q6. What happens during high-uncertainty episodes such as the Great Recession?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;During the Great Recession (2007–2009), the estimated uncertainty measure Σt spiked sharply; firms responded by shifting techniques toward safer suppliers, resulting in expected log GDP that is about 0.25% lower in the baseline than in a &amp;ldquo;no uncertainty&amp;rdquo; (Σ = 0) economy and GDP variance that is about 2.4% lower.&lt;/strong&gt; The insurance paid off in terms of realized outcomes: realized log GDP in the baseline economy was approximately 2.7% higher than in the &amp;ldquo;as-if Σ = 0&amp;rdquo; economy in 2009, because firms had taken out insurance against exactly the kind of bad TFP draws that materialized during the crisis. The perfect-foresight economy (where εt is known before technique choice) outperforms the baseline by up to 3% in realized GDP during the Great Recession — the maximum value of uncertainty resolution.&lt;/p&gt;
&lt;h3 id="q7-what-is-the-key-distinction-between-the-effects-of-mean-and-variance-changes-for-welfare-vs-expected-gdp"&gt;Q7. What is the key distinction between the effects of mean and variance changes for welfare vs. expected GDP?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Changes in mean beliefs μi and welfare W are co-monotone (Proposition 6), but changes in μi and expected log GDP E[y] can be non-monotone when inputs are substitutes: a small increase in μi for a less productive sector can actually lower E[y] in the short run because firms shift toward that sector at the expense of more productive alternatives, even though this shift reduces variance and raises welfare.&lt;/strong&gt; The divergence between E[y] and W is the key mechanism: when ρ &amp;gt; 0 (risk-averse households), reducing variance has positive welfare value even when it lowers the level of expected GDP, so the production network adjusts in directions that appear sub-optimal for average productivity but are optimal for welfare. The calibrated relative risk aversion parameter ρ̂ = 4.3 indicates meaningful risk aversion that makes these variance-mean trade-offs quantitatively relevant.&lt;/p&gt;
&lt;h3 id="q8-what-is-the-role-of-input-complementarity-versus-substitutability-in-determining-network-responses"&gt;Q8. What is the role of input complementarity versus substitutability in determining network responses?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;When inputs i and j are gross substitutes (Hessian element [Hi]ij &amp;lt; 0), an increase in sector j&amp;rsquo;s uncertainty Σjj induces sectors that use both i and j to shift away from j and toward i, reducing j&amp;rsquo;s Domar weight and increasing i&amp;rsquo;s — the network becomes more concentrated in safer inputs.&lt;/strong&gt; When inputs are gross complements ([Hi]ij &amp;gt; 0), an increase in Σjj also reduces the demand for the complementary input i, because both inputs must be used together and the safe input i becomes jointly less attractive when paired with volatile j; this can cause both E[y] and V[y] to fall simultaneously, resulting in an ambiguous welfare effect that depends on the magnitude of ρ relative to the E[y]-V[y] trade-off (Corollary 4 ensures welfare falls, but the split across E[y] and V[y] depends on complementarity structure).&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;technique choice&lt;/strong&gt; : a sector&amp;rsquo;s endogenous selection of input shares αij prior to observing TFP realizations; the key margin of adjustment in the model through which uncertainty shapes the production network; characterized by convex cost functions Ai that favor balanced input use around baseline shares α°.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;endogenous Domar weight&lt;/strong&gt; : the share of total expenditure on a sector&amp;rsquo;s output in aggregate nominal GDP, computed in the model&amp;rsquo;s equilibrium; equals the shadow value of the sector&amp;rsquo;s resource constraint and responds to changes in beliefs (μ, Σ); in the fixed-network case reduces to the standard Hulten-theorem Domar weight.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;flight to safety&lt;/strong&gt; : the equilibrium response in which firms shift their input shares away from high-mean, high-variance suppliers toward lower-mean, lower-variance alternatives when aggregate uncertainty rises; generates the prediction that network restructuring during recessions reduces GDP volatility while raising expected production costs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;risk-adjusted expected welfare (W)&lt;/strong&gt; : the social planner&amp;rsquo;s objective, defined as E[y] − (ρ/2)V[y] where y is log GDP and ρ is the coefficient of relative risk aversion; this non-separable objective function generates the trade-off between expected productivity and risk reduction that drives endogenous network formation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;cascading link destruction&lt;/strong&gt; : the propagation of reduced sectoral linkages through the network when one sector&amp;rsquo;s uncertainty rises; in examples with complementary inputs, the reduced demand for a volatile sector also reduces demand for its complements, potentially amplifying the flight to safety beyond the directly affected sector.&lt;/p&gt;</description></item><item><title>Enlightenment Ideals and Belief in Progress in the Run-up to the Industrial Revolution</title><link>https://macropaperwarehouse.com/papers/enlightenment-ideals-and-belief-in-progress-in-the-run-up-to-the-industrial-revolution/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/enlightenment-ideals-and-belief-in-progress-in-the-run-up-to-the-industrial-revolution/</guid><description>&lt;p&gt;This paper tests Joel Mokyr&amp;rsquo;s claim that Britain&amp;rsquo;s industrialization was preceded and enabled by a cultural shift — specifically, that Enlightenment ideals produced a &amp;ldquo;progress-oriented&amp;rdquo; view of science that diffused to artisans and craftsmen. The central research question is whether and when the language of science became more progress-oriented in the build-up to the Industrial Revolution, and whether this shift was concentrated in volumes directly linked to industrial production.&lt;/p&gt;
&lt;p&gt;The authors assemble 173,031 unique volumes printed in England and written in English between 1500 and 1900, drawn from the Hathitrust Digital Library. Because copyright law prohibits downloading full text, they use HDL&amp;rsquo;s Extracted-Features &amp;ldquo;bag of words&amp;rdquo; dataset. After removing duplicates and Latin-language volumes from an initial set of 420,081, they apply Latent Dirichlet Allocation (LDA) with cross-validated perplexity minimization to identify an optimal T=60 topics. Topic-pair co-occurrence analysis identifies three categories — science, religion, and political economy — each anchored by three defining topics. Volume-level category weights are derived by multiplying each topic&amp;rsquo;s weight by its category coefficient. The resulting classification yields 50,090 science volumes, 102,565 political economy volumes, and 14,124 religion volumes.&lt;/p&gt;
&lt;p&gt;Progressive sentiment is measured using a seven-word dictionary (progress, improvement, stride, betterment, advance, rise, amelioration) assembled from thesaurus synonyms for &amp;ldquo;progress,&amp;rdquo; manually vetted by all four authors, and restricted to words attested in the Oxford English Dictionary before 1643 (Newton&amp;rsquo;s birth year). Sentiment for each volume equals the count of progress-dictionary words divided by total word count. An analogous optimism-sentiment placebo dictionary is constructed separately.&lt;/p&gt;
&lt;p&gt;Industrial relevance is scored using the digitized indexes of all five volumes of Appleby&amp;rsquo;s Illustrated Handbook of Machinery (1877–1903); the top industrial root words are crane (weight 51), electr (42), weight (37), rope (27), and cost (27). Each volume receives an industry score equal to the weighted occurrence of industrial root words normalized by volume length.&lt;/p&gt;
&lt;p&gt;Three main findings emerge. First, the language of science and religion showed little overlap beginning in the 17th century — that is, the secularization of science predates the onset of industrialization. Science volumes shifted from approximately 40 percent religious content around 1700 to only about 10 percent by 1850, with scientific content rising correspondingly from roughly 40 percent to over 60 percent. This trend was stable from 1650 through 1900.&lt;/p&gt;
&lt;p&gt;Second, while scientific volumes became more progress-oriented during the Enlightenment, this progressive shift was concentrated in volumes at the nexus of science and political economy. Volumes of &amp;ldquo;pure&amp;rdquo; science were largely neutral with respect to progress sentiment, and those at the science-religion nexus had on average negative progress sentiment. The marginal effect of scientific content on progress sentiment was greatest for volumes mixing science and political economy, and most of the increase in predicted sentiment at that nexus occurred during the 18th century, remaining stable thereafter. A placebo test using optimism sentiment finds the opposite pattern: volumes at the science-political economy nexus were among the least optimistic, while the most optimistic language appeared at the religion-political economy nexus. This rules out the interpretation that the measured shift reflects a general increase in positive affect rather than specifically progress-oriented language.&lt;/p&gt;
&lt;p&gt;Third, volumes employing industrial terminology that also sat at the science-political economy nexus were distinctively progressive beginning in the mid-18th century. At the 90th percentile of industry score, predicted progress sentiment at the science-political economy nexus was positive throughout the sample; at zero industry score, it was negative until the mid-18th century. Volumes at the religion-political economy nexus showed modestly positive and time-stable progress sentiment regardless of industry score.&lt;/p&gt;
&lt;p&gt;The paper concludes that it was the pragmatic, applied volumes — those bridging science and political economy, written for artisans and a broader literate public rather than for the human-capital elite alone — that embodied the cultural values Mokyr identifies as central to Britain&amp;rsquo;s industrialization.&lt;/p&gt;
&lt;p&gt;Q: What gap in the existing literature does this paper address?&lt;/p&gt;
&lt;p&gt;A: Prior work on the cultural deep roots of economic growth rarely tracks how culture changes over time, relying instead on cross-sectional variation or qualitative case studies. Quantitative evidence that the language of science itself became more progress-oriented — and that this change reached beyond elite thinkers to artisans and craftsmen — had not been marshaled before. The paper provides inaugural quantitative support by analyzing 173,031 volumes spanning four centuries.&lt;/p&gt;
&lt;p&gt;Q: Why does the paper restrict the progress-sentiment dictionary to words attested before 1643?&lt;/p&gt;
&lt;p&gt;A: Words that entered English only after 1643 (Newton&amp;rsquo;s birth year) could not have appeared in volumes from the early Enlightenment, so including them would bias sentiment scores toward the later part of the sample. The restriction ensures the dictionary is applicable and unbiased across the full 1500–1900 period. The final retained words are: progress, improvement, stride, betterment, advance, rise, amelioration.&lt;/p&gt;
&lt;p&gt;Q: How does LDA classify volumes, and how is T=60 selected?&lt;/p&gt;
&lt;p&gt;A: LDA treats each volume as a bag of words and derives a Dirichlet distribution such that observed documents are generated by repeated topic sampling. The number of topics T is selected by minimizing perplexity on held-out data via 4-fold cross-validation, rotating training and test sets across folds; this procedure yields T=60 as optimal. Each volume is then represented as a mixture over those 60 topics.&lt;/p&gt;
&lt;p&gt;Q: What are the three categories and their anchor topics?&lt;/p&gt;
&lt;p&gt;A: Political Economy is anchored by topics on law/public opinion, governance/parliament, and trade/price/labour. Religion is anchored by topics on church/Christian doctrine, God/faith/sin, and virtue/fame/religion. Science is anchored by topics on engineering/steam/electricity, chemistry/acid/heat, and geometry/equations/trigonometry. These three sets of topics were selected for high corpus-wide importance and mutual independence.&lt;/p&gt;
&lt;p&gt;Q: What does the finding on science-religion separation imply for timing?&lt;/p&gt;
&lt;p&gt;A: The separation of scientific and religious language was already visible by 1600 and firmly established by the mid-17th century, well before the Industrial Revolution conventionally dated to the mid-18th century. This supports Mokyr&amp;rsquo;s argument that the secularization of science was an Enlightenment-era precursor to industrialization rather than a product of it. The trend remained stable from 1650 through 1900.&lt;/p&gt;
&lt;p&gt;Q: How does the progressive sentiment differ between pure science and the science-political economy nexus?&lt;/p&gt;
&lt;p&gt;A: Volumes of pure science were largely neutral with respect to progress-oriented language and in some periods showed slightly negative predicted progress sentiment. The science-religion nexus showed consistently negative progress sentiment. By contrast, volumes at the science-political economy nexus showed the highest level of progressive sentiment beginning in the mid-18th century, and most of this growth in predicted sentiment occurred during the 18th century, after which it remained stable.&lt;/p&gt;
&lt;p&gt;Q: What does the placebo optimism test show?&lt;/p&gt;
&lt;p&gt;A: The optimism sentiment scores are nearly the mirror opposite of the progress scores: the most optimistic language appears at the religion-political economy nexus, while volumes at the science-political economy nexus are among the least optimistic. This dissociation rules out the interpretation that the measured progress-sentiment rise reflects a general shift toward positive language rather than a specific cultural embrace of science as a tool for improving human welfare.&lt;/p&gt;
&lt;p&gt;Q: How is the industrial score constructed and what are the most heavily weighted terms?&lt;/p&gt;
&lt;p&gt;A: The authors digitized the detailed indexes of all five volumes of Appleby&amp;rsquo;s Illustrated Handbook of Machinery (1877–1903), restricted to words attested before 1643, and weighted each industrial root word by its index frequency. Each corpus volume&amp;rsquo;s industry score equals the sum of (word count × index weight) across all industrial words, normalized by volume length, yielding a score between 0 and 1. The top-weighted terms are crane (51), electr (42), weight (37), rope (27), and cost (27).&lt;/p&gt;
&lt;p&gt;Q: What is the key result linking industrial scores to progressive sentiment?&lt;/p&gt;
&lt;p&gt;A: At the science-political economy nexus, volumes with industry scores at the 90th percentile had persistently positive predicted progress sentiment throughout the sample, while volumes at that nexus with zero industry score had negative predicted sentiment until the mid-18th century. The shift to positive sentiment for high-industry volumes at this nexus occurred in the mid-18th century — roughly coinciding with the onset of Britain&amp;rsquo;s industrialization — and those volumes remained the most progress-oriented in the corpus thereafter.&lt;/p&gt;
&lt;p&gt;Q: What is the paper&amp;rsquo;s interpretation of the science-political economy nexus finding in relation to Mokyr?&lt;/p&gt;
&lt;p&gt;A: The authors interpret volumes at the science-political economy nexus as pragmatic, applied works aimed at a broader literate audience including artisans and craftsmen, not exclusively the human-capital elite. These are precisely the volumes Mokyr&amp;rsquo;s &amp;ldquo;Industrial Enlightenment&amp;rdquo; thesis predicts would carry progress-oriented cultural values into the mechanical and artisanal pursuits that drove industrialization. The finding that pure-science volumes were not especially progressive, while applied volumes bridging science and political economy were, is consistent with Mokyr&amp;rsquo;s argument that it was the diffusion of Enlightenment ideals to skilled practitioners — not just to elite scientists — that mattered.&lt;/p&gt;
&lt;p&gt;Q: What qualitative examples support the quantitative findings?&lt;/p&gt;
&lt;p&gt;A: Martin Clare&amp;rsquo;s The Motion of Fluids (1735) explicitly addresses &amp;ldquo;the Unlearned&amp;rdquo; and states in its preface that the work is meant to be &amp;ldquo;of singular Use and Benefit to Mankind&amp;rdquo; — a direct expression of the progress-oriented language the algorithm detects. George Stephenson&amp;rsquo;s 1831 railway report argues that rail infrastructure would allow Ireland to &amp;ldquo;reciprocate with England and with other nations, the products of industry,&amp;rdquo; exemplifying how progress-oriented language pervaded industrial writing by the early 19th century. These examples confirm that the high progress-sentiment scores for industrial volumes at the science-political economy nexus reflect genuine rhetorical content, not measurement artifacts.&lt;/p&gt;
&lt;p&gt;Q: What are the paper&amp;rsquo;s limitations regarding early sample periods?&lt;/p&gt;
&lt;p&gt;A: The corpus is thin in earlier eras, particularly around 1550, so results from the earliest decades must be interpreted with caution. The HDL data derive from digitized scans with OCR output of very old books, introducing errors such as the &amp;ldquo;long-S&amp;rdquo; misread (e.g., &amp;ldquo;juftice&amp;rdquo; for &amp;ldquo;justice&amp;rdquo;) that require manual correction. Additionally, the bag-of-words model discards word order, which may obscure some semantic distinctions.&lt;/p&gt;
&lt;p&gt;Q: What future research directions do the authors identify?&lt;/p&gt;
&lt;p&gt;A: The authors propose applying the same textual analysis techniques to test whether English-language volumes began reflecting greater freedom of expression in the run-up to Britain&amp;rsquo;s economic takeoff, connecting to the literature on European political fragmentation and the marketplace of ideas. They also suggest applying the approach to corpora in other languages — Dutch (following McCloskey&amp;rsquo;s argument about bourgeois values) and Spanish (to examine whether the Counter-Reformation and Spain&amp;rsquo;s economic lag are reflected in cultural attitudes toward progress and science).&lt;/p&gt;
&lt;p&gt;LDA (Latent Dirichlet Allocation): An unsupervised generative statistical model that treats each document as a bag of words and extracts latent topics as multinomial distributions over vocabulary; used here to reduce 173,031 volumes to mixtures of 60 topics without imposing prior scholarly interpretations.&lt;/p&gt;
&lt;p&gt;Progressive Sentiment Score: The fraction of words in a volume belonging to a seven-word dictionary of progress synonyms (progress, improvement, stride, betterment, advance, rise, amelioration), normalized by total word count; measures the cultural orientation toward the betterment of humankind as embedded in text.&lt;/p&gt;
&lt;p&gt;Industrial Score: A volume-level measure equal to the weighted count of industrial root words — derived from the indexes of Appleby&amp;rsquo;s Illustrated Handbook of Machinery (1877–1903) — normalized by volume length; captures the degree to which a volume&amp;rsquo;s vocabulary overlaps with industrial production terminology.&lt;/p&gt;
&lt;p&gt;Science-Political Economy Nexus: The region of the topic simplex where volumes carry substantial weight in both the science and political economy categories but low weight in religion; the paper finds this is where progress-oriented language was most concentrated from the mid-18th century onward, interpreted as applied science aimed at artisans and a broader literate public.&lt;/p&gt;
&lt;p&gt;Industrial Enlightenment: Joel Mokyr&amp;rsquo;s (2009) concept describing the diffusion of Enlightenment ideals about the practical utility of science into the mechanical and artisanal pursuits that drove Britain&amp;rsquo;s industrialization; the paper provides quantitative support for this thesis by showing that industrial volumes at the science-political economy nexus were distinctively progress-oriented.&lt;/p&gt;
&lt;p&gt;Culture of Growth: Mokyr&amp;rsquo;s (2016) broader argument that a pan-European network of elite intellectuals fostered a progress-oriented view of science — the idea that scientific understanding could improve the human condition — and that this cultural norm, in combination with Britain&amp;rsquo;s stock of skilled craftsmen, made industrialization possible.&lt;/p&gt;
&lt;p&gt;Bag of Words: A representation of text that records only word frequencies within a document, discarding word order; used here both because HDL copyright restrictions prevent full-text download and because it is the input format required by LDA.&lt;/p&gt;</description></item><item><title>Equal Pay for Similar Work</title><link>https://macropaperwarehouse.com/papers/equal-pay-for-similar-work/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/equal-pay-for-similar-work/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This paper studies the labor market effects of &amp;ldquo;Equal Pay for Similar Work&amp;rdquo; (EPSW) policies — laws that require firms to pay equal wages to workers of different protected-class identities (e.g., different genders) who perform &amp;ldquo;similar&amp;rdquo; work within a firm. EPSW has become increasingly prevalent: as of January 2023, more of the U.S. workforce falls under state EPSW laws than state &amp;ldquo;Equal Pay for Equal Work&amp;rdquo; (EPEW) laws. Despite this spread, the equilibrium consequences of EPSW were previously unknown.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Theoretical Framework&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors develop two theoretical models. The first is a static cooperative game (whose outcomes coincide with the Nash equilibria of a non-cooperative simultaneous-wage-offer game). Homogeneous firms with constant-returns-to-scale production compete for a continuum of heterogeneous workers. Workers belong to one of two groups A or B (e.g., men and women), with group A constituting a β ≥ 1 majority. Each worker&amp;rsquo;s productivity v is drawn from a group-specific distribution (FA or FB); firms&amp;rsquo; willingness to pay equals each worker&amp;rsquo;s productivity, but can embed taste-based discrimination. The analysis is framed as applying &amp;ldquo;within job&amp;rdquo; in a local labor market — only workers performing &amp;ldquo;similar&amp;rdquo; work in the eyes of the law.&lt;/p&gt;
&lt;p&gt;The second model is a dynamic search-and-bargaining framework with an arbitrary number of firms, search frictions, reallocation frictions, and Nash-in-Nash bargaining. EPSW is introduced as a surprise, and constrained firms choose whether to segregate for one group or remain desegregated (paying a common wage to all workers).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Theoretical Findings&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Without EPSW, Bertrand competition among firms drives every worker&amp;rsquo;s wage to equal her productivity; any wage gap between groups A and B exactly reflects the difference in average productivities (EA(v) − EB(v)), whether or not those productivity differences stem from discrimination.&lt;/p&gt;
&lt;p&gt;With EPSW, the equilibrium is qualitatively transformed. In the static model (Proposition 2), firms generically fully segregate their workforces: one firm hires all A-group workers and the other hires all B-group workers. EPSW functions as an enforcement mechanism for this segregation analogous to location choices in Hotelling&amp;rsquo;s model — poaching a worker from the competing firm is costly because EPSW then requires the poaching firm to pay equal wages to all workers it employs. In the core with EPSW (Proposition 3), the wage gap moves in favor of the majority group (A-group, β &amp;gt; 1) in the sense that all core outcomes except one strictly increase the A-group wage advantage. Moreover, firm profits and the magnitude of the wage gap co-move: firms benefit from selecting equilibria with larger wage gaps. The directional conclusion — EPSW benefits the majority group — holds regardless of the distributions of the two groups&amp;rsquo; productivities, conditional only on β &amp;gt; 1 for the wage gap; for the log wage gap the additional regularity condition βEA[v] &amp;gt; EB[v] is required.&lt;/p&gt;
&lt;p&gt;In the dynamic search model (Proposition 4), all firms eventually segregate under any equilibrium, with the long-run wage ratio moving in favor of the group toward which more firms segregate. Under equitable search and sufficiently low reallocation frictions (Proposition 5), more firms segregate toward the majority group when βEA[v] &amp;gt; EB[v]. Firms that are nearly segregated at the time of EPSW enactment segregate sooner than others (Proposition 6).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Empirical Setting and Design&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors test these predictions using Chile&amp;rsquo;s 2009 EPSW (Law 20.348), the country&amp;rsquo;s first equal pay law, which prohibited paying women less than men (or vice versa) for similar work. Firms with 10 or more long-term workers at the time of announcement (June 2009) face formal grievance procedures and financial penalties (69–1,384 USD per worker-month of violation); firms below this threshold face no financial penalty, providing a clean threshold-based treatment assignment.&lt;/p&gt;
&lt;p&gt;The data are matched employer-employee administrative records from the Chilean unemployment insurance system covering January 2005 – December 2013, a random sample of approximately 4% of all firms stratified by size. The main estimation sample restricts to firms with 6–13 total workers at announcement (41% of active firms), and the design is a difference-in-differences (event study) comparing treated (≥ 10 long-term workers) to control (&amp;lt; 10 long-term workers) firms. The identifying assumption is parallel trends between similarly sized firms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Empirical Findings&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;First, EPSW increases full gender segregation across firms. The share of fully gender-segregated firms increases by 4.4 percentage points (baseline: 34.3% of firms were fully segregated at announcement). Simultaneously, the share of nearly-but-not-fully segregated firms (majority gender share ∈ [0.8, 1)) declines by 4.0 percentage points — a &amp;ldquo;missing mass&amp;rdquo; of near-segregated firms consistent with the search model&amp;rsquo;s prediction that firms on the margin of full segregation segregate most readily (e.g., by separating the sole worker of the &amp;ldquo;wrong&amp;rdquo; gender). Moreover, firms that are nearly segregated at announcement experience an 8.7 percentage point increase in full segregation post-EPSW, compared to 2.8 percentage points for firms not nearly segregated at announcement.&lt;/p&gt;
&lt;p&gt;Second, EPSW shifts the gender wage gap in favor of the local labor market majority group. In male-majority local labor markets (defined by industry × county), EPSW increases the gender wage gap in favor of men by 4.3 percentage points. In female-majority local labor markets, EPSW decreases the gender wage gap (i.e., in favor of women) by 6.2 percentage points. The wage gap change is primarily driven by reductions in minority-group wages: women&amp;rsquo;s average wages in male-majority markets fall by 3.3 percentage points, and men&amp;rsquo;s average wages in female-majority markets fall by 4.5 percentage points; there are no statistically significant changes in majority-group wages. Because men dominate Chile&amp;rsquo;s overall labor market (approximately 5/6 of all workers are employed in majority-male local labor markets), the overall effect of EPSW is to increase the gender wage gap (in favor of men) by 2.7 percentage points. Pre-treatment coefficients are statistically indistinguishable from zero across all specifications, supporting the parallel trends assumption. These findings are robust across six alternative specifications covering different samples, fixed-effect structures, and controls.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Theoretical results apply within a set of &amp;ldquo;similar&amp;rdquo; workers in a given local labor market — the paper does not predict differential effects across job types within a firm (e.g., custodians vs. lawyers) that do not perform similar work. Empirical results are identified for firms with 6–13 workers and pertain to Chile&amp;rsquo;s formal sector (informal labor share ~25% in 2009). Predictions on the wage ratio (log wage gap) require the additional regularity condition βEA[v] &amp;gt; EB[v], which is consistent with the Chilean data.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-core-mechanism-by-which-epsw-leads-firms-to-fully-segregate-in-the-static-model"&gt;Q1. What is the core mechanism by which EPSW leads firms to fully segregate in the static model?&lt;/h3&gt;
&lt;p&gt;A: EPSW makes cross-group poaching prohibitively costly. If a firm that hires only A-group workers were to hire even a positive measure of B-group workers, EPSW would — by transitivity — require it to pay the same wage to all workers. This eliminates the firm&amp;rsquo;s ability to exploit productivity heterogeneity across workers; it would have to raise all wages to match the highest worker, destroying profit. As a result, firms segregate in equilibrium to avoid the bite of EPSW entirely: each firm caters to one group, and the within-group wage schedule remains unconstrained. The mechanism is analogous to Hotelling&amp;rsquo;s location model: segregation serves as the enforcement device for avoiding the equal-pay constraint.&lt;/p&gt;
&lt;h3 id="q2-how-does-the-equal-profit-condition-generate-a-wage-gap-in-favor-of-the-majority-group"&gt;Q2. How does the equal profit condition generate a wage gap in favor of the majority group?&lt;/h3&gt;
&lt;p&gt;A: In any core outcome under EPSW (Proposition 3), the Equal Profit Condition requires both firms to earn the same total profit. When there are β &amp;gt; 1 A-group workers (more than B-group workers), the firm serving A-group workers must pay higher average wages per worker to extract the same total profit from a larger pool, relative to the firm serving a smaller B-group. This mechanically raises A-group average wages relative to B-group average wages. Crucially, this directional conclusion — EPSW widens the majority-group wage advantage — holds regardless of the shapes of FA and FB, meaning it is robust to any underlying discriminatory or non-discriminatory productivity differences.&lt;/p&gt;
&lt;h3 id="q3-what-is-the-baseline-without-epsw-wage-gap-and-how-does-epsw-change-it"&gt;Q3. What is the baseline (without-EPSW) wage gap, and how does EPSW change it?&lt;/h3&gt;
&lt;p&gt;A: Without EPSW, Proposition 1 establishes that every worker is paid exactly her productivity in any core outcome (full employment, wages = productivity). Therefore, the wage gap equals EA(v) − EB(v) and the wage ratio equals EA(v)/EB(v): any gap reflects only productivity differences (including discrimination embedded in willingness to pay). Under EPSW, Proposition 3 shows that all core outcomes except a single (measure-zero) one strictly widen the wage gap beyond this level. The wage ratio result (Proposition 3, Part 4) requires the additional condition βEA[v] &amp;gt; EB[v] — that the majority group is not sufficiently less productive or more discriminated against to reverse the direction.&lt;/p&gt;
&lt;h3 id="q4-how-does-the-dynamic-search-model-modify-the-static-predictions"&gt;Q4. How does the dynamic search model modify the static predictions?&lt;/h3&gt;
&lt;p&gt;A: In the dynamic model (Proposition 4), full segregation is achieved in finite time T in any equilibrium, not instantaneously. Prior to T, firms make sequential segregation decisions; workers displaced by firm desegregation choices are replaced at rate ρ ∈ [0,1]. The long-run wage ratio is determined by the ratio nA/nB — the number of firms segregating toward group A versus B. If nA &amp;gt; nB, the long-run wage ratio moves in favor of A; if nA = nB, the policy has no long-run effect on the wage ratio. The key departure from the static model is that this outcome depends not only on the majority group size but also on search intensities and reallocation frictions (high firm tenure/low d can make segregating toward the majority costly if the firm already employs many minority-group workers).&lt;/p&gt;
&lt;h3 id="q5-under-what-conditions-does-the-dynamic-model-predict-that-more-firms-segregate-toward-the-majority-group"&gt;Q5. Under what conditions does the dynamic model predict that more firms segregate toward the majority group?&lt;/h3&gt;
&lt;p&gt;A: Proposition 5 states that for sufficiently large d (fast worker turnover / low reallocation frictions) and equitable search (equal search intensity across firms within a group), the number of firms segregating toward A satisfies nA ∈ [xA−1, xA+1], where xA is defined by an equal-profit condition. Moreover, if βEA[v] &amp;gt; EB[v] (the majority group is collectively more valuable), then nA ≥ nB. Without equitable search, the conclusion holds under more stringent conditions: for any search intensity vector r, there exist d* and β* such that for d &amp;gt; d* and β &amp;gt; β*, any equilibrium yields nA &amp;gt; nB. Empirically, 94% of local-labor-market-by-month units in Chile exhibit more firms segregating toward the majority gender post-EPSW, consistent with these conditions being met.&lt;/p&gt;
&lt;h3 id="q6-why-do-firms-that-are-nearly-segregated-at-announcement-respond-most-strongly-to-epsw"&gt;Q6. Why do firms that are nearly segregated at announcement respond most strongly to EPSW?&lt;/h3&gt;
&lt;p&gt;A: Proposition 6 establishes that firms with a low ratio of minority-group to majority-group search intensity (i.e., nearly segregated in employment) segregate earliest, provided the discount rate is sufficiently low. The intuition is that for a nearly segregated firm, the cost of segregating — separating the few minority-group workers — is small relative to the costs of remaining desegregated (paying a common wage that compresses profit, and being unable to poach new workers). Empirically, firms nearly segregated at announcement (majority gender share ∈ [0.8,1) at announcement) show an 8.7 percentage point increase in full segregation post-EPSW, roughly three times larger than the 2.8 percentage point effect for firms not nearly segregated at announcement. This &amp;ldquo;missing mass&amp;rdquo; pattern (decline in near-segregation matched by increase in full segregation) is also consistent with Proposition 6.&lt;/p&gt;
&lt;h3 id="q7-what-is-the-heterogeneous-effect-of-epsw-on-the-wage-gap-by-local-labor-market-type"&gt;Q7. What is the heterogeneous effect of EPSW on the wage gap by local labor market type?&lt;/h3&gt;
&lt;p&gt;A: The empirical design allows the wage gap effect to differ by local labor market (LLM) majority type (male vs. female). In male-majority LLMs (firm industry × county pairs where males comprise more than 50% of workers in June 2009), EPSW increases the gender wage gap in favor of men by 4.3 percentage points (SE = 0.0116). In female-majority LLMs, EPSW decreases the gender wage gap (in favor of women) by 6.2 percentage points (SE = 0.0234). These findings precisely match the theoretical prediction that EPSW benefits whichever group is in the majority of the local labor market. The dynamic event studies show no pre-trends in either subsample; effects begin at announcement (τ = 0) and grow over time.&lt;/p&gt;
&lt;h3 id="q8-what-drives-the-wage-gap-change--majority-wages-rising-or-minority-wages-falling"&gt;Q8. What drives the wage gap change — majority wages rising or minority wages falling?&lt;/h3&gt;
&lt;p&gt;A: The change is primarily driven by a reduction in the minority group&amp;rsquo;s average wages, not an increase in majority wages. Women&amp;rsquo;s average wages in male-majority labor markets fall by 3.29 percentage points (SE = 0.0111) in treated versus control firms post-EPSW. Men&amp;rsquo;s average wages in female-majority labor markets fall by 4.45 percentage points (SE = 0.0178) in treated versus control firms post-EPSW. There are no statistically significant changes in the average wages of the majority group of workers within any LLM type. This is consistent with the model&amp;rsquo;s mechanism: segregation reduces competition for minority-group workers (fewer firms competing for them), depressing their wages.&lt;/p&gt;
&lt;h3 id="q9-what-is-the-aggregate-economy-wide-effect-of-epsw-on-the-gender-wage-gap-in-chile"&gt;Q9. What is the aggregate (economy-wide) effect of EPSW on the gender wage gap in Chile?&lt;/h3&gt;
&lt;p&gt;A: Because approximately 5/6 of all Chilean workers are employed in male-majority local labor markets (men have higher labor force participation, with female labor force participation at roughly 30% in 2009), the overall effect of EPSW is to increase the gender wage gap in favor of men by 2.74 percentage points (SE = 0.0102). This is a net effect that averages the positive (pro-male) gap increase in male-majority markets and the negative (pro-female) gap decrease in female-majority markets, weighted by market sizes.&lt;/p&gt;
&lt;h3 id="q10-how-does-the-identification-strategy-deal-with-anticipation-and-compositional-changes"&gt;Q10. How does the identification strategy deal with anticipation and compositional changes?&lt;/h3&gt;
&lt;p&gt;A: Treatment status is assigned based on firm size at the time of policy announcement (June 2009) rather than enactment (November 2009), creating an intent-to-treat framework: some &amp;ldquo;treated&amp;rdquo; firms may fall below the threshold by enactment, and some &amp;ldquo;control&amp;rdquo; firms may rise above it, both attenuating the estimates (implying estimated effects are plausible lower bounds). The no-anticipation assumption is supported by the absence of statistically significant pre-trends in either the segregation or wage-gap specifications. To address compositional changes in worker characteristics across LLMs induced by EPSW itself, the wage regressions include time fixed effects interacted with human capital dimensions (education, contract type, age decade) and firm comparison groups, controlling for observable composition shifts. Placebo tests at alternative firm-size thresholds find no statistically or economically meaningful effects, supporting the causal interpretation.&lt;/p&gt;
&lt;h3 id="q11-how-does-epsw-in-chile-compare-to-epew-theoretically-and-in-the-literature"&gt;Q11. How does EPSW in Chile compare to EPEW theoretically and in the literature?&lt;/h3&gt;
&lt;p&gt;A: EPEW requires equal pay only for workers doing exactly equal work, which creates an easily exploitable loophole: firms can proliferate job titles or marginally differentiate duties to avoid compliance. EPSW closes this by requiring equal pay across a coarser &amp;ldquo;similar work&amp;rdquo; category, making evasion harder. Theoretically, the prior EPEW literature (Bhaskar et al. 2002, Kaas 2009, Lagerlöf 2020, Lanning 2014) generated ambiguous directional predictions — equal pay laws could either increase or decrease wage disparities within the same paper. The authors attribute this ambiguity to EPEW models&amp;rsquo; requirement that workers be exactly equally productive. By contrast, EPSW applies across workers with heterogeneous productivities, and the authors derive unambiguous predictions: full segregation and a wage gap shift toward the majority group, both of which are confirmed empirically.&lt;/p&gt;
&lt;h3 id="q12-what-is-the-analogy-to-best-price-guarantees-in-product-markets"&gt;Q12. What is the analogy to &amp;ldquo;best-price guarantees&amp;rdquo; in product markets?&lt;/h3&gt;
&lt;p&gt;A: The paper draws a methodological parallel to most-favored-customer (MFC) clauses in product markets. MFC clauses commit firms to rebating past consumers if prices fall, which directly equalizes payments across buyers but unintentionally raises firm market power. In the EPSW setting, the policy plays the role of a best-wage guarantee — but because firms compete for workers, the constraint binds off the equilibrium path. Firms segregate so that no firm is ever exposed to the equal-pay constraint in equilibrium, yet the threat of the constraint (if a firm deviates and hires from both groups) effectively differentiates labor costs across groups, driving the unintended wage effects. This is related to &amp;ldquo;artificial&amp;rdquo; switching costs that create local market power in consumer markets (Klemperer, 1987).&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Equal Pay for Similar Work (EPSW):&lt;/strong&gt; A legal constraint requiring that within a firm, workers belonging to different protected-class identities (e.g., different genders) who perform &amp;ldquo;similar&amp;rdquo; work receive equal wages. Distinguished from &amp;ldquo;Equal Pay for Equal Work&amp;rdquo; (EPEW) by its coarser similarity standard, which cannot be evaded by minor job-title differentiation. In the model, this constraint is formalized as: a firm cannot hire positive measures of workers from two different groups such that all workers in one group receive strictly higher wages than all workers in the other group; by transitivity, a firm hiring from both groups must pay almost all workers the same wage.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Core Outcome:&lt;/strong&gt; The solution concept used in the static model, drawing on cooperative game theory (Shapley–Shubik assignment game). An outcome (specifying which firm hires each worker and at what wage) is in the core if no firm and subset of workers can form a blocking coalition that makes both the firm and each worker in the coalition strictly better off. The paper uses this concept because its pure-strategy Nash equilibrium outcomes (in the associated non-cooperative simultaneous wage-offer game) exactly coincide with the core outcomes under the restriction that firms pay the same wage to all workers of the same type.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Full Segregation:&lt;/strong&gt; A labor market outcome in which each firm employs workers from only one group (all A-group workers at one firm, all B-group workers at the other). The paper proves (Proposition 2) that EPSW generically forces full segregation in equilibrium, because any deviation to hire from both groups exposes the firm to the equal-pay constraint. Empirically measured as a binary indicator for whether all workers at a given firm in a given month are of the same gender.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Near Segregation:&lt;/strong&gt; A firm-level state in which the majority gender constitutes 80–99% of the firm&amp;rsquo;s workforce (the majority gender share is in [0.8, 1)). The paper uses this as a complementary outcome to full segregation; theory (Proposition 6) predicts a decline in near segregation post-EPSW because firms in this state face the lowest cost of transitioning to full segregation. Empirically, the near-segregation share falls by 4.0 percentage points post-EPSW, mirroring the 4.4 percentage point rise in full segregation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Local Labor Market (LLM):&lt;/strong&gt; Defined in the empirical analysis as a firm&amp;rsquo;s geographic county interacted with its industry code, creating 321 × 21 potential cells. The LLM is classified as male-majority or female-majority based on the share of female workers across all firms in the industry-county pair in June 2009. This is the unit at which the &amp;ldquo;majority group&amp;rdquo; for Proposition 3&amp;rsquo;s wage gap prediction is defined, and the level at which the heterogeneous wage effects of EPSW are estimated.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Equal Profit Condition:&lt;/strong&gt; A necessary condition of any core outcome (with or without EPSW): both firms must earn the same total profit in equilibrium. Under EPSW with full segregation, this condition determines the relative average wages of the two groups — because firm sizes differ (β A-group workers vs. 1 B-group worker), equal profit requires the firm serving the larger group to pay higher average wages, mechanically moving the wage gap in favor of the majority group.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Nash-in-Nash Bargaining:&lt;/strong&gt; The bargaining protocol used in the dynamic search model, following Horn and Wolinsky (1988). Each bilateral worker-firm bargain splits the available surplus in proportion to exogenous bargaining power parameter Δ ∈ (0,1), taking as given the outcome of all other bilateral bargains. A worker&amp;rsquo;s disagreement point is the wage she would receive from bargaining with the next firm in her search order. This generates the result that a worker&amp;rsquo;s realized payoff is increasing in the number of segregated (non-EPSW-constrained) firms competing for her, connecting firm segregation decisions to wage determination.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Reallocation Friction:&lt;/strong&gt; In the dynamic search model, represented by a low departure probability d ∈ (0,1) for existing employees. When d is low, firms retain a large fraction of their workforce across periods, making segregation costly because the firm must separate from any existing workers of the &amp;ldquo;wrong&amp;rdquo; group. The paper shows (Proposition 5) that for sufficiently large d (low frictions), the equal-profit condition approximately pins down the number of firms segregating toward each group, and for d above a threshold, the majority group attracts weakly more segregating firms.&lt;/p&gt;</description></item><item><title>Exorbitant Privilege Gained and Lost: Fiscal Implications</title><link>https://macropaperwarehouse.com/papers/exorbitant-privilege-gained-and-lost-fiscal-implications/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/exorbitant-privilege-gained-and-lost-fiscal-implications/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper studies three centuries of U.K. fiscal history to understand the fiscal implications of safe asset supplier status — what the authors call &amp;ldquo;exorbitant privilege&amp;rdquo; — and how it can be gained and lost. Using the discounted cash flow approach to fiscal capacity developed in Jiang, Lustig, Van Nieuwerburgh, and Xiaolan (2019), the paper measures the present discounted value of expected future primary surpluses (inclusive of convenience yield seigniorage) and compares it to the observed market value of outstanding government debt. The central finding is a sharp historical discontinuity: before World War I, when the U.K. was the world&amp;rsquo;s dominant safe asset supplier and its gilts served as the global reserve asset, roughly only three-quarters of U.K. debt was backed by future surpluses even after accounting for convenience yields earned from global safe asset demand. After World War II, when the U.K. lost its safe asset supplier status to the U.S., the U.K.&amp;rsquo;s debt became fully backed by surpluses and fiscal capacity became closely tied to its own macro fundamentals. By contrast, the U.S. after World War II shows a pattern similar to the pre-war U.K. but more extreme: less than one-third of outstanding U.S. Treasury debt is backed by future surpluses according to the paper&amp;rsquo;s estimates, with the gap between debt and estimated fiscal capacity growing sharply over recent decades.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-how-does-the-paper-measure-fiscal-capacity"&gt;Q1. How does the paper measure fiscal capacity?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper follows the Jiang-Lustig-Van Nieuwerburgh-Xiaolan (2019) methodology, expressing the market value of outstanding government debt as the present risk-adjusted discounted value of expected future primary surpluses under the government&amp;rsquo;s intertemporal budget constraint — the no-arbitrage condition that rules out rational debt bubbles.&lt;/strong&gt; The market value of the government debt portfolio equals the present value of tax revenues minus the present value of government spending. A Vector AutoRegression (VAR) imposing cointegration of GDP with tax revenues and government spending is used to forecast the joint dynamics of the surplus. The paper uses the market or output risk premium as the discount rate, imputing the risk properties of GDP to spending and tax revenue claims.&lt;/p&gt;
&lt;p&gt;A key methodological challenge is handling structural breaks: before World War I, U.K. fiscal policy was pre-Keynesian — acyclical spending and taxes (except during wars) — so spending and tax revenue as shares of output inherit the risk properties of output, and the market risk premium is the appropriate discount rate. After World War II, spending becomes counter-cyclical and taxes pro-cyclical in the Keynesian framework; the paper argues that applying the market risk premium in this regime produces an upper bound on the PDV of surpluses. For the U.K., this methodology is validated: the correlation of fiscal capacity with the debt/output ratio is 0.90 in the pre-WW-I sample and remains high after WW-II, despite the fiscal regime change.&lt;/p&gt;
&lt;h3 id="q2-what-are-the-quantitative-findings-for-the-uk"&gt;Q2. What are the quantitative findings for the U.K.?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper finds that before World War I, U.K. fiscal capacity fell systematically short of the observed market value of U.K. debt: the average debt/GDP ratio was 87.06% while the estimated fiscal capacity was only 69.32%, with the ratio of fiscal capacity to debt averaging 74.32% — implying roughly 26% of U.K. debt was not backed by future surpluses even after including convenience yield seigniorage.&lt;/strong&gt; The U.K. earned average long-term convenience yields of approximately 100 basis points per annum from 1873 to 1931 (translating to approximately 0.47% of GDP in annual seigniorage), reflecting its dominant position as the world&amp;rsquo;s safe asset supplier and the quasi-monopoly position of gilts in global securities markets (U.K. national debt accounted for more than half of the world&amp;rsquo;s traded securities around 1815). Despite these convenience yields, the gap between fiscal capacity and debt persisted throughout the 19th and early 20th century.&lt;/p&gt;
&lt;p&gt;After World War II, the picture reverses: the U.K.&amp;rsquo;s average post-war fiscal capacity of 82.03% of GDP exceeds its average debt/GDP ratio of 53.42%, leaving more than 50% of fiscal capacity unborrowed. The correlation with debt dynamics persists but the sign changes — U.K. borrowing is now constrained by own macro fundamentals rather than extended by global coordination.&lt;/p&gt;
&lt;h3 id="q3-what-do-the-authors-find-for-the-united-states"&gt;Q3. What do the authors find for the United States?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The U.S. experience mirrors the pre-war U.K. after World War II but with much larger magnitudes: the paper&amp;rsquo;s estimates indicate that less than one-third (32.20%) of post-war U.S. Treasury debt is backed by future surpluses, with the gap between fiscal capacity and debt growing sharply toward the end of the sample to exceed U.S. GDP.&lt;/strong&gt; Before World War I, the U.S. did not earn convenience yields — it was forced to borrow at higher rates than the U.K. despite having lower debt-to-output ratios — and its fiscal capacity exceeded its debt, with the ratio of capacity to debt averaging 169.36%. After World War II, when the U.S. became the global safe asset supplier under the Bretton-Woods architecture, the relationship inverted: average U.S. fiscal capacity of 13.20% of GDP represents only 32.20% of outstanding debt. The gap is increasingly large in recent decades as U.S. debt has grown while surplus projections have not expanded commensurately.&lt;/p&gt;
&lt;h3 id="q4-why-does-safe-asset-supplier-status-allow-a-country-to-borrow-beyond-its-fiscal-capacity"&gt;Q4. Why does safe asset supplier status allow a country to borrow beyond its fiscal capacity?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper argues that global investors coordinate on a single safe asset issuer based on relative macro fundamentals; this coordination is self-reinforcing because each additional investor holding the asset reduces rollover risk and renders the debt safer for all others, creating strategic complementarities that concentrate global fiscal capacity in one country beyond what its own surpluses would warrant.&lt;/strong&gt; Unlike domestic convenience yields (arising from household demand for safe assets to insure idiosyncratic risks), the global safe asset effect creates a form of extra-fiscal capacity that depends on investors&amp;rsquo; common belief about which country is the hegemon. The measured seigniorage from convenience yields — about 0.47% of U.K. GDP before WW-I and 0.36% per year for the U.S. post-war — does not fully capture this coordination benefit; the remaining gap between fiscal capacity and debt reflects the additional &amp;ldquo;license to borrow&amp;rdquo; that comes with global hegemon status.&lt;/p&gt;
&lt;p&gt;The transition from U.K. to U.S. hegemony illustrates the mechanism: as U.K. macro fundamentals deteriorated relative to U.S. fundamentals after the world wars, investors shifted the concentration of fiscal capacity toward the U.S. The U.K. lost its license to borrow beyond its fundamentals; the U.S. gained it. The paper notes that the U.K. debt/output ratio exceeded 200% after WW-II — a level associated with the loss of hegemony.&lt;/p&gt;
&lt;h3 id="q5-what-historical-data-and-institutional-context-does-the-analysis-use"&gt;Q5. What historical data and institutional context does the analysis use?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper uses annual data for the U.K. from 1729 to 2020 (from the Bank of England&amp;rsquo;s Millennium of Macroeconomics dataset and the Ellison-Scott dataset on individual bond market values from 1694 onward) and for the U.S. from 1791 to 2020 (from Hall-Sargent and CRSP), constructing primary surpluses, tax revenues, spending, GDP, and convenience yields consistently over nearly three centuries.&lt;/strong&gt; U.K. convenience yields before WW-I are measured as the interest rate differential between U.K. government securities and otherwise comparable bonds from countries on the gold standard; the sample average is approximately 147 basis points at the short end and 110 basis points at the long end, with the spread declining at longer maturities (the opposite of what default risk would predict), providing evidence that convenience yield rather than residual default risk drives the differential. U.S. post-war convenience yields are constructed from the spread between the 3-month Treasury yield and the 3-month CD rate (or bankers&amp;rsquo; acceptance rate before 1964), averaging 36 basis points per year from 1947 to 2020.&lt;/p&gt;
&lt;h3 id="q6-what-are-the-implications-for-models-of-fiscal-capacity-and-debt-sustainability"&gt;Q6. What are the implications for models of fiscal capacity and debt sustainability?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The results favor models in which the safe asset supplier&amp;rsquo;s fiscal capacity is determined partly by relative macro fundamentals (which country the global financial system coordinates on) rather than solely by absolute fundamentals (its own surpluses), and challenge models that treat the transversality condition as a binding constraint at all times for all countries.&lt;/strong&gt; The finding that a large fraction of U.S. Treasury debt is not backed by future surpluses — even when the market risk premium is used to discount — has implications for debt sustainability analyses: standard present-value-of-surpluses calculations will understate the true fiscal capacity of the safe asset supplier, while overstating it for others. The paper&amp;rsquo;s framework suggests this extra capacity depends on maintaining relative macro fundamentals and global investor coordination, and can be lost — as the U.K. experience demonstrates — when relative fundamentals deteriorate.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;dl&gt;
&lt;dt&gt;&lt;strong&gt;fiscal capacity&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;the present risk-adjusted discounted value of a government&amp;rsquo;s expected future primary surpluses, computed from the government&amp;rsquo;s intertemporal budget constraint under no-arbitrage; in this paper, inclusive of seigniorage revenue from convenience yields earned on government debt.&lt;/dd&gt;
&lt;dt&gt;&lt;strong&gt;exorbitant privilege&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;the ability of the safe asset supplier country to borrow at below-market interest rates due to global demand for its government debt as a safe asset; quantified in this paper as the gap between the market value of debt and estimated fiscal capacity from surpluses alone, which exceeds fiscal capacity for the pre-WW-I U.K. and post-WW-II U.S.&lt;/dd&gt;
&lt;dt&gt;&lt;strong&gt;convenience yield&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;the yield reduction (below comparable risky borrowing rates) that the safe asset supplier earns from global safe asset demand; measured as approximately 100 bps long-term for the pre-WW-I U.K. and approximately 36 bps on average for the post-WW-II U.S., contributing 0.47% and 0.36% of GDP annually in seigniorage revenue respectively.&lt;/dd&gt;
&lt;dt&gt;&lt;strong&gt;transversality condition (TVC)&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;the condition ruling out rational government debt bubbles, requiring the expected discounted value of outstanding debt to approach zero at long horizons; the paper imposes the TVC and finds that for the pre-WW-I U.K. and post-WW-II U.S., the observed debt level exceeds fiscal capacity even under this constraint, interpreted as evidence of the extra borrowing license conferred by safe asset supplier status.&lt;/dd&gt;
&lt;/dl&gt;</description></item><item><title>Expectation-driven term structure of equity and bond yields</title><link>https://macropaperwarehouse.com/papers/expectation-driven-term-structure-of-equity-and-bond-yields/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/expectation-driven-term-structure-of-equity-and-bond-yields/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; What drives the joint historical dynamics of the term structure of equity yields and nominal bond yields — and can a single unified equilibrium model explain the procyclical equity yield slope, the switch in bond-stock correlation from positive to negative after the late 1990s, the maturity-declining predictability of dividend strip returns, and standard aggregate stock market puzzles?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Key Departure from Prior Literature.&lt;/strong&gt; Existing equilibrium models (habit formation, long-run risk, disaster risk) rely on time-varying risk premia to explain asset prices. Recent survey evidence challenges this: De La O and Myers (2021) show that most aggregate stock price movements are driven by cash-flow growth expectations rather than return expectations, and Van Binsbergen et al. (2013) show that equity yields are driven mainly by dividend growth expectations. This paper constructs an equilibrium model in which equity (bond) yield variation is attributable to subjective dividend growth (GDP growth) expectations, with a constant subjective risk premium implied by CRRA utility.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model Architecture.&lt;/strong&gt; The representative agent has CRRA utility with risk-aversion coefficient γ = 4 and subjective discount factor β = 1.0065 (calibrated to the average 10-year equity yield). The agent departs from rational expectations by having the &amp;ldquo;belief in the law of small numbers&amp;rdquo; (Tversky and Kahneman 1971): she perceives small samples to represent their population as well as large samples, leading to subjective learning gains that differ from the rational Kalman gain. The subjective belief updating rule is a modified Kalman filter in which the likelihood is exaggerated by factor (1+θ), producing a subjective learning gain ν that exceeds the Kalman gain K when overreaction applies and falls below it when underreaction applies.&lt;/p&gt;
&lt;p&gt;The model has three blocks of fundamentals, each decomposed into a stable and a transitory component. (1) Real GDP growth is decomposed into PCE growth (stable, with a random-walk trend state µ_g) and a volatile gap component (stationary state x_g, persistence ρ_g = 0.941). (2) Inflation is decomposed into core inflation (stable, with trend state µ_π) and a volatile gap (persistence ρ_π = 0.932). (3) Real aggregate dividend is decomposed into a long-duration dividend component dl (levered on log real GDP with leverage λ = 3) and the share of long-duration dividend ds (stationary with persistence ρ_d = 0.94). This cross-sectional decomposition uses firm-level long-term earnings growth (LTG) forecasts from IBES as a model-free equity duration measure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Estimation.&lt;/strong&gt; State-space parameters are estimated by maximum likelihood with the Kalman filter on data from NYSE/NASDAQ/AMEX firms (CRSP/Compustat), quarterly, from 1987Q4 to 2019Q4. Subjective learning gains are estimated by minimizing RMSE between model-implied expectations and consensus forecasts: 1-year real GDP growth and inflation from the Survey of Professional Forecasters (SPF, 1981Q3–2019Q4), and 1-year aggregate dividend growth extended from De La O and Myers (2021) to 2019Q4. Equity yield data are from Giglio et al. (2021); bond yields are end-of-quarter zero-coupon nominal yields from Gürkaynak et al. (2007).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings.&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Equity Term Structure Dynamics.&lt;/strong&gt; The model&amp;rsquo;s subjective dividend growth expectations drive equity yields. The 1-year model-implied equity yield correlates 0.68 with data; the 10-year correlates 0.79; the 10Y–1Y slope correlates 0.59 with data. Consistent with &amp;ldquo;belief in the law of small numbers,&amp;rdquo; the agent overreacts to dividend news (estimated learning gains νl_d = 0.166 and νs_d = 0.458, both below their Kalman gains, which under the level-to-growth translation implies overreaction to dividend growth news, confirmed by negative CG(2015) regression slope coefficients of −0.69 at 1Y and −0.97 at 5Y).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Procyclical Equity Yield Slope.&lt;/strong&gt; During recessions, the average equity yield slope (10Y–1Y) in the model is −3.77%; during expansions it is +3.96%, matching the data (−5.50% in recessions, +3.93% in expansions). The sign reversal is driven primarily by the dividend-specific component of the decomposition: in recessions, short-run dividend growth expectations fall much more sharply than long-run expectations.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Bond Pricing.&lt;/strong&gt; The model&amp;rsquo;s 1-year and 10-year nominal bond yields achieve correlations of 0.92 and 0.95 with their data counterparts, inheriting the explanatory power of Zhao (2020) for the bond market. The agent underreacts to GDP growth and inflation news (estimated learning gains well below Kalman gains, confirmed by positive CG(2015) slope coefficients of +2.08 at 1Y for GDP growth and +1.01 at 1Y for inflation).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Bond-Stock Correlation Switch.&lt;/strong&gt; In data, 10Y bond vs. dividend strip return correlation (5Y strip) goes from +0.46 before 2000 to −0.49 after 2000. The model produces +0.14 before and −0.56 after (for the 5Y strip). Decomposing the change in bond-stock return covariance: the &amp;ldquo;inflation real effect&amp;rdquo; (correlation between expected inflation and real growth) accounts for approximately 27–31% of total changes (for 5Y to 10Y strips); the &amp;ldquo;real growth correlation&amp;rdquo; channel — stronger co-movement between real GDP and real dividend growth expectations after 2000 — accounts for approximately 89–95% of total changes. The paper identifies this real bond hedging channel as the dominant and previously unexamined driver.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Dividend Strip Return Predictability.&lt;/strong&gt; The price-dividend ratio predicts annual market excess returns with R² of 10.3% (data) vs. 9.0% (model). Strip return predictability is downward-sloping by maturity: in data, the R² is 20.2% for 5-year strips and 14.5% for 10-year strips; the model generates 14.2% and 10.4% respectively. This is decomposed into three sources: bond return predictability (small contribution), dividend forecast error predictability (dominant for short maturities), and forecast revision predictability (negative contribution that offsets). The downward slope occurs because current news has smaller impact on long-term dividend expectations.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Aggregate Market Puzzles.&lt;/strong&gt; The model-implied log dividend-price ratio correlates 0.86 with data, with AR(1) coefficient 0.96 (data: 0.95). Model-implied average market return is 9% (data: 8%); annualized return volatility 12% (data: 16%). The model replicates the switch of the bond-stock aggregate return correlation from +0.13 before 2000 to −0.46 after 2000 (data: +0.39 to −0.64).&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions.&lt;/strong&gt; Results apply to U.S. equity and bond markets over 1987Q4–2019Q4 (with bond learning using data back to 1959Q1). The model assumes a representative agent with CRRA utility and constant subjective risk premium. It is silent on the term structure of expected returns in the statistical sense (which requires identification of latent states under the physical measure). The aggregate market results require a reduced-form specification for stochastic equity duration H_t linked to the value-weighted LTG average.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-core-psychological-mechanism-generating-subjective-beliefs-and-how-does-it-differ-from-the-diagnostic-expectations-approach"&gt;Q1. What is the core psychological mechanism generating subjective beliefs, and how does it differ from the diagnostic expectations approach?&lt;/h3&gt;
&lt;p&gt;The agent has the &amp;ldquo;belief in the law of small numbers&amp;rdquo; (Tversky and Kahneman 1971): she treats small samples as equally representative of their population as large samples. Formally, this is embedded by exaggerating the likelihood in the Bayesian update: p(x_t|I_t) ∝ p(y_t|x_t)^{1+θ} × p(x_t|I_{t-1}), where θ captures the magnitude of cognitive bias. The resulting subjective learning gain ν = (1+θ)P̃ / [(1+θ)P̃ + σ²_ε] can exceed the Kalman gain K when θ is large (overreaction) or fall below it when θ is small (underreaction). This differs from diagnostic expectations (Bordalo et al. 2019, 2020a,b), which are based on the representativeness heuristic; the paper notes the two notions of news are highly correlated in simulation (Table IA.2) and that both can imply overreaction.&lt;/p&gt;
&lt;h3 id="q2-why-does-the-model-generate-overreaction-to-dividend-growth-news-even-though-the-dividend-level-learning-gains-are-smaller-than-the-kalman-gains"&gt;Q2. Why does the model generate overreaction to dividend growth news even though the dividend-level learning gains are smaller than the Kalman gains?&lt;/h3&gt;
&lt;p&gt;The model separates dividend learning into level and growth. Section 2.2 derives that underreaction to dividend level news (νl_d &amp;lt; Kl_d, νs_d &amp;lt; Ks_d, estimated values 0.166 and 0.458 against Kalman gains 0.19 and 0.49 respectively) translates into overreaction to dividend growth news. This is confirmed by the CG(2015) rationality test: regressing forecast errors on lagged forecast revisions yields slope coefficients of −0.69 (1Y) and −0.97 (5Y) for real dividend growth, both statistically significant (t-statistics −3.63 and −3.22). In contrast, the same test yields positive slope coefficients for GDP growth (2.08 at 1Y) and inflation (1.01 at 1Y), confirming underreaction for these series.&lt;/p&gt;
&lt;h3 id="q3-how-well-does-the-model-match-subjective-dividend-growth-expectations-in-the-survey-data"&gt;Q3. How well does the model match subjective dividend growth expectations in the survey data?&lt;/h3&gt;
&lt;p&gt;The model-implied 1-year subjective dividend growth forecast is estimated by minimizing RMSE against the consensus dividend growth forecast series (extended from De La O and Myers 2021 to 2019Q4, with a replication correlation of 0.92 over the overlapping sample). The unconditional correlation between model-implied and data 1-year forecasts is 0.80. Although only 1-year forecasts are used in estimation, the model also achieves a correlation of 0.80 for 2-year forecasts, providing an out-of-sample validation.&lt;/p&gt;
&lt;h3 id="q4-what-explains-the-higher-volatility-of-short-term-equity-yields-relative-to-long-term-equity-yields"&gt;Q4. What explains the higher volatility of short-term equity yields relative to long-term equity yields?&lt;/h3&gt;
&lt;p&gt;Short-term subjective dividend growth expectations are more volatile because the agent&amp;rsquo;s short-run expectation mean-reverts toward the less volatile long-run (levered) GDP growth expectation. In the model&amp;rsquo;s two-component dividend structure, the transitory dividend-share component xd has persistence ρ_d = 0.94 and its effect on equity yields decays as maturity increases (via the factor (1−ρ^n_d)/n). Similarly, the effect of the transitory GDP growth state x_g decays with maturity. Long-term equity yields are thus anchored by the slower-moving trend components µ_g and µ_d. In the data from Giglio et al. (2021), 1-year yields have a standard deviation of 8.89% annualized vs. 2.70% for 10-year yields; the model generates 8.22% and 1.89% respectively.&lt;/p&gt;
&lt;h3 id="q5-what-is-the-quantitative-importance-of-the-real-growth-correlation-channel-vs-the-inflation-real-effect-channel-in-explaining-the-bond-stock-correlation-switch"&gt;Q5. What is the quantitative importance of the &amp;ldquo;real growth correlation&amp;rdquo; channel vs. the &amp;ldquo;inflation real effect&amp;rdquo; channel in explaining the bond-stock correlation switch?&lt;/h3&gt;
&lt;p&gt;For the switch in bond-stock return correlation (using the 10-year nominal bond and various maturity dividend strips), the decomposition in Table 4 shows that the &amp;ldquo;real growth correlation&amp;rdquo; channel accounts for 89.1% (5Y strip), 92.1% (7Y strip), and 94.8% (10Y strip) of total bond-stock covariance changes, while the &amp;ldquo;inflation real effect&amp;rdquo; (correlation between expected inflation and expected real growth) accounts for 27.3%, 29.3%, and 31.1% respectively. The &amp;ldquo;volatility of shocks to expected inflation and real growth&amp;rdquo; makes a negative contribution (−16.4%, −21.4%, −25.9%), mostly attributable to more volatile beliefs during the 2008 global financial crisis. The real growth correlation channel reflects that after 2000, real bonds provide a better hedge to aggregate real dividend risks because real GDP growth expectations and real dividend growth expectations became more positively correlated.&lt;/p&gt;
&lt;h3 id="q6-does-the-same-real-growth-correlation-story-hold-for-the-fed-model-bond-stock-yield-correlation"&gt;Q6. Does the same real growth correlation story hold for the &amp;ldquo;Fed model&amp;rdquo; (bond-stock yield correlation)?&lt;/h3&gt;
&lt;p&gt;Yes, but with a quantitatively different balance. For yield correlations (Table 5), the &amp;ldquo;real growth correlation&amp;rdquo; channel accounts for 72.4%–80.1% of bond-stock yield covariance changes (5Y to 10Y strip), while the &amp;ldquo;inflation real effect&amp;rdquo; now accounts for 41.2%–43.9%. The inflation real effect is proportionally larger for yield levels because persistent expected inflation correlates strongly with the level of expected real GDP growth — even though inflation expectations do not move fast enough at high frequency to explain return correlation, they co-move strongly with expected growth at low frequency.&lt;/p&gt;
&lt;h3 id="q7-how-does-the-model-generate-a-downward-sloping-term-structure-of-return-predictability"&gt;Q7. How does the model generate a downward-sloping term structure of return predictability?&lt;/h3&gt;
&lt;p&gt;The strip excess return is decomposed into three components (Equation 44): maturity-matched bond excess return (Bond), dividend forecast error within the holding period (FE), and forecast revision regarding dividend growth after the holding period (FR). For short maturities, bond predictability contributes little (R² ≈ 6.7% for 5Y strip), while FE predictability (R² ≈ 31.5%) and FR predictability (R² ≈ 35.6%) dominate. As maturity increases, the current news has smaller impact on long-term dividend expectations, reducing the predictability of FE (R² ≈ 26.6% for 10Y) and FR (R² ≈ 26.5% for 10Y). Taken together, total model-implied strip R² declines from 14.2% (5Y) to 10.4% (10Y), matching the data pattern (20.2% to 14.5%). The paper identifies forecast revision predictability as a new channel not previously documented.&lt;/p&gt;
&lt;h3 id="q8-why-do-forecast-errors-and-forecast-revisions-have-opposite-signs-in-the-predictability-regressions"&gt;Q8. Why do forecast errors and forecast revisions have opposite signs in the predictability regressions?&lt;/h3&gt;
&lt;p&gt;Bad news (high equity yields, i.e., low current stock prices) triggers excessively pessimistic subjective dividend growth expectations because the agent overreacts to dividend news. These overly pessimistic forecasts tend to be disappointed in the future — actual dividend realizations exceed the forecast — producing positive subsequent forecast errors (FE is positively predicted by high yields, with R² ≈ 31.5% for 5Y strips). However, as dividend levels mean-revert, higher subsequent realizations cause the agent to revise down the forecast for dividend growth thereafter, leading to negative forecast revisions (FR is negatively predicted by high yields, with R² ≈ 35.6% for 5Y strips, opposite sign from FE). The net effect on return predictability is thus a combination of positive (FE) and negative (FR) contributions.&lt;/p&gt;
&lt;h3 id="q9-how-does-the-model-handle-the-aggregate-market-dividend-price-ratio-and-its-persistence"&gt;Q9. How does the model handle the aggregate market dividend-price ratio and its persistence?&lt;/h3&gt;
&lt;p&gt;The aggregate stock price is modeled as the sum of dividend strip prices up to a stochastic horizon H_t, which is parameterized as a linear function of the value-weighted average of LTG forecasts: H_t = a + b·LTG_t. Parameters a and b are estimated by minimizing RMSE between model-implied and data log dividend-price ratio. The model-implied ratio achieves a correlation of 0.86 with data, an AR(1) coefficient of 0.96 (data: 0.95), and an annualized volatility of 26% (data: 30%). The time-variation is driven entirely by strip yield variations and exogenous LTG movements.&lt;/p&gt;
&lt;h3 id="q10-is-the-overreaction-to-dividend-news-and-underreaction-to-gdpinflation-news-consistent-in-a-single-framework"&gt;Q10. Is the overreaction to dividend news and underreaction to GDP/inflation news consistent in a single framework?&lt;/h3&gt;
&lt;p&gt;Yes. The model&amp;rsquo;s subjective learning framework (based on &amp;ldquo;belief in the law of small numbers&amp;rdquo;) generates both over- and underreaction depending on the estimated subjective learning gain relative to the Kalman gain. For GDP growth and inflation, the learning gains (ν*_g = 0.012, νgap_g = 0.065; ν*_π = 0.049, νgap_π = 0.228) are below their Kalman gains (0.29 and 0.67 for GDP components; 0.67 and 0.48 for inflation components), producing underreaction. The paper hypothesizes this is related to the Fed&amp;rsquo;s dual mandate: agents rationally assign lower weight to GDP and inflation shocks expecting the Fed will stabilize them. For dividend growth, a level-to-growth translation converts level underreaction into growth overreaction.&lt;/p&gt;
&lt;h3 id="q11-what-are-the-robustness-checks-and-what-do-they-show"&gt;Q11. What are the robustness checks, and what do they show?&lt;/h3&gt;
&lt;p&gt;The paper checks three alternative equity duration measures: those from Dechow et al. (2004), Weber (2018), and Gonçalves (2021b), as well as the book-to-market ratio following Lettau and Wachter (2007). Table IA.1 shows that replacing LTG with these measures still produces model-implied equity yields that replicate key data moments with high time-series correlations. Changing the cross-sectional breakpoint for long-duration dividends from the median LTG to the 40th or 60th percentile leaves results similar. The paper also presents an Internet Appendix extension in which the agent has ambiguity about real GDP and dividend growth (model misspecification fear), yielding equity yields and returns even closer to data.&lt;/p&gt;
&lt;h3 id="q12-what-is-the-papers-contribution-to-the-bond-market-relative-to-zhao-2020"&gt;Q12. What is the paper&amp;rsquo;s contribution to the bond market relative to Zhao (2020)?&lt;/h3&gt;
&lt;p&gt;The bond pricing block closely follows Zhao (2020), inheriting its explanatory power for bond market stylized facts. The model&amp;rsquo;s 1-year and 10-year nominal bond yields achieve correlations of 0.92 and 0.95 with data, respectively. The new contribution is the joint model covering both equity and bond markets simultaneously, enabling the decomposition of bond-stock covariance and the identification of the real growth correlation as the dominant driver of the bond-stock correlation switch — a channel not addressed by Zhao (2020), which focused on bond market puzzles alone.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Equity Yield (Dividend Strip Yield).&lt;/strong&gt; Defined as ey^(n)_t = (1/n)(d$_t − p^(n)_t), where p^(n)_t is the log price of the n-period dividend strip (a claim to the nominal dividend n periods ahead) and d$_t is the log nominal aggregate dividend. It decomposes into the bond yield, a subjective dividend growth component, and a (constant) risk premium component.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Belief in the Law of Small Numbers.&lt;/strong&gt; A cognitive bias (Tversky and Kahneman 1971) in which the agent perceives small samples to represent their population as well as large samples. Modeled by exaggerating the likelihood in Bayesian updating: p(x_t|I_t) ∝ p(y_t|x_t)^{1+θ} × p(x_t|I_{t-1}). This generates a subjective learning gain ν that can exceed the Kalman gain (overreaction) or fall below it (underreaction) depending on θ and the signal-to-noise ratio.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Subjective Learning Gain.&lt;/strong&gt; The coefficient ν in the subjective Kalman filter update ẽ_t x_t = ρẽ_{t-1}x_{t-1} + ν(y_t − ρẽ_{t-1}x_{t-1}). It equals (1+θ)P̃ / [(1+θ)P̃ + σ²_ε], where P̃ is the subjective predictive variance. When ν &amp;gt; K (the rational Kalman gain), the agent overreacts to news; when ν &amp;lt; K, the agent underreacts.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Long-Duration Dividend Component.&lt;/strong&gt; The portion of aggregate real dividend (dl_t) attributable to &amp;ldquo;long-duration&amp;rdquo; firms — those with above-median analyst LTG forecasts in CRSP/Compustat/IBES data. Levered on log real GDP with leverage parameter λ = 3, it carries aggregate risk. The complementary short-duration dividend share ds_t is stationary and carries no aggregate risk. The decomposition allows the model to exploit cross-sectional cash-flow duration information when learning about future aggregate dividend growth.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Real Growth Correlation Channel.&lt;/strong&gt; A bond-stock covariance component defined as Cov(RGDP^(N), RDIV^(n)), where RGDP^(N) is the real GDP growth expectation component of 10-year nominal bond returns and RDIV^(n) is the real dividend growth expectation component of n-period strip returns. This channel captures whether real bonds hedge aggregate real dividend risks. The paper shows this channel accounts for approximately 89–95% of the post-2000 bond-stock covariance change for dividend strips.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Inflation Real Effect.&lt;/strong&gt; The covariance component Cov(INFL^(N)_B, RGDP^(n) + RDIV^(n)), defined as the correlation between shocks to expected inflation (embedded in nominal bond returns) and shocks to expected real growth (in strip returns). In the paper&amp;rsquo;s framework this is distinct from the standard inflation risk premium story, as it concerns the correlation between subjective beliefs rather than realized covariances under the physical measure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Forecast Error (FE) and Forecast Revision (FR) Predictability.&lt;/strong&gt; Two of three components of realized strip excess return (Equation 44). FE = ∆d$&lt;em&gt;{t+1:t+h} − ẽ_t∆d$&lt;/em&gt;{t+1:t+h} is the realized dividend growth forecast error within the holding period; FR = (ẽ_{t+h} − ẽ_t)∆d$_{t+h+1:t+n} is the forecast revision for dividend growth beyond the holding period. Because the agent overreacts to dividend news, bad news triggers overly pessimistic forecasts (positive subsequent FE) and, as dividends mean-revert, downward forecast revisions (negative FR). These two have opposite signs in predictive regressions, generating the downward-sloping term structure of return predictability.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Fed Model.&lt;/strong&gt; The empirical positive correlation between equity yields (real) and nominal bond yield levels. The paper shows that this yield-level correlation switched from strongly positive (≈ 0.85 before 2000) to significantly negative (≈ −0.60 to −0.62 after 2000) for 5Y–10Y dividend strips, and that the same real growth correlation and inflation real effect decomposition applies, albeit with the inflation real effect proportionally larger (≈ 40%) for yield levels than for returns (≈ 30%) because persistent inflation expectations co-move with the level of expected real GDP growth.&lt;/p&gt;</description></item><item><title>Explicit consumption functions with borrowing constraints: A continuous-time approach</title><link>https://macropaperwarehouse.com/papers/explicit-consumption-functions-with-borrowing-constraints-a-continuous-time-approach/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/explicit-consumption-functions-with-borrowing-constraints-a-continuous-time-approach/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research question.&lt;/strong&gt; The paper asks whether an explicit, global, closed-form solution exists for the consumption function in the standard income fluctuation problem with a borrowing constraint and constant income, a problem that has resisted closed-form solution since at least Schechtman (1976). All prior continuous-time work (Park 2006, Holm 2018, Fischer 2024) produced only &lt;em&gt;implicit&lt;/em&gt; expressions; Achdou et al. (2022) produced explicit expressions valid only locally, near zero assets or as assets diverge to infinity, and only for r &amp;gt; 0.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model.&lt;/strong&gt; A single agent with CRRA utility (coefficient of relative risk aversion γ &amp;gt; 0) maximizes discounted utility over an infinite horizon, subject to the flow budget constraint da/dt = ra + y − c, with a borrowing constraint a(t) ≥ 0. The agent receives a constant, deterministic income stream y ≥ 0 and discounts at rate ρ, with the impatience condition ρ &amp;gt; r maintained throughout. The paper takes a continuous-time formulation arrived at by letting the discrete period length Δ → 0, nesting Helpman (1981)&amp;rsquo;s discrete-time analysis as a special case.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Key analytical device.&lt;/strong&gt; A one-to-one mapping exists between initial assets a and the time T it takes for the consumer to fully run down her assets. This map, denoted T = h(a; y), is well-defined, strictly increasing, and concave in a (established in Proposition 1 via the Hadamard-Lévy theorem). Expressing the optimal consumption function as c*(a; y) = y · exp(ρh(a;y)/γ) evaluated at t = 0 reduces the problem to explicitly inverting the transcendental equation relating a to T.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main result (r = 0).&lt;/strong&gt; For the case of a zero net real interest rate, the transcendental equation can be solved explicitly using the second branch W₋₁(·) of the Lambert W function. The closed-form consumption function is (Theorem 2 and Corollary 2.1):&lt;/p&gt;
&lt;p&gt;c*(a; y) = y · exp(ρ h(a;y) / γ), where h(a; y) = −(a/y + γ/ρ) − (γ/ρ) W₋₁(f(a;y)), and f(a;y) = −exp(−b(a + γy/ρ)/y), b := ρ/γ.&lt;/p&gt;
&lt;p&gt;This is a &lt;strong&gt;global&lt;/strong&gt; solution (valid for all a ≥ 0), in contrast to the local solutions in prior work. The paper notes that for the illustrative parameter values r = 0.01, γ = 0.5, ρ = 0.08, y = 3 (broadly consistent with average U.S. real interest rates in 2025), there is a visually sizable gap between the constrained and unconstrained consumption functions except as a → ∞, where the two converge (in line with the asymptotic linearity result of Benhabib et al. 2015).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main result (r &amp;gt; 0).&lt;/strong&gt; For positive interest rates, the Lambert W function cannot invert a sum of exponentials with different exponents (an open mathematical problem). The paper instead derives a global closed-form &lt;strong&gt;approximation&lt;/strong&gt; valid for r ∼ 0, by expanding e^(−rT) ≈ 1 − rT to first order and applying the same Lambert W inversion. The approximating consumption function has the same structural form but with modified coefficients b_r, c_r, d_r that collapse to their r = 0 counterparts as r → 0 (Proposition 2). Numerical comparison against the implicit-expression solution of Park (2006) confirms the approximation is close for small r.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Characterization of the MPC and supermodularity (Section 3).&lt;/strong&gt; Leveraging the explicit expression, the paper derives the full Jacobian vector and Hessian matrix of c*(a; y) in closed form (Propositions 3 and 4). Key findings, all proved formally and holding under the impatience condition ρ &amp;gt; r:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Consumption is increasing in both assets and permanent income&lt;/strong&gt; (both entries of the Jacobian are strictly positive — Corollary 2.2). The second result (∂c*/∂y &amp;gt; 0 for all a) is new for the borrowing-constrained setting; Achdou et al. (2022) provided only suggestive evidence for the limiting case a ∼ 0.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Consumption is strictly concave in both assets and permanent income&lt;/strong&gt; (both diagonal entries of the Hessian are strictly negative — Corollary 2.3). Concavity in assets was known (Carroll and Kimball 1996); concavity in permanent income under borrowing constraints is new.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;The consumption function is supermodular&lt;/strong&gt;: the cross-derivative ∂²c*/∂a∂y is strictly positive (Corollary 2.3). This means assets and permanent income are complements in generating consumption. Equivalently, the MPC out of permanent income is strictly increasing in the level of initial assets — a counter-intuitive result, since high MPCs are usually associated with poor (low-asset) agents. An identical result was obtained by Commault (2025) for a life-cycle model &lt;em&gt;without&lt;/em&gt; borrowing constraints; the current paper confirms it holds in the presence of a borrowing constraint. By symmetry of the Hessian, the MPC out of assets is also strictly increasing in permanent income.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;Intuition for supermodularity.&lt;/strong&gt; When assets are low, an increase in permanent income produces little additional consumption because the risk of hitting the borrowing constraint is high. When assets are higher, the agent has buffer savings, faces a lower constraint-risk, and can smooth the higher future income stream into current consumption.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope conditions.&lt;/strong&gt; Results are derived under CRRA utility, constant (deterministic) income, no stochastic variation, and the impatience condition ρ &amp;gt; r. The exact closed form applies to r = 0; the approximation is characterized as valid for r ∼ 0 and is not a local expansion in assets.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-longstanding-gap-in-the-literature-that-this-paper-addresses"&gt;Q1. What is the longstanding gap in the literature that this paper addresses?&lt;/h3&gt;
&lt;p&gt;A: Since Zeldes (1989) noted that no closed-form solution exists for the consumption function with stochastic income and CRRA utility, researchers settled for numerical solutions or local analytical approximations. In the constant-income/borrowing-constraint version studied here, Park (2006), Holm (2018), and Fischer (2024) derived only implicit continuous-time expressions. Achdou et al. (2022) gave explicit local solutions valid near a ∼ 0 or a → ∞ under r &amp;gt; 0. No prior work produced an explicit, global closed-form for any case.&lt;/p&gt;
&lt;h3 id="q2-why-does-moving-to-continuous-time-enable-progress-that-discrete-time-did-not"&gt;Q2. Why does moving to continuous time enable progress that discrete time did not?&lt;/h3&gt;
&lt;p&gt;A: In discrete time, the consumption function is piecewise linear (Helpman 1981), with kinks at the sequence of asset thresholds µ(T) for T = 0, Δ, 2Δ, …. As Δ → 0, the piecewise-linear function converges to a smooth function whose governing ODE can be solved analytically. This convergence to smoothness, illustrated in Figure 1, is what enables the application of the Lambert W function to invert the resulting transcendental equation.&lt;/p&gt;
&lt;h3 id="q3-what-is-the-role-of-the-lambert-w-function-specifically-its-second-branch-w"&gt;Q3. What is the role of the Lambert W function, specifically its second branch W₋₁?&lt;/h3&gt;
&lt;p&gt;A: The optimal asset-depletion time T satisfies the transcendental equation e^(bT) = yT + c (for r = 0), which cannot be solved with elementary functions. Via the change of variables z := −bT − bc/y, the equation reduces to ze^z = α, whose solution is z = W(α). The argument α lies in (−1/e, 0) for a ∈ (0, +∞), and it is precisely on this interval that the Lambert W function is double-valued; the relevant branch is W₋₁ (the second, lower branch), which is well-defined and strictly less than −1 on (−1/e, 0). It is the properties of W₋₁ on this domain — specifically that 1 + W₋₁(α) &amp;lt; 0 — that drive the sign conclusions for the Hessian.&lt;/p&gt;
&lt;h3 id="q4-why-does-the-lambert-w-approach-fail-for-r--0-and-what-is-the-approximation-strategy"&gt;Q4. Why does the Lambert W approach fail for r &amp;gt; 0, and what is the approximation strategy?&lt;/h3&gt;
&lt;p&gt;A: For r &amp;gt; 0, Equation (8) contains two exponentials with different exponents — e^((ρ−r)T/γ) and e^(−rT) — and their sum cannot be inverted by the Lambert W function, which handles only a linear-plus-single-exponential structure. Inverting a sum of exponentials with different exponents is stated in the paper to be an open problem. The approximation strategy exploits the fact that for r ∼ 0, e^(−rT) ≈ 1 − rT + o(r), reducing the equation to a single-exponential transcendental form (Equation 15) with modified coefficients b_r, d_r, c_r, all of which converge to their r = 0 analogues as r → 0.&lt;/p&gt;
&lt;h3 id="q5-what-does-proposition-1-establish-and-why-is-it-necessary-before-stating-the-main-theorem"&gt;Q5. What does Proposition 1 establish, and why is it necessary before stating the main theorem?&lt;/h3&gt;
&lt;p&gt;A: Proposition 1 establishes that the mapping µ(T) from depletion time T to initial assets a is smooth (infinitely differentiable), bijective (one-to-one and onto) on ℝ₊, and strictly convex. The Hadamard-Lévy theorem then guarantees that its inverse h(a;y) = µ⁻¹(a) exists, is unique, is strictly increasing, and is strictly concave in a. This is a necessary prerequisite for Theorem 2 because h(a;y) is the central object in the closed-form consumption function; without establishing its existence and uniqueness, Theorem 2 would have no well-defined object.&lt;/p&gt;
&lt;h3 id="q6-what-does-the-jacobian-characterization-proposition-3-and-corollary-22-contribute"&gt;Q6. What does the Jacobian characterization (Proposition 3 and Corollary 2.2) contribute?&lt;/h3&gt;
&lt;p&gt;A: Proposition 3 gives explicit formulas for ∂c*/∂a = (ρ/γ) · w/(1+w) and ∂c*/∂y in terms of w = W₋₁(f(a;y)). Corollary 2.2 proves both are strictly positive using the property w &amp;lt; −1 on (−1/e, 0), which ensures w/(1+w) &amp;gt; 0 and that the bracketed term in the expression for ∂c*/∂y is strictly positive. The contribution is that the positivity of ∂c*/∂y for all a was previously unproven in a borrowing-constrained setting with constant income.&lt;/p&gt;
&lt;h3 id="q7-what-is-the-structure-of-the-hessian-matrix-and-what-signs-do-its-entries-take"&gt;Q7. What is the structure of the Hessian matrix and what signs do its entries take?&lt;/h3&gt;
&lt;p&gt;A: All four entries of Hc are proportional to w/(1+w)³. Since w &amp;lt; −1, we have 1 + w &amp;lt; 0, so (1+w)³ &amp;lt; 0, making w/(1+w)³ &amp;gt; 0. The diagonal elements ∂²c*/∂a² = −(ρ²/γ²y) · w/(1+w)³ and ∂²c*/∂y² = −(ρ²a²/γ²y³) · w/(1+w)³ are both strictly negative (concavity). The off-diagonal elements ∂²c*/∂a∂y = (aρ²/γ²y²) · w/(1+w)³ are strictly positive (supermodularity/complementarity).&lt;/p&gt;
&lt;h3 id="q8-what-is-the-precise-counter-intuitive-implication-of-supermodularity-for-mpc-heterogeneity"&gt;Q8. What is the precise counter-intuitive implication of supermodularity for MPC heterogeneity?&lt;/h3&gt;
&lt;p&gt;A: Supermodularity (∂²c*/∂a∂y &amp;gt; 0) means the MPC out of permanent income — conventionally associated with low-wealth households — is in fact &lt;em&gt;increasing&lt;/em&gt; in the level of initial assets. This contradicts the conventional narrative that high MPCs are a hallmark of poor agents. The paper&amp;rsquo;s intuition is that low-asset agents face high risk of hitting the constraint, suppressing their consumption response to income news, while high-asset agents can freely smooth the increased income stream. The same supermodularity implies, by the symmetry of the Hessian, that the MPC out of assets is also increasing in permanent income.&lt;/p&gt;
&lt;h3 id="q9-how-does-this-result-relate-to-commault-2025"&gt;Q9. How does this result relate to Commault (2025)?&lt;/h3&gt;
&lt;p&gt;A: Commault (2025) proved, in a life-cycle model with a permanent/transitory stochastic income process but &lt;em&gt;without&lt;/em&gt; borrowing constraints, that the MPC out of permanent income is increasing in assets. The current paper obtains the same qualitative finding in the opposite environment — constant income &lt;em&gt;with&lt;/em&gt; a borrowing constraint. The paper treats these as complementary, noting that the result thus appears robust to these different modeling choices.&lt;/p&gt;
&lt;h3 id="q10-what-does-concavity-in-permanent-income-cy--0-add-that-was-not-previously-known"&gt;Q10. What does concavity in permanent income (∂²c*/∂y² &amp;lt; 0) add that was not previously known?&lt;/h3&gt;
&lt;p&gt;A: Carroll and Kimball (1996) established concavity of the consumption function in assets for a broad utility class. Concavity in permanent income — that the marginal consumption response to a windfall increase in y is diminishing — had been proved by Commault (2025) only in the absence of borrowing constraints. The current paper provides the first formal proof of this property in a setting with a borrowing constraint (albeit for constant, deterministic income and CRRA utility in continuous time).&lt;/p&gt;
&lt;h3 id="q11-what-is-the-potential-use-of-these-closed-form-results-for-numerical-methods"&gt;Q11. What is the potential use of these closed-form results for numerical methods?&lt;/h3&gt;
&lt;p&gt;A: The paper notes in the conclusion that the closed-form solutions for r = 0 and the approximation for r ∼ 0 can serve as benchmarks for assessing the reliability of continuous-time numerical methods when computing objects such as the MPC out of assets. Because the exact solution is known analytically, numerical implementations can be compared against it to detect discretization errors or convergence failures.&lt;/p&gt;
&lt;h3 id="q12-what-parameter-values-are-used-to-illustrate-the-consumption-function-and-what-do-they-imply"&gt;Q12. What parameter values are used to illustrate the consumption function, and what do they imply?&lt;/h3&gt;
&lt;p&gt;A: The paper uses r = 0.01, γ = 0.5, ρ = 0.08, y = 3, where r = 0.01 is described as roughly in line with the average real interest rate in the U.S. in 2025. With these values, Figure 1 shows a visually sizable gap between the constrained and unconstrained consumption functions at low to moderate asset levels, with the two converging as a → ∞ as guaranteed by asymptotic linearity (Benhabib et al. 2015).&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Income fluctuation problem (with borrowing constraint):&lt;/strong&gt; The standard infinite-horizon single-agent savings problem in which the agent faces a non-negativity constraint on assets (a(t) ≥ 0), so that the agent cannot borrow. In the paper&amp;rsquo;s formulation: maximize ∫ e^(−ρt)u(c(t))dt subject to da/dt = ra + y − c and a(t) ≥ 0, with constant income y and CRRA utility. The borrowing constraint creates the concavity of the consumption function and was the source of intractability in prior closed-form attempts.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Lambert W function (second branch W₋₁):&lt;/strong&gt; A special transcendental function defined as the solution to we^w = x. It is double-valued on (−1/e, 0); the second branch W₋₁ takes values strictly less than −1 on this interval. In this paper, the transcendental equation linking initial assets to asset-depletion time is reduced to the form ze^z = α, enabling explicit inversion via W₋₁. The property that 1 + W₋₁(α) &amp;lt; 0 on (−1/e, 0) is the algebraic engine driving all sign results in the Hessian.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Asset-depletion time T = h(a; y):&lt;/strong&gt; The time it takes for the optimal consumer to fully run down her initial assets before settling into perpetual income consumption of y. The paper establishes a bijective mapping from initial assets a to depletion time T (Proposition 1); the closed-form solution is obtained by explicitly inverting this mapping. In the paper&amp;rsquo;s formulation, h(a; y) = µ⁻¹(a) where µ(T) is derived from the ODE governing the consumption path.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Supermodularity of the consumption function:&lt;/strong&gt; The property that the cross-derivative ∂²c*/∂a∂y is strictly positive, meaning assets a and permanent income y act as complements in generating consumption. This is an equilibrium property of the consumption function (not an assumption on the utility function), and the paper identifies it as new to the income fluctuation literature. It implies the MPC out of permanent income is increasing in a, and the MPC out of assets is increasing in y.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;&lt;em&gt;MPC out of permanent income (∂c&lt;/em&gt;/∂y):&lt;/em&gt;* The marginal increase in current consumption per unit increase in the constant income stream y, holding initial assets constant. This object is less studied than the MPC out of a transient asset windfall. In the paper&amp;rsquo;s setting, it is shown to be strictly positive for all a (Corollary 2.2) and, counter-intuitively, strictly increasing in a (supermodularity).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Global vs. local closed-form solution:&lt;/strong&gt; A global solution holds for all values of the state variable (here, all a ≥ 0), while a local solution is valid only in the neighborhood of a particular value (e.g., a ∼ 0 or a → ∞). Achdou et al. (2022) produced local closed-form expressions; the current paper&amp;rsquo;s Theorem 2 (r = 0) is the first global explicit closed-form for this class of problems.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Piecewise-linear consumption function (discrete time):&lt;/strong&gt; In Helpman (1981)&amp;rsquo;s discrete-time formulation with period length Δ = 1, the optimal consumption function is piecewise linear in assets, with slope changes at the asset thresholds µ(T) for integer T. As Δ → 0, this becomes a smooth function, enabling the passage to the continuous-time closed form derived in the paper.&lt;/p&gt;</description></item><item><title>Failing Banks</title><link>https://macropaperwarehouse.com/papers/failing-banks/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/failing-banks/</guid><description>&lt;p&gt;Correia, Luck, and Verner ask a foundational question in banking: why do banks fail? Specifically, they seek to adjudicate between two theoretical views — the solvency view (failures caused by deteriorating asset quality and insolvency) and the bank runs view (failures caused by depositor coordination failure that can bring down otherwise solvent banks) — using the longest micro-level panel of U.S. commercial bank balance sheets assembled to date.&lt;/p&gt;
&lt;p&gt;The authors construct a panel covering approximately 37,000 distinct banks across two samples: a historical sample of all national banks from 1863 to 1941 (sourced from OCC Annual Reports, digitized via OCR) and a modern sample of all commercial banks from 1959 to 2024 (from FFIEC Call Reports merged with the FDIC failure list). More than 5,000 banks fail across the full sample, with 2,887 failures before 1935 and 2,233 after 1959. The sample spans institutional regimes before and after the Federal Reserve (founded 1913) and the FDIC (founded 1933/1934).&lt;/p&gt;
&lt;p&gt;Three sets of findings emerge. First, failing banks are characterized by deteriorating fundamentals well before failure: rising non-performing loans and declining solvency (equity-to-assets falls by 8 percentage points in the five years before failure in the modern sample), increasing reliance on expensive noncore funding (rising by 18% of assets in the decade before modern-era failures), and a boom-bust pattern in real assets (expanding by 34% from ten years to three years before failure before contracting). These patterns are consistent across the pre-FDIC and modern eras.&lt;/p&gt;
&lt;p&gt;Second, bank failures are highly predictable from publicly available accounting data. Using simple regression models with insolvency risk, noncore funding reliance, and asset growth as predictors, the area under the ROC curve (AUC) for predicting failure within one year reaches 86% in the historical sample and 90–95% in the modern sample. Pseudo-out-of-sample performance is nearly as strong as in-sample performance. A bank in the top 5th percentile of both insolvency risk and noncore funding vulnerability faces a three-year failure probability of 27% in both the historical and modern samples, compared to unconditional rates of 2.5% (historical) and 1% (modern) — a 10- to 25-fold increase.&lt;/p&gt;
&lt;p&gt;Third, while large deposit outflows consistent with bank runs were common in pre-FDIC failures — deposits declined on average by 14% immediately before failure in 1880–1934, and by 21% in the period before the banking holiday — failures with runs are as predictable as failures without runs, and they occur in banks with similarly weak fundamentals. Recovery rates on failed banks&amp;rsquo; assets averaged only 52% of book value in pre-FDIC failures. Using a framework comparing recovery rates to leverage, the majority of pre-FDIC failed banks appear to have been fundamentally insolvent. Even under the extreme assumption of zero value destruction from failure, runs on banks that were not fundamentally insolvent account for fewer than 8% of pre-FDIC failures; under an assumption of 20% value destruction from failure, this share rises to 22%.&lt;/p&gt;
&lt;p&gt;OCC bank examiners classified fewer than 2% of pre-FDIC failures as caused by runs or liquidity issues; most were attributed to losses, fraud, or external shocks. The aggregate failure rate is also largely predictable: regressing the actual bank failure rate on predicted aggregate failure risk yields an R-squared of 40%.&lt;/p&gt;
&lt;p&gt;Scope conditions: the historical sample covers only national banks (market share ranging from ~80% in the 1870s to ~45% in the 1930s); the modern sample excludes de novo banks (younger than three years); deposit outflow data for the historical period begin in 1880; and FDIC failure transaction data for the modern period begin in 1993.&lt;/p&gt;
&lt;p&gt;Q: What are the two main theoretical views the paper evaluates, and how does the paper distinguish between them?
A: The solvency view holds that bank failures are caused by deteriorating asset quality and insolvency, with the runnable nature of liabilities playing no essential causal role. The bank runs view holds that the runnable nature of demandable deposits is central, with depositor coordination failure capable of bringing down otherwise solvent banks (Diamond and Dybvig, 1983) or weak-but-solvent banks (Goldstein and Pauzner, 2005). The paper distinguishes between them using three empirical tests: predictability of failures from fundamentals, deposit outflows before failure, and asset recovery rates in failure.&lt;/p&gt;
&lt;p&gt;Q: How predictable are bank failures, and what does predictability imply for the bank runs view?
A: In the historical pre-FDIC sample (1863–1934), the in-sample AUC for predicting failure within one year is 86%; in the modern sample (1959–2024) it is 90–95%. Pseudo-out-of-sample AUC is nearly as strong as in-sample AUC. High predictability is consistent with the solvency view and fundamental-based panic run models, but is inconsistent with non-fundamental self-fulfilling runs (Diamond and Dybvig, 1983), which should strike randomly. Predictability also cuts against the assumption of rational, forward-looking depositors in fundamental-run models, since attentive depositors would act on observable signals and accelerate failure, reducing predictability.&lt;/p&gt;
&lt;p&gt;Q: What is the boom-bust pattern in failing banks&amp;rsquo; assets?
A: In the decade before failure, failing banks&amp;rsquo; real total assets expand by 34% from ten years to three years before failure, then contract over the final two years. The boom-and-bust pattern is present in both the historical and modern samples but is more pronounced in the modern period. The boom is driven primarily by loan growth (particularly real estate lending and C&amp;amp;I lending in the modern sample) rather than by growth in liquid assets, consistent with the view that rapid credit expansion produces future credit losses.&lt;/p&gt;
&lt;p&gt;Q: How does noncore funding behave in failing banks, and why does it matter?
A: In failing banks in the modern sample, noncore funding (time deposits plus wholesale funding) rises by 18% of assets over the decade before failure, while demand deposits decline as a share of assets. In the historical sample, noncore (wholesale) funding also rises gradually. Noncore funding is a signal of failure for multiple reasons: it is more expensive than core deposits, eroding profitability; it can finance risky asset growth; it reflects realized losses being funded at the margin; and it increases funding fragility, making banks more vulnerable to shocks.&lt;/p&gt;
&lt;p&gt;Q: How strong is the joint signal from insolvency and noncore funding?
A: A bank in the top 5th percentile of both insolvency risk and noncore funding vulnerability faces a three-year failure probability of 27% in the historical sample and 27% in the modern sample. The unconditional three-year failure probability is 2.5% in the historical sample and 1% in the modern sample. This amounts to a 10- to 20-fold increase in failure probability, illustrating that the combination of solvency and funding weakness is a powerful joint predictor.&lt;/p&gt;
&lt;p&gt;Q: Were deposit outflows common before the FDIC, and did they decline after its introduction?
A: In the 1880–1934 historical sample, deposits in failing banks declined on average by 14% between the last call report and failure, with 25% of pre-FDIC failures preceded by outflows exceeding 20%; during the period before the banking holiday the average deposit decline was 21%. In contrast, in the modern sample (1993–2024), average pre-failure deposit outflows were only 2.5%, and outflows exceeding 20% occurred in only 3% of failures, consistent with deposit insurance insulating most depositors.&lt;/p&gt;
&lt;p&gt;Q: Are failures with large deposit outflows (runs) less connected to weak fundamentals than other failures?
A: No. The paper finds that failures with large deposit outflows are as predictable as failures without large deposit outflows. The relationship between insolvency risk or noncore funding and three-year failure probability is similar for failures with and without large deposit outflows. This implies that runs did not disproportionately strike banks with otherwise strong fundamentals.&lt;/p&gt;
&lt;p&gt;Q: What do asset recovery rates reveal about the insolvency status of pre-FDIC failed banks?
A: Recovery rates on pre-FDIC failed banks averaged 52% of book value of assets. Under the extreme assumption that receivership destroys zero bank value, runs on non-fundamentally-insolvent (weak but solvent) banks account for fewer than 8% of pre-FDIC failures. Under the equally extreme assumption that failure destroys 20% of bank value, this share rises to 22%. The majority of pre-FDIC failed banks therefore appear to have been fundamentally insolvent.&lt;/p&gt;
&lt;p&gt;Q: What did contemporary OCC bank examiners attribute as the causes of bank failures?
A: OCC bank examiners classified most pre-FDIC failures as caused by losses, fraud, or external economic shocks. Runs and liquidity issues together account for fewer than 2% of OCC-classified failures, notwithstanding the common occurrence of large deposit outflows before many of these failures. This examiner evidence supports the solvency view.&lt;/p&gt;
&lt;p&gt;Q: Can bank-level fundamentals predict systemic banking crises and aggregate failure waves?
A: Yes. The authors aggregate out-of-sample predicted failure probabilities to construct a predicted aggregate bank failure rate. The R-squared from regressing the actual aggregate bank failure rate on this predicted rate is 40%, indicating that spikes in bank failures during systemic crises are substantially accounted for by the prior deterioration of bank-level fundamentals.&lt;/p&gt;
&lt;p&gt;Q: Why is predictability higher in the modern sample than in the historical sample?
A: The authors identify several reasons. Accounting data quality is higher in the modern sample. Historical national banks operated as unit branches with less geographic diversification, making idiosyncratic shocks more important and harder to predict. Modern-era failures are preceded by larger lending booms that produce more predictable downstream losses. Additionally, in the modern context bank failures are largely supervisory decisions, and frictions in the supervisory process may delay closure and thereby increase predictability.&lt;/p&gt;
&lt;p&gt;Q: What role do the authors assign to depositor inattention?
A: The high predictability of failures combined with the finding that many failing banks had high predicted failure probabilities before actually failing suggests that depositors were often slow to react to observable signals of bank weakness. The authors note this points to behavioral frictions such as neglect of downside risk (Gennaioli et al., 2012) and sleepy or inattentive depositors (Hanson et al., 2015; Jiang et al., 2023), rather than the rational, forward-looking depositor assumption embedded in standard bank run models.&lt;/p&gt;
&lt;p&gt;Q: What is the paper&amp;rsquo;s overall interpretive conclusion about the relative importance of solvency versus runs?
A: The primary cause of bank failures is almost always and everywhere a deterioration of bank solvency. Runs were more common in the historical pre-FDIC data as a mechanism triggering failure, but they typically closed banks that were already fundamentally insolvent. Non-fundamental, self-fulfilling runs on otherwise healthy banks appear to be an uncommon cause of bank failures. Under the solvency view, even when runs occur, they are the trigger and final mechanism rather than the root cause.&lt;/p&gt;
&lt;p&gt;Insolvency risk: A bank&amp;rsquo;s proximity to default, proxied in the historical sample by surplus profits relative to equity (capturing profitability and capitalization) and in the modern sample by net income to assets. High insolvency risk reflects declining profitability and eroding capital buffers.&lt;/p&gt;
&lt;p&gt;Noncore funding: Expensive, risk-sensitive funding sources outside core demand deposits, including time deposits, wholesale funding (bills payable, rediscounts), and non-deposit wholesale borrowings. Banks relying heavily on noncore funding face higher funding costs, reduced profitability, and greater fragility to funding shocks.&lt;/p&gt;
&lt;p&gt;Fundamental run: A run triggered when bank fundamentals are so weak (theta at or below the lower threshold in the Goldstein-Pauzner framework) that all depositors have an incentive to withdraw regardless of others&amp;rsquo; actions — the bank is effectively insolvent and failure is inevitable.&lt;/p&gt;
&lt;p&gt;Panic-based run: A run triggered when bank fundamentals are moderately weak (below the threshold equilibrium in Goldstein-Pauzner) but the bank would have been able to pay all creditors absent the run; the run itself destroys value and causes failure.&lt;/p&gt;
&lt;p&gt;Non-fundamental (self-fulfilling) run: A run on an otherwise solvent bank driven purely by depositor coordination failure, as in Diamond and Dybvig (1983); failure arises from one of two equilibria and is not predicted by fundamentals.&lt;/p&gt;
&lt;p&gt;Recovery rate: Funds ultimately collected by the receiver throughout receivership proceedings divided by the book value of assets at suspension; used as a proxy for the degree of fundamental insolvency at failure. Pre-FDIC recovery rates averaged 52% of book value.&lt;/p&gt;
&lt;p&gt;Area Under the ROC Curve (AUC): A measure of binary classification performance used to quantify the predictability of bank failures; an uninformative predictor has AUC of 0.5, while AUC of 1.0 indicates perfect classification. In this paper, AUC ranges from 86% (historical, one-year horizon) to 95% (modern).&lt;/p&gt;
&lt;p&gt;Boom-bust pattern: The systematic tendency of failing banks to experience rapid loan-driven asset growth in the years preceding failure followed by asset contraction in the final two years before failure — present in both the historical and modern samples, more pronounced in the latter, with real assets expanding by 34% from ten to three years before failure.&lt;/p&gt;</description></item><item><title>Financial Frictions: Micro versus Macro Volatility</title><link>https://macropaperwarehouse.com/papers/financial-frictions-micro-versus-macro-volatility/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/financial-frictions-micro-versus-macro-volatility/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; How do consumer credit spreads — the gap between household borrowing rates and deposit rates — affect aggregate business cycle dynamics and the distribution of consumption across the wealth distribution? And what is the welfare trade-off between macroeconomic stabilization and household-level consumption volatility when bank capital requirements are tightened?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Empirical Approach.&lt;/strong&gt; The empirical analysis draws on Danish administrative register data for 2003–2018, combining approximately 15.5 million household-year observations. Income tax return data, which capture housing wealth, portfolio wealth, bank deposits, and bank and mortgage debt, are merged with bank-level reporting of interest rates submitted to Danmarks Nationalbank (MFI data). Household-specific credit spreads are constructed as the difference between the loan rate at a household&amp;rsquo;s primary loan bank and the deposit rate at its primary deposit bank in a given year. Consumption is imputed from household balance sheets following the method of Crawley and Kuchler (2023). The empirical specifications include household and time fixed effects, and quantile regressions are run across bins of the net wealth distribution.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model.&lt;/strong&gt; The authors develop a Heterogeneous Agent New Keynesian (HANK) model with explicit banking intermediation. Banks, subject to an agency friction following Gertler and Karadi (2011) — in which bankers can divert a fraction λ = 0.381 of assets — combine household deposits with net worth to invest in corporate equity and consumer loans. This leverage constraint generates an endogenous, countercyclical spread between borrowing and saving rates. Households face idiosyncratic income risk and a kink in their budget constraint at zero net worth due to the spread. The supply side features New Keynesian sticky prices (Rotemberg quadratic adjustment costs) and a Taylor rule. Aggregate shocks include monetary policy surprises, total factor productivity (TFP), and capital quality shocks (affecting bank net worth). The model is solved by first-order perturbation using the method of Bayer and Luetticke (2020) and calibrated to Danish macro and micro moments for 2003–2018.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Empirical Findings.&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;The average consumer credit spread in Denmark is strongly countercyclical, with a cross-correlation with HP-filtered output of −0.44 in the data (−0.31 in the model).&lt;/li&gt;
&lt;li&gt;Higher credit spreads increase the transition rate into the zero net wealth state for households with moderately positive wealth at the beginning of the year, and reduce the outflow rate for households already at zero net wealth.&lt;/li&gt;
&lt;li&gt;Pooled OLS (with household and time fixed effects) finds that a higher spread is negatively associated with consumption (coefficient −0.266), and the interaction between spread and log income is positive (coefficient 1.366), indicating that higher spreads raise income sensitivity of consumption. For below-median wealth households, the income–consumption link is stronger and the negative spread effect on consumption is larger.&lt;/li&gt;
&lt;li&gt;The consumption-income elasticity derived from quantile regression estimates has a standard deviation of 2.4 percent and a cross-correlation with output of −0.53 when spread variation is incorporated; holding spreads constant roughly halves the volatility (to 1.3 percent) and reduces the countercyclicality (cross-correlation −0.31).&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Model Aggregate Findings.&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Consumer credit is procyclical (cross-correlation with output 0.56 in data, 0.67 in model) and more than twice as volatile as output (standard deviation ratio 2.11 in data, 1.51 in model).&lt;/li&gt;
&lt;li&gt;Capital quality shocks and monetary policy shocks are amplified at the aggregate level through a financial accelerator working through endogenous spread movements. TFP shocks generate little spread amplification because households&amp;rsquo; labor supply responses partially insulate banks&amp;rsquo; net worth.&lt;/li&gt;
&lt;li&gt;A 1 percentage point contractionary monetary policy shock leads to a sharp, persistent decline in aggregate output and investment, and is amplified relative to a constant-spread HANK benchmark.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Distributional Findings.&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;In response to a contractionary monetary policy shock, consumption of households at the 10th percentile of the consumption distribution (who are indebted) falls sharply in the short run, while consumption of the 90th percentile (wealthy households) rises in the short run due to higher returns on savings. The responses converge across the distribution in the medium run as spreads normalize.&lt;/li&gt;
&lt;li&gt;When the consumer credit spread is held constant, consumption paths move in parallel across the wealth distribution, demonstrating that endogenous spread movements are the key driver of distributional effects for monetary policy and capital quality shocks.&lt;/li&gt;
&lt;li&gt;The MPC is countercyclical in the model, with a cross-correlation with output of −0.60 (unconditional), compared with −0.53 for the empirically-estimated consumption-income elasticity. The consumption-income elasticity and MPC are correlated at 90 percent in the model at the annual rate.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Macroprudential Regulation.&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;A tightening of bank capital requirements reducing leverage by 10 percent (diversion parameter λ rising from 0.381 to 0.445) reduces output volatility by 5.5 percent and investment volatility by 10.1 percent, and does so at apparently no long-run aggregate cost in the HANK setting (precautionary savings stimulate output and consumption in the stationary equilibrium).&lt;/li&gt;
&lt;li&gt;However, the regulation increases the annual consumer credit spread by 40 basis points, raises household consumption volatility across the wealth distribution (from about 8 percent to 10 percent for the poorest households under idiosyncratic shocks alone), and generates welfare losses across all deciles equivalent to 0.24–4.28 percent of consumption (with aggregate welfare loss of 0.79 percent).&lt;/li&gt;
&lt;li&gt;When aggregate shocks are included, the lower cyclical sensitivity of spreads partially mitigates welfare losses for the poorest 80 percent of the population, but the overall welfare effect remains negative with an aggregate loss equivalent to 0.58 percent of consumption. The paper thus documents a trade-off between macro volatility (stabilized) and micro volatility (increased).&lt;/li&gt;
&lt;li&gt;Results are robust to the extension of the model to three assets (including illiquid assets), which provides a better fit to micro data without materially changing the welfare conclusions.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-specific-danish-dataset-used-and-how-is-consumption-constructed"&gt;Q1. What is the specific Danish dataset used, and how is consumption constructed?&lt;/h3&gt;
&lt;p&gt;A: The dataset covers 2003–2018 from Statistics Denmark administrative registers, combining income tax return data (which report end-of-year balances on all bank accounts, housing wealth, portfolio wealth, bank deposits, bank loans, and mortgage debt) with bank-level MFI interest rate reporting submitted to Danmarks Nationalbank. The total sample is approximately 15.5 million household-year observations (about 1.76–1.97 million households per year). Consumption is imputed as after-tax labor income plus after-tax financial income minus the change in end-of-year net worth, following Crawley and Kuchler (2023). Households with self-employment, housing transactions in the current or prior year, negative imputed consumption, or in the bottom and top 1 percent of wealth or income distributions are excluded.&lt;/p&gt;
&lt;h3 id="q2-how-are-household-specific-credit-spreads-constructed-from-the-administrative-data"&gt;Q2. How are household-specific credit spreads constructed from the administrative data?&lt;/h3&gt;
&lt;p&gt;A: Each household&amp;rsquo;s primary loan bank is defined as the bank where it holds the largest loan balance at end of calendar year, and the primary deposit bank as the one holding the largest deposit balance. The household-specific spread is the difference between the loan rate applied by the primary loan bank and the deposit rate applied by the primary deposit bank, both measured as averages over the calendar year. If a household has no loans, the loan rate of the primary deposit bank is used. This construction yields a household-level interest rate spread that moves countercyclically at the aggregate level (cross-correlation with HP-filtered output of −0.44).&lt;/p&gt;
&lt;h3 id="q3-what-do-the-empirical-results-say-about-the-relationship-between-spreads-and-the-probability-of-a-household-reaching-zero-net-wealth"&gt;Q3. What do the empirical results say about the relationship between spreads and the probability of a household reaching zero net wealth?&lt;/h3&gt;
&lt;p&gt;A: Equation (2) is estimated as a linear probability model for the transition to zero net wealth (defined as net assets within plus or minus two weeks of 2007 median weekly income). Higher spreads significantly increase the transition rate into zero net wealth for households with moderately positive net wealth at the beginning of the year (those in the third to sixth net wealth bins), and reduce the outflow rate from zero net wealth for households already in that state. Higher spreads also appear to increase debt repayments for indebted households (third to fifth bins), making it more difficult for them to accumulate wealth. Households at the extremes of the wealth distribution (very poor or very wealthy) show essentially no sensitivity of transition rates to spread movements.&lt;/p&gt;
&lt;h3 id="q4-what-do-the-consumption-regressions-in-table-1-find-and-what-is-the-key-identification-caveat"&gt;Q4. What do the consumption regressions in Table 1 find, and what is the key identification caveat?&lt;/h3&gt;
&lt;p&gt;A: The pooled regression (column 1) finds a positive income–consumption coefficient of 0.372, a negative spread coefficient of −0.266, and a positive income–spread interaction of 1.366, all statistically significant with standard errors clustered at the household level (15,610,327 observations, R² = 0.591). When interacted with below-median wealth (column 2), the income coefficient is larger (0.397 versus 0.335 for above-median), the spread effect is more negative for below-median wealth (−0.362 versus −0.101 for above-median), and the income–spread interaction is stronger for below-median wealth (1.640 versus 0.875). The authors explicitly note that these results should not be given a causal interpretation, as income and consumption are likely jointly determined. Institutional features of the Danish mortgage market (covered bonds, competitive market, rates independent of borrower credit situation) minimize confounding from mortgage rate correlation with consumer credit spreads.&lt;/p&gt;
&lt;h3 id="q5-how-do-the-quantile-regression-results-and-the-derived-consumption-income-elasticity-demonstrate-countercyclical-mpc"&gt;Q5. How do the quantile regression results and the derived consumption-income elasticity demonstrate countercyclical MPC?&lt;/h3&gt;
&lt;p&gt;A: Quantile regressions across five-percent bins of the net wealth distribution show that income coefficients decline with wealth (from nearly 0.5 for the poorest to about 0.35 for the wealthiest households), spread coefficients are negative for households with negative, zero, and moderately positive wealth and positive for significantly wealthy households, and the income–spread interaction term is positive for all but the richest households (largest near zero net wealth). The consumption-income elasticity is computed as β₀,ⱼ + β₂,ⱼ × spread at the household level, then averaged cross-sectionally. When only wealth distribution shifts are allowed, the elasticity&amp;rsquo;s standard deviation is 1.3 percent and its cross-correlation with HP-filtered output is −0.31. When spread variation is also incorporated, standard deviation rises to 2.4 percent and the cross-correlation becomes −0.53. This measure is highly correlated (90 percent) with the model MPC, supporting the inference that the MPC is countercyclical.&lt;/p&gt;
&lt;h3 id="q6-what-is-the-structure-of-the-banking-sector-in-the-hank-model-and-how-does-the-agency-friction-generate-a-countercyclical-spread"&gt;Q6. What is the structure of the banking sector in the HANK model, and how does the agency friction generate a countercyclical spread?&lt;/h3&gt;
&lt;p&gt;A: A continuum of banks combines household deposits with net worth to invest in corporate equity and consumer loans. Bankers can divert a fraction λ = 0.381 of assets, and if they do so, depositors can recover only the remaining fraction (1 − λ). This threat of diversion constrains the supply of deposits, resulting in banks needing to earn excess returns — Et(RK,t+1 − RS,t+1) &amp;gt; 0 — on their assets relative to the deposit rate. The leverage ratio is bounded above by ϱt/λ, where ϱt is a value multiplier that depends on current and expected future excess returns. When an adverse shock (capital quality shock or monetary tightening) reduces banking sector net worth, the leverage constraint tightens, banks reduce asset supply, and the spread between the return on capital (and hence the consumer loan rate, which is proportional to RK at markup ωB = 0.0075) and the deposit rate rises. This generates the observed countercyclical credit spread.&lt;/p&gt;
&lt;h3 id="q7-in-the-model-how-do-aggregate-shocks-affect-the-distribution-of-consumption-and-why-is-the-monetary-policy-shock-particularly-distributional"&gt;Q7. In the model, how do aggregate shocks affect the distribution of consumption, and why is the monetary policy shock particularly distributional?&lt;/h3&gt;
&lt;p&gt;A: A one-percent capital quality shock reduces both wages and bank net worth, causing spreads to rise. In the baseline economy, rising borrowing rates lead to a large reduction in consumption for indebted households (10th percentile) while the constant spread model shows near-parallel movements across the distribution. A one-percentage-point monetary policy shock reduces equity returns, depressing bank net worth and (with a lag) raising spreads. Indebted households face both lower labor income and higher borrowing costs, producing a sharp consumption decline at the 10th percentile; wealthy households gain from higher returns on savings, so their consumption rises in the short run. Responses converge as spreads return to normal over the medium run. This matches empirical evidence from Holm, Paul, and Tischbirek (2021) for Norway. For TFP shocks, banks&amp;rsquo; net worth is less affected because households&amp;rsquo; higher labor supply partially offsets the productivity decline, so spreads move little and distributional effects are smaller (driven mainly by wage effects across the distribution).&lt;/p&gt;
&lt;h3 id="q8-how-does-the-financial-accelerator-in-the-hank-model-compare-to-the-rank-version"&gt;Q8. How does the financial accelerator in the HANK model compare to the RANK version?&lt;/h3&gt;
&lt;p&gt;A: In response to capital quality shocks and monetary policy shocks, the HANK model with banking frictions generates amplification relative to a constant-spread HANK benchmark, confirming the presence of a financial accelerator. However, relative to the RANK model, the incomplete markets model implies slightly less amplification of aggregate investment and consumption. This is because, in the HANK model, households facing higher credit spreads increase their labor supply (precautionary motive), which partially stabilizes aggregate income and moderates the financial accelerator. The finding that heterogeneous agent aspects are less important at the aggregate level is consistent with Berger, Bocola, and Dovis (2020). For TFP shocks, the financial accelerator through spreads is largely absent in both HANK and RANK, as spread changes are minor.&lt;/p&gt;
&lt;h3 id="q9-what-are-the-long-run-aggregate-effects-of-tightening-bank-capital-requirements-reducing-leverage-by-10-percent-in-the-hank-versus-rank-model"&gt;Q9. What are the long-run aggregate effects of tightening bank capital requirements (reducing leverage by 10 percent) in the HANK versus RANK model?&lt;/h3&gt;
&lt;p&gt;A: In the RANK model, higher capital requirements increase the annual spread between the return on capital and the deposit rate by 25 basis points, reduce the aggregate capital stock by 2.4 percent, output by 0.5 percent, and aggregate consumption by 0.8 percent. In the HANK model, the spread increases by 40 basis points annually, but the mechanism differs: much of the spread change is absorbed by a reduction in the deposit rate (from 3.81 percent to 3.54 percent annually) rather than an increase in the capital return. Households respond to the lower deposit rate and higher credit costs by increasing precautionary savings and labor supply, so aggregate output and consumption actually rise slightly in the HANK stationary equilibrium. The capital requirements thus appear costless at the aggregate level in the HANK model — but this masks welfare costs that operate through the idiosyncratic risk channel.&lt;/p&gt;
&lt;h3 id="q10-what-are-the-quantitative-welfare-costs-of-macroprudential-regulation-and-how-do-they-vary-across-the-wealth-distribution-and-between-idiosyncratic-and-aggregate-shocks"&gt;Q10. What are the quantitative welfare costs of macroprudential regulation, and how do they vary across the wealth distribution and between idiosyncratic and aggregate shocks?&lt;/h3&gt;
&lt;p&gt;A: Welfare is measured as the fraction of lifetime consumption households are willing to give up to stay in the unregulated baseline. In the face of idiosyncratic shocks only, welfare losses range from 0.24 to 0.43 percent of consumption for the first seven wealth deciles, and reach 4.28 percent for the richest decile (primarily because of the reduction in the return on their savings), with an average welfare loss of 0.79 percent. When aggregate shocks are added, the losses are substantially reduced for the poorest 80 percent (due to lower cyclical sensitivity of spreads), but remain large for the wealthiest decile (4.23 percent) and in aggregate (0.58 percent). These results are robust to the three-asset model extension, where the poorest households are approximately welfare-neutral under the regulation when aggregate shocks are included (0.00 percent), but aggregate welfare losses remain at 0.75 percent.&lt;/p&gt;
&lt;h3 id="q11-how-does-the-three-asset-model-extension-with-illiquid-assets-affect-the-key-results"&gt;Q11. How does the three-asset model extension (with illiquid assets) affect the key results?&lt;/h3&gt;
&lt;p&gt;A: In the three-asset extension, households can hold illiquid capital (calibrated with an adjustment probability of φk = 0.0025 per quarter, targeting the Danish ratio of bank deposits to output of 34 percent), creating wealthy hand-to-mouth households who have illiquid assets but no liquid assets. The consumption impulse responses across the wealth distribution remain very similar to the two-asset baseline: endogenous spread movements generate heterogeneous consumption dynamics in response to capital quality and monetary shocks, while constant-spread models produce near-parallel responses. The three-asset model provides a better fit to the micro data (consumption-spread-income relationship across the wealth distribution), but the welfare conclusions from macroprudential regulation are essentially unchanged: welfare losses across the distribution in the stationary equilibrium, partially mitigated when aggregate shocks are added, with losses concentrated in the richest decile.&lt;/p&gt;
&lt;h3 id="q12-what-robustness-checks-are-reported-for-the-empirical-consumption-regressions"&gt;Q12. What robustness checks are reported for the empirical consumption regressions?&lt;/h3&gt;
&lt;p&gt;A: Three robustness exercises are reported. First, capitalizing car purchases using their official tax value (rather than treating car purchases as current expenditure) yields coefficients similar to the baseline (Table 10). Second, excluding households who purchase a car in the current or prior year (reducing the sample to 13.24 million observations) also leaves results unchanged. Third, first-differenced specifications (equation 42, with and without household fixed effects) produce results similar to the levels specification; the main exception is the spread effect for above-median wealth households when household fixed effects are omitted from the differenced specification (Table 11). The income–spread interaction is consistently positive and significant across all robustness checks.&lt;/p&gt;
&lt;h3 id="q13-what-evidence-does-the-paper-provide-that-the-models-mpc-is-countercyclical-and-that-credit-spreads-are-the-primary-driver"&gt;Q13. What evidence does the paper provide that the model&amp;rsquo;s MPC is countercyclical and that credit spreads are the primary driver?&lt;/h3&gt;
&lt;p&gt;A: Figure 7 shows impulse response functions of the average MPC to each of the three aggregate shocks. In all three cases, the MPC rises in recessions (countercyclical). The key mechanism is that adverse shocks cause spreads to rise, increasing the mass of households at the kink in the budget constraint (zero liquid assets), where MPCs are highest. When the consumer credit spread is held constant, the MPC remains countercyclical but close to constant, indicating that spread movements account for most of the cyclical variation in MPC. Eliminating the spread altogether implies an acyclical MPC (Table 12, Appendix D). The unconditional cross-correlation of the model MPC with output is −0.60, compared with −0.53 for the empirically estimated consumption-income elasticity in the Danish data.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Consumer credit spread (borrowing-saving spread):&lt;/strong&gt; In the paper, this is the difference between the gross real interest rate on consumer loans (RL,t) charged by banks and the gross real return on deposits (RS,t) received by savers. It is not an abstract measure of credit conditions but a household-specific, bank-derived rate gap that moves countercyclically due to banking agency frictions and creates a kink in households&amp;rsquo; budget constraints at zero net worth. Distinct from mortgage spreads (which in Denmark are market-determined and independent of borrower credit conditions).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Kink in the budget constraint:&lt;/strong&gt; The household budget constraint has a kink at zero net assets because borrowers face RL,t &amp;gt; RS,t; households at exactly zero liquid assets (type IV in the paper&amp;rsquo;s taxonomy) face a discrete jump in the cost of additional borrowing. This kink creates a mass point in the wealth distribution at zero net wealth, and households at this kink have higher MPCs than unconstrained savers or borrowers. The size of the mass point increases when the spread rises.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Financial accelerator (in the HANK-with-banking context):&lt;/strong&gt; The amplification mechanism in which shocks that reduce banking sector net worth tighten banks&amp;rsquo; leverage constraints, raise credit spreads, reduce asset supply to both the corporate sector and households, and further depress investment and consumption — which in turn reduces bank net worth further. In this paper, the accelerator operates through the consumer credit spread channel in addition to the standard corporate lending channel, and is present for capital quality and monetary policy shocks but not materially for TFP shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Countercyclical MPC:&lt;/strong&gt; The MPC — defined as the response of consumption to a small transitory income shock — rises during recessions and falls during expansions in this model. The mechanism is that recessions are associated with higher consumer credit spreads, which expand the mass of households at or near the zero net wealth kink (high MPC), and contract the mass of unconstrained savers (low MPC). This is a distinct source of MPC cyclicality from the wealth distribution channel alone.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Agency friction (diversion problem):&lt;/strong&gt; Banks can divert a fraction λ of their assets; if they do so, depositors can recover only the fraction (1 − λ) and the bank is liquidated. This threat limits depositors&amp;rsquo; willingness to supply funds, resulting in an incentive-compatibility constraint on bank leverage: assets cannot exceed ϱt/λ (where ϱt is the bank&amp;rsquo;s franchise value multiplier). When ϱt declines (because expected excess returns fall), the constraint binds more tightly and the spread between the return on assets and the deposit rate must be positive to sustain bank participation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Macro versus micro volatility trade-off:&lt;/strong&gt; The paper uses this phrase to describe the finding that tighter bank capital requirements (restricting leverage) reduce the cyclical volatility of aggregate output and investment (macro volatility falls) while simultaneously increasing the volatility of individual household consumption streams due to higher credit spreads and lower deposit returns (micro volatility rises). Welfare costs from increased micro volatility outweigh the aggregate stabilization benefits.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Consumption-income elasticity (d log c / d log y):&lt;/strong&gt; A time-varying cross-sectional average measure derived from quantile regression parameter estimates, equal to β₀,ⱼ + β₂,ⱼ × RSi,t for household i in wealth bin j. It is used in the paper as an empirical proxy for the MPC (not a direct estimate), and is shown to be highly correlated with the model MPC (cross-correlation of 90 percent at the annual rate). Its cyclicality is stronger when spread variation is incorporated (standard deviation 2.4 percent, cross-correlation with output −0.53) than when spreads are held fixed (standard deviation 1.3 percent, cross-correlation −0.31).&lt;/p&gt;</description></item><item><title>Financial shocks and leverage of financial institutions: When do they matter?</title><link>https://macropaperwarehouse.com/papers/financial-shocks-and-leverage-of-financial-institutions-when-do-they-matter/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/financial-shocks-and-leverage-of-financial-institutions-when-do-they-matter/</guid><description>&lt;p&gt;This paper investigates the role of leverage of financial institutions in amplifying the transmission of financial shocks to the macroeconomy, with particular attention to whether that amplification differs across economic regimes. The authors develop a new endogenous regime-switching structural vector autoregression (RS-SVAR) model with time-varying transition probabilities, in which the probability of switching regime depends on the contemporaneous state of the economy (endogenous switching). The model extends the Sims and Zha (2006) and Sims, Waggoner, and Zha (2008) Markov-switching SVAR framework by: (1) incorporating a time-varying transition matrix in which the probability of staying in a regime is a logistic function of lagged endogenous variables; and (2) introducing new identification techniques for RS-SVARs, including non-recursive zero restrictions, sign restrictions, and narrative sign restrictions, which can in some cases uniquely identify structural shocks rather than merely set-identify them.&lt;/p&gt;
&lt;p&gt;The leverage measure is market-based — book assets divided by market equity — constructed from CRSP/Compustat institution-level data covering publicly listed depository institutions, bank holding companies, and nonbank financial institutions. The sample runs monthly from December 1988 to December 2019. The five-variable VAR includes industrial production growth, core CPI inflation, the 2-year Treasury rate, market leverage of financial institutions, and the Chicago Fed&amp;rsquo;s National Financial Conditions Index (NFCI). The authors estimate three model variants that substitute in turn the leverage of: (i) all depository institutions, (ii) Global Systemically Important Banks (GSIBs), and (iii) securities brokers and dealers.&lt;/p&gt;
&lt;p&gt;The model identifies two coefficient regimes — a &amp;ldquo;financial constraint&amp;rdquo; regime and &amp;ldquo;normal times&amp;rdquo; — using the criterion that the first regime has higher smoothed probability during September 2008 to August 2009. The financial constraint regime covers the end of the Savings and Loan crisis, the 1990/91 recession, the Russian debt default, the Global Financial Crisis (GFC), and the European sovereign debt crisis.&lt;/p&gt;
&lt;p&gt;The core finding is that real effects of financial shocks are amplified in the financial constraint regime but not in normal times. In the financial constraint regime, the output response to a financial shock is significantly negative, large, and protracted; GSIB leverage initially rises sharply (as falling asset prices erode equity) and then declines as institutions deleverage. In normal times, the output growth response is negative but non-persistent, and market leverage remains insignificant over the entire horizon.&lt;/p&gt;
&lt;p&gt;The counterfactual experiment holding GSIB market leverage constant as of October 2008 is the sharpest quantitative result: if GSIB leverage had not risen further at the onset of the GFC, the decline in industrial production growth would have been approximately 20 percentage points smaller, with a faster subsequent recovery in output growth and inflation and higher short-term interest rates. The counterfactual probability of staying in the financial constraint regime would have fallen as low as 0.1 for some draws, compared to the actual probability remaining elevated. By contrast, for a system using depository institution leverage, the lower-bound counterfactual probability of staying in the constraint regime does not fall below 0.90, indicating substantially weaker heterogeneity effects for the broader depository sector.&lt;/p&gt;
&lt;p&gt;Securities brokers and dealers show leverage that rises more on impact than other institutions and then declines immediately, consistent with their willingness to expand balance sheets going into the crisis amplifying losses and forcing a sharp post-crisis contraction.&lt;/p&gt;
&lt;p&gt;A separate counterfactual holding the NFCI constant (rather than leverage) shows that the probability of staying in the constraint regime does not decline, confirming that market leverage and the financial conditions index provide distinct characterizations of the financial system and have different implications for shock propagation and regime persistence. Results are robust to substituting the GZ corporate spread for the NFCI and to imposing narrative restrictions for shock identification.&lt;/p&gt;
&lt;p&gt;Q: What is the central research question?
A: The paper asks whether and how the leverage of financial institutions amplifies the transmission of financial shocks to the real economy, and whether this amplification differs between a financial constraint regime and normal times. A secondary question concerns heterogeneity: do GSIBs, depository institutions broadly, and nonbank securities dealers transmit shocks differently?&lt;/p&gt;
&lt;p&gt;Q: What is novel about the econometric framework?
A: The RS-SVAR model allows the probability of remaining in a given coefficient regime to vary over time as a logistic function of lagged endogenous variables, so regime switching is endogenous to the state of the economy rather than governed by a fixed transition matrix. The paper also introduces sign restrictions, zero restrictions, and narrative sign restrictions into the RS-SVAR class, enabling identification of both structural shocks and regimes within a single framework; in roughly 20 percent of posterior draws these sign restrictions uniquely identify the financial shock.&lt;/p&gt;
&lt;p&gt;Q: Why does the paper use market leverage rather than book leverage?
A: Market leverage (book assets divided by market equity) is argued to be more timely than book leverage because book equity incorporates losses with a delay, giving institutions time to adjust book leverage to avoid regulatory limits. Market capitalization reflects market participants&amp;rsquo; assessment of an institution&amp;rsquo;s creditworthiness, and low market-to-book ratios signal that institutions are more leveraged than their books indicate. Market leverage is therefore a more informative early-warning indicator of financial fragility and the need for rapid deleveraging.&lt;/p&gt;
&lt;p&gt;Q: How are the two regimes identified?
A: For each estimated regime, the authors count the number of months between September 2008 and August 2009 (inclusive) for which the smoothed probability of being in that regime exceeds 0.70; the regime with the higher count is labeled &amp;ldquo;financial constraint&amp;rdquo; and ordered first. Shock identification uses sign restrictions: in the financial constraint regime, a positive financial shock must have a contemporaneously negative effect on output, inflation, and the short-term interest rate, but positive effects on the financial conditions index and leverage; in normal times, only the financial conditions index is required to respond positively on impact.&lt;/p&gt;
&lt;p&gt;Q: What regimes does the model assign historically?
A: The smoothed probability of the financial constraint regime is elevated during the end of the Savings and Loan crisis, the 1990/91 recession, the Russian debt default, the GFC and associated recession (where the probability reaches 1.0 at end-2008 and beginning-2009 before declining sharply to approximately 0.6 percent in 2009/2010), and the European sovereign debt crisis.&lt;/p&gt;
&lt;p&gt;Q: What do the impulse responses show in the financial constraint regime?
A: In the financial constraint regime, the output response to a positive financial shock (tightening) is significantly negative, large, and protracted. GSIB leverage initially rises due to a sharp decline in asset prices eroding market equity, then falls as GSIBs deleverage in response. The authors interpret this pattern as evidence that deleveraging produces procyclical financial amplification effects with adverse real consequences.&lt;/p&gt;
&lt;p&gt;Q: What do the impulse responses show in normal times?
A: In normal times, the output growth response is large and negative but non-persistent, in contrast to the financial constraint regime. Market leverage remains statistically insignificant across the entire horizon in normal times, indicating that the leverage amplification channel is inactive outside of financial constraint episodes.&lt;/p&gt;
&lt;p&gt;Q: What does the GSIB leverage counterfactual show quantitatively?
A: Holding GSIB market leverage constant as of October 2008 implies a decline in industrial production growth that is approximately 20 percentage points smaller than actually occurred, along with a faster recovery in output growth and inflation and higher short-term interest rates. The counterfactual probability of staying in the financial constraint regime declines to as low as 0.1 for some posterior draws, compared to remaining elevated in the actual data.&lt;/p&gt;
&lt;p&gt;Q: How do depository institutions compare to GSIBs in the counterfactual?
A: For the model using broad depository institution leverage, the lower-bound counterfactual probability of staying in the financial constraint regime does not fall below 0.90, compared to as low as 0.1 for the GSIB specification. This implies that GSIB deleveraging has substantially more detrimental macroeconomic effects and a much larger effect on regime persistence than the broader depository sector.&lt;/p&gt;
&lt;p&gt;Q: What is distinctive about securities brokers and dealers?
A: Broker-dealer market leverage rises more on impact than leverage of other financial institutions following a financial shock, and then immediately declines due to rapid deleveraging. The authors interpret this as reflecting that dealers&amp;rsquo; willingness to expand balance sheets ahead of the crisis amplified growth and losses, followed by a sharp post-crisis contraction — a pattern consistent with the procyclical leverage mechanism described in Adrian and Shin (2014).&lt;/p&gt;
&lt;p&gt;Q: How do the authors distinguish the role of market leverage from the financial conditions index?
A: A counterfactual holding the NFCI constant (rather than leverage) as of October 2008 shows that the probability of staying in the financial constraint regime does not decline, unlike the leverage counterfactual. This demonstrates that market leverage and the NFCI provide distinct characterizations of financial conditions and have different implications for the propagation of shocks and the persistence of the constraint regime.&lt;/p&gt;
&lt;p&gt;Q: How robust are the results?
A: Substituting the GZ corporate bond spread for the NFCI yields very similar results, specifically that the probability of staying in the constraint regime declines much more in the counterfactual than in the actual data, suggesting the findings are not driven by the choice of financial conditions proxy. Imposing narrative restrictions for shock identification (exploiting the known high-stress period around Lehman&amp;rsquo;s failure in September 2008) yields results that are &amp;ldquo;rather robust&amp;rdquo; relative to the baseline sign-restriction identification.&lt;/p&gt;
&lt;p&gt;Q: What are the policy implications?
A: The results confirm the leverage ratio as a useful financial stability indicator, with particular emphasis on market leverage as providing timely information for monitoring. The heterogeneity findings suggest that regulatory attention to GSIB leverage is especially warranted, since GSIB deleveraging can have substantially more detrimental macroeconomic effects and a much larger influence on the persistence of financial constraint regimes than deleveraging by the broader depository sector. The leverage ratio is characterized as complementary to the risk-weighted capital ratio as a regulatory tool.&lt;/p&gt;
&lt;p&gt;Market leverage: Measured as book assets divided by market equity (not book equity), constructed from CRSP/Compustat institution-level data at monthly frequency. The paper argues market leverage is more timely than book leverage because market equity immediately reflects losses, preventing institutions from masking fragility through delayed book adjustments.&lt;/p&gt;
&lt;p&gt;Financial constraint regime: One of two identified coefficient regimes in the RS-SVAR, characterized by a significantly negative, large, and protracted output response to financial shocks and by active leverage amplification. Identified empirically as the regime with the highest smoothed probability during September 2008 to August 2009.&lt;/p&gt;
&lt;p&gt;Endogenous regime switching: A modeling approach in which the probability of transitioning between regimes depends on lagged values of the endogenous variables themselves (via a logistic function), rather than being governed by a fixed constant transition matrix. This allows regime dynamics to respond to the state of the economy.&lt;/p&gt;
&lt;p&gt;Time-varying transition probabilities: The diagonal elements of the coefficient-regime transition matrix follow a logistic transformation of a linear function of lagged endogenous variables, so the probability of remaining in any given regime changes each period as a function of current financial and macroeconomic conditions.&lt;/p&gt;
&lt;p&gt;Procyclical financial amplification: The mechanism by which financial institution deleveraging in response to falling asset prices further tightens financial conditions and reduces real output, generating a feedback loop. The paper provides empirical evidence for this channel operating specifically in financial constraint regimes.&lt;/p&gt;
&lt;p&gt;Heterogeneity of financial institutions: The finding that GSIBs, broad depository institutions, and securities brokers and dealers differ substantially in how their leverage affects the transmission of financial shocks. GSIB deleveraging is shown to have much more detrimental macroeconomic effects and a much larger influence on the probability of remaining in the financial constraint regime than depository institution deleveraging more broadly.&lt;/p&gt;
&lt;p&gt;Narrative sign restrictions in RS-SVARs: An identification technique extended from Antolin-Diaz and Rubio-Ramirez (2018) to the regime-switching context, which uses known historical episodes (here, the Lehman failure in September 2008) to impose restrictions on which regime the economy was in or on the sign of structural shocks at particular dates, thereby aiding identification of both shocks and regimes.&lt;/p&gt;</description></item><item><title>Firm Accommodation After Workplace Disability: Labor Market Impacts and Implications for Subsidy Design</title><link>https://macropaperwarehouse.com/papers/firm-accommodation-after-workplace-disability-labor-market-impacts-and-implications-for-subsidy-design/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/firm-accommodation-after-workplace-disability-labor-market-impacts-and-implications-for-subsidy-design/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This paper studies (1) how firm accommodation decisions respond to financial incentives in the context of workplace disability under workers&amp;rsquo; compensation, (2) what the causal effect of accommodation is on workers&amp;rsquo; subsequent labor market outcomes, and (3) whether the equilibrium level of accommodation is socially efficient, and what the welfare implications of wage subsidies for accommodation are.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Empirical Context and Data&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The analysis uses the universe of Oregon workers&amp;rsquo; compensation claims from 2005 through 2017 — over 131,000 disabling claims — linked to longitudinal quarterly earnings records from the Oregon Employment Department. The setting exploits Oregon&amp;rsquo;s Employer at Injury Program (EAIP), which subsidizes employers who provide &amp;ldquo;transitional work&amp;rdquo; accommodations (primarily through wage subsidies) to workers with temporary workplace disabilities. EAIP accounts for roughly 25 percent of claims on average, with the wage subsidy component representing over 96 percent of EAIP expenses.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Identification Strategy&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors exploit a policy change in July 2013 that reduced the EAIP wage subsidy rate from 50 percent to 45 percent. They construct a firm-level &amp;ldquo;exposure&amp;rdquo; measure — the fraction of a firm&amp;rsquo;s claims that used EAIP in a baseline period (2005–2009) — and estimate a continuous difference-in-differences specification in which the interaction of exposure and a post-2013 indicator instruments for accommodation. The identifying assumption is strong parallel trends: firms with low baseline exposure are unlikely to respond to the subsidy reduction, while high-exposure firms respond more, generating cross-firm variation in accommodation rates after 2013. An MTE framework (Heckman and Vytlacil 2005) is then used to explore heterogeneous treatment effects along an unobserved resistance-to-treatment dimension.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Empirical Findings&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;The subsidy reduction from 50% to 45% decreased accommodation rates by &lt;strong&gt;2.9 percentage points&lt;/strong&gt; (9.3 percent) for claims in firms with average exposure, implying a subsidy elasticity of accommodation of 0.9.&lt;/li&gt;
&lt;li&gt;The policy change led to a &lt;strong&gt;0.95 percentage point decrease in employment&lt;/strong&gt; and a &lt;strong&gt;$120 decrease in quarterly earnings&lt;/strong&gt; four quarters after disability for claims in average-exposure firms (roughly 1.3–1.5 percent declines relative to means), with no significant effect on worker turnover to other firms.&lt;/li&gt;
&lt;li&gt;IV estimates of the effect of accommodation itself (using predicted EAIP as instrument) show &lt;strong&gt;accommodation increases the probability of employment four quarters after disability by 33 percentage points&lt;/strong&gt; and &lt;strong&gt;increases quarterly earnings by approximately $4,100&lt;/strong&gt;.&lt;/li&gt;
&lt;li&gt;The MTE analysis reveals &lt;strong&gt;negative selection on gains&lt;/strong&gt;: workers with workplace disabilities who are least likely to receive accommodation have the highest potential gains from it, driven largely by severe disabilities with high accommodation costs.&lt;/li&gt;
&lt;li&gt;Descriptive and IV evidence is consistent with accommodation operating primarily as &lt;strong&gt;general human capital investment&lt;/strong&gt;: accommodation has no statistically significant effect on the probability of moving to a new firm, and earnings gains are not systematically lower for workers who change employers after accommodation.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Structural Model and Counterfactual Findings&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A two-period frictional labor market model with risk-averse workers, risk-neutral firms, Nash bargaining, imperfect experience rating in workers&amp;rsquo; compensation, and firm accommodation as human capital investment is developed and estimated. Two inefficiency sources are identified: (1) a human capital externality — because accommodation builds general human capital, firms cannot capture the full surplus when workers separate, reducing accommodation incentives; and (2) a fiscal externality — imperfectly experience-rated firms do not fully internalize the workers&amp;rsquo; compensation cost savings from accommodation, further depressing it below the efficient level. Counterfactual simulations show:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Eliminating wage subsidies (from 50% to 0%) reduces accommodation rates from &lt;strong&gt;33% to 11%&lt;/strong&gt;, leading to a &lt;strong&gt;7% decline in post-disability employment&lt;/strong&gt; and a &lt;strong&gt;15% decline in post-disability quarterly wages&lt;/strong&gt; (roughly $1,358).&lt;/li&gt;
&lt;li&gt;A revenue-neutral reform eliminating wage subsidies reduces average welfare and the welfare of &lt;strong&gt;more than 90% of workers&lt;/strong&gt;.&lt;/li&gt;
&lt;li&gt;Welfare gains from the subsidy are &lt;strong&gt;larger for low-skilled workers&lt;/strong&gt; than high-skilled workers.&lt;/li&gt;
&lt;li&gt;Conditional on experiencing disability, eliminating wage subsidies decreases welfare by about &lt;strong&gt;10%&lt;/strong&gt;, while increasing the subsidy to 100% raises welfare for disabled workers by around &lt;strong&gt;30%&lt;/strong&gt;.&lt;/li&gt;
&lt;li&gt;Firm profit is maximized at a subsidy rate around 80%, after which higher taxes offset accommodation gains.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-employer-at-injury-program-eaip-and-how-does-it-differ-from-standard-workers-compensation"&gt;Q1. What is the Employer at Injury Program (EAIP), and how does it differ from standard workers&amp;rsquo; compensation?&lt;/h3&gt;
&lt;p&gt;A1: EAIP is an optional component of Oregon&amp;rsquo;s workers&amp;rsquo; compensation system that subsidizes employers for the costs of accommodating workers with temporary disabilities during a transitional return-to-work period. Unlike standard workers&amp;rsquo; compensation premiums (which are experience-rated at the firm level), EAIP is funded through a flat payroll tax on all firms that is not experience-rated — meaning firms that use EAIP do not pay higher premiums. The wage subsidy component accounts for over 96 percent of EAIP expenses; other reimbursable costs (worksite modifications up to $5,000, retraining up to $1,000, clothing up to $400) are rarely used. Eligible employers must be the employer at which the disability occurred, and accommodation is limited to a transitional period during which workers cannot simultaneously receive time-loss benefits.&lt;/p&gt;
&lt;h3 id="q2-how-is-firm-level-exposure-constructed-and-what-is-the-rationale-for-using-it-as-an-instrument"&gt;Q2. How is firm-level &amp;ldquo;exposure&amp;rdquo; constructed, and what is the rationale for using it as an instrument?&lt;/h3&gt;
&lt;p&gt;A2: Exposure is the fraction of a firm&amp;rsquo;s workers&amp;rsquo; compensation claims that used EAIP during a five-year baseline period from 2005 to 2009 — a separate historical period chosen to reduce volatility and avoid mean-reversion. The rationale draws on prior work (Aizawa et al., 2022) showing that firm fixed effects account for nearly 25 percent of variation in accommodation, far more than worker or disability characteristics (1 and 3 percent, respectively), suggesting permanent firm-level heterogeneity in the relative benefits and costs of accommodation. Firms with zero historical exposure are unlikely to change accommodation behavior in response to a subsidy reduction, while high-exposure firms respond more, creating differential quasi-experimental variation in accommodation rates after July 2013.&lt;/p&gt;
&lt;h3 id="q3-what-are-the-first-stage-and-reduced-form-results-from-the-did-specification"&gt;Q3. What are the first-stage and reduced-form results from the DID specification?&lt;/h3&gt;
&lt;p&gt;A3: The first-stage DID coefficient shows that a ten-percentage-point increase in exposure is associated with a one-percentage-point decrease in EAIP take-up after 2013, implying a 2.9 percentage point decrease for claims in firms with average exposure (mean 0.27). The corresponding reduced-form results show a 0.35 percentage point decrease in employment four quarters post-disability and a $45 decrease in quarterly earnings for every ten-percentage-point increase in exposure, scaling to 0.95 percentage points and $120 at average exposure. There is no statistically significant effect on the probability of moving to a new firm. Pre-trend tests show parallel accommodation trends across exposure terciles prior to 2013, supporting the identifying assumption.&lt;/p&gt;
&lt;h3 id="q4-what-do-the-iv-estimates-imply-about-the-causal-effect-of-accommodation-on-labor-market-outcomes"&gt;Q4. What do the IV estimates imply about the causal effect of accommodation on labor market outcomes?&lt;/h3&gt;
&lt;p&gt;A4: Under the exclusion restriction that the subsidy change affects labor market outcomes only through accommodation, the IV estimates imply that receipt of accommodation increases the probability of employment four quarters after disability by &lt;strong&gt;33 percentage points&lt;/strong&gt; (against a mean of 72 percent) and increases quarterly earnings by approximately &lt;strong&gt;$4,100&lt;/strong&gt; (against a mean of $7,807). There is no significant effect on the probability of working at a new firm four quarters later. The authors note these large estimates reflect local average treatment effects for compliers — workers whose accommodation status was changed by the instrument — who disproportionately have high unobserved resistance to treatment and high accommodation returns, explaining the magnitude.&lt;/p&gt;
&lt;h3 id="q5-what-does-the-mte-framework-reveal-about-the-distribution-of-accommodation-effects-and-selection"&gt;Q5. What does the MTE framework reveal about the distribution of accommodation effects and selection?&lt;/h3&gt;
&lt;p&gt;A5: The MTE curves show that workers with the highest unobserved resistance to treatment (least likely to receive accommodation) have the highest potential employment and earnings gains from accommodation. This negative selection on gains arises because these workers tend to have worse employment outcomes in the untreated state, consistent with more severe disabilities commanding higher accommodation costs. IV weights are concentrated at high-resistance values, explaining the large IV estimates. Negative selection on gains is also found along observable dimensions: workers in self-insured firms, healthcare support occupations, women, and those with wounds/cuts/burns show larger gains but lower likelihood of receiving accommodation.&lt;/p&gt;
&lt;h3 id="q6-what-evidence-supports-characterizing-firm-accommodation-as-general-rather-than-firm-specific-human-capital-investment"&gt;Q6. What evidence supports characterizing firm accommodation as general rather than firm-specific human capital investment?&lt;/h3&gt;
&lt;p&gt;A6: Three pieces of evidence point toward general human capital. First, the IV estimate shows accommodation has no statistically significant effect on the probability of working at a new firm four quarters after disability. Second, a triple-interaction specification (DID interacted with new-firm indicator) yields suggestive evidence of even larger earnings gains for workers who move to a new firm post-accommodation, though this is not statistically significant — a pattern inconsistent with firm-specific human capital. Third, the subset of claims that receive non-wage EAIP benefits (worksite modifications, retraining) do show lower mobility, but this comprises fewer than 5 percent of the sample, meaning the predominant form of investment in the context is general in nature.&lt;/p&gt;
&lt;h3 id="q7-what-are-the-two-sources-of-market-inefficiency-in-accommodation-identified-in-the-model"&gt;Q7. What are the two sources of market inefficiency in accommodation identified in the model?&lt;/h3&gt;
&lt;p&gt;A7: The first is a human capital externality operating through worker turnover. Because accommodation builds general human capital that workers carry to new employers, a firm accommodating a worker does not capture the portion of future surplus that accrues to future employers upon separation. In a Nash bargaining framework with lack of commitment, this dynamic inefficiency is larger when industry-wide turnover rates are higher — consistent with the descriptive finding that accommodation rates are strongly negatively associated with industry separation rates. The second is a fiscal externality from imperfect experience rating: firms whose workers&amp;rsquo; compensation premiums are not fully linked to their own claim costs do not fully internalize the cost-savings from accommodation (i.e., reduced time-loss benefit payments), leading them to accommodate at inefficiently low rates.&lt;/p&gt;
&lt;h3 id="q8-how-is-heterogeneity-incorporated-in-the-structural-estimation-and-what-do-the-estimated-parameters-show"&gt;Q8. How is heterogeneity incorporated in the structural estimation, and what do the estimated parameters show?&lt;/h3&gt;
&lt;p&gt;A8: The model incorporates observed heterogeneity (firm insurance status, worker skill type — measured by pre-disability wages — firm baseline exposure, and pre/post policy change) and unobserved heterogeneity mapped to the MTE framework&amp;rsquo;s unobserved resistance to treatment. Indirect inference matches cross-sectional accommodation rates, earnings by subgroup, and the DID coefficients. Key findings: net output during the disability period is negative (accommodation is a costly short-run investment), while post-disability output is higher for accommodated workers. Low-skilled workers experience larger productivity gains from accommodation than high-skilled workers. Accommodation cost shock variance is lower for higher unobserved types, meaning high-gain workers are also more sensitive to subsidy changes, consistent with the large IV estimates. The model fits the DID coefficients for accommodation, employment, and wages well.&lt;/p&gt;
&lt;h3 id="q9-what-do-the-counterfactual-simulations-show-about-the-welfare-effects-of-varying-the-subsidy-rate"&gt;Q9. What do the counterfactual simulations show about the welfare effects of varying the subsidy rate?&lt;/h3&gt;
&lt;p&gt;A9: Eliminating wage subsidies from the current 50% rate reduces the accommodation rate from 33% to 11% and lowers post-disability employment by 7 percentage points and post-disability quarterly wages by 15% ($1,358). From a welfare perspective, eliminating subsidies in a revenue-neutral reform reduces average ex-ante worker welfare and lowers welfare for more than 90% of workers. Conditional on experiencing disability, eliminating subsidies reduces welfare by about 10% while raising the subsidy to 100% increases welfare of disabled workers by around 30%. Firm profit is increasing in the subsidy rate up to about 80%, then decreases. Ex-ante worker welfare gains from the current 50% subsidy relative to no subsidy are modest in consumption-equivalent terms (at most 0.6% increase in consumption), partly because the disability probability is low (2.2%) and because unaccommodated workers still receive two-thirds wage replacement through time-loss benefits.&lt;/p&gt;
&lt;h3 id="q10-what-distributional-implications-do-wage-subsidies-have-across-worker-and-firm-types"&gt;Q10. What distributional implications do wage subsidies have across worker and firm types?&lt;/h3&gt;
&lt;p&gt;A10: Welfare gains from higher wage subsidies are larger for low-skilled workers than high-skilled workers, so the subsidy has a redistributive dimension beyond efficiency correction. Welfare gains are also larger for workers in imperfectly experience-rated firms, where the fiscal externality creates the greater wedge from the efficient level. Self-insured firms, which already internalize workers&amp;rsquo; compensation cost savings and thus accommodate closer to the optimal rate, benefit less from the subsidy and can even be made worse off if subsidies are set very high (since they bear higher flat payroll taxes with smaller marginal accommodation gains). The fraction of worker-firm matches experiencing welfare gains exceeds 90% under the benchmark subsidy level, indicating broad rather than narrowly concentrated gains.&lt;/p&gt;
&lt;h3 id="q11-how-do-the-experience-rating-channel-and-the-worker-turnover-channel-interact-in-comparative-statics"&gt;Q11. How do the experience-rating channel and the worker-turnover channel interact in comparative statics?&lt;/h3&gt;
&lt;p&gt;A11: Model comparative statics show that reducing the job-to-job transition rate of workers with disabilities to one-quarter of its estimated value substantially raises accommodation rates, and this effect is more pronounced for imperfectly experience-rated firms than for self-insured firms. This occurs because self-insured firms already have a strong incentive to accommodate (to reduce workers&amp;rsquo; compensation premiums), so turnover is less marginal for them. Forcing all firms to be self-insured (perfect experience rating) would substantially increase accommodation rates in currently imperfectly rated firms. Lowering the accommodation cost during the disability period (increasing net output during the disability period) also raises accommodation rates for both firm types.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Firm Accommodation (EAIP):&lt;/strong&gt; In this paper&amp;rsquo;s specific sense, accommodation refers to a firm&amp;rsquo;s decision to offer a worker with a temporary workplace disability &amp;ldquo;transitional work&amp;rdquo; — alternative tasks, modified duties, or flexible arrangements — during their recovery period, funded in part through Oregon&amp;rsquo;s Employer at Injury Program wage subsidy. Accommodation is distinct from simple early return to work; it functions as a form of human capital investment by potentially providing skill development opportunities and preventing human capital depreciation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Exposure (Instrument):&lt;/strong&gt; A firm-level continuous measure defined as the fraction of a firm&amp;rsquo;s workers&amp;rsquo; compensation claims that used EAIP during a five-year baseline period (2005–2009). Exposure captures permanent, time-invariant firm-level propensity to accommodate, and is used to construct a difference-in-differences instrument for the causal effect of accommodation by interacting exposure with a post-2013 indicator (when the subsidy rate was cut from 50% to 45%).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Imperfect Experience Rating:&lt;/strong&gt; The degree to which a firm&amp;rsquo;s workers&amp;rsquo; compensation insurance premium adjusts to reflect that firm&amp;rsquo;s own claims costs, rather than being set at an industry average. Fully experience-rated (self-insured) firms internalize 100% of claim costs and thus have strong incentives to accommodate. Partially experience-rated firms face a fiscal externality: because their premiums do not fully reflect their own time-loss benefit expenditures, they do not capture all the cost savings from accommodating workers, leading to under-accommodation relative to the social optimum.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Human Capital Externality (Dynamic Inefficiency in Accommodation):&lt;/strong&gt; The mechanism — analogous to Acemoglu and Pischke (1999) and Fang and Gavazza (2011) — by which worker turnover reduces firms&amp;rsquo; incentives to invest in general human capital (here, accommodation). When accommodation raises workers&amp;rsquo; general productivity, part of the future surplus from this investment accrues to future employers upon job-to-job separation. With Nash bargaining and lack of commitment (re-bargaining in the second period), the accommodating firm cannot capture this surplus, creating a dynamic inefficiency that is more severe in high-turnover industries.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Negative Selection on Gains:&lt;/strong&gt; The empirical finding, established via the MTE framework, that workers with workplace disabilities who are least likely to receive accommodation (highest unobserved resistance to treatment) have the largest potential employment and earnings gains from accommodation. This pattern arises because workers with more severe disabilities have high accommodation costs (making firms unwilling to accommodate them) but also face far worse counterfactual labor market outcomes without accommodation, creating large potential gains.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Marginal Treatment Effect (MTE):&lt;/strong&gt; Following Heckman and Vytlacil (2005), the treatment effect of accommodation evaluated at a specific quantile of unobserved resistance to treatment — defined here as the propensity score value at which a worker is indifferent between treatment and non-treatment. The MTE curve maps out the full distribution of treatment effects and reveals who benefits (and by how much), how IV estimates are weighted averages over this distribution, and which compliers drive the large IV estimates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;General vs. Firm-Specific Human Capital (in Accommodation Context):&lt;/strong&gt; Accommodation is characterized as general human capital investment if the productivity and earnings gains it produces are transferable across employers — i.e., if accommodated workers who move to new firms retain their wage gains. It is firm-specific if gains are tied to the current match. In this paper, general human capital is supported by the null effect of accommodation on new-firm employment probability, suggestive evidence of non-lower (possibly larger) earnings gains for new-firm movers, and the observation that fewer than 5% of claims use non-wage EAIP benefits associated with firm-specific investment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Revenue-Neutral Counterfactual:&lt;/strong&gt; A counterfactual policy experiment in which the wage subsidy rate for accommodation is varied while imposing that both the time-loss benefit program and the EAIP wage subsidy program remain budget-balanced. Higher subsidy rates raise firm accommodation, reduce time-loss benefit payouts (lowering base premiums for imperfectly experience-rated firms), but require a higher flat EAIP payroll tax on all firms, some of which is passed through to workers via lower first-period wages.&lt;/p&gt;</description></item><item><title>Firm dynamics and random search over the business cycle</title><link>https://macropaperwarehouse.com/papers/firm-dynamics-and-random-search-over-the-business-cycle/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/firm-dynamics-and-random-search-over-the-business-cycle/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;How do aggregate economic fluctuations reallocate workers across the firm productivity distribution over the business cycle? In particular, to what extent do recessions impede workers&amp;rsquo; movement up the job ladder toward more productive firms?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model and Methodology&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper develops a tractable random search model combining three features that had not previously been integrated in a single quantitative framework: (i) firm dynamics driven by idiosyncratic productivity shocks, with endogenous entry and exit; (ii) on-the-job search, generating a job ladder in which workers gradually move toward more productive firms; and (iii) aggregate productivity shocks. Multi-worker firms post employment contracts, choose hiring rates, and decide whether to continue or exit. The key tractability result — called &amp;ldquo;size-independence&amp;rdquo; (Result 1) — shows that, under a constant-returns hiring cost technology, firms&amp;rsquo; optimal policies (contract value, hiring rate, exit decision) are all independent of firm size, so the relevant state space reduces from the full joint distribution of firm productivity and size to the employment-weighted distribution of firm productivity alone. A further result (&amp;ldquo;rank-monotonic equilibrium,&amp;rdquo; Result 2) guarantees, under a sufficient convexity condition on hiring costs (hc&amp;rsquo;&amp;rsquo;(h)/c&amp;rsquo;(h) ≥ 1), that the optimal employment contract is increasing in firm productivity, so the job ladder maps one-for-one onto the firm productivity ladder. The optimal wage contract then admits a closed-form solution.&lt;/p&gt;
&lt;p&gt;The model is calibrated to British data for 1997–2018. Worker-level transition rates (unemployment-to-employment, employment-to-unemployment, and job-to-job) are drawn from the British Household Panel Survey (BHPS). Firm-level data on labor productivity (value added per worker) and employment costs per worker come from the Annual Respondents Database (ARD) and Annual Business Survey (ABS), merged with the Business Structure Database (BSD). The numerical solution adapts ideas from Krusell and Smith (1998), approximating the employment-weighted productivity distribution by a small set of moments and parameterizing value functions as polynomials in the aggregate state; standard linearization methods are inapplicable because endogenous firm entry and exit introduces a discontinuity in value functions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Model validation via the OP decomposition.&lt;/em&gt; The paper&amp;rsquo;s central validation exercise uses the Olley-Pakes (OP) decomposition of a labor productivity index constructed from firm-level data. The aggregate employment-weighted labor productivity index is decomposed into (a) the unweighted average firm productivity and (b) an interaction term (the &amp;ldquo;OP term&amp;rdquo;), which captures the covariance between employment shares and productivity — i.e., how well workers are allocated to productive firms. In the British firm-level data, approximately 20 percent of the variance of the aggregate labor productivity index is accounted for by this interaction (OP) term, with the remaining ~80 percent attributable to the unweighted average of firm productivity. The baseline model, with this moment untargeted, successfully replicates this 80/20 split. By contrast, the leading benchmark model of Moscarini and Postel-Vinay (2016) (MPV2016), calibrated to the same British data, attributes nearly all of the variance of labor productivity to the OP/worker reallocation term, grossly overstating the importance of job-ladder dynamics.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Structural decomposition of labor productivity.&lt;/em&gt; Using the calibrated baseline model to decompose the variance of aggregate labor productivity over the post-war British business cycle (&amp;ldquo;GDP shocks&amp;rdquo; going back to 1955), the baseline model attributes approximately 30 percent to the direct effect of the aggregate productivity shock, approximately 50 percent to changes in the distribution of active firms (the &amp;ldquo;firm ladder&amp;rdquo; or firm selection component), and approximately 20 percent to the worker reallocation component (the OP interaction term). This result is robust to an alternative calibration with a lower curvature of the hiring cost function (c1 = 1).&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Persistence and mechanisms.&lt;/em&gt; The impact of recessions on the job ladder is persistent: while the aggregate productivity shock is typically close to its pre-recession value four years after a typical recession onset, the overall allocation of workers to firms remains clearly worse relative to the pre-recession level at that same horizon. The Great Recession, viewed through the lens of the model, is a large but not unusually large recession.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Firm selection with multiple aggregate shocks.&lt;/em&gt; An unexpected finding concerns the direction of firm selection. With a single aggregate productivity shock, the model generates a standard &amp;ldquo;cleansing&amp;rdquo; mechanism: negative shocks raise the firm exit threshold, so surviving firms are on average more productive. However, when additional shocks to the exogenous separation rate (δ) and hiring cost scale (c0) are included — as required to match the volatility of labor market flows — firm selection instead amplifies the decline in labor productivity. The mechanism is a general equilibrium one: a higher separation rate lowers the optimal wage contract (since greater separation risk is passed on to workers), which in turn lowers the entry-exit threshold. Less productive firms become viable because their employees face higher unemployment risk and therefore accept lower wages; moreover, a larger pool of unemployed workers makes it easier for low-productivity firms to recruit.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Wage flexibility tension.&lt;/em&gt; The model implies a pass-through elasticity of wages to productivity shocks of approximately 0.7, well above the 0.05–0.2 range typically found empirically.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;All calibration and quantitative results pertain to Britain for the period 1997–2018 (firm-level data) and 1955–2018 (GDP-based aggregate shocks). The model abstracts from decreasing returns to scale in production and from nominal rigidities. The tractability results rely on specific assumptions about the hiring cost function; the rank-monotonicity condition requires sufficient convexity (hc&amp;rsquo;&amp;rsquo;(h)/c&amp;rsquo;(h) ≥ 1).&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-central-tractability-result-and-why-does-it-matter-for-computational-feasibility"&gt;Q1. What is the central tractability result and why does it matter for computational feasibility?&lt;/h3&gt;
&lt;p&gt;A: Result 1 (&amp;ldquo;size-independence&amp;rdquo;) shows that, because both the production technology and the hiring cost function are constant returns to scale, the firm&amp;rsquo;s present discounted value of profits is linear in employment. As a result, per-worker profits are independent of firm size, and optimal firm policies — the hiring rate, the contract value offered to workers, and the continuation/exit decision — all depend only on the firm&amp;rsquo;s current productivity, not on its size. This collapses the state space from the full joint distribution of firm productivity and employment size to the employment-weighted measure of firm productivity Lt(p), a uni-dimensional object. Without this result, the model would require tracking the entire joint firm distribution, making it computationally intractable.&lt;/p&gt;
&lt;h3 id="q2-what-is-a-rank-monotonic-equilibrium-rme-and-what-conditions-guarantee-it"&gt;Q2. What is a rank-monotonic equilibrium (RME) and what conditions guarantee it?&lt;/h3&gt;
&lt;p&gt;A: An RME is a recursive equilibrium in which the optimal contract offered by a firm is weakly increasing in that firm&amp;rsquo;s current productivity realization, for all aggregate states. Result 2 provides sufficient conditions: (i) the Markov process for firm-specific productivity satisfies first-order stochastic dominance (more productive firms today are more likely to be more productive tomorrow), (ii) the distribution of offered contracts is everywhere differentiable (ruling out mass points), and (iii) the hiring cost function satisfies hc&amp;rsquo;&amp;rsquo;(h)/c&amp;rsquo;(h) ≥ 1 — a sufficient convexity condition. The economic interpretation of the convexity condition is that firms must find retention (offering higher wages) sufficiently costly relative to new hiring that more productive firms optimally choose to use the wage margin to limit quits. The baseline calibration yields c1 ≈ 5.9 (so costs are highly convex in the hiring rate), though results are also reported for the minimum permissible c1 = 1.&lt;/p&gt;
&lt;h3 id="q3-what-does-the-optimal-employment-contract-look-like-in-a-rank-monotonic-equilibrium-and-what-does-it-reveal-about-rent-extraction"&gt;Q3. What does the optimal employment contract look like in a rank-monotonic equilibrium, and what does it reveal about rent extraction?&lt;/h3&gt;
&lt;p&gt;A: In an RME, the optimal contract V(p,ω,L) is a weighted average of the value of unemployment U(ω,L) and the firm-workers&amp;rsquo; joint surplus S(p,ω,L), where the weights are determined endogenously by the employment-weighted measure of firm productivity L. Specifically, the contract integrates the surplus of all firms with productivity below p, weighted by the share of employed workers at those firms, and divided by the mass of job seekers willing to accept the contract. As the employed workers&amp;rsquo; relative search intensity s approaches zero, the contract converges to the value of unemployment — workers receive no rents. The endogenous bargaining weight evolves with the aggregate state over the business cycle, unlike standard Nash bargaining models with a fixed exogenous weight.&lt;/p&gt;
&lt;h3 id="q4-what-firm-level-moments-are-used-to-calibrate-the-steady-state-model-and-what-is-the-logic-behind-the-parameter-moment-mapping"&gt;Q4. What firm-level moments are used to calibrate the steady-state model, and what is the logic behind the parameter-moment mapping?&lt;/h3&gt;
&lt;p&gt;A: Eight moments are targeted. From the BHPS worker data: the average UE rate (0.058) pins down the scale of hiring costs c0; the average EU rate (0.003) pins down the exogenous separation rate δ; and the average EE (job-to-job) rate (0.016) pins down the relative search intensity s. From the firm-level ARD/BSD data: average firm size (12.1 employees) pins down the entry probability µ; the share of job destruction from firm exits (0.526) disciplines the flow value of unemployment b; the autocorrelation of firm employment ln(n) (0.949 annually) disciplines the persistence of idiosyncratic productivity ρp; the interquartile range of firm-level labor productivity (1.129 log points) disciplines the volatility of idiosyncratic shocks σp; and the regression coefficient of firm employment growth on lagged labor productivity (0.136) disciplines the curvature of hiring costs c1. The baseline calibration fits all eight moments closely.&lt;/p&gt;
&lt;h3 id="q5-how-does-the-calibrated-model-match-non-targeted-moments-and-what-does-this-establish"&gt;Q5. How does the calibrated model match non-targeted moments, and what does this establish?&lt;/h3&gt;
&lt;p&gt;A: The model generates several realistic features not targeted in calibration. It produces a realistic Pareto tail for the employment-size distribution (Pareto tail exponent of 1.033 in the model vs. 1.066 in the data), which arises from the combination of size-independent growth rates and firm entry and exit — conditions identified in the literature as generating power law distributions. The model also matches the dispersion of employment costs per worker across firms (capturing about 70 percent of the interquartile range of ECi,t), the slope of a regression of employment costs on labor productivity (model: 0.685 vs. data: 0.704), and the slope of a regression of employment growth on employment costs (model: 0.162 vs. data: 0.131). These non-targeted matches provide independent validation of the model&amp;rsquo;s wage-determination mechanism.&lt;/p&gt;
&lt;h3 id="q6-why-is-a-single-aggregate-productivity-shock-insufficient-to-match-labor-market-fluctuations-and-what-additional-shocks-are-needed"&gt;Q6. Why is a single aggregate productivity shock insufficient to match labor market fluctuations, and what additional shocks are needed?&lt;/h3&gt;
&lt;p&gt;A: With a single aggregate productivity shock calibrated to match the autocorrelation and standard deviation of log GDP, the model generates labor market fluctuations that are roughly an order of magnitude smaller than in the data. For example, the standard deviation of the EU transition rate is 4.1×10⁻⁴ in the single-shock model versus 2.3×10⁻³ in the data. Adding a discount rate shock (ω,r) partially helps but still leaves the job-finding rate (UE) more than 50 percent too smooth. Adding a separation rate shock (ω,δ) substantially increases EU and UE volatility but generates insufficient EE (job-to-job) volatility. The combination (ω,δ,c0) — adding a shock to the scale of hiring costs c0 — brings the standard deviations of EU and UE close to the data (2.0×10⁻³ and 4.0×10⁻⁴ vs. data 2.3×10⁻³ and 2.7×10⁻⁴), though the model still generates slightly under half the observed volatility in EE rates. This combination is the baseline for the quantitative analysis.&lt;/p&gt;
&lt;h3 id="q7-what-is-the-op-decomposition-how-is-it-computed-from-the-firm-level-data-and-what-does-it-measure-in-the-model"&gt;Q7. What is the OP decomposition, how is it computed from the firm-level data, and what does it measure in the model?&lt;/h3&gt;
&lt;p&gt;A: The aggregate labor productivity index LPt is constructed from firm-level data as the employment-share-weighted average of log value added per worker across firms. The OP decomposition writes this as LPt = LPt_bar + OPt, where LPt_bar is the unweighted (simple) average of firm-level productivity and OPt is the covariance between employment shares and labor productivity (the &amp;ldquo;interaction term&amp;rdquo;). In the data, OPt increases when workers are disproportionately employed at above-average-productivity firms. In the model, LPt_bar maps onto the average (log) productivity of active firms — the support of the job ladder — while OPt maps onto the difference between the employment-weighted and the unweighted averages of firm productivity, directly measuring how high up the ladder workers are located relative to the set of active firms. Around 20 percent of the variance of LPt in the British data is accounted for by OPt, and the model replicates this.&lt;/p&gt;
&lt;h3 id="q8-how-does-the-great-recession-appear-in-the-op-decomposition-and-does-the-model-fit-the-decomposition-during-this-episode"&gt;Q8. How does the Great Recession appear in the OP decomposition, and does the model fit the decomposition during this episode?&lt;/h3&gt;
&lt;p&gt;A: During the Great Recession (2008q2–2009q3 in the UK), around 20 percent of the overall fall in the labor productivity index is accounted for by the fall in the OP interaction term, with the remaining 80 percent coming from the fall in the unweighted average firm productivity. The model, even though it does not target this decomposition in calibration, successfully matches both the average firm productivity component and the interaction (OP) component during the Great Recession. This matching holds both in the baseline calibration (c1 ≈ 5.9) and in the alternative calibration with c1 = 1. The model also matches the analogous decomposition for employment costs per worker (ECt), an additional non-targeted validation.&lt;/p&gt;
&lt;h3 id="q9-why-does-firm-selection-amplify-rather-than-cleanse-in-the-baseline-multi-shock-calibration"&gt;Q9. Why does firm selection amplify rather than cleanse in the baseline multi-shock calibration?&lt;/h3&gt;
&lt;p&gt;A: In the single-shock (productivity ω only) model, a negative productivity shock lowers surplus at all firms, raising the exit threshold pE and thus selecting out low-productivity firms — the standard &amp;ldquo;cleansing&amp;rdquo; mechanism. In the multi-shock baseline, the additional separation rate shock (δ) generates a less intuitive mechanism. A higher δ lowers the optimal wage contract (since increased separation risk is passed on to workers: ∂V/∂δ ≤ 0), which reduces the value of continued employment. This lowers the joint firm-worker surplus threshold for exit, making it viable for low-productivity firms to remain active. Moreover, the larger pool of unemployed workers (generated by the δ shock) depresses the outside option of workers and makes it easier for low-productivity firms to recruit. As a result, the entry-exit threshold pE,t falls — the set of active firms becomes less productive on average — producing a negative firm selection contribution to labor productivity and a positive (amplifying rather than cleansing) contribution to the variance of LPt.&lt;/p&gt;
&lt;h3 id="q10-what-is-the-structural-variance-decomposition-of-labor-productivity-in-the-baseline-model"&gt;Q10. What is the structural variance decomposition of labor productivity in the baseline model?&lt;/h3&gt;
&lt;p&gt;A: Simulating the baseline model over the post-war British business cycle (1955–2020, GDP shocks), the variance of aggregate labor productivity LPt decomposes into three structural terms: approximately 30 percent (0.296) from the direct effect of the aggregate productivity shock ln(ωt); approximately 50 percent (0.541) from changes in the average productivity of active firms E[KP bar_t(ln p)] — the &amp;ldquo;firm ladder&amp;rdquo; or firm selection component; and approximately 20 percent (0.163) from the worker reallocation component OPt = E[LP bar_t(ln p)] − E[KP bar_t(ln p)]. This decomposition implies that roughly 70 percent of fluctuations in labor productivity are driven by worker reallocation broadly defined (the firm ladder plus the interaction term), with the firm selection component being the largest single driver. The result is robust to the alternative c1 = 1 calibration (30/49/22 percent split).&lt;/p&gt;
&lt;h3 id="q11-how-does-the-baseline-model-compare-to-mpv2016-in-the-variance-decomposition"&gt;Q11. How does the baseline model compare to MPV2016 in the variance decomposition?&lt;/h3&gt;
&lt;p&gt;A: In the multi-shock calibration (ω,δ,c0), the MPV2016 model calibrated to the same British data attributes approximately 97.7 percent (0.977) of the variance of LPt to the worker reallocation (OP) term, with essentially none attributed to a firm selection term (since there is no firm entry and exit in MPV2016). This is nearly five times the 20 percent share attributed to worker reallocation in the data and in the baseline model. In the single-shock (ω) calibration, both models attribute a more modest share to worker reallocation (7.2 percent for the baseline model, 0.1 percent for MPV2016 with c1=5), and the difference narrows considerably. The contrast thus stems from the interaction of firm dynamics with multiple aggregate shocks: allowing for endogenous firm entry and exit is critical to prevent the model from overstating the role of the job ladder.&lt;/p&gt;
&lt;h3 id="q12-how-persistent-is-the-impact-of-recessions-on-the-job-ladder-based-on-the-model-simulations"&gt;Q12. How persistent is the impact of recessions on the job ladder, based on the model simulations?&lt;/h3&gt;
&lt;p&gt;A: The paper simulates the structural decomposition of labor productivity starting from each of seven post-war British recessions (defined by two consecutive quarters of negative GDP growth). On average across these recessions, the aggregate productivity shock ln(ωt) is close to its pre-recession level by four years after the recession onset. However, the overall employment-weighted average productivity E[LP bar_t(ln p)] — reflecting workers&amp;rsquo; position on the job ladder — remains clearly below its pre-recession value at the four-year horizon, indicating persistent misallocation. The OP interaction term accounts for approximately 20 percent of the total drop in the employment-weighted productivity measure three years after a typical recession onset. Through the model&amp;rsquo;s lens, the Great Recession is a large recession but not an outlier relative to the historical distribution.&lt;/p&gt;
&lt;h3 id="q13-what-does-the-counterfactual-with-countercyclical-unemployment-benefits-reveal-about-the-tradeoff-between-firm-selection-and-worker-reallocation"&gt;Q13. What does the counterfactual with countercyclical unemployment benefits reveal about the tradeoff between firm selection and worker reallocation?&lt;/h3&gt;
&lt;p&gt;A: When the flow value of unemployment is made countercyclical (falling in recessions, rising in expansions — mimicking US unemployment insurance extension programs), the model generates a sign reversal in the firm selection (&amp;ldquo;firm ladder&amp;rdquo;) component. With countercyclical b, the unemployment value rises in recessions, which raises the minimum wage firms must offer and raises the exit threshold pE,t: fewer low-productivity firms survive, improving the composition of active firms. However, countercyclical benefits also amplify the slowdown in job-to-job reallocation: the higher value of unemployment reduces workers&amp;rsquo; willingness to accept job offers, and all firms cut recruitment since optimal wage contracts must rise. The OP interaction term therefore falls more sharply than in the baseline model. The counterfactual with ϵb,ω ∈ {−100, −50} finds that the positive &amp;ldquo;firm ladder&amp;rdquo; effect dominates on net, so the overall allocation of workers to firms improves relative to the baseline after a typical recession under countercyclical unemployment benefits.&lt;/p&gt;
&lt;h3 id="q14-what-is-the-numerical-solution-method-and-why-are-standard-linearization-approaches-inapplicable"&gt;Q14. What is the numerical solution method, and why are standard linearization approaches inapplicable?&lt;/h3&gt;
&lt;p&gt;A: The model is solved in two steps. First, aggregate shocks are shut down and the steady-state rank-monotonic equilibrium is solved numerically by discretizing the firm productivity process (401 grid points via Tauchen&amp;rsquo;s method) and iterating on the value function and the employment-weighted productivity measure until convergence. Second, aggregate shocks are reintroduced using a simulation-based approach adapted from Krusell and Smith (1998): the employment-weighted distribution of productivity is summarized by Nm = 2 moments (plus the unemployment rate), and the value functions are parameterized as polynomials in the aggregate state, with coefficients updated by regression until convergence. Standard linearization methods (Reiter 2009) are inapplicable because the endogenous entry-exit decision creates a kink (discontinuity) in value functions at the productivity threshold pE, making first-order approximations around the steady state inaccurate. Accuracy tests based on den Haan (2010) show that the polynomial approximation generates errors of at most 0.065 percent for value functions and at most 1 percentage point for the unemployment rate across simulation paths.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;1. Rank-Monotonic Equilibrium (RME)&lt;/strong&gt;
A recursive equilibrium in which the optimal state-contingent employment contract V(p,ω,L) offered by a firm is weakly increasing in the firm&amp;rsquo;s current productivity realization p, for all aggregate states (ω,L). This property implies that the job ladder maps one-for-one onto the firm productivity ladder: workers always prefer to work at more productive firms. The paper shows this property holds under a sufficient convexity condition on hiring costs (hc&amp;rsquo;&amp;rsquo;(h)/c&amp;rsquo;(h) ≥ 1) and first-order stochastic dominance of the productivity process.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;2. Size-Independence&lt;/strong&gt;
The property that a firm&amp;rsquo;s optimal policies — the hiring rate h(p), the employment contract V(p), and the entry/exit decision χ(p) — are all independent of the firm&amp;rsquo;s current employment size n. This follows from constant returns to scale in production and hiring, which implies that firm profits are linear in employment. Size-independence reduces the model&amp;rsquo;s relevant state space to the employment-weighted distribution of firm productivity, enabling tractability.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;3. Employment-Weighted Distribution of Firm Productivity (L_t(p))&lt;/strong&gt;
The measure recording, for each productivity level p, the total employment at firms with productivity at most p. This is the sufficient statistic for the state of the job ladder at any point in time: combined with the aggregate shock ω, it determines all equilibrium policy functions and value functions. In the model, it replaces the full joint distribution of firm productivity and employment size that would otherwise be required.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;4. OP Decomposition (Olley-Pakes Decomposition)&lt;/strong&gt;
The decomposition of the aggregate employment-weighted labor productivity index LPt into: (a) the unweighted average firm productivity LPt-bar, which summarizes the productivity of active firms (the support of the job ladder); and (b) an interaction term OPt, the covariance between employment shares and firm-level productivity, which measures how well workers are allocated across the productivity distribution (i.e., how high up the ladder workers sit given the set of active firms). In the model, (a) maps to E[KP bar_t(ln p)] and (b) maps to OPt = E[LP bar_t(ln p)] − E[KP bar_t(ln p)].&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;5. Contract Posting&lt;/strong&gt;
The wage-setting protocol in which each firm commits upon entry to a full state-contingent employment contract — a schedule mapping each future realization of aggregate and idiosyncratic productivity to a wage and continuation decision — and is bound by an equal treatment constraint to offer the same contract to all employees. Workers cannot renegotiate based on outside offers. This protocol produces a well-defined closed-form for the optimal contract in an RME and differs from alternating-offer bargaining (Nash bargaining) in that the bargaining weights are endogenous rather than fixed.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;6. Firm-Workers&amp;rsquo; Joint Surplus (S_t(p))&lt;/strong&gt;
The total present discounted value accruing to the firm-worker pair: firm profits per worker plus the contract value promised to workers. Because utility is transferable (risk neutrality) and the firm fully commits to its contract, this surplus depends only on the firm&amp;rsquo;s current productivity and the aggregate state — not on the promised contract value V. The surplus S_t(p) is the key object determining firm entry/exit (the firm continues if and only if S_t(p) ≥ U_t) and optimal hiring (the marginal return to an additional hire equals S_t(p) − V(p)).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;7. Cleansing vs. Anti-Cleansing Firm Selection&lt;/strong&gt;
In models with endogenous firm entry and exit, a negative aggregate shock can either raise or lower the productivity threshold for firm survival. &amp;ldquo;Cleansing&amp;rdquo; refers to the standard mechanism where a negative productivity shock raises the exit threshold, selecting out low-productivity firms and improving the average quality of survivors. &amp;ldquo;Anti-cleansing&amp;rdquo; (as in the baseline multi-shock calibration) occurs when separation rate or hiring cost shocks lower the optimal wage contract and reduce the exit threshold, allowing less productive firms to survive and worsening average firm productivity.&lt;/p&gt;</description></item><item><title>Firm idiosyncratic risk and productivity investment: Macroeconomic implications</title><link>https://macropaperwarehouse.com/papers/firm-idiosyncratic-risk-and-productivity-investment-macroeconomic-implications/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/firm-idiosyncratic-risk-and-productivity-investment-macroeconomic-implications/</guid><description>&lt;p&gt;This paper quantifies how idiosyncratic firm-level risk affects aggregate output, TFP, and firm life-cycle growth in an environment where firm productivity evolves endogenously through risky investment. The paper embeds endogenous productivity investment into a Lucas span-of-control model with risk-averse firm owners and endogenous entry and exit, and studies the effects of mean-preserving increases in the variance of returns to productivity investment. A mean-preserving increase in the variance of firm productivity shocks that raises the firm exit rate by 10% (from 0.10 to 0.11) is estimated to cause a 0.73% decline in output, a 0.38% decline in measured TFP, and a 3.69% decline in firm productivity investment; these elasticities remain approximately constant in the empirically relevant range. The driving force is that risk-averse firm owners reduce their risky productivity investment as variance rises; if capital financing constraints are present—as is common in developing economies—these effects are amplified and the increase in uncertainty may also slow firm life-cycle growth. Previously circulated as &amp;ldquo;Uncertainty, Firm Lifecycle Growth, and Aggregate Productivity.&amp;rdquo;&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary based on a working paper version, AI-assisted and human-reviewed. See the linked published article for the authoritative version.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-distinguishes-this-paper-from-standard-models-of-firm-misallocation"&gt;Q1. What distinguishes this paper from standard models of firm misallocation?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Unlike the bulk of firm misallocation literature (Hsieh-Klenow 2009; Gopinath et al. 2017; Sraer-Thesmar 2023), which takes firm productivity as exogenous, this paper models productivity as an endogenous outcome of risky investment, so that idiosyncratic uncertainty affects allocative efficiency not only through selection effects but also through its discouragement of productivity investment by risk-averse owners.&lt;/strong&gt; The paper incorporates endogenous productivity investment into a standard Lucas span-of-control model, allowing the model to capture how higher uncertainty reduces the incentive to invest in productivity, on top of any selection effects from the exit option.&lt;/p&gt;
&lt;h3 id="q2-what-are-the-two-opposing-effects-of-higher-idiosyncratic-risk"&gt;Q2. What are the two opposing effects of higher idiosyncratic risk?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Higher idiosyncratic firm-level risk has two opposing effects on aggregate productivity: (i) a selection effect—a mean-preserving increase in variance leads to stronger selection and raises the productivity of survivors while reallocating exiters to alternative productive uses—that tends to raise average productivity; and (ii) a productivity investment effect—risk-averse owners reduce risky productivity investment in response to higher variance—that tends to reduce aggregate productivity and firm life-cycle growth.&lt;/strong&gt; The paper shows quantitatively that the productivity investment effect dominates in the baseline calibration, so that higher idiosyncratic risk reduces output and TFP despite positive selection effects.&lt;/p&gt;
&lt;h3 id="q3-what-are-the-main-quantitative-findings"&gt;Q3. What are the main quantitative findings?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;A mean-preserving increase in the variance of firm productivity shocks calibrated to raise the firm exit rate by 10% (from 0.10 to 0.11) results in a 0.73% decline in output, a 0.38% decline in measured TFP, and a 3.69% decline in firm productivity investment; these elasticities remain approximately constant in the empirically relevant range.&lt;/strong&gt; The exit-rate increase from 0.10 to 0.11 is also associated with a 7.5% increase in the job destruction rate and a 14.6% increase in the standard deviation of firm growth rates—the latter is less than one-fifth of the increases in these risk measures observed when comparing India or Mexico to the U.S.&lt;/p&gt;
&lt;h3 id="q4-how-do-capital-financing-constraints-interact-with-the-results"&gt;Q4. How do capital financing constraints interact with the results?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;When firms face capital financing constraints—as is common in developing economies—the negative effects of higher idiosyncratic risk are amplified and the increase in uncertainty may also slow firm life-cycle growth.&lt;/strong&gt; The mechanism is that constrained firms must rely more heavily on internal financing, making risk-averse owners even more sensitive to increases in variance. The paper implies that the macro-financial implications of idiosyncratic risk are more severe in developing economies where both idiosyncratic risk levels and financing constraints are greater—consistent with cross-country patterns of firm growth dynamics.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;productivity investment&lt;/strong&gt; : endogenous spending by firms on activities that shift their productivity process; in the model, this investment exposes firm owners to idiosyncratic risk via the innovation in the productivity process; the key margin through which higher uncertainty reduces aggregate productivity and output.
&lt;strong&gt;mean-preserving increase in variance&lt;/strong&gt; : a statistical experiment that increases the spread of the distribution of returns to productivity investment while leaving the mean unchanged; used here to isolate the pure risk effect on firm behavior and aggregate outcomes from any change in expected returns.
&lt;strong&gt;span-of-control model&lt;/strong&gt; : the Lucas (1978) model of firm size distribution with decreasing returns to scale in the entrepreneurial input; used as the production environment; extended here by adding endogenous productivity investment and endogenous entry and exit.&lt;/p&gt;</description></item><item><title>Firm Responses and Wage Effects of Foreign Demand Shocks with Fixed Labor Costs and Monopsony</title><link>https://macropaperwarehouse.com/papers/firm-responses-and-wage-effects-of-foreign-demand-shocks-with-fixed-labor-costs-and-monopsony/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/firm-responses-and-wage-effects-of-foreign-demand-shocks-with-fixed-labor-costs-and-monopsony/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; The paper asks three related questions in the context of Belgium, a small open economy: (1) What do firms&amp;rsquo; responses to demand shocks reveal about their cost structures? (2) What are the worker and wage impacts of foreign demand shocks? (3) How sensitive are the aggregate wage effects of foreign demand shifts to firms&amp;rsquo; cost structures and imperfect competition in the labor market?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data.&lt;/strong&gt; The analysis combines administrative micro-data from Belgium for 2002–2014, provided by the National Bank of Belgium. The linked dataset covers 995,739 firm-year observations from private, non-financial firms with at least one FTE employee, and integrates: (a) a Business-to-Business (B2B) VAT transactions registry capturing all annual domestic firm-to-firm sales above €250; (b) customs records and intra-EU declarations for imports and exports at the 8-digit product level; (c) annual accounts containing data on sales, labor costs, intermediate inputs, capital, and firm characteristics; and (d) employer-employee matched data from the Belgian social security administration (BCSS) for a random sample of 500,000 workers in firms with 10 or more FTE employees, covering 2003–2014.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Identification Strategy.&lt;/strong&gt; To isolate variation in firms&amp;rsquo; sales driven by foreign demand rather than supply-side factors, the authors construct a firm-specific foreign demand instrument following Hummels et al. (2014) and Dhyne et al. (2021). The instrument is the weighted average of changes in world import demand facing a firm, using lagged export shares as weights and excluding Belgian imports from the world import measure. Crucially, the instrument captures both direct foreign demand exposure (for exporters) and indirect exposure through the domestic production network — including the foreign demand shocks passing through to upstream domestic suppliers via buyer-supplier links. Firm and industry-year fixed effects control for time-invariant heterogeneity and industry-level trends.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Key Empirical Facts.&lt;/strong&gt; Within-firm analysis over four-year windows finds that intermediate input purchases respond nearly proportionally to changes in sales (slope coefficient 0.82), while labor costs respond less than proportionally (slope coefficient 0.57). The less-than-proportional response of labor costs — with the employment slope of 0.48 and the average wage slope of 0.09 — is consistent with sizable fixed overhead costs in labor inputs and upward-sloping labor supply curves. Output prices co-move more with input prices than with average wages, consistent with labor constituting a smaller share of variable costs than intermediate inputs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;IV Estimates of Firm Responses.&lt;/strong&gt; In response to a foreign demand shock inducing a 10 percent instantaneous increase in a firm&amp;rsquo;s sales, the firm&amp;rsquo;s cumulative sales over four years increase by approximately 7.6 percent (balanced panel). Over the same four-year horizon, total input purchases increase by about 7.0–7.8 percent, while labor costs increase by only 3.5–4.1 percent — a substantially less-than-proportional response. Roughly one-quarter of the labor cost change comes from changes in average wages rather than employment changes. Domestic input purchases increase by 5.3–6.0 percent, indicating that firms pass on a large share of foreign demand shocks to their domestic suppliers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Structural Parameters.&lt;/strong&gt; The implied IV estimate of the labor cost elasticity with respect to sales is 0.53 (standard error 0.08), statistically significantly below one. The implied elasticity of total input purchases is 1.05 (standard error 0.15), close to one, so the fixed share of intermediate inputs is approximately zero. The labor supply elasticity estimated from the ratio of wage and employment responses is approximately 3.9 in the full sample and 2.3 in the stayer subsample; the implied wage markdown is 21 percent and 30 percent respectively. Incorporating upward-sloping labor supply into equation (15), the estimated share of total labor inputs that is fixed overhead is approximately 53 percent. By comparison, the fixed share of total costs (labor and intermediate inputs combined) is approximately 29 percent in Belgium — higher than the 18–22 percent found in U.S. data (De Loecker et al. 2020) and the 20 percent found in U.S. manufacturing plants (Ederhof et al. 2021).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;General Equilibrium Counterfactuals.&lt;/strong&gt; The authors parameterize and solve a small open economy general equilibrium model with monopsonistic competition in labor markets, monopolistic competition in product markets, and fixed and variable labor and intermediate input costs. Using the Dekle-Eaton-Kortum (2007) &amp;ldquo;hat algebra&amp;rdquo; technique, they simulate a 5 percent increase in foreign tariffs on all Belgian exports and compare four counterfactual economies: (1) baseline Belgium with fixed costs and imperfect labor market competition (ε = 3.9); (2) fixed costs and perfectly elastic labor supply (ε = ∞); (3) no fixed costs with imperfect competition; (4) no fixed costs and perfectly competitive labor markets.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings on Wages.&lt;/strong&gt; In the baseline Belgian economy, a 5 percent increase in foreign tariffs produces a 4.9 percent fall in the average real wage. With fixed costs but perfectly elastic labor supply, the real wage falls by 4.8 percent — nearly identical. With upward-sloping labor supply but no fixed costs, the real wage falls by only 3.0 percent; without fixed costs and with perfectly competitive labor supply, the fall is only 2.8 percent. The paper concludes that fixed overhead costs in labor substantially amplify real wage declines, while incorporating upward-sloping labor supply appears quantitatively less consequential for aggregate wage outcomes. Standard models that assume no fixed costs and perfectly elastic labor supply — the typical modeling choice in the trade literature — may substantially understate (by roughly 43–75 percent of the true effect) the aggregate wage decline from a negative foreign demand shock.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mechanism.&lt;/strong&gt; Fixed overhead costs reduce labor&amp;rsquo;s share of variable costs. When labor is a smaller share of variable costs, output prices are less sensitive to changes in wages. With a fixed aggregate labor supply, the economy must lower prices through wage reductions to restore equilibrium after a negative demand shock; the required wage decline is larger when fixed labor costs are taken into account. The findings are robust to adjustment cost specifications, a nested logit extension of the labor market model, and controlling for location-year fixed effects and import price changes.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-two-motivating-empirical-facts-about-belgian-firms-does-the-paper-establish"&gt;Q1. What two motivating empirical facts about Belgian firms does the paper establish?&lt;/h3&gt;
&lt;p&gt;A1: First, within-firm four-year changes show that intermediate input purchases respond nearly proportionally to changes in sales (slope coefficient 0.82), while labor costs respond less than proportionally (slope coefficient 0.57). The labor cost response decomposes into an employment slope of 0.48 and a wage slope of 0.09. Second, output prices co-move more strongly with input (intermediate goods) prices than with average wages, consistent with labor constituting a smaller share of variable costs than intermediate inputs.&lt;/p&gt;
&lt;h3 id="q2-how-does-the-instrument-for-foreign-demand-shocks-capture-indirect-exposure-through-production-networks"&gt;Q2. How does the instrument for foreign demand shocks capture indirect exposure through production networks?&lt;/h3&gt;
&lt;p&gt;A2: The instrument for firm k is a weighted average of changes in world import demand, where the weights reflect both the firm&amp;rsquo;s own direct export shares across countries and products and the firm&amp;rsquo;s indirect export exposure through its domestic buyers&amp;rsquo; export shares. The term H̃_{kn,t-1} captures the share of firm k&amp;rsquo;s total sales purchased by firm n directly and indirectly through all upstream chains. This means even non-exporting firms receive a non-zero instrument through their sales to directly-exporting firms. In fact, non-directly-exporting firms sell on average nearly 10 percent of their output indirectly to foreign markets.&lt;/p&gt;
&lt;h3 id="q3-what-is-the-estimated-magnitude-of-the-labor-supply-elasticity-facing-belgian-firms-and-what-does-it-imply-for-wage-markdowns"&gt;Q3. What is the estimated magnitude of the labor supply elasticity facing Belgian firms, and what does it imply for wage markdowns?&lt;/h3&gt;
&lt;p&gt;A3: In the full main estimation sample (balanced panel), the IV estimate of the firm-specific labor supply elasticity is approximately 3.9, implying a wage markdown of about 21 percent relative to the marginal revenue product of labor. In the stayer subsample (incumbent workers only, holding workforce composition fixed), the estimated labor supply elasticity is approximately 2.3, implying a markdown of about 30 percent. The paper can reject perfect competition (infinite elasticity, zero markdown) at a significance level of 0.06 in the full sample and 0.001 in the stayer sample using the closure method.&lt;/p&gt;
&lt;h3 id="q4-what-is-the-estimated-labor-cost-elasticity-with-respect-to-demand-driven-sales-changes-and-what-does-it-imply-about-fixed-labor-costs"&gt;Q4. What is the estimated labor cost elasticity with respect to demand-driven sales changes, and what does it imply about fixed labor costs?&lt;/h3&gt;
&lt;p&gt;A4: The IV estimate of the labor cost elasticity with respect to sales is 0.528 (standard error 0.085), statistically significantly below one. If labor supply were perfectly elastic, this would directly imply a fixed labor cost share of approximately 47 percent. Incorporating the estimated upward-sloping labor supply curve through equation (15), the model implies that approximately 53 percent of total labor inputs are fixed overhead. For context, occupational data from Belgium&amp;rsquo;s 2014 Structure of Earnings Survey shows that clerical support workers and managers together account for 21 percent of total earnings, and adding professionals raises this to 51 percent — broadly consistent with the estimated fixed share.&lt;/p&gt;
&lt;h3 id="q5-what-does-the-estimated-elasticity-of-input-purchases-with-respect-to-sales-imply-about-fixed-intermediate-input-costs"&gt;Q5. What does the estimated elasticity of input purchases with respect to sales imply about fixed intermediate input costs?&lt;/h3&gt;
&lt;p&gt;A5: The IV estimate of the elasticity of total input purchases with respect to sales is 1.050 (standard error 0.150), close to one. The implied fixed share of total intermediate inputs is therefore approximately zero. However, there is substantial heterogeneity by input type: purchases from the manufacturing sector (roughly half of all input purchases) have an elasticity close to one, whereas service-sector inputs (roughly 30 percent of total input purchases) have an implied fixed cost share of approximately 36 percent, with a size-weighted average cumulative response of 4.3 percent against a total cumulative sales increase of 6.7 percent.&lt;/p&gt;
&lt;h3 id="q6-how-does-the-paper-rule-out-alternative-explanations-for-the-less-than-proportional-response-of-labor-costs"&gt;Q6. How does the paper rule out alternative explanations for the less-than-proportional response of labor costs?&lt;/h3&gt;
&lt;p&gt;A6: The paper considers three main alternatives. First, adjustment costs: even in the presence of labor adjustment costs, under a homothetic constant-returns production function a permanent shock should eventually produce a proportional labor response. The paper focuses on four-year cumulative responses where firm responses change little after the first couple of years, and shows identification of fixed costs holds even in models with quadratic or Calvo-style adjustment costs. Second, a non-homothetic CES production function without fixed costs: Appendix B.3 shows that such a specification predicts that if the labor cost elasticity is below one, the input purchase elasticity must be above one — at odds with the data, which shows the input purchase elasticity is close to one while the labor cost elasticity is well below one. Third, variable markups: a uniform markup change would reduce both elasticities proportionally, not create the large gap between labor cost and input purchase elasticities observed.&lt;/p&gt;
&lt;h3 id="q7-why-are-firms-domestic-suppliers-affected-by-foreign-demand-shocks-and-how-large-are-the-pass-through-effects"&gt;Q7. Why are firms&amp;rsquo; domestic suppliers affected by foreign demand shocks, and how large are the pass-through effects?&lt;/h3&gt;
&lt;p&gt;A7: Firms pass on foreign demand shocks to their domestic suppliers through buyer-supplier production network links. When a foreign demand shock increases a firm&amp;rsquo;s sales by 10 percent instantaneously, its domestic input purchases increase cumulatively by approximately 5.3–6.0 percent over four years. Total input purchases increase by 7.0–7.8 percent over the same period; the difference between total and domestic input purchases reflects service inputs (which have smaller responses) and the composition of imported versus domestic inputs.&lt;/p&gt;
&lt;h3 id="q8-what-is-the-aggregate-real-wage-effect-of-a-5-percent-increase-in-foreign-tariffs-on-belgian-exports-in-the-baseline-model"&gt;Q8. What is the aggregate real wage effect of a 5 percent increase in foreign tariffs on Belgian exports in the baseline model?&lt;/h3&gt;
&lt;p&gt;A8: In the baseline counterfactual representing the actual Belgian economy (with fixed overhead costs and labor supply elasticity ε = 3.9), a uniform 5 percent increase in foreign tariffs on all Belgian exports produces a 4.9 percent fall in the average real wage. The median firm reduces output by 3.8 percent, marginal costs by 4.8 percent, and wages by 7.9 percent. The fall in wages is driven by a general equilibrium mechanism: since the foreign price is exogenous and trade balance must hold, wages are the key adjusting margin.&lt;/p&gt;
&lt;h3 id="q9-how-much-does-the-modeling-of-fixed-overhead-costs-versus-imperfect-labor-market-competition-matter-for-the-aggregate-wage-counterfactual"&gt;Q9. How much does the modeling of fixed overhead costs versus imperfect labor market competition matter for the aggregate wage counterfactual?&lt;/h3&gt;
&lt;p&gt;A9: Fixed overhead costs account for nearly all of the amplification relative to the standard model. With fixed costs but perfectly elastic labor supply, the real wage falls 4.8 percent — almost identical to the 4.9 percent in the baseline. Without fixed costs but with the estimated upward-sloping labor supply, the fall is only 3.0 percent. Without either, the fall is 2.8 percent. Thus, incorporating fixed overhead costs in labor raises the estimated wage decline by approximately 1.9 percentage points, while incorporating imperfect labor market competition adds only about 0.1 percentage points. The paper concludes that fixed overhead costs, not monopsony, are the essential feature for accurately predicting tariff impacts on wages.&lt;/p&gt;
&lt;h3 id="q10-what-is-the-mechanism-by-which-fixed-overhead-costs-amplify-the-aggregate-wage-decline-from-a-negative-demand-shock"&gt;Q10. What is the mechanism by which fixed overhead costs amplify the aggregate wage decline from a negative demand shock?&lt;/h3&gt;
&lt;p&gt;A10: Fixed overhead costs reduce the share of labor in firms&amp;rsquo; total variable costs. When labor constitutes a smaller fraction of variable costs, output prices are less sensitive to changes in wages. With aggregate labor supply fixed, the economy restores equilibrium after a negative demand shock by reducing prices through wage cuts. To achieve the same magnitude of price reduction when labor is a smaller fraction of variable costs, wages must fall by a larger amount — amplifying the aggregate wage impact. Fixed overhead costs in labor also make foreign inputs relatively more important in variable costs, as shown empirically in Appendix D.1.&lt;/p&gt;
&lt;h3 id="q11-is-the-conclusion-about-the-relative-importance-of-fixed-costs-versus-labor-market-imperfections-robust-to-alternative-specifications-of-the-labor-market"&gt;Q11. Is the conclusion about the relative importance of fixed costs versus labor market imperfections robust to alternative specifications of the labor market?&lt;/h3&gt;
&lt;p&gt;A11: Yes. The paper extends the model to a nested logit structure for worker preferences (following Lamadon et al. 2022), which allows Belgium to contain multiple labor markets (defined as industry-region nests), permits heterogeneous markdowns across markets, and is still identified from the data. Empirically, incorporating multiple labor markets and heterogeneous markdowns does not quantitatively alter the aggregate counterfactual predictions for the wage effects of foreign demand shocks.&lt;/p&gt;
&lt;h3 id="q12-are-heterogeneous-responses-to-the-foreign-demand-shock-observed-across-exporters-importers-and-domestic-only-firms"&gt;Q12. Are heterogeneous responses to the foreign demand shock observed across exporters, importers, and domestic-only firms?&lt;/h3&gt;
&lt;p&gt;A12: The paper finds no systematic differences in the elasticities of labor cost and input purchases between firms that trade internationally and those that do not. This implies that exporters and importers have higher absolute fixed costs (consistent with fixed export and import costs) but comparable fixed cost shares — since these firms tend to be larger and thus spread higher absolute fixed costs over larger output volumes.&lt;/p&gt;
&lt;h3 id="q13-do-the-findings-about-fixed-overhead-costs-extend-beyond-foreign-demand-shocks"&gt;Q13. Do the findings about fixed overhead costs extend beyond foreign demand shocks?&lt;/h3&gt;
&lt;p&gt;A13: Yes. The paper shows in Appendix D.4 that a uniform 5 percent reduction in the productivity of all Belgian manufacturing firms generates qualitatively and quantitatively similar conclusions: fixed overhead costs amplify the predicted wage effects of domestic productivity shocks, while imperfect competition in the labor market matters to a lesser but still meaningful extent.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Fixed Overhead Costs (Fixed Labor Costs / Fixed Intermediate Input Costs):&lt;/strong&gt; In the paper&amp;rsquo;s model, each firm has firm-specific fixed overhead input requirements for labor (denoted ℓ̄_k^f) and intermediate inputs (denoted q̄_k^f) that must be satisfied regardless of the firm&amp;rsquo;s output level. These fixed requirements are separate from the variable inputs used in production. Fixed labor costs may reflect administration, worker management, facility maintenance, and other tasks that do not directly translate into output. Fixed intermediate input costs include waste management, accounting services, and electricity payments that occur irrespective of sales. The share of total labor inputs that is fixed is identified by how much less than proportionally labor costs respond to demand-driven changes in sales.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Monopsonistic Competition in the Labor Market:&lt;/strong&gt; The paper models each firm as facing an upward-sloping firm-specific labor supply curve arising from workers&amp;rsquo; heterogeneous idiosyncratic preferences over non-wage firm attributes (amenities). Because workers&amp;rsquo; idiosyncratic tastes are private information, firms cannot price-discriminate and thus face an increasing marginal cost of labor. Each firm is infinitesimal within the aggregate labor market but has wage-setting power at the firm level. This gives rise to a constant-elasticity firm-level labor supply curve ℓ_k = A_k w_k^ε, where ε is the labor supply elasticity facing the firm.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Wage Markdown:&lt;/strong&gt; The firm&amp;rsquo;s equilibrium wage is marked down relative to the marginal revenue product of labor by the factor ε/(1+ε), which is less than one when ε is finite. With a labor supply elasticity of 3.9, the implied markdown is approximately 21 percent; with a supply elasticity of 2.3 (stayer sample), the markdown is approximately 30 percent. Perfect competition corresponds to ε = ∞ and a markdown of zero.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Labor Cost Elasticity:&lt;/strong&gt; The elasticity of a firm&amp;rsquo;s total labor cost with respect to a demand-driven change in the firm&amp;rsquo;s sales, as derived from the model&amp;rsquo;s comparative statics (equation 15). This elasticity depends on both the variable share of labor inputs (ℓ_k^v / ℓ_k) and the labor supply elasticity ε. It lies strictly between zero (all labor fixed) and one (all labor variable), and is declining in ε for a given variable share. The paper estimates this elasticity at 0.528 via IV, implying substantial fixed overhead in labor.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Total Foreign Demand Shock:&lt;/strong&gt; The firm-level measure of foreign demand used as an instrument, defined as the weighted average of changes in world import demand (excluding Belgium) across country-product pairs, where the weights reflect both the firm&amp;rsquo;s own lagged direct export shares and its indirect exposure through the domestic production network (via the Leontief inverse matrix H̃). This measure captures both direct exporter exposure and indirect upstream exposure for non-exporting firms that supply to exporters.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Indirect Export Exposure:&lt;/strong&gt; The share of a firm&amp;rsquo;s output that reaches foreign markets indirectly through sales to domestic buyers who subsequently export. Defined recursively: the total export share of firm k equals its direct export revenue share plus the sum over all domestic buyers of the product of k&amp;rsquo;s revenue share from that buyer and the buyer&amp;rsquo;s own total export share. Even non-direct-exporting firms sell on average approximately 10 percent of their output indirectly to foreign markets in the Belgian data.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Dekle-Eaton-Kortum Hat Algebra:&lt;/strong&gt; A technique for solving general equilibrium counterfactuals in trade models by expressing all outcomes as proportional changes (&amp;ldquo;hats&amp;rdquo;) relative to the observed equilibrium, without needing to recover the underlying structural parameters. The paper uses this approach to compute counterfactual wages under alternative tariff scenarios, holding fixed the observed firm-level expenditure shares from the reference year (2012) while allowing parameters such as productivity and technology weights to vary across counterfactual economies to rationalize identical observed firm-level observables.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Worker Rents:&lt;/strong&gt; In the monopsony model, inframarginal workers earn rents defined as the excess return over what would be required to make them indifferent between employers. These rents arise because firms cannot price-discriminate across workers with heterogeneous amenity valuations. The additional rents accruing to workers from a demand-driven increase in firm sales decompose into: (1) wage increases for incumbent workers multiplied by current employment, (2) rents for new hires (the excess of their wage bill over the amount required to induce them to switch to the expanding firm), and (3) a correction term related to the fraction of the labor cost increase borne by expanding employment rather than wages.&lt;/p&gt;</description></item><item><title>FraNK: Fragmentation in the NK Model</title><link>https://macropaperwarehouse.com/papers/frank-fragmentation-in-the-nk-model/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/frank-fragmentation-in-the-nk-model/</guid><description>&lt;p&gt;Moro and Nispi Landi develop FraNK, a multi-country New Keynesian model designed to study geoeconomic fragmentation — defined, following Aiyar et al. (2023), as a policy-driven reversal of economic integration guided by strategic considerations. The model extends Gali and Monacelli (2005) along three dimensions: it is multi-country rather than small-open-economy; it assumes incomplete international financial markets, relaxing perfect risk sharing; and it incorporates commodities as intermediate inputs in production, capturing both domestic and imported commodity sourcing. A fragmentation shock is modeled as a simultaneous increase in three tax rates imposed on rival countries: a tax on imports of final goods, a tax on imports of commodities, and a tax on the purchase of foreign bonds (capital controls).&lt;/p&gt;
&lt;p&gt;The paper proceeds in two stages. First, under a symmetric two-bloc calibration, closed-form analytical results establish the distinct macroeconomic channels of each tax. The good import tax operates through both demand (households reduce consumption of foreign goods) and supply (firms face higher real marginal costs), with the demand channel dominating: output falls unambiguously and PPI inflation decreases, though CPI inflation rises on impact due to the direct pass-through of import prices. The commodity import tax operates exclusively through supply — raising intermediate input costs — so both output and PPI inflation move in the same direction: output falls and PPI inflation rises. The bond tax is neutral under symmetric calibration: because each country&amp;rsquo;s net foreign asset position is unchanged (each country reduces its holdings of rival-bloc bonds by exactly as much as it reduces its own issuance), output and inflation are unaffected.&lt;/p&gt;
&lt;p&gt;Second, the model is calibrated to four asymmetric regions: the United States (US), US-allied countries including the European Union (WE), the China-Russia-aligned bloc (CR), and a neutral rest of the world (NE). Bloc assignment follows Den Besten et al. (2023), using a political alignment index combining sanctions data, military imports, Belt and Road Initiative participation, and UNGA voting on Russia&amp;rsquo;s invasion of Ukraine. The US and WE impose all three taxes on CR, and vice versa; NE neither imposes nor receives taxes.&lt;/p&gt;
&lt;p&gt;Five main findings emerge from the asymmetric simulation. First, fragmentation predominantly affects CR and WE: both experience substantial declines in consumption and production across all three tax scenarios, with CR most affected when goods or asset taxes are applied. Second, the US is largely insulated: its lower trade and financial exposure to the rival bloc relative to WE limits the pass-through of fragmentation. Third, spillovers to neutral NE are nearly negligible: the expenditure-switching channel (which raises demand for untaxed NE goods) and the global income channel (which reduces demand for all goods as the world becomes poorer) roughly cancel each other out. Fourth, fragmentation is not necessarily inflationary: whether PPI inflation rises or falls depends on the relative weight of commodities in production and the mix of taxes applied — a goods tax lowers PPI inflation, while a commodity tax raises it. Fifth, the bilateral exchange rates most affected are those of the CR bloc, which appreciate under goods and asset taxes and depreciate under commodity taxes.&lt;/p&gt;
&lt;p&gt;Sensitivity analyses confirm robustness across higher elasticity of substitution between domestic and foreign goods (eta raised from 1.5 to 5), lower elasticity of substitution between labor and commodities (xi lowered from 0.4 to 0.1), tighter financial market integration (bond transaction costs multiplied by 5), and permanent shocks (persistence rho raised to 1). Under permanent shocks, the goods-tax effect on PPI inflation approaches zero — consistent with the closed-form result — while commodity-tax effects on production become larger and more persistent.&lt;/p&gt;
&lt;p&gt;Q: What is the core research question of FraNK?
A: The paper asks how geoeconomic fragmentation — modeled as policy-driven increases in taxes on rival countries&amp;rsquo; goods, commodities, and bonds — affects output, inflation, exchange rates, and capital flows at both the global and country level. It also asks whether different sources of fragmentation (real versus financial) have distinct macroeconomic implications, and whether neutral countries experience meaningful spillovers.&lt;/p&gt;
&lt;p&gt;Q: How does the model depart from the Gali-Monacelli (2005) benchmark?
A: Three departures are made. The model is multi-country (N countries) rather than a single small open economy facing the rest of the world. Financial markets are incomplete, so international risk sharing is imperfect — a realistic assumption in a fragmented world. And intermediate-good production uses a CES bundle of labor and a commodity bundle that includes both domestic and imported commodities, which is essential for capturing commodity market disruptions such as those following Russia&amp;rsquo;s invasion of Ukraine.&lt;/p&gt;
&lt;p&gt;Q: What are the three tax instruments and what does each represent?
A: The goods import tax (tau_ijt) is a tariff on final goods imports, representing trade barriers. The commodity import tax (tau_O_ijt) is a tariff on imported commodity inputs, representing sanctions or restrictions on energy and raw material trade. The bond tax (theta_ijt) is a capital control discouraging purchases of bonds issued by rival countries, representing financial fragmentation or sanctions on financial assets.&lt;/p&gt;
&lt;p&gt;Q: What does the closed-form symmetric-calibration result establish about output?
A: Under the symmetric calibration, both the goods import tax and the commodity import tax reduce output unambiguously (Proposition 3.3). The bond tax is neutral for output under symmetry because each country&amp;rsquo;s net foreign asset position is unchanged — any reduction in holdings of rival-bloc bonds is exactly matched by a reduction in own-bond issuance, leaving net positions and aggregate demand unaffected (Proposition 3.4).&lt;/p&gt;
&lt;p&gt;Q: Why does the goods import tax reduce PPI inflation while the commodity import tax raises it?
A: The goods import tax operates through two opposing channels: a demand channel (households substitute away from foreign goods, reducing aggregate demand) and a supply channel (import taxes raise firms&amp;rsquo; real marginal costs). The closed-form solution establishes that the demand channel dominates, so PPI inflation falls. The commodity import tax operates only through the supply channel — raising the cost of intermediate inputs directly — so PPI inflation rises unambiguously. CPI inflation rises on impact under the goods tax because import prices are directly included in the CPI even as PPI falls.&lt;/p&gt;
&lt;p&gt;Q: Under what condition does simultaneous fragmentation (goods and commodity taxes together) produce PPI inflation?
A: When both taxes are imposed simultaneously, the net effect on PPI inflation is ambiguous. The paper shows analytically that PPI inflation rises if and only if omega * gamma_O_tilde &amp;gt; gamma_tilde * (phi/sigma), where omega is the commodity weight in production, gamma_O_tilde captures commodity import weights, and gamma_tilde captures goods import weights. That is, fragmentation tends to be stagflationary the larger the weight of commodities in the production function, consistent with the empirical finding in Caldara et al. (2024) of stagflationary effects from elevated geopolitical risk.&lt;/p&gt;
&lt;p&gt;Q: Why is the US more shielded from fragmentation than its WE allies?
A: The US has relatively lower trade and financial exposure to the CR bloc compared to WE. Because the trade and financial weights calibrated from UN Comtrade, IMF CPIS, BIS LBS, and IMF CDIS data place WE in closer economic relationships with CR countries, a tax on CR imports or assets falls more heavily on WE than on the US. This asymmetry is a direct consequence of the calibration: no structural or strategic advantage of the US is assumed beyond its actual pattern of trade and financial linkages.&lt;/p&gt;
&lt;p&gt;Q: What happens to the CR bloc&amp;rsquo;s exchange rate under each tax scenario?
A: Under the goods import tax, the CR exchange rate appreciates: CR&amp;rsquo;s own tax reduces demand for US/WE goods, increasing domestic demand relative to the rest of the world, and the reduced demand for CR bonds from abroad raises CR interest rates, further attracting capital. Under the commodity import tax, the CR exchange rate depreciates: lower commodity demand reduces CR commodity prices and production, shifting labor toward goods, increasing goods supply, and lowering the CR price level relative to trading partners. Under the bond tax, the CR exchange rate also appreciates, as reduced CR demand for US/WE bonds is interpreted by markets as a shift in capital flows favoring CR assets.&lt;/p&gt;
&lt;p&gt;Q: What explains the near-zero spillovers to neutral countries?
A: Two forces operate on NE in opposite directions. The expenditure-switching channel raises demand for NE goods and commodities, as taxing countries divert purchases away from taxed rival goods toward untaxed NE products — a positive demand shock for NE. The global income channel reduces demand for all goods, including NE&amp;rsquo;s, as the taxing and taxed regions become poorer and reduce imports from everywhere. In the calibration these two forces approximately cancel, leaving NE macroeconomic variables nearly unchanged.&lt;/p&gt;
&lt;p&gt;Q: How is the commodity sector modeled, and why does this matter for the commodity tax result?
A: Each country has a representative commodity firm using a linear production function (Y_iOt = A_iO * H_iOt), where A_iO is interpretable as a per-capita endowment of natural resources. Intermediate-good firms use a CES bundle of labor and commodities (domestic and imported) with elasticity xi=0.4 between the two. When the commodity import tax is imposed, firms face higher commodity input costs, raising real marginal costs and PPI inflation while depressing production. The asymmetry between commodity exporters (CR, NE) and importers (WE) under this tax is the main source of differential regional effects.&lt;/p&gt;
&lt;p&gt;Q: How are financial openness differences across country pairs captured, and what effect do they have?
A: Bond transaction costs psi_ijF differ across pairs: psi_12F = psi_21F = 0.01 for the US-WE pair (reflecting high financial integration), while all other pairs have psi_ijF = 1 — one hundred times higher — reflecting limited cross-bloc financial integration. The sensitivity analysis multiplies all psi_ijF by 5 (less open financial markets) and finds that bond position volatility falls but qualitative results are unchanged, confirming that the financial openness calibration does not drive the main results.&lt;/p&gt;
&lt;p&gt;Q: What are the main caveats acknowledged by the authors?
A: The model omits capital accumulation, so investment dynamics are absent. Cross-country production networks (global value chains) are not modeled, which the authors acknowledge limits the richness of the production structure relative to Baqaee-Farhi (2024) style models. Domestic financial markets are assumed frictionless. The model has no role for dollar dominance in the global economy, which may matter for exchange rate and capital flow dynamics in reality. These are flagged as directions for future research.&lt;/p&gt;
&lt;p&gt;Q: What is the key result for permanent (rho=1) versus temporary (rho=0.9) fragmentation shocks?
A: Under permanent shocks, output reductions become permanent rather than transitory. For the goods import tax, the effect on PPI inflation approaches zero in the permanent case, consistent with the closed-form prediction that the demand channel effect on PPI vanishes when the tax persists indefinitely (households no longer have an intertemporal substitution motive). The commodity tax permanent shock induces a larger and more persistent fall (rise) in production for commodity importers (exporters). Bond tax permanent shock has larger magnitude effects but is otherwise qualitatively similar to the temporary case.&lt;/p&gt;
&lt;p&gt;Q: How does FraNK relate to the existing DSGE literature on sanctions and trade wars?
A: The paper positions FraNK as providing a unified framework covering all three forms of fragmentation (goods, commodity, and financial) simultaneously, with nominal rigidities allowing for inflation analysis, closed-form analytical results for transparency, and a multi-country setup rather than small-open-economy. Ghironi et al. (2024) study sanctions in a three-country model but without nominal rigidities. Itskhoki and Mukhin (2022) analyze sanctions on Russia but in a small-open-economy. Attinasi et al. (2023) and Conteduca et al. (2024b) use richer production networks (Baqaee-Farhi) but are static and exclude financial fragmentation. FraNK trades production network richness for dynamics, nominal rigidities, financial fragmentation, and analytical tractability.&lt;/p&gt;
&lt;p&gt;Geoeconomic fragmentation: A policy-driven reversal of economic integration, often guided by strategic or geopolitical considerations, operationalized in FraNK as simultaneous increases in taxes on rival countries&amp;rsquo; goods imports, commodity imports, and bond purchases.&lt;/p&gt;
&lt;p&gt;Fragmentation shock: A simultaneous increase in three tax rates — goods import tax (tau), commodity import tax (tau_O), and bond tax (theta) — applied by each bloc against the other, representing the policy instruments through which integration is reversed.&lt;/p&gt;
&lt;p&gt;Demand channel (goods tax): The mechanism by which a goods import tax reduces aggregate demand, as households substitute away from now-more-expensive foreign goods, reducing output and — because this channel dominates the supply channel — lowering PPI inflation.&lt;/p&gt;
&lt;p&gt;Supply channel (commodity tax): The mechanism by which a commodity import tax raises intermediate input costs for firms, increasing real marginal costs and PPI inflation while reducing output — a purely cost-push effect with no offsetting demand-side force.&lt;/p&gt;
&lt;p&gt;Bond tax neutrality: Under symmetric calibration, capital controls on rival-bloc bonds are macroeconomically neutral because each country&amp;rsquo;s net foreign asset position is unchanged: the reduction in holdings of rival bonds is exactly matched by a reduction in own-bond issuance, leaving the IS curve and Phillips curve unaffected.&lt;/p&gt;
&lt;p&gt;Expenditure-switching channel: The force by which fragmentation between two blocs diverts import demand toward untaxed third-country (neutral) goods, generating a positive demand spillover for NE countries that roughly offsets the global income channel.&lt;/p&gt;
&lt;p&gt;Global income channel: The negative spillover to neutral countries arising from the reduction in world income caused by fragmentation between the taxing blocs, which reduces demand for all goods including those of neutral producers, approximately canceling the expenditure-switching channel.&lt;/p&gt;</description></item><item><title>Growth Experiences and Trust in Government</title><link>https://macropaperwarehouse.com/papers/growth-experiences-and-trust-in-government/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/growth-experiences-and-trust-in-government/</guid><description>&lt;p&gt;This paper investigates whether individuals who have experienced stronger GDP growth over their lifetimes are more likely to trust their national government. The authors — Besley, Dann, and Dray — assemble a newly harmonized global dataset comprising approximately 3.3 million respondents across 166 countries since 1990, drawn from 11 major opinion surveys (Afrobarometer, Americasbarometer, Arabarometer, Asiabarometer, European Social Survey, Gallup World Poll, Integrated Values Survey, Latinobarometer, Life in Transition Survey, South Asia Barometer, and World Justice Project). They supplement this with longer-run U.S. evidence from the American National Election Studies (ANES) going back to 1958, covering respondents born as early as the 1880s, and longitudinal Swiss evidence from the Swiss Household Panel (SHP) which allows individual fixed-effects estimation.&lt;/p&gt;
&lt;p&gt;The core methodological contribution is the exploitation of country-cohort variation in lifetime GDP growth experiences. Following Malmendier and Nagel (2011), the authors construct a weighted average of past growth realizations across an individual&amp;rsquo;s lifetime, with weights decaying linearly over time (lambda = 1), so that more recent growth receives greater weight. The baseline specification includes country fixed effects, cohort-by-subcontinent fixed effects, survey-by-survey-year fixed effects, controls for log GDP per capita at year of birth, and individual characteristics (sex, marital status, education, religious denomination). More demanding specifications add country-by-survey-year and country-by-age fixed effects. For Switzerland, individual fixed effects are included, fully absorbing time-invariant personal characteristics.&lt;/p&gt;
&lt;p&gt;The main finding is that a one standard deviation increase in lifetime GDP growth experience — corresponding to approximately 2 percentage points of additional growth — is associated with a 2.1 percentage point increase in the probability of trusting the national government, significant at the 1 percent level. This corresponds to roughly 0.042 standard deviations of the trust outcome and approximately 5 percent of the global mean trust in government. The effect is quantitatively meaningful: it approximates between one-quarter and one-half of the difference in average trust between older and younger cohorts in India and Italy, respectively. For the U.S. ANES sample, a one standard deviation increase in growth experience (about 0.2 percentage points) increases trust in the federal government by 2.4 percentage points, explaining more than two-thirds of the average trust gap between Baby Boomers (born 1946–1964) and Millennials (born 1981–1996).&lt;/p&gt;
&lt;p&gt;Several scope conditions and heterogeneity findings sharpen the interpretation. First, the growth-trust link is specific to government institutions: there is no statistically significant effect of growth experience on interpersonal trust or trust in religious organizations, indicating the channel runs through perceptions of state performance rather than generalized social capital. Second, a recency heuristic operates: the linearly decaying weighting function (lambda = 1) outperforms both an unweighted lifetime average (lambda = 0) and a formative-years weighting. Growth experienced during formative years (ages 18–25) or before birth has no detectable effect on trust in government; the pre-birth result serves as a placebo test. Third, the positive growth-trust relationship is stronger in democracies than in autocracies, which the authors interpret as democracies producing citizens more responsive to government performance signals. Fourth, a &amp;ldquo;trust paradox&amp;rdquo; emerges: unconditionally, average trust in government is lower in democracies than in autocracies, and longer democratic experience is associated with lower trust, which the authors attribute to democratic institutions generating greater citizen skepticism about government performance. Fifth, core results are robust to controlling for other lifetime politico-economic experiences including inflation, banking and currency crises, epidemics, political unrest, executive turnover, stock market returns, and income inequality. The Swiss evidence further shows that private income growth experience does not drive the result — only aggregate macroeconomic growth does.&lt;/p&gt;
&lt;p&gt;Q: What is the paper&amp;rsquo;s core quantitative finding on the growth-trust relationship?
A: Using the global harmonized dataset of 3.3 million respondents across 166 countries, a one standard deviation increase in lifetime GDP growth experience (corresponding to approximately 2 percentage points of additional growth) is associated with a 2.1 percentage point increase in the probability of trusting the national government, significant at the 1 percent level. Using only the Gallup World Poll subsample (roughly half the observations), the estimated effect is somewhat larger at 3.6 percentage points per standard deviation increase. These estimates remain statistically significant under more demanding specifications with country-by-survey-year and country-by-age fixed effects, though the magnitudes decrease as these interacted fixed effects absorb variation in recent growth experiences.&lt;/p&gt;
&lt;p&gt;Q: How do the authors measure individual lifetime growth experience?
A: The growth experience variable is a weighted average of all past annual GDP per capita growth rates since an individual&amp;rsquo;s birth, with weights that decay linearly over time (lambda = 1 in the Malmendier-Nagel framework). Under this parameterization, the measure simplifies to how much recent economic performance (in the year prior to the survey) exceeds the long-run mean over the respondent&amp;rsquo;s lifetime, scaled by the respondent&amp;rsquo;s midpoint of life. This implies younger individuals are more sensitive to recent growth outcomes because their shorter life histories give recent events relatively greater weight. The authors validate this lambda = 1 choice via a grid search over alternative weighting structures using minimum residual sum of squares as the criterion.&lt;/p&gt;
&lt;p&gt;Q: How is reverse causality addressed?
A: The empirical strategy identifies the relationship using past, cumulative growth experiences measured prior to the survey, so current trust in government cannot cause past growth. Survey-year fixed effects absorb all aggregate time trends simultaneously affecting trust and growth. The authors also conduct a placebo test showing that GDP growth occurring before an individual&amp;rsquo;s birth has a precisely estimated null effect on their trust in government, which would not be the case if unobserved societal trends were jointly driving both growth histories and political perceptions.&lt;/p&gt;
&lt;p&gt;Q: Does growth experience affect interpersonal trust or trust in non-state institutions?
A: No. The estimated coefficient on lifetime growth experience is statistically insignificant at conventional levels when interpersonal trust replaces trust in government as the dependent variable, with narrow confidence intervals indicating a precisely estimated null. Similarly, growth experience has no systematic effect on trust in religious organizations such as churches or mosques. The authors interpret these null results as evidence against the alternative explanation that broad modernizing social changes are jointly driving both growth experiences and political trust.&lt;/p&gt;
&lt;p&gt;Q: What do the U.S. ANES results add?
A: The ANES data, which extends back to 1958 and captures cohorts born as early as the 1880s, provide a within-country test controlling for state fixed effects, generation dummies, and rich individual characteristics including partisan affiliation and partisan strength. A one standard deviation increase in U.S. growth experience (approximately 0.2 percentage points) raises trust in the federal government by 2.4 percentage points, significant at the 1 percent level. This estimate is quantitatively large enough to explain more than two-thirds of the average trust gap between Baby Boomers and Millennials. Results are robust to adding state-by-survey-year fixed effects and birth-state-by-generation fixed effects, and hold for a broader &amp;ldquo;trust in government index&amp;rdquo; covering beliefs about waste, corruption, and responsiveness of the federal government.&lt;/p&gt;
&lt;p&gt;Q: What do the Swiss Household Panel results contribute?
A: The SHP allows individual fixed-effects estimation, exploiting within-person changes in growth experience and trust over time from 1999 onward, which absorbs all time-invariant individual characteristics that could confound the global and U.S. cross-cohort results. The growth experience coefficient remains positive and significant, with a one standard deviation increase yielding a 1.9 percentage point increase in trust in the Swiss federal government (significant at the 1 percent level). The Swiss data also uniquely allow the authors to test whether personal income growth experience drives the result; they find no significant effect of private income growth experience on trust in government, only aggregate macroeconomic growth matters.&lt;/p&gt;
&lt;p&gt;Q: Does the recency heuristic hold — does growth in formative years matter?
A: No. The authors find no detectable effect of growth experienced specifically during formative years (ages 18–25) on trust in government. Additionally, in a grid-search exercise assessing model fit across different lambda values, the linearly decaying weighting scheme (lambda = 1, giving more weight to recent growth) outperforms both equal-weighted lifetime averages (lambda = 0) and weighting schemes that emphasize earlier life experiences (lambda less than 0). The pre-birth placebo result (null effect) and the absence of a formative-years effect together indicate that the operative mechanism is about evaluating current government performance based on recent macroeconomic experience, not the imprinting of long-lasting political dispositions during youth.&lt;/p&gt;
&lt;p&gt;Q: What is the &amp;ldquo;trust paradox&amp;rdquo; and how is it documented?
A: The trust paradox refers to the empirical finding that average trust in government is lower in democracies than in autocracies at the cross-country level, and that longer experience with democratic institutions within countries is associated with lower levels of trust in government in the micro data. This is counterintuitive given the standard view that good institutions should foster confidence in government. The authors suggest the paradox likely reflects democracies cultivating greater citizen skepticism and more critical judgment of government performance, rather than indicating that democratic governance actually performs worse. Importantly, the positive effect of growth experience on trust remains present in democracies, and the growth-trust relationship is actually stronger in democratic regimes, consistent with citizens in democracies being more responsive to government performance signals.&lt;/p&gt;
&lt;p&gt;Q: How is the growth-trust finding related to corruption perceptions and living standards?
A: Using the Gallup World Poll, the authors find that stronger lifetime growth experience is associated with lower perceived corruption in government, greater satisfaction with personal living standards, and higher likelihood of feeling one lives comfortably on one&amp;rsquo;s present income. These results are consistent with citizens attributing economic success to government competence and integrity, and with growth translating into perceptions of improved personal circumstances through both direct income effects and indirect public goods provision.&lt;/p&gt;
&lt;p&gt;Q: Are the results robust to controlling for other lifetime politico-economic experiences?
A: Yes. When the authors include lifetime experience measures for political unrest, executive turnover, epidemic exposure, banking crises, currency crises, and inflation (both levels and volatility) simultaneously in equation (3), the growth experience coefficient remains consistently positive, stable, and significant across all specifications. Among the other experience variables, only lifetime unrest and epidemic exposure are independently negative and statistically significant at conventional levels. F-tests reject the null hypothesis that the crisis and growth experience coefficients are equal in magnitude. The U.S. results are also robust to adding lifetime experiences with S&amp;amp;P 500 returns, unemployment, and top-income-share inequality measures.&lt;/p&gt;
&lt;p&gt;Q: What are the policy implications of the findings?
A: The authors note that sustained economic growth may itself be a mechanism for building political trust, with positive downstream effects for policy compliance — a connection they document has been relevant during the COVID-19 pandemic (where higher-trust societies showed lower mobility during lockdowns and higher vaccine acceptance). The growth-trust channel could have implications for increasing compliance across a range of policy domains including climate action and tax morale. Governments that deliver sustained economic growth can expect citizens to update their trust upward, particularly in democracies where citizens are more performance-responsive, while governments that preside over stagnation or contraction face predictable erosion of political legitimacy across cohorts.&lt;/p&gt;
&lt;p&gt;Growth experience: A weighted average of all past annual GDP per capita growth realizations since an individual&amp;rsquo;s birth, with weights that decay linearly over time following Malmendier and Nagel (2011), so that more recent growth receives greater weight. Under the paper&amp;rsquo;s preferred parameterization (lambda = 1), the measure equals how much last year&amp;rsquo;s GDP per capita exceeds the respondent&amp;rsquo;s lifetime mean, scaled by the respondent&amp;rsquo;s midpoint of life.&lt;/p&gt;
&lt;p&gt;Trust in government: A binary dummy variable equal to one if a survey respondent expresses &amp;ldquo;a great deal&amp;rdquo; or &amp;ldquo;quite a lot&amp;rdquo; of trust or confidence in the national government, constructed from harmonized responses across 11 major opinion surveys. The paper treats this as reflecting respondents&amp;rsquo; perceptions of government performance rather than a deep interpersonal trust relationship.&lt;/p&gt;
&lt;p&gt;Trust paradox: The empirical regularity documented in the paper whereby average trust in government is unconditionally lower in democracies than in autocracies at the cross-country level, and whereby longer democratic experience within countries is associated with lower individual trust in government. The authors attribute this to democratic institutions generating more critical citizen judgment of government performance.&lt;/p&gt;
&lt;p&gt;Recency heuristic: The finding that more recent growth experiences carry greater weight in forming trust in government, as captured by the linear decay weighting scheme (lambda = 1) outperforming equal-weighted or early-life-weighted alternatives. Growth before birth and growth during formative years (ages 18–25) have no detectable effect, while recent macroeconomic performance is the operative signal.&lt;/p&gt;
&lt;p&gt;Cohort-level variation: The within-country differences in lifetime growth experiences across birth cohorts that form the paper&amp;rsquo;s primary identification strategy. Because different cohorts in the same country have lived through different sequences of growth episodes, differences in trust across cohorts within a country can be attributed to differential growth exposure rather than time-invariant country characteristics.&lt;/p&gt;
&lt;p&gt;Formative years effect: The hypothesis, tested and rejected in the paper, that economic experiences during ages 18–25 have a lasting imprint on political attitudes analogous to formative-years effects found in other political behavior literatures. The paper finds no statistically significant association between growth experienced during these years and trust in government.&lt;/p&gt;
&lt;p&gt;Source text origin: In the pipeline context relevant to this paper&amp;rsquo;s acquisition, this refers to whether a summary was generated from full working paper text (&amp;ldquo;pdf&amp;rdquo; or &amp;ldquo;oa-html&amp;rdquo;) versus abstract only (which is hard-blocked). The working paper was obtained from LSE Research Online (eprint 129614), classified as published version under CC BY 4.0.&lt;/p&gt;</description></item><item><title>Health Shocks, Health Insurance, Human Capital, and the Dynamics of Earnings and Health</title><link>https://macropaperwarehouse.com/papers/health-shocks-health-insurance-human-capital-and-the-dynamics-of-earnings-and-health/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/health-shocks-health-insurance-human-capital-and-the-dynamics-of-earnings-and-health/</guid><description>&lt;p&gt;Capatina and Keane build and calibrate a life-cycle model of labor supply and savings for U.S. men that incorporates health shocks, endogenous human capital accumulation via learning-by-doing, employer-sponsored health insurance (ESHI), means-tested social insurance, and endogenous medical treatment decisions. The model is calibrated to White males using the Medical Expenditure Panel Survey (MEPS) for 2000–2013, supplemented by CPS, HRS, and PSID data; separate calibrations are presented for Black and Hispanic men with high school or less education.&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s central research question is how health shocks affect labor supply, earnings, and earnings inequality over the life cycle, and through which mechanisms. Four channels are identified and quantified: (1) the direct labor supply effect — sick days and reduced tastes for work caused by health shocks; (2) the human capital effect — reduced work experience from health-shock-induced employment exits, which deteriorates future job and wage offers in a snowball dynamic; (3) the health-productivity effect — reduced functional health directly lowering wage offers; and (4) the behavioral effect — anticipation of health risk induces low-skill workers lacking ESHI to curtail labor supply to maintain means-tested transfer eligibility.&lt;/p&gt;
&lt;p&gt;The key quantitative findings from eliminating serious health shocks for working-age men (ages 25–64) are: the expected present value of lifetime earnings (PVE) for White men rises by 11% on average, and inequality in PVE falls by 12% (coefficient of variation). For White men with high school or less education the increase in PVE is 17.9%. For the typical White male the four channels contribute 5.7%, 2.7%, 1.4%, and 0.8% respectively. For low-skill White high school men the same channels contribute 10.7%, 14.8%, 1.3%, and 9.8% — with the human capital and behavioral effects dramatically larger for the low-skill group. For comparison, a severe health shock at age 40 reduces the present value of remaining lifetime earnings by 5.6% (approximately $53.9k) for a typical college man and by 11.5% (approximately $55.0k) for a typical high school man.&lt;/p&gt;
&lt;p&gt;Human capital amplification operates through employment persistence: a major health shock causes full-time employment to drop by 12 percentage points one year after the shock for the average man, and by 20 percentage points for high school men, with recovery still incomplete eight years later (employment remains 7.8 pp and 10 pp below baseline, respectively). Holding human capital fixed as in the pre-shock baseline causes employment to recover quickly, confirming that persistent wage-offer deterioration is the mechanism.&lt;/p&gt;
&lt;p&gt;On health insurance policy, the model evaluates providing public insurance to all workers lacking ESHI. This substantially increases medical utilization, improves health and life expectancy (survival to age 65 rises from 82% to 87% when health shocks are eliminated, as a related benchmark), reduces Medicaid and free-care costs, and raises labor supply among low-skill workers by weakening means-tested transfer incentives. The net program cost in a balanced budget simulation is modest, and all agent types are ex ante better off. By contrast, expanding Medicaid access creates perverse labor supply disincentives — workers reduce labor supply to maintain eligibility — does little to improve health, and makes almost all agents worse off in a balanced budget scenario.&lt;/p&gt;
&lt;p&gt;Scope conditions: the primary calibration covers non-institutionalized civilian White males; results for Blacks and Hispanics are presented only for the high school or less education group due to small samples. The model period ends at 2013, before ACA implementation.&lt;/p&gt;
&lt;p&gt;Q: What is the model&amp;rsquo;s overall estimate of how much health shocks reduce lifetime earnings for White men?
A: Eliminating serious health shocks at working ages (25–64) would increase the expected present value of lifetime earnings (PVE) for the average White male by 11% and reduce inequality in PVE by 12% as measured by the coefficient of variation. For White men with high school or less education the PVE gain is larger at 17.9%.&lt;/p&gt;
&lt;p&gt;Q: What are the four channels through which health shocks affect earnings, and how large is each for the average White male versus a low-skill high school male?
A: The four channels are (1) direct labor supply via sick days and reduced tastes for work, (2) human capital deterioration from lost work experience worsening future job/wage offers, (3) reduced health productivity lowering wage offers, and (4) behavioral responses to health risk reducing labor supply to preserve transfer eligibility. For the average White male the contributions to PVE are 5.7%, 2.7%, 1.4%, and 0.8%, respectively. For low-skill White high school men the same channels contribute 10.7%, 14.8%, 1.3%, and 9.8% — the human capital and behavioral effects are roughly five to twelve times larger for the low-skill group.&lt;/p&gt;
&lt;p&gt;Q: Why is the human capital effect so much larger for low-skill high school men than for college men?
A: Low-skill high school men are much more likely to exit full-time employment following a major health shock and are slow to return. Lifetime work years decline by 1.89 for the typical high school man versus only 0.84 for the typical college man following a major shock at age 40. Because job offer probabilities depend on lagged employment, absence from the labor market creates a snowball effect that persistently depresses offer quality; human capital accounts for 42% of the earnings decline for high school men versus 34% for college men.&lt;/p&gt;
&lt;p&gt;Q: How does the paper characterize the persistent employment effects of a major health shock?
A: For the average man, full-time employment drops by 12 percentage points one year after a severe shock and remains 7.8 pp below baseline after eight years. For high school men the initial drop is 20 pp, still 10 pp below baseline after eight years; for college men the figures are 7 pp and 3 pp. When human capital is held fixed at the pre-shock baseline — so wage and job offers do not deteriorate due to lost experience — employment recovers quickly for workers of all skill levels, confirming the human capital mechanism drives the persistence.&lt;/p&gt;
&lt;p&gt;Q: How does the behavioral effect operate for low-skill workers?
A: Workers without ESHI who face health risk have an incentive to maintain sufficiently low income and assets to qualify for means-tested social insurance, which provides a consumption floor approximating Medicaid, Food Stamps, SSDI, and SSI. This perverse incentive leads low-skill workers to curtail labor supply preemptively. When health risk is eliminated, this incentive disappears and labor supply rises, generating the behavioral effect of 9.8% of PVE for low-skill high school men versus only 0.8% for the average White male.&lt;/p&gt;
&lt;p&gt;Q: How does the paper correct for under-reporting of health shocks among the uninsured?
A: The measurement model assumes health shocks are correctly measured for the treated, but uninsured workers who do not seek treatment only record a shock with a shock-specific probability less than one. A key identifying assumption is that, conditional on health status, risk factors, age, and education, the true frequency of health shocks does not differ by insurance status per se — ruling out ex ante moral hazard. The measurement model parameters are calibrated to match observed frequencies of health shocks and high risk in MEPS for the uninsured.&lt;/p&gt;
&lt;p&gt;Q: What does the model estimate regarding the effect of a severe health shock on cumulative earnings relative to existing reduced-form evidence?
A: The model predicts an average cumulative (non-discounted) earnings loss of $42.8k over ten years following a severe shock for men aged 50, compared with Smith&amp;rsquo;s (2004) estimate of $37k from the HRS. The paper argues Smith&amp;rsquo;s estimate identifies effects on workers who actually experience shocks, who are a selected sample with low baseline earnings (as untreated shocks are more likely to be severe, and non-treaters tend to have low earnings). The model&amp;rsquo;s &amp;ldquo;average effect&amp;rdquo; — comparing a world where everyone experiences the shock to one where no one does — yields a substantially higher loss of $59.8k.&lt;/p&gt;
&lt;p&gt;Q: What are the key findings from the public insurance experiment (providing insurance to the uninsured)?
A: Providing public insurance to all workers lacking ESHI substantially increases medical utilization among the previously uninsured, who are intrinsically less healthy. This improves health and life expectancy, raising Social Security costs. However, it also generates positive labor supply incentives for low-skill workers (reducing their reliance on means-tested transfers), substantially reduces Medicaid and free-care costs, and increases tax revenue. On balance, the net program cost in a balanced budget simulation is modest, and all types of workers are ex ante better off.&lt;/p&gt;
&lt;p&gt;Q: Why does expanding Medicaid access produce perverse results in contrast to providing public insurance?
A: Medicaid is means-tested, so expanded access requires workers to maintain sufficiently low income and assets to remain eligible. This creates disincentives to work and save — workers reduce labor supply to preserve eligibility. The result is reduced earnings, lower tax revenue, little improvement in health (as access to care depends on maintaining low income), and almost all agents being worse off in a balanced budget scenario.&lt;/p&gt;
&lt;p&gt;Q: What role does insurance play beyond consumption smoothing in this model?
A: Beyond lowering out-of-pocket (OOP) costs and smoothing consumption, insurance grants access to care: in the US system, proof of insurance is often required before treatment, so uninsured workers may not have the option to treat at all. The model captures three distinct option sets for the uninsured — all options available, treatment not available, or default not available — each motivated by different real-world contexts. Non-treatment worsens health transition probabilities, so the access-granting role of insurance independently affects health trajectories beyond its cost-reducing role.&lt;/p&gt;
&lt;p&gt;Q: What explains the observed positive association between education, income, insurance, and health transitions in the data, and how does the model generate this without education entering the health production function directly?
A: The association between education and health is largely driven by the positive correlation between education and latent health types; controlling for latent health type in a descriptive logit largely eliminates the education coefficient. The association between insurance and health transitions is driven by the fact that the insured are more likely to receive treatment; controlling for treatment and true shocks eliminates the insurance coefficient. Education affects health indirectly through its effects on treatment decisions — via wages, job offers with ESHI, and consumption capacity — without appearing as a direct argument in the health production function.&lt;/p&gt;
&lt;p&gt;Q: How large are the effects of health shocks on key population health statistics according to the model?
A: Eliminating serious health shocks at working ages would increase the fraction of working-age men in good health from 60% to 75% and raise the probability of survival to age 65 from 82% to 87%. Average annual sick days of 16.42 would be eliminated, implying a 6% increase in work days for employed workers and an employment rate increase from 88% to 91%. Average annual medical costs would fall from $4,618 to $1,132.&lt;/p&gt;
&lt;p&gt;Q: How do the results for Black and Hispanic men compare to White men?
A: The results are qualitatively similar, but the magnitudes for Black men are somewhat larger. Eliminating health shocks would raise PVE for Whites, Blacks, and Hispanics with high school or less education by 17.9%, 23.7%, and 17.7%, respectively. Separate access-to-care probabilities are calibrated for each group, reflecting racial disparities in access that explain part of the observed differences in health outcomes and treatment rates.&lt;/p&gt;
&lt;p&gt;Q: What is the role of the consumption floor (means-tested social insurance) in shaping equilibrium outcomes for low-skill workers?
A: The consumption floor guarantees a minimum household consumption level approximating Medicaid, Food Stamps, SSDI, and SSI. It shields low-skill workers from the full cost of health shocks, reducing both the consumption-smoothing value of ESHI and precautionary saving incentives. However, it also creates a powerful disincentive for low-skill workers without ESHI to work, as earning above the eligibility threshold would eliminate benefits. This mechanism amplifies earnings inequality by generating perverse labor supply behavior concentrated among low-skill, uninsured workers.&lt;/p&gt;
&lt;p&gt;Functional Health (H): A discrete stock variable (Poor, Fair, or Good) measuring aspects of health that directly affect worker productivity and tastes for work; distinguished from asymptomatic health risk. Transitions depend on lagged health, latent health type, age, persistent health shocks, and whether shocks are treated.&lt;/p&gt;
&lt;p&gt;Asymptomatic Health Risk (R): A binary state (low or high) capturing risk factors such as obesity, high cholesterol, and hypertension that increase the probability of future health shocks but do not affect current productivity.&lt;/p&gt;
&lt;p&gt;Human Capital Effect: The channel by which health shocks reduce lifetime earnings not directly but indirectly — by causing employment exits that slow work experience accumulation, which in turn deteriorates future job offer probabilities and wage offers in a persistent, self-reinforcing (snowball) dynamic.&lt;/p&gt;
&lt;p&gt;Behavioral Effect: The reduction in labor supply — and associated earnings loss — that occurs because workers facing health risk and lacking ESHI have an incentive to keep income and assets low enough to maintain eligibility for means-tested social insurance, even absent any contemporaneous health shock.&lt;/p&gt;
&lt;p&gt;Tied Wage-Hours-Insurance Offer: The model&amp;rsquo;s labor market structure in which employment offers jointly specify a wage rate, hours (no offer, part-time, or full-time), and whether the offer includes ESHI; workers accept or reject the bundle rather than choosing hours and insurance independently.&lt;/p&gt;
&lt;p&gt;Source Text Origin: The paper&amp;rsquo;s own term distinguishing how the full text of a paper was obtained (PDF, OA-HTML, or abstract-only); used in the summarization pipeline. [Note: this concept is from the summarization pipeline metadata, not from the paper itself — omitting.]&lt;/p&gt;
&lt;p&gt;Treatment/Payment Options: The set of decisions available to a worker after a health shock occurs — whether to seek treatment and, if treated, whether to pay the out-of-pocket cost or default on bills. The available choice set differs by insurance status and context: the uninsured may face denial of access (option to treat unavailable) or required prepayment (default unavailable), or may have all options including free care.&lt;/p&gt;
&lt;p&gt;Latent Health Type: An unobserved permanent individual characteristic capturing innate biological resilience and pre-age-25 health investments; determines baseline transition probabilities for functional health conditional on shocks. Positively correlated with latent skill type within education groups.&lt;/p&gt;</description></item><item><title>Hedge funds and the Treasury cash-futures basis trade</title><link>https://macropaperwarehouse.com/papers/hedge-funds-and-the-treasury-cash-futures-basis-trade/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/hedge-funds-and-the-treasury-cash-futures-basis-trade/</guid><description>&lt;p&gt;The U.S. Treasury market is the deepest and most liquid fixed-income market in the world, yet in March 2020 it experienced unprecedented dysfunction—widening bid-ask spreads, skyrocketing repo rates, and diverging arbitrage spreads that prompted massive Federal Reserve intervention. This paper documents the rise and near-collapse of the Treasury cash-futures basis trade—an arbitrage strategy among hedge funds exploiting a persistent disconnect between cash Treasury prices and futures prices—as a central feature of that episode. Using regulatory datasets on hedge fund exposures and repo transactions, the authors show that at its peak the basis trade accounted for an estimated $400–$500 billion in positions, constituting more than 60% of total hedge fund Treasury exposure, more than 70% of hedge fund repo borrowing, and more than 25% of primary dealers&amp;rsquo; repo lending. A model and empirical evidence link the trade&amp;rsquo;s growth after 2016 to broader Treasury market developments, and show how the trade&amp;rsquo;s reliance on short-term repo financing creates both margin risk and rollover risk. In March 2020 many of these risks materialized, though the unwinding of basis positions was likely a consequence rather than the primary cause of the stress; prompt Federal Reserve intervention may have prevented a liquidity spiral.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-treasury-cash-futures-basis-trade-and-why-did-it-become-popular"&gt;Q1. What is the Treasury cash-futures basis trade and why did it become popular?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The basis trade exploits the arbitrage relationship Pₜ,τ = ΣBₜ,ₛcₛ + Bₜ,T Fₜ,τ,T: when futures prices are too high relative to the present value of the deliverable bond, traders go &amp;ldquo;long the basis&amp;rdquo; by buying the cash bond and shorting the futures, financing the long position in the overnight repo market.&lt;/strong&gt; The trade became popular following 2016 as demand for long Treasury futures positions grew (from institutional investors seeking leveraged duration exposure) while the supply of warehousing capacity from dealers contracted under post-crisis regulatory constraints. Hedge funds stepped in as the marginal warehouser, exploiting the resulting premium embedded in futures prices. The trade is nearly zero net-cash but requires continuous repo rollover.&lt;/p&gt;
&lt;h3 id="q2-how-large-did-the-trade-become-and-how-was-its-size-estimated"&gt;Q2. How large did the trade become and how was its size estimated?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Using regulatory data—specifically CFTC Form 40 (hedge fund futures positions), SEC Form PF (AUM and derivatives exposures), and FR 2004 (primary dealer repo data)—the authors estimate basis trade positions peaked at $400–$500 billion, comprising more than 60% of hedge fund Treasury exposure, more than 70% of hedge fund repo borrowing, and more than 25% of primary dealer repo lending to hedge funds.&lt;/strong&gt; The data allow the authors to identify basis positions directly, distinguishing them from outright long Treasury positions, by matching the simultaneous long cash / short futures pattern that defines the trade. The estimates underscore that hedge funds had become systemically important participants in Treasury market intermediation.&lt;/p&gt;
&lt;h3 id="q3-what-financial-stability-risks-does-the-basis-trade-create"&gt;Q3. What financial stability risks does the basis trade create?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The basis trade creates two interrelated risks: margin risk (variation margin calls on futures positions can force immediate liquidation) and rollover risk (if repo lenders withdraw funding, the cash Treasury position must be sold).&lt;/strong&gt; The paper&amp;rsquo;s model formalizes how limits to arbitrage—specifically repo market illiquidity and margin requirements—impair risk-sharing between dealers and holders of long futures positions. These constraints mean that even a moderate adverse price move can trigger a self-reinforcing cycle: higher basis volatility → margin calls → forced sales → further basis widening → further margin calls.&lt;/p&gt;
&lt;h3 id="q4-what-happened-in-march-2020-and-what-was-the-federal-reserves-role"&gt;Q4. What happened in March 2020 and what was the Federal Reserve&amp;rsquo;s role?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Beginning in early March 2020, the COVID-19 pandemic triggered a &amp;ldquo;dash for cash&amp;rdquo; that disrupted Treasury market functioning: bid-ask spreads widened dramatically, repo rates spiked, and the cash-futures basis moved sharply against basis traders, generating large margin calls.&lt;/strong&gt; The authors find that while Treasury market disruptions spurred hedge funds to sell Treasuries, the unwinding of the basis trade was likely a consequence rather than a primary cause of the stress. The Federal Reserve intervened by dramatically expanding Treasury purchases from dealers and offering unlimited repo and reverse repo facilities, which likely prevented a liquidity spiral by removing the constraint on dealer intermediation capacity. The paper argues this episode highlights structural vulnerabilities in Treasury market intermediation arising from the shift of warehousing capacity to hedge funds.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Treasury cash-futures basis trade&lt;/strong&gt; : an arbitrage strategy in which a trader simultaneously holds a long position in cash Treasury bonds (funded via repo) and a short position in Treasury futures, profiting from the convergence of cash and futures prices at delivery.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;warehousing role of hedge funds&lt;/strong&gt; : the function of holding Treasury bonds on behalf of institutional investors who want long futures exposure, financed in the repo market; this creates a link between Treasury, futures, and repo markets and exposes the system to repo rollover and margin risk.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;rollover risk&lt;/strong&gt; : the risk that short-term repo lenders decline to roll over funding at maturity, forcing the borrower to sell the collateral asset (Treasury bonds) at potentially distressed prices.&lt;/p&gt;</description></item><item><title>Heterogeneous innovations and growth under imperfect technology spillovers</title><link>https://macropaperwarehouse.com/papers/heterogeneous-innovations-and-growth-under-imperfect-technology-spillovers/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/heterogeneous-innovations-and-growth-under-imperfect-technology-spillovers/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; Jo and Kim ask two related questions: (1) How do firms use different types of innovation when learning others&amp;rsquo; technology takes time? (2) How does this process alter the aggregate implications of firm innovation, particularly in the context of increasing competition?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model.&lt;/strong&gt; The paper develops a discrete-time infinite-horizon endogenous growth model with multi-product firms pursuing two types of innovation — &amp;ldquo;own-innovation&amp;rdquo; (improving existing product quality) and &amp;ldquo;creative destruction&amp;rdquo; (entering new product markets by displacing incumbents) — subject to a novel friction called &amp;ldquo;imperfect technology spillovers.&amp;rdquo; The friction takes the specific form of lagged learning: creative destruction builds on the one-period-lagged technology of the target market&amp;rsquo;s incumbent, while only the incumbent can observe the current frontier technology level. This one-period lag creates a technology gap (Δ = q_t / q_{t−1}) between the incumbent&amp;rsquo;s frontier and the level available to rivals. Four possible technology gap values arise in equilibrium: Δ₁ = 1 (no gap), Δ₂ = λ (one successful own-innovation), Δ₃ = η (one successful creative destruction), and Δ₄ = η/λ. The step sizes satisfy λ² &amp;gt; η &amp;gt; λ, meaning a single creative destruction improves quality more than a single own-innovation, but two consecutive own-innovations dominate a single creative destruction.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Key Mechanisms.&lt;/strong&gt; The learning friction generates two novel mechanisms. First, the &amp;ldquo;market-protection effect&amp;rdquo;: incumbents with a technology advantage (Δ &amp;gt; 1) intensify own-innovation to widen the gap and protect their product lines when competitive pressure rises. Formally, own-innovation probability is highest for Δ₂ products and declines monotonically (z₂ &amp;gt; z₃ &amp;gt; z₄ &amp;gt; z₁), and ∂z₂/∂x &amp;gt; ∂z₃/∂x &amp;gt; 0 while ∂z₁/∂x &amp;lt; 0, conditional on value coefficients. Second, the &amp;ldquo;technological barrier effect&amp;rdquo;: higher overall own-innovation and creative destruction intensity widens the average technology gap across products, reducing rivals&amp;rsquo; conditional probability of successfully taking over a product market. This is distinct from the standard Schumpeterian effect (lower expected future profits) and from the escape-competition effect in step-by-step models (which apply only to neck-and-neck, single-product firms).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Empirical Strategy.&lt;/strong&gt; The empirical analysis combines the USPTO PatentsView database, the Longitudinal Business Database (LBD), the Longitudinal Firm Trade Transactions Database (LFTTD), the Census of Manufactures (CMF), Compustat, and NBER-CES data, covering the universe of U.S. patenting firms from 1976 to 2016, with main analyses from 1982 to 2007. Own-innovation is proxied by the self-citation ratio of patents (the ratio of self-citations to total backward citations); creative destruction by new products added and low-self-citation patents. Exogenous competitive pressure comes from China&amp;rsquo;s WTO accession in 2001, instrumented by the industry-level NTR tariff gap (the gap between non-NTR and NTR rates in 1999) following Pierce and Schott (2016).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Empirical Findings.&lt;/strong&gt; Pre-shock (1982–1999): patents with lower self-citation ratios (closer to creative destruction) have significantly longer backward citation gaps (coefficient −2.29 to −2.59, p &amp;lt; 0.01 across specifications), confirming that learning others&amp;rsquo; technology takes more time. Creative-destruction-type patents also have higher market value (Kogan et al. stock return measure) and scientific value (forward citations), with self-citation ratio negatively associated with both (e.g., coefficient on self-citation for market value: −0.289 without firm FE; −0.110 with firm FE, p &amp;lt; 0.01). Conditional on patenting, higher self-citation ratios are negatively associated with employment growth (coefficient −0.256, p &amp;lt; 0.05), number of industries added (−0.158, p &amp;lt; 0.05), and products added (−0.274, p &amp;lt; 0.01).&lt;/p&gt;
&lt;p&gt;Post-shock (DID): foreign competition had no statistically significant effect on overall patent counts, but firms with above-average innovation intensity in industries with high NTR gaps significantly increased their self-citation ratio — indicating a shift toward own-innovation. The triple-interaction coefficient is 0.795 (p &amp;lt; 0.01) with baseline controls. For a firm with average lagged innovation intensity (0.18) in an industry with an average NTR gap (0.291), this corresponds to a 4.2 percentage point increase in the seven-year growth rate of the self-citation ratio, representing a 15.0% increase relative to the average growth rate of 28.2 percentage points. Consistent with the technological barrier effect, firm entry rates are lower in industries with higher TFPR-skewness-based technological barriers (coefficient −0.012 to −0.016, p &amp;lt; 0.05).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Quantitative Analysis.&lt;/strong&gt; Calibrated to the U.S. manufacturing sector in 1992, the model matches six target moments including average number of products (2.3), products added (0.3), firm entry rate (7.6%), average productivity growth (1.9%), high-growth-firm employment growth (22.5%), and import penetration (15.3%). Creative destruction contributes approximately 1.88 times more to growth per unit than own-innovation (step size ratio 0.075/0.04). The aggregate R&amp;amp;D-to-sales ratio (untargeted) is 4.6% in the model vs. 4.1% in data.&lt;/p&gt;
&lt;p&gt;A counterfactual increasing outside entrants by 83% (matching the rise in import penetration from 15.3% to 25.1% between 1992 and 2007) generates a 1.51% increase in aggregate creative destruction arrival rate x, but firm-level creative destruction probability falls 1.33% and startup creative destruction also falls 1.33%. The aggregate R&amp;amp;D-to-sales ratio falls 1.6% and creative destruction R&amp;amp;D intensity falls 1.2%. Average domestic productivity growth declines 11.0%, with growth from creative destruction falling 13.0% and growth from domestic startups falling 1.7%. The total mass of domestic firms falls 6.4%.&lt;/p&gt;
&lt;p&gt;In economies with creative destruction costs 80 times higher than the U.S. baseline, the same competitive pressure shock raises rather than lowers total R&amp;amp;D (by 1.0%), but domestic growth still falls 9.7%, because the marginal decline in creative destruction impedes the growth contribution and firm entry even when aggregate innovation spending rises.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-key-friction-that-distinguishes-this-model-from-the-existing-multi-product-firm-literature-eg-klette-and-kortum-2004-akcigit-and-kerr-2018"&gt;Q1. What is the key friction that distinguishes this model from the existing multi-product firm literature (e.g., Klette and Kortum 2004; Akcigit and Kerr 2018)?&lt;/h3&gt;
&lt;p&gt;A: The key friction is &amp;ldquo;imperfect technology spillovers,&amp;rdquo; modeled as lagged learning: creative destruction can only build on the one-period-lagged technology of the target product (q_{j,t−1}), while the product&amp;rsquo;s current owner observes the frontier technology (q_{j,t}). In models without this friction — such as Akcigit and Kerr (2018) — rivals can instantly learn and copy frontier technology, so firms have no technological advantage and cannot protect their markets. In the current model, own-innovation by the incumbent widens the gap between q_{j,t} and q_{j,t−1}, creating a barrier that a rival must overcome even after successful creative destruction. This makes own-innovation an endogenous function of the technology gap, a feature absent from existing multi-product firm frameworks.&lt;/p&gt;
&lt;h3 id="q2-why-does-the-model-predict-that-own-innovation-increases-with-the-technology-gap-up-to-a-point-then-decreases"&gt;Q2. Why does the model predict that own-innovation increases with the technology gap up to a point, then decreases?&lt;/h3&gt;
&lt;p&gt;A: From Corollary 1, the ordering z₂ &amp;gt; z₃ &amp;gt; z₄ &amp;gt; z₁ reflects competing forces. Products with gap Δ₂ = λ gain the most from additional own-innovation in terms of reducing the probability of losing the product line (equation 2), so own-innovation is highest there. Products with Δ₃ = η or Δ₄ = η/λ already have substantial technological advantages from prior creative destruction, so the marginal value of own-innovation in reducing market loss probability is lower. Products with Δ₁ = 1 have no advantage at all: if a rival succeeds in creative destruction, the incumbent loses the product regardless of own-innovation (equation 1), so z₁ is lowest. Beyond a certain gap level, the incumbent is sufficiently protected that additional own-innovation has diminishing returns in deterrence.&lt;/p&gt;
&lt;h3 id="q3-what-is-the-market-protection-effect-formally-and-for-which-products-is-it-strongest"&gt;Q3. What is the market-protection effect formally, and for which products is it strongest?&lt;/h3&gt;
&lt;p&gt;A: The market-protection effect (Corollary 2) is the positive response of a firm&amp;rsquo;s own-innovation to an increase in the aggregate creative destruction arrival rate x, conditional on the value coefficients A₁ and A₂ being fixed. It is strongest for products with Δ₂ = λ (∂z₂/∂x is the largest and positive), positive but weaker for Δ₃ = η (∂z₃/∂x &amp;gt; 0), of ambiguous sign for Δ₄ = η/λ, and negative for Δ₁ = 1 (∂z₁/∂x &amp;lt; 0). The asymmetry reflects the asymmetric payoff to own-innovation across gap levels: for Δ₂ products, successful own-innovation can turn a losing situation into a winning one because it shifts the technology gap from Δ₁ to Δ₂ from the rival&amp;rsquo;s perspective, effectively defeating the rival&amp;rsquo;s creative destruction attempt. This mechanism provides a micro-foundation for why frontier firms (like Google or NVIDIA) keep innovating intensely despite their technological leads, a pattern the standard step-by-step model cannot explain.&lt;/p&gt;
&lt;h3 id="q4-what-is-the-technological-barrier-effect-and-how-does-it-differ-from-the-schumpeterian-effect"&gt;Q4. What is the technological barrier effect and how does it differ from the Schumpeterian effect?&lt;/h3&gt;
&lt;p&gt;A: The technological barrier effect refers to the reduction in rivals&amp;rsquo; incentive for creative destruction caused by an increase in the average technology gap across product lines. When incumbents do more own-innovation or when outside firms do more creative destruction, the distribution of technology gaps shifts rightward (density at Δ₁ falls; density at Δ₂, Δ₃, Δ₄ rises). This raises the average technology barrier rivals must overcome to successfully take over a product market, reducing the conditional takeover probability x^{takeover} and the expected value of creative destruction B. In the U.S. counterfactual, the technological barrier effect accounts for 17.0% of the total change in the aggregate creative destruction rate x and 15.0% of the change in startup creative destruction x_e. In contrast, the Schumpeterian effect refers to the reduction in expected future profits from owning a product due to increased displacement risk (through the value coefficient A₂), a mechanism present in standard quality-ladder models. Both operate simultaneously but the technological barrier effect is a novel feature of this framework.&lt;/p&gt;
&lt;h3 id="q5-how-is-own-innovation-vs-creative-destruction-measured-empirically-and-what-validates-this-measure"&gt;Q5. How is own-innovation vs. creative destruction measured empirically, and what validates this measure?&lt;/h3&gt;
&lt;p&gt;A: The self-citation ratio (the share of a patent&amp;rsquo;s backward citations that cite the same assignee&amp;rsquo;s earlier patents) is used as the primary measure: a higher ratio indicates greater reliance on the firm&amp;rsquo;s own prior knowledge, hence a higher probability that the innovation improves an existing product line (own-innovation). This is validated empirically in three ways. First, patents with lower self-citation ratios have significantly larger backward citation gaps (coefficient −2.29 to −2.59 across fixed-effect specifications on 728,721 observations), consistent with creative destruction requiring more time to learn others&amp;rsquo; technology. Second, lower self-citation patents have higher market value and scientific value (forward citations), consistent with η &amp;gt; λ (creative destruction contributes more per event to quality). Third, firm-level regressions show that lower self-citation ratios are associated with higher employment growth, more products added, and more industries entered, consistent with creative destruction contributing more to firm expansion.&lt;/p&gt;
&lt;h3 id="q6-how-does-the-did-identification-strategy-work-and-what-are-the-main-results"&gt;Q6. How does the DID identification strategy work, and what are the main results?&lt;/h3&gt;
&lt;p&gt;A: The identification exploits the removal of trade policy uncertainty (TPU) after China&amp;rsquo;s WTO accession in 2001. The treatment variable is the industry-level NTR gap (the gap between non-NTR and NTR tariff rates in 1999): industries with larger gaps experienced a larger reduction in uncertainty and thus a greater increase in Chinese import competition. The DID compares patenting firms across periods (1992–1999 vs. 2000–2007) and across high- vs. low-NTR-gap industries, with a triple interaction for firm-level innovation intensity (lagged five-year average patents per employee, normalized within two-digit NAICS). The main finding (Table 4): the NTR gap × Post interaction has no significant effect on overall patent counts (coefficient 0.238 without controls, standard error 0.237), but the triple interaction (NTR gap × Post × innovation intensity) has a positive and significant effect on the growth rate of the self-citation ratio (0.732 without controls, p &amp;lt; 0.05; 0.795 with baseline controls, p &amp;lt; 0.01). This implies that innovation-intensive firms in high-competition industries shifted their composition toward own-innovation, while overall patenting was unchanged — consistent with an offsetting rise in own-innovation and fall in creative destruction.&lt;/p&gt;
&lt;h3 id="q7-what-are-the-aggregate-growth-effects-of-increasing-competitive-pressure-in-the-calibrated-model"&gt;Q7. What are the aggregate growth effects of increasing competitive pressure in the calibrated model?&lt;/h3&gt;
&lt;p&gt;A: Using an 83% increase in outside entrants (matching the 1992–2007 rise in import penetration from 15.3% to 25.1%), average domestic productivity growth falls 11.0%. Decomposing: growth from domestic own-innovation falls 11.4%, growth from domestic creative destruction falls 13.0%, and growth from domestic startups falls 1.7% (Table 9). The aggregate R&amp;amp;D-to-sales ratio falls 1.6% and the creative destruction R&amp;amp;D intensity falls 1.2%, indicating that the decline in creative destruction R&amp;amp;D outweighs the rise in own-innovation R&amp;amp;D. The total mass of domestic firms falls 6.4% and the average number of products per firm falls 5.5%.&lt;/p&gt;
&lt;h3 id="q8-how-do-results-differ-in-economies-with-high-creative-destruction-costs-vs-the-us"&gt;Q8. How do results differ in economies with high creative destruction costs vs. the U.S.?&lt;/h3&gt;
&lt;p&gt;A: When creative destruction costs (χ̃) are set 80 times higher than the U.S. baseline, the initial equilibrium has much lower creative destruction: R&amp;amp;D-to-sales ratio is 1.39% (vs. 4.58% in U.S.), creative destruction R&amp;amp;D intensity is 8.6% (vs. 63.9%), average number of products is 1.0 (vs. 2.3), and average domestic productivity growth is 1.4% (vs. 1.9%). Under the same competition shock, total R&amp;amp;D actually rises by 1.0% in this high-CD-cost economy (because own-innovation increases more than creative destruction falls, given the already low baseline of creative destruction), in contrast to the −1.6% in the U.S. However, domestic growth still falls 9.7% even in this economy, driven by reductions in creative destruction by incumbents and startups combined with a decline in the mass of domestic incumbents. This result holds even with a fixed firm mass (Table E5), confirming the mechanism is not solely due to entry/exit dynamics.&lt;/p&gt;
&lt;h3 id="q9-what-is-the-technological-barrier-effects-quantitative-contribution-to-the-decline-in-creative-destruction"&gt;Q9. What is the technological barrier effect&amp;rsquo;s quantitative contribution to the decline in creative destruction?&lt;/h3&gt;
&lt;p&gt;A: In the U.S. counterfactual (Table 8 and associated decomposition), 17.0% of the total change in the aggregate creative destruction arrival rate x and 15.0% of the total change in startup creative destruction x_e are attributable specifically to the technological barrier effect — that is, to the shift in the technology gap distribution µ(Δℓ) holding all else equal. The conditional takeover probability x^{takeover} declines from 73.2% to 73.0%. The density at Δ₁ (the easiest gap to overcome) falls 0.4%, while densities at Δ₃ and Δ₄ rise 1.1% and 1.4% respectively, driven by increased creative destruction by outside firms and intensified own-innovation by incumbents.&lt;/p&gt;
&lt;h3 id="q10-what-are-the-policy-implications-the-paper-draws-from-its-framework"&gt;Q10. What are the policy implications the paper draws from its framework?&lt;/h3&gt;
&lt;p&gt;A: The paper argues that policies evaluating innovation should account for composition, not just aggregate R&amp;amp;D levels or patent counts. Increased overall innovation driven by defensive own-innovation contributes less to economic growth than creative destruction and restricts firm entry — so it is less beneficial than it appears. In low-creativity economies (e.g., European economies with high regulatory barriers to creative destruction), increased foreign competition may raise aggregate R&amp;amp;D while still lowering domestic growth, misleading policymakers who track only total innovation spending. The model also suggests that the mixed empirical findings in the competition-innovation literature (Aghion et al. 2005; Bloom et al. 2016; Autor et al. 2020) can be reconciled by accounting for compositional shifts: the net effect of competition on total innovation is ambiguous because it raises own-innovation for technologically advantaged firms while reducing creative destruction for all firms.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Imperfect Technology Spillovers:&lt;/strong&gt; The novel friction introduced in this paper, modeled as lagged learning: firms attempting creative destruction can only access the one-period-lagged technology of the target product market (q_{j,t−1}), while the incumbent product owner observes and can improve from the current frontier (q_{j,t}). This asymmetry creates a persistent technological advantage for incumbents and enables strategic defensive innovation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Own-Innovation:&lt;/strong&gt; R&amp;amp;D investment by a firm to improve the quality of its existing product lines. Successful own-innovation raises product quality by a step size λ &amp;gt; 1. Own-innovation does not require learning others&amp;rsquo; technology and, in the model, constitutes the incumbents&amp;rsquo; defensive margin against creative destruction. At the aggregate level, it contributes more to total growth than creative destruction because it succeeds more frequently, but per successful event it contributes less (λ &amp;lt; η).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Creative Destruction:&lt;/strong&gt; R&amp;amp;D investment enabling a firm to enter a new product market by displacing the incumbent. Successful creative destruction improves the lagged quality of the target product by a step size η &amp;gt; λ, where λ² &amp;gt; η &amp;gt; λ. It requires learning the incumbent&amp;rsquo;s one-period-lagged technology, takes longer to develop (evidenced empirically by longer backward citation gaps), and contributes more to firm growth and product expansion per event than own-innovation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Technology Gap (Δ):&lt;/strong&gt; The ratio of a product&amp;rsquo;s current-period technology to its previous-period technology (Δ_{j,t} = q_{j,t}/q_{j,t−1}). This gap summarizes the technological advantage the incumbent holds in a product market under imperfect spillovers. Four values are possible in equilibrium: Δ₁ = 1, Δ₂ = λ, Δ₃ = η, Δ₄ = η/λ. The gap determines both the incumbent&amp;rsquo;s own-innovation incentive and the rival&amp;rsquo;s probability of successfully completing a product takeover conditional on creative destruction.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Market-Protection Effect:&lt;/strong&gt; The mechanism by which incumbents with a technological advantage (Δ &amp;gt; 1) increase own-innovation in response to heightened competitive pressure (an increase in the aggregate creative destruction arrival rate x). This effect is maximized for products with Δ₂ = λ and positive but diminishing for Δ₃. It is absent for Δ₁ = 1 products (where own-innovation cannot prevent displacement) and is formally distinct from the escape-competition effect in step-by-step innovation models, which applies only to neck-and-neck single-product firms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Technological Barrier Effect:&lt;/strong&gt; The reduction in rivals&amp;rsquo; incentive for creative destruction caused by an increase in the average technology gap across the economy&amp;rsquo;s product lines. When incumbents intensify own-innovation and/or when outside creative destruction increases, the distribution of technology gaps shifts toward higher Δ values, reducing the conditional probability that a rival successfully takes over any given product market. This feedback mechanism endogenously suppresses creative destruction and firm entry beyond what the Schumpeterian effect alone would predict.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Self-Citation Ratio:&lt;/strong&gt; The share of a patent&amp;rsquo;s backward citations that cite patents previously owned by the same firm. Used in the paper as a continuous proxy for the likelihood that a patent represents own-innovation vs. creative destruction: a ratio of 1 (100% self-citations) implies 100% probability of own-innovation; a ratio of 0 implies 100% probability of creative destruction. This measure follows Akcigit and Kerr (2018) and is validated in the paper against learning time, quality, and firm growth outcomes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;NTR Gap (Trade Policy Uncertainty Shock):&lt;/strong&gt; The industry-level difference between non-NTR (column 2) and NTR (column 1) U.S. tariff rates in 1999, used as an instrument for the exogenous increase in Chinese competitive pressure following China&amp;rsquo;s WTO accession and the U.S. granting of Permanent Normal Trade Relations (PNTR) in 2002. Industries with larger NTR gaps experienced a greater reduction in trade policy uncertainty and thus a larger increase in competitive pressure from foreign firms.&lt;/p&gt;</description></item><item><title>Homeownership, Polarization, and Inequality</title><link>https://macropaperwarehouse.com/papers/homeownership-polarization-and-inequality/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/homeownership-polarization-and-inequality/</guid><description>&lt;p&gt;This paper asks why job polarization and income inequality are higher in large U.S. cities, and proposes a novel housing-market mechanism that operates independently of — but interacts with — the skill-biased technical change (SBTC) explanations dominant in the existing literature.&lt;/p&gt;
&lt;p&gt;The core argument is that large cities have experienced faster growth in house prices relative to both wages (price-wage ratio) and rents (price-rent ratio) since 1980. This excess price growth has priced middle-income households out of homeownership in expensive cities. Because low-income households cannot afford to own anywhere and high-income households can afford to own everywhere, it is specifically middle-income (middle-skilled) households whose location choice becomes entangled with their tenure choice. These households increasingly sort toward smaller, more affordable cities where they can purchase a home. This selective out-migration hollows out the middle of the income distribution in large cities, producing greater employment polarization and income inequality there.&lt;/p&gt;
&lt;p&gt;Empirically, the paper uses Census and ACS data from 1980 to 2019 covering 465 commuting zones (CZs). Polarization is measured following Autor and Dorn (2013) by assigning 3-digit occupations to income percentiles fixed at 1980 levels; inequality is measured by the Gini coefficient and variance of log annual wages. Housing costs are captured by hedonic price and rent indices and three derived ratios. OLS and IV results (instrumented using the interaction of land unavailability and long-run changes in real interest rates) show that doubling of prices is associated with a 1 percentage point decline in the middle-skilled employment share; doubling of the price-rent ratio is associated with an 11.3 percentage point decline; doubling of the price-wage ratio with a 5.3 percentage point decline. Inequality follows the same pattern: doubling prices raises 100x the variance of log wages by 2.3 points; doubling the price-rent ratio raises it by 11.7 points; doubling the price-wage ratio by 7.7 points.&lt;/p&gt;
&lt;p&gt;The migration mechanism is documented using 2001–2019 CPS ASEC data, which — uniquely among available sources — reports reasons for moving. A doubling of the price index, price-wage ratio, or price-rent ratio in the origin state relative to the destination raises the probability that a middle-income (2nd–4th quintile) household moves for housing-related reasons by approximately 5–10 percentage points in absolute terms, implying a 50–80% relative increase compared with low- or high-income households making a housing-related move.&lt;/p&gt;
&lt;p&gt;The theoretical framework extends the standard spatial equilibrium (Rosen-Roback) model with two additions: skill heterogeneity and housing tenure choice. Households face a minimum house size constraint and a payment-to-income (PTI) constraint (calibrated at lambda = 0.308). These constraints create distinct skill thresholds for homeownership that vary by city; the interaction between location and tenure choices applies only to middle-skilled households who can afford ownership in cheap but not expensive cities.&lt;/p&gt;
&lt;p&gt;In the quantitative model, calibrated separately for 1980 and 2019 with two locations (top 30 CZs vs. the rest), counterfactual experiments show that holding price-wage ratios at their 1980 levels reduces the excess polarization gap between large and small CZs by 93% and the excess inequality gap by 40%. Holding price-rent ratios constant reduces the polarization gap by 96% and the inequality gap by 27%. By contrast, shutting down SBTC entirely reduces the polarization gap by only 54% and the inequality gap by 73%. These results establish that while SBTC is an important driver, its effect on polarization and inequality is substantially amplified by faster house price growth in large cities; without the housing affordability channel, the effect of SBTC on disproportionate polarization would be 63–81% smaller and on the inequality gap 18–36% smaller.&lt;/p&gt;
&lt;p&gt;Q: What is the paper&amp;rsquo;s central research question?
A: The paper asks why job polarization and income inequality are systematically higher in large U.S. cities than in small ones. Prior literature attributed this to skill-biased technical change, external labor demand shocks, or IT-driven displacement of routine jobs; this paper proposes a complementary, housing-market-based explanation that does not rely on features of the production technology.&lt;/p&gt;
&lt;p&gt;Q: What is the core mechanism linking house prices to polarization?
A: When price-wage and price-rent ratios are higher in large cities, middle-income households face binding minimum-size and payment-to-income constraints that prevent them from owning a home there but not in cheaper cities. Because homeownership carries financial advantages, these households sort toward smaller, more affordable cities. Low-income households cannot afford ownership anywhere and high-income households can afford it anywhere, so only the middle group&amp;rsquo;s location choice is distorted by tenure considerations. This selective out-migration hollows out the middle of the income distribution in expensive large cities.&lt;/p&gt;
&lt;p&gt;Q: What empirical patterns in CZ-level data motivate the paper?
A: Doubling CZ size is associated with a 1.9 percentage point greater fall in the middle-skilled employment share and a 2.7 point higher growth in 100x the variance of log wages from 1980 to 2019. Larger CZs also experienced 3.4% higher price growth, 3.1% higher price-wage ratio growth, and a 10% greater increase in price-rent ratios. These associations persist after controlling for initial CZ size and other characteristics.&lt;/p&gt;
&lt;p&gt;Q: What do the OLS and IV results show about house prices and polarization?
A: A doubling of house prices is associated with a 1 percentage point decline in the middle-skilled share; a doubling of the price-rent ratio with an 11.3 percentage point decline; and a doubling of the price-wage ratio with a 5.3 percentage point decline. IV results using the interaction of land unavailability and the change in real interest rates as an instrument confirm the negative relationship remains statistically significant, suggesting a causal interpretation is plausible.&lt;/p&gt;
&lt;p&gt;Q: What do the OLS and IV results show about house prices and income inequality?
A: A doubling of prices is associated with a 2.3 point increase in 100x the variance of log wages; a doubling of the price-rent ratio with an 11.7 point increase; and a doubling of the price-wage ratio with a 7.7 point increase. IV results suggest a causal relationship between price growth and income inequality at the CZ level.&lt;/p&gt;
&lt;p&gt;Q: What evidence does the paper provide for the migration mechanism?
A: Using 2001–2019 CPS ASEC data (which reports stated reasons for moving, unlike the ACS), the paper estimates logit regressions of interstate migration for housing-related reasons. A doubling of the price index in the origin state relative to the destination raises the probability of a housing-related move for middle-income (2nd–4th quintile) households by 5–6 percentage points; a doubling of the price-wage ratio raises it by 6–7 percentage points; and a doubling of the price-rent ratio raises it by 7–10 percentage points. These effects imply a 50–80% relative increase in housing-related migration probability for the middle quintiles compared with the bottom or top quintile. Housing-related movers constitute over 12% of all interstate migrants in the sample.&lt;/p&gt;
&lt;p&gt;Q: What is the key finding about homeownership rates?
A: There is no statistically significant relationship between the change in homeownership rates and the growth in prices, price-rent, or price-wage ratios from 1980 to 2019. This is consistent with the model&amp;rsquo;s mechanism, in which middle-income households who cannot afford ownership in large cities move away rather than simply switching to renting there — so aggregate local ownership rates need not fall.&lt;/p&gt;
&lt;p&gt;Q: How does the theoretical model generate the polarization result?
A: The model extends the Rosen-Roback spatial equilibrium framework with skill heterogeneity and housing tenure choice. Two skill thresholds — one for minimum-size-constrained ownership and one for unconstrained ownership — interact with the price-wage and price-rent ratios of each city. Proposition 1 proves that a city with higher price-wage and price-rent ratios will have a lower middle-skilled share, because middle-skilled workers (those who can afford to own in cheap but not expensive cities) are drawn to cheaper locations. Proposition 2 shows that in a world with only renters or only owners, skill shares would be identical across cities regardless of price differences — the polarization result requires heterogeneity in tenure choice.&lt;/p&gt;
&lt;p&gt;Q: What does the no-SBTC counterfactual show?
A: Holding the parameters governing local returns to skills at their 1980 levels (shutting down skill-biased technical change) reduces the difference in the decline in the middle-skilled share between large and small CZs by 54% and the gap in the increase in the variance of log wages by 73%. This is broadly consistent with prior literature attributing the bulk of disproportionate polarization and inequality in big cities to SBTC.&lt;/p&gt;
&lt;p&gt;Q: What do the constant price-ratio counterfactuals show?
A: When price-wage ratios are held at 1980 levels (but SBTC is allowed to operate), the excess polarization gap between large and small CZs falls by 93% and the excess inequality gap by 40%. When price-rent ratios are held at 1980 levels, the polarization gap falls by 96% and the inequality gap by 27%. When both are held constant simultaneously, the polarization gap falls by 89% and the inequality gap by 27%. These results show that the effect of SBTC on polarization would be 63–81% smaller in the absence of the housing affordability amplification channel.&lt;/p&gt;
&lt;p&gt;Q: Who are the largest losers from rising price-wage ratios in large cities?
A: The counterfactual welfare analysis identifies middle-skilled workers with skill levels between approximately 0.29 and 0.80 as the primary losers. In the counterfactual with fixed price-wage ratios, workers with skills from 0.29 to 0.57 who previously could not afford ownership in large cities are now able to own there, and those with skills from 0.57 to 0.80 spend a smaller share of income on housing. This group either lost homeownership opportunities or was induced to move to less productive CZs by the actual price growth that occurred.&lt;/p&gt;
&lt;p&gt;Q: How is the quantitative model calibrated and structured?
A: The model is calibrated separately for 1980 and 2019 as two stationary spatial equilibria. It features two locations (the top 30 CZs, which account for 49.3% of employment, and the remaining CZs). Key parameters include a Frechet elasticity of 6.1, an agglomeration externality of 0.04, a PTI constraint of 0.308, and an annual discount factor of 0.96. Land shares differ between large and small CZs (0.3965 vs. 0.2239). The model finds that the price-rent ratio was relatively stable in large cities but fell in small ones, while the price-wage ratio increased much more in large CZs — both indicators point to purchasing a home becoming relatively more expensive in large CZs.&lt;/p&gt;
&lt;p&gt;Q: What are the paper&amp;rsquo;s policy implications?
A: Zoning reforms and other policies that increase housing supply in large, unaffordable cities could produce a more efficient spatial allocation of labor, greater aggregate productivity, and more economically diverse — less polarized and less unequal — cities, while also reducing the wealth gap between owners and renters. Policies that promote homeownership by reducing the cost of owning without raising housing supply may reduce local polarization and inequality but could lower aggregate output and do not necessarily increase homeownership rates.&lt;/p&gt;
&lt;p&gt;Q: How does this paper relate to existing explanations for city-level polarization?
A: The paper&amp;rsquo;s housing-market mechanism is explicitly complementary to SBTC-based explanations (Baum-Snow, Freedman, and Pavan, 2018; Cerina et al., 2023), external demand shock explanations (Davis, Mengus, and Michalski, 2020), and IT-displacement explanations (Eeckhout, Hedtrich, and Pinheiro, 2024). The paper&amp;rsquo;s key added contribution is that even if SBTC were the primary driver of disproportionate polarization, its measured effect would be substantially smaller in the absence of faster house price growth in large cities — the housing market amplifies rather than replaces the technology channel.&lt;/p&gt;
&lt;p&gt;Job polarization (city-level): The hollowing out of middle-income employment shares in a commuting zone, measured as the change in the share of workers in occupations assigned to the 21st–80th income percentile (using the 1980 occupation-to-percentile mapping fixed over time). In this paper, polarization is greater in cities where price-wage and price-rent ratios grew faster, attributed to selective out-migration of middle-skilled households.&lt;/p&gt;
&lt;p&gt;Price-wage ratio: The ratio of hedonic house prices to median annual wages in a commuting zone, constructed from Census and ACS data. A higher price-wage ratio tightens the payment-to-income constraint on potential homebuyers and is the primary driver of the skill threshold for homeownership in the model.&lt;/p&gt;
&lt;p&gt;Price-rent ratio: The ratio of hedonic house prices to rents in a commuting zone. In the model, a higher price-rent ratio reduces the financial advantage of owning over renting, raising the skill threshold at which ownership becomes optimal. The paper treats price-rent and price-wage ratios as distinct channels that both independently amplify polarization.&lt;/p&gt;
&lt;p&gt;Housing tenure choice: The household decision to own or rent, modeled as a discrete choice made at the start of life that interacts with location choice. Ownership requires satisfying both a minimum house size constraint and a payment-to-income (PTI) constraint (lambda = 0.308). The interaction between tenure and location choices is the paper&amp;rsquo;s key model innovation; it exists only for middle-skilled workers whose income is sufficient for ownership in cheap but not expensive cities.&lt;/p&gt;
&lt;p&gt;Skill threshold for homeownership (s*_i): The minimum skill level at which a worker in city i chooses to own rather than rent, defined by Lemma 2. This threshold is decreasing in local labor productivity and increasing in price-wage and price-rent ratios. Workers with skill below s*_i in all cities always rent; those with skill above s*_i in all cities always own; those in between face city-dependent tenure choice that distorts their location decision.&lt;/p&gt;
&lt;p&gt;Skill-biased technical change (SBTC): In the paper&amp;rsquo;s quantitative model, SBTC is represented by faster growth in the skill dispersion parameter (alpha_it) in large CZs, reflecting differential productivity growth concentrated at the top of the skill distribution. The paper finds SBTC accounts for 54% of the polarization gap and 73% of the inequality gap in its counterfactual, but argues its effect is amplified 4–5x by the housing affordability channel.&lt;/p&gt;
&lt;p&gt;Payment-to-income (PTI) constraint: The constraint that a homebuyer cannot spend more than a fraction lambda (calibrated at 0.308) of annual labor earnings on the annual housing payment (user cost times price times quantity). This constraint, together with the minimum house size, determines the income threshold for ownership and makes location and tenure choices interdependent for middle-skilled workers.&lt;/p&gt;</description></item><item><title>How Do You Identify a Good Manager?</title><link>https://macropaperwarehouse.com/papers/how-do-you-identify-a-good-manager/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/how-do-you-identify-a-good-manager/</guid><description>&lt;p&gt;This paper develops a novel experimental method to identify the causal contribution of managers to team performance, and uses it to evaluate which characteristics predict managerial effectiveness and how manager selection mechanisms affect organizational outcomes.&lt;/p&gt;
&lt;p&gt;The core identification challenge is that managers are not randomly assigned to teams in the field, and field managers are a highly non-random sample, making it difficult to infer which traits genuinely predict managerial performance. The authors address this by repeatedly randomly assigning managers to multiple teams in a controlled laboratory experiment, then estimating each manager&amp;rsquo;s average causal contribution to group output after conditioning on group members&amp;rsquo; individual productive skills. The intuition is that a good manager is someone who consistently causes their team to produce more than the sum of their parts.&lt;/p&gt;
&lt;p&gt;The experiment was conducted at the University of Essex lab with 555 participants (46% female, mean age 25, ethnically diverse) forming 728 groups of three across four rounds. Each group consisted of one manager and two workers who performed a Collaborative Production Task requiring coordination across three problem-solving modules (numerical, spatial, and analytical reasoning). The team score was the minimum module score — a weakest-link structure making coordination essential. Prior to group testing, all participants completed individual assessments of task-specific skill, fluid intelligence (CFIT), emotional perceptiveness (Reading the Mind in the Eyes Test, RMET), economic decision-making skill (the Assignment Game, which measures resource allocation under comparative advantage), Big 5 personality, and demographic characteristics. Manager selection was randomly varied at the session level: in 20 sessions, the participant with the strongest preference for leadership became manager (self-promotion); in 19 sessions, managers were assigned by lottery.&lt;/p&gt;
&lt;p&gt;The main quantitative findings are as follows. First, there are large, stable, and statistically significant manager effects: a manager one standard deviation above average improves team performance by approximately 0.23 standard deviations (p = 0.04). This estimate is roughly 90% the size of the combined productive skill coefficient for the two workers (approximately 0.26 sd), indicating that a good manager is roughly twice as valuable as a good individual worker. Manager contributions predict out-of-sample group performance in a leave-one-out procedure (p &amp;lt; 0.01).&lt;/p&gt;
&lt;p&gt;Second, among randomly assigned managers, only two predictors significantly explain managerial performance: fluid intelligence (CFIT) and economic decision-making skill (Assignment Game scores), both significant at below the 1% level. Gender, age, and ethnicity do not predict managerial performance.&lt;/p&gt;
&lt;p&gt;Third, self-promoted managers perform substantially worse than lottery-assigned managers, by approximately 0.10 standard deviations — roughly equivalent to being assigned a manager with fluid intelligence one full standard deviation below average. The mechanism is overconfidence: people who strongly prefer management roles are significantly more overconfident (d = 0.41 sd, p &amp;lt; 0.01) and exhibit a strong negative correlation between self-reported social skills and actual emotional perceptiveness on the RMET (r = -0.37, p &amp;lt; 0.001). Among self-promoted managers, self-reported extraversion and political skill are negatively correlated with managerial performance (rho = -0.24 and -0.26, p &amp;lt; 0.05); no such negative relationship appears among lottery managers.&lt;/p&gt;
&lt;p&gt;Fourth, selecting managers on economic decision-making skill rather than self-promotion improves average manager quality by 0.6 standard deviations — equivalent to replacing an average worker in every group with a worker at the 99th percentile of individual productivity.&lt;/p&gt;
&lt;p&gt;The three mechanisms through which good managers improve performance are: (1) monitoring — good managers (1 sd above average) cut monitoring errors from 16% to 8%; (2) optimal task allocation according to comparative advantage — groups with optimally assigned workers score 0.52 sd higher (p &amp;lt; 0.01); (3) worker motivation in late-stage effort — teams led by a 1-sd-above-average manager solve 0.6 more problems in the final two minutes versus only 0.3 more in the first two minutes.&lt;/p&gt;
&lt;p&gt;The experiment was conducted in a university lab in the UK, and the sample skews toward graduate students with limited work experience. Generalizability to field settings is supported by prior evidence that peer productivity spillover experiments yield similar magnitudes in lab versus field settings, and that the estimated manager effects are similar to Lazear et al. (2015) estimates from a large employer dataset.&lt;/p&gt;
&lt;p&gt;Q: What is the core methodological innovation of this paper?
A: The paper requires repeated random assignment of managers to multiple teams, combined with controls for individual productive skill measured prior to group work. This allows identification of each manager&amp;rsquo;s average causal contribution to group output, rather than confounding management quality with team composition or individual worker ability. The key estimand is the standard deviation of individual manager effects (sigma_alpha), interpreted as the impact of having a manager one standard deviation above average.&lt;/p&gt;
&lt;p&gt;Q: How large is the estimated manager effect, and how does it compare to worker effects?
A: A manager one standard deviation above average improves team performance by approximately 0.23 standard deviations (p = 0.04 by randomization inference). This is roughly 90% the size of the combined productive skill effect of both workers together (approximately 0.26 sd), implying a good manager is nearly twice as valuable as a good individual worker. Without conditioning on production skills, the manager effect rises to 0.29 sd.&lt;/p&gt;
&lt;p&gt;Q: What characteristics predict managerial performance among randomly assigned managers?
A: Only two measures predict managerial performance in the lottery arm: fluid intelligence (CFIT) and economic decision-making skill (scores on the Assignment Game), both significant at below the 1% level. These predictors are robust to controls for demographics, education, work experience, emotional perceptiveness, and personality traits. Gender, age, and ethnicity do not predict managerial performance.&lt;/p&gt;
&lt;p&gt;Q: What is the &amp;ldquo;Assignment Game&amp;rdquo; and why is it a strong predictor?
A: The Assignment Game (Caplin et al., 2024) places participants in a simulated managerial role where they must assign fictional workers to tasks. Performing well requires understanding comparative advantage intuitively, managing an attentionally demanding numerical environment, and avoiding biases such as anchoring. The paper argues its strong predictive power reflects that good managers excel at allocating workers according to comparative advantage — which the experiment directly identifies as a key mechanism.&lt;/p&gt;
&lt;p&gt;Q: How do self-promoted managers perform relative to lottery-assigned managers?
A: Self-promoted managers perform approximately 0.10 standard deviations below lottery managers, and this gap is robust across model specifications. The performance deficit is roughly equivalent to being assigned a manager whose fluid intelligence is one full standard deviation below average. This finding implies that common organizational practice of selecting managers partly via self-nomination actively reduces team productivity.&lt;/p&gt;
&lt;p&gt;Q: Why do self-promoted managers underperform?
A: The paper attributes underperformance primarily to overconfidence. People strongly preferring management roles are significantly more overconfident than those without strong preferences (d = 0.41 sd, p &amp;lt; 0.01). Self-promoted managers specifically overestimate their social skills: among them, self-reported people skills are strongly negatively correlated with actual emotional perceptiveness on the RMET (r = -0.37, p &amp;lt; 0.001), and self-reported extraversion and political skill are negatively correlated with managerial performance (rho = -0.24 and -0.26, p &amp;lt; 0.05). None of these negative relationships appear among lottery managers.&lt;/p&gt;
&lt;p&gt;Q: Who wants to be a manager, and does it differ by gender?
A: The three variables most strongly correlated with wanting to be in charge are extraversion, risk appetite, and being male. The relationship between high extraversion and preference for management is driven largely by men. Women are much less likely to nominate themselves for leadership roles despite being equally or more effective on average — a finding consistent with broader experimental evidence on gender and leadership self-selection.&lt;/p&gt;
&lt;p&gt;Q: How large are the potential gains from skill-based manager selection?
A: Compared to self-promotion, selecting managers based on economic decision-making skill yields managers who are 0.6 standard deviations better in terms of estimated manager effects. In terms of group performance, this is equivalent to replacing an average worker in every group with a worker at the 99th percentile of individual productivity. Selecting on both economic decision-making and fluid intelligence outperforms random assignment, selection on social skills, or selection on worker task performance (the Peter Principle).&lt;/p&gt;
&lt;p&gt;Q: What are the three mechanisms through which good managers improve team performance?
A: First, monitoring: good managers (1 sd above average) reduce monitoring errors — defined as having a worker on a module substantially above the minimum score at task end — from 16% to 8% (bivariate correlation with manager performance = -0.40, p &amp;lt; 0.001). Second, optimal task allocation: the probability of finding the optimal comparative-advantage-based assignment is positively associated with manager performance (rho = 0.19, p &amp;lt; 0.01), and groups with always-optimal starting assignments score 0.52 sd higher than those with never-optimal assignments (p &amp;lt; 0.01). Third, worker motivation: team performance in the final two-minute period is about 50% more influential for overall outcomes than the first two minutes (p = 0.038), and 1-sd-above-average managers generate 0.6 more problems solved in the final period versus 0.3 in the first, consistent with differential motivational effects emerging over time.&lt;/p&gt;
&lt;p&gt;Q: What is the Peter Principle, and how does this paper relate to it?
A: The Peter Principle refers to the practice of promoting employees based on their performance as line workers rather than their suitability for management — promoting individuals to their level of incompetence. Benson et al. (2019) document this selection pattern empirically. This paper shows that selecting managers on worker task skill is inferior to selecting on economic decision-making skill or fluid intelligence, confirming that task skill is not the right criterion for manager selection even if it predicts individual worker output.&lt;/p&gt;
&lt;p&gt;Q: How does the paper validate that manager effects are real and not noise?
A: The paper uses randomization inference with 5,000 simulated allocations to compute p-values, obtaining p = 0.04 for the main manager effect. Robustness checks include controlling for pre-existing social relationships, manager risk appetite, variance of individual scores, and granular skill measures — all yielding estimates near 0.22 sd. A leave-one-out out-of-sample prediction test confirms manager contributions significantly predict held-out group performance (p &amp;lt; 0.01), while the analogous worker out-of-sample estimate is less than half the magnitude and not statistically significant.&lt;/p&gt;
&lt;p&gt;Q: What are the scope conditions on the experimental results?
A: The experiment is conducted in a university lab in the UK with graduate students averaging 25 years of age and two years of work experience, limiting direct generalizability to experienced workers or senior management. The task lasts approximately 15 minutes, which may not capture longer-run managerial dynamics. Compensation equalized average earnings between managers and workers, which differs from most real-world settings. The authors note their effect-size estimates closely match Lazear et al. (2015) from a large employer, and that Herbst and Mas (2015) find lab peer-productivity experiments generalize to the field.&lt;/p&gt;
&lt;p&gt;Manager Effect (sigma_alpha): The standard deviation of individual managers&amp;rsquo; average causal contributions to group performance, estimated via repeated random assignment and conditioning on individual productive skill. Represents the impact of having a manager one standard deviation above average, estimated at approximately 0.23 standard deviations of group output.&lt;/p&gt;
&lt;p&gt;Collaborative Production Task: A novel lab group task in which a manager and two workers solve problems across three modules (numerical, spatial, analytical reasoning), with team score defined as the minimum module score (weakest-link structure). Managers are responsible for worker assignment, monitoring, and motivation; workers face no financial performance incentives.&lt;/p&gt;
&lt;p&gt;Economic Decision-Making Skill: Defined by Caplin et al. (2024) as the ability to make good resource allocation decisions, assessed via the Assignment Game in which participants must optimally assign workers to tasks under comparative advantage. The single strongest predictor of managerial performance in the lottery arm.&lt;/p&gt;
&lt;p&gt;Monitoring Failure: Defined in the paper as having any group member working on a module at task end whose score is substantially greater (e.g., 10 points higher) than the minimum module score — meaning the worker&amp;rsquo;s effort is not contributing to the group score. Occurs in 16% of groups overall; managers one sd above average reduce this to 8%.&lt;/p&gt;
&lt;p&gt;Self-Promotion (as selection mechanism): A treatment condition in which the participant with the strongest stated preference for being manager (on a 1-10 scale) is assigned the managerial role. Contrasted with lottery assignment; self-promoted managers perform approximately 0.10 sd worse than lottery managers.&lt;/p&gt;
&lt;p&gt;Overconfidence (in managerial context): The gap between self-assessed skill (particularly social/interpersonal skill) and objectively measured skill (e.g., RMET score). Self-promoters are significantly more overconfident (d = 0.41 sd), and overconfidence is strongly negatively correlated with actual emotional perceptiveness (r = -0.33, p &amp;lt; 0.001).&lt;/p&gt;
&lt;p&gt;Comparative Advantage Allocation: The practice of assigning each worker to the module in which they have the highest relative (not absolute) performance advantage. Captured via whether a manager selects the optimal one-to-one assignment given pre-measured individual module scores; groups with always-optimal allocation score 0.52 sd higher.&lt;/p&gt;</description></item><item><title>Ideas Have Consequences: The Impact of Law and Economics on American Justice</title><link>https://macropaperwarehouse.com/papers/ideas-have-consequences-the-impact-of-law-and-economics-on-american-justice/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/ideas-have-consequences-the-impact-of-law-and-economics-on-american-justice/</guid><description>&lt;p&gt;This paper quantifies the effect of the Manne Economics Institute for Federal Judges — an intensive two-week economics training program run by the Law and Economics Center from 1976 to 1998 — on the decision-making of U.S. federal judges. The research question is whether exposure to a coherent set of economic ideas can directly shift the policy decisions of sitting policymakers, as distinct from effects operating through partisan affiliation or formal legal rules.&lt;/p&gt;
&lt;p&gt;The program trained nearly half of all federal judges over its two decades of operation. By 1990, forty percent of federal judges had attended; by the late 1990s, roughly half of circuit court cases had a Manne-trained judge on the panel. Instructors included Milton Friedman, Armen Alchian, Harold Demsetz, Martin Feldstein, Paul Samuelson, and Orley Ashenfelter, covering supply-and-demand theory, the Coase Theorem, externalities, property rights, and criminal deterrence following Becker (1968). The program was funded by pro-business foundations and had a recognized conservative-leaning orientation, though it invited both Republican- and Democrat-appointed judges and was popular across party lines.&lt;/p&gt;
&lt;p&gt;The identification strategy is a differences-in-differences design exploiting staggered attendance timing. Because the program was oversubscribed and admitted judges on a first-come-first-served basis — with applicants bumped to later cohorts when capacity was reached — the timing of attendance within the ever-attending population has a quasi-random component. The preferred control group consists exclusively of other ever-attending judges who had not yet attended, rather than never-attenders, because never-attenders differ systematically on observables and show a pre-existing positive trend in economics language use, likely from ambient diffusion through clerks, law schools, and organizations such as the Federalist Society. Judge fixed effects and circuit-by-year (or courthouse-by-year) fixed effects absorb time-invariant judge characteristics and court-level time trends. Elastic-net-selected covariates predicting attendance timing, fully interacted with year fixed effects, are added as robustness controls. Standard errors are clustered by judge.&lt;/p&gt;
&lt;p&gt;The data cover approximately 200,000 published circuit court opinions (1970–2005) from Bloomberg Law, a 5% random sample of circuit cases hand-coded for ideological direction from the Songer-Auburn database, machine-coded regulatory agency outcomes, a newly collected antitrust case dataset, and approximately 1.03 million district court criminal sentencing records (1992–2003) from TRAC.&lt;/p&gt;
&lt;p&gt;The main findings are as follows. First, after attending the Manne program, judges increase their use of economics language in written opinions by approximately one-third of a standard deviation, measured via word-embedding similarity to an economics lexicon; this effect is statistically significant in the short-run event-study window but does not persist over the full career. Second, Manne attendance raises conservative voting in economics-related cases (labor and regulation) by approximately one-quarter of a standard deviation — corresponding to judges deciding in the conservative direction about 20 percent more often relative to the mean — with no significant effect on non-economics cases; the interaction effect is robust across specifications including never-attenders. Third, post-Manne judges vote more frequently against federal labor and environmental regulatory agencies, a result that is statistically significant and economically meaningful with no detectable pre-trends. Fourth, post-Manne judges impose longer and more frequent prison sentences, with no increase in sentencing harshness for drug crimes — consistent with Manne instructors having explicitly advocated drug legalization — and with the harshness gap between Manne and non-Manne judges widening after the 2005 Booker decision expanded judicial sentencing discretion. Fifth, there is some evidence of increased voting against antitrust enforcement, though this result is more sensitive to specification. Persuasion rates computed following DellaVigna and Gentzkow (2010) are slightly larger than those estimated for partisan media interventions such as Fox News and are closest to the effect of a 10-week Washington Post subscription on Democratic governor vote share. Neither the legalist model (judges follow statutes mechanically) nor the attitudinal model (judges follow party affiliation) can explain these within-judge, within-party shifts.&lt;/p&gt;
&lt;p&gt;Q: What is the central identification challenge and how do the authors address it?
A: The key threat is that judges who chose to attend the Manne program — or who attended at a particular time — may differ systematically from non-attenders in ways correlated with their decision trajectories. The authors address this in two steps. First, they restrict the control group to other ever-attending judges who had not yet attended, exploiting the first-come-first-served oversubscription rule that created quasi-random variation in timing among applicants. Second, they use judge fixed effects plus circuit-by-year fixed effects, and add elastic-net-selected biographical covariates (e.g., birth cohort indicators) interacted with year fixed effects as a robustness check. Republican affiliation — the most salient ideological predictor of attendance — is not a statistically significant predictor of attendance timing, supporting the exclusion restriction.&lt;/p&gt;
&lt;p&gt;Q: Why are never-attenders excluded from the preferred control group?
A: Never-attenders differ from attenders on observables including political party and show a positively trending use of economics language in their opinions even before any treatment, suggesting ambient diffusion of economics ideas through law clerks, law school curricula, and organizations such as the Federalist Society. Including never-attenders in the control group produces a near-zero coefficient on the language outcome, which the authors interpret as reflecting spillovers rather than a true null effect; the coefficient on conservative voting in the interaction specification, however, remains positive and significant even when never-attenders are included.&lt;/p&gt;
&lt;p&gt;Q: What is the magnitude of the effect on economics language use?
A: The within-judge effect of Manne attendance on the word-embedding similarity between judicial opinions and an economics lexicon is approximately one-third of a standard deviation, statistically significant in the short-run event-study window (covering six years before and after attendance). The effect shrinks and becomes non-significant when the full career of Manne judges is examined (rather than just the event-study window), consistent with broad diffusion of economics language across the judiciary over time rather than a persistent individual-level treatment effect.&lt;/p&gt;
&lt;p&gt;Q: How large is the effect on conservative voting, and is it concentrated in particular case types?
A: Post-Manne attendance raises conservative voting in economics-related cases (labor and regulation) by approximately one-quarter of a standard deviation, corresponding to judges deciding in the conservative direction about 20 percent more often relative to the mean liberal-conservative decision rate. There is no statistically significant effect on non-economics cases. The interaction coefficient — the differential effect on economics versus non-economics cases — is positive and significant across all specifications including the full sample with never-attenders, making this the most robust directional result in the paper.&lt;/p&gt;
&lt;p&gt;Q: What is the effect on regulatory agency voting?
A: Post-Manne judges vote more frequently against federal labor agencies (National Labor Relations Board, OSHA, Department of Labor, Federal Labor Relations Authority, Office of Worker&amp;rsquo;s Compensation Programs) and the Environmental Protection Agency. The event study shows a positive and significant increase that persists across the event-study window with no detectable pre-trends. This result is robust to both the baseline specification and the elastic-net-controls specification.&lt;/p&gt;
&lt;p&gt;Q: What is the effect on criminal sentencing, and what heterogeneity is found?
A: Post-Manne judges impose both more frequent prison sentences and longer sentences, consistent with Becker&amp;rsquo;s deterrence framework taught in the program&amp;rsquo;s criminal law curriculum. The sentencing effects are absent for drug crimes, consistent with Manne instructors — including Milton Friedman — having explicitly advocated against the drug war and for drug legalization. The gap in sentencing harshness between Manne and non-Manne judges widens after the 2005 United States v. Booker decision, which made the Federal Sentencing Guidelines advisory rather than mandatory; this is consistent with the program having shaped latent judicial preferences that are expressed more fully when formal constraints are relaxed.&lt;/p&gt;
&lt;p&gt;Q: How do the persuasion rates compare to benchmark media studies?
A: The persuasion rates computed following DellaVigna and Gentzkow (2010) are slightly larger than those estimated for partisan media interventions such as Fox News (DellaVigna and Kaplan, 2007) and are closest to the persuasion rates implied by a 10-week subscription to the Washington Post on Democratic governor vote share (Gerber et al. 2009). The comparison contextualizes the Manne program as a moderately high-intensity ideational intervention relative to documented cases of political persuasion.&lt;/p&gt;
&lt;p&gt;Q: What do the results imply for theories of judicial behavior?
A: The findings are inconsistent with both the legalist/formalist model — under which judges apply statutes and precedent without regard to extra-legal factors, predicting zero effect — and the attitudinal model — under which judges simply follow partisan preferences, also predicting zero effect since the program attended judges of both parties. The within-judge, within-party shifts point to a third channel: judicial worldviews and economic ideas, independent of formal law and partisan affiliation, shape high-stakes precedent-setting decisions.&lt;/p&gt;
&lt;p&gt;Q: Can the authors distinguish between a pedagogical (informational) and an ideological persuasion mechanism?
A: They cannot definitively distinguish between the two. Both mechanisms predict increased economics language, more conservative rulings in economics cases, deregulatory voting, and harsher non-drug sentences. The drug-crime heterogeneity is somewhat more consistent with a nuanced pedagogical channel, since Manne instructors explicitly discussed drug legalization, but this pattern is also consistent with complex ideological effects. Evidence on decision quality (citation rates, judicial promotion) is mixed and not robust, providing no clean test of the informational mechanism.&lt;/p&gt;
&lt;p&gt;Q: What does the antitrust evidence show?
A: Post-Manne judges tend to vote against antitrust claimants (i.e., in favor of less antitrust enforcement), but this result is more sensitive to specification than the regulatory agency and sentencing results and is not always statistically significant across specifications. The authors treat it as suggestive rather than conclusive.&lt;/p&gt;
&lt;p&gt;Q: How does this paper relate to the literature on economics education and normative beliefs?
A: Prior work finds that economics students are less redistributive (Selten and Ockenfels 1998), view surge prices more favorably (Frey and Meier 2005), favor profit maximization (Rubinstein 2006), and that economics professors are less ideologically liberal than other social scientists (Jelveh et al. 2018). The present paper extends this literature by studying established professionals (judges) making high-stakes real-world decisions, and by documenting a direct policy impact rather than a change in survey responses or experimental choices.&lt;/p&gt;
&lt;p&gt;Q: What is the scope of the dataset and the program coverage?
A: The circuit court dataset covers approximately 200,000 published opinions from 1970 through 2005. The district court sentencing dataset covers approximately 1.03 million cases from 1992 through 2003 (event study sample). The Manne program ran from 1976 to 1998, with roughly twenty judges per cohort; by 1990 forty percent of federal judges had attended, and by the late 1990s roughly half of circuit court cases had a Manne-trained panelist. Biographical information comes from the Federal Judicial Center; program attendance lists come from Butler (1999) supplemented by FOIA-obtained annual reports.&lt;/p&gt;
&lt;p&gt;Manne Economics Institute for Federal Judges: An intensive two-week economics training program for sitting U.S. federal judges, run by the Law and Economics Center from 1976 to 1998, covering supply-and-demand theory, the Coase Theorem, externalities, property rights, deterrence theory, and related topics; funded by pro-business foundations; admitted judges on a first-come-first-served basis and trained nearly half of all federal judges over its operation.&lt;/p&gt;
&lt;p&gt;Word-embedding economics language measure: A continuous measure of how closely a judicial opinion&amp;rsquo;s vocabulary aligns with a lexicon of law-and-economics phrases, constructed using word2vec embeddings (Mikolov et al. 2013) trained on the corpus of judicial opinions; measures the semantic proximity of opinion text to the Ellickson (2000) economics lexicon in embedding space, capturing implicit and contextual use of economics reasoning rather than raw phrase counts.&lt;/p&gt;
&lt;p&gt;Deterrence theory (Becker model): The framework, drawn from Becker (1968), taught in the Manne program&amp;rsquo;s criminal law curriculum, which holds that optimal crime deterrence requires setting the expected penalty — the economic cost of punishment times the probability of detection — high enough to outweigh the expected benefits of crime; treated in the paper as the theoretical basis for predicting harsher sentencing among post-Manne judges, and contrasted with retribution- or rehabilitation-based sentencing rationales that dominated before its diffusion.&lt;/p&gt;
&lt;p&gt;Conservative judicial decision (economics cases): In the paper&amp;rsquo;s usage, a ruling against the liberal/pro-plaintiff position in a case involving labor or regulation, as hand-coded by the Songer-Auburn database; includes ruling against a labor agency, rejecting a regulatory claimant, or voting against antitrust enforcement; the paper finds Manne attendance shifts judges in this direction in economics cases but not in non-economics cases.&lt;/p&gt;
&lt;p&gt;First-come-first-served oversubscription: The admission rule of the Manne program during its oversubscribed heyday (from the second cohort in 1977 through the late 1980s), under which applicants who did not secure a spot were bumped to the next year&amp;rsquo;s cohort; the authors argue this rule generates quasi-random variation in the timing of attendance among ever-attending judges, conditional on applying, providing the identifying variation for the differences-in-differences design.&lt;/p&gt;
&lt;p&gt;Persuasion rate: A summary statistic, following DellaVigna and Gentzkow (2010), measuring the fraction of the &amp;ldquo;persuadable&amp;rdquo; population that is convinced by a treatment; used in the paper to benchmark the Manne program&amp;rsquo;s effect size against documented media persuasion interventions such as Fox News and Washington Post subscriptions.&lt;/p&gt;</description></item><item><title>Identifying Preference for Early Resolution from Asset Prices</title><link>https://macropaperwarehouse.com/papers/identifying-preference-for-early-resolution-from-asset-prices/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/identifying-preference-for-early-resolution-from-asset-prices/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper develops a revealed-preference theory that uses asset-market data to identify whether investors have a preference for early resolution of uncertainty (PER), a property of non-expected utility preferences that is distinct from risk aversion. The central theorem shows that, under a condition called generalized risk sensitivity (GRS), the representative agent prefers early resolution if and only if claims to future stock market volatility earn a positive premium during the period in which the informativeness of upcoming macroeconomic announcements is resolved — a window the authors call the Resolution of Information Quality (ROIQ) period. Using S&amp;amp;P 500 index option data from 1996 to 2019, the paper identifies the ROIQ period as the five weekdays before FOMC announcements, demonstrates that the inverse slope of the implied-volatility term structure (9-day/90-day VIX ratio) significantly predicts the informativeness of upcoming announcements, and finds a statistically significant positive ROIQ premium on synthetic variance claims (beta = 1.085, t = 2.44) and on at-the-money straddles (beta = 0.428, t = 2.25). The evidence supports Epstein-Zin recursive utility with the intertemporal elasticity of substitution exceeding the reciprocal of risk aversion, and hence is consistent with the Bansal-Yaron long-run risk framework. Crucially, this identification requires no parametric calibration of the full asset pricing model.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a published paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-preference-for-early-resolution-per-and-why-is-it-hard-to-identify"&gt;Q1. What is preference for early resolution (PER) and why is it hard to identify?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;PER means that an agent with a given distribution over future outcomes strictly prefers to learn the outcome sooner rather than later, as formalized by Kreps and Porteus (1978); under Epstein-Zin recursive utility, PER is equivalent to risk aversion exceeding the reciprocal of the IES (or IES &amp;gt; 1/risk aversion).&lt;/strong&gt; In standard applied asset pricing models with constant-elasticity recursive utility, PER is intertwined with risk aversion and the IES, so that the separate role of the timing of resolution is obscured. Existing papers either test joint implications of the full calibrated model (conflating PER with other preference properties) or use thought-experiment willingness-to-pay calculations without market-data grounding. The authors&amp;rsquo; goal is to provide a necessary and sufficient condition for PER directly from asset prices, independent of a fully specified model.&lt;/p&gt;
&lt;h3 id="q2-what-is-the-role-of-generalized-risk-sensitivity-grs-in-the-identification-theorem"&gt;Q2. What is the role of Generalized Risk Sensitivity (GRS) in the identification theorem?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;GRS — the condition that the certainty-equivalent functional I is increasing in second-order stochastic dominance — provides the bridge between the unobservable ranking of utility levels across states and the observable ranking of marginal utilities (stochastic discount factors) across those states.&lt;/strong&gt; The authors prove that under GRS (Theorem 1), the vector of partial derivatives of I with respect to continuation utility is strictly negatively comonotone with the level of continuation utility: higher utility states have lower marginal utility. This inversion is what allows asset prices to reveal the ordering of utility levels. GRS itself is empirically supported by the well-documented fact that assets earn positive announcement premia around scheduled macroeconomic releases (Savor and Wilson, 2013).&lt;/p&gt;
&lt;h3 id="q3-how-does-the-main-theorem-theorem-2-identify-per-from-a-single-asset-class"&gt;Q3. How does the main theorem (Theorem 2) identify PER from a single asset class?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Theorem 2 establishes that, under strict GRS, the premium earned by any asset comonotone with the informativeness of upcoming macroeconomic announcements during the ROIQ period is strictly positive if and only if the agent has PER; a negative ROIQ premium would indicate preference for late resolution.&lt;/strong&gt; The intuition is that if the agent prefers early resolution, she assigns higher continuation utility to the early-resolution state (0E) than to the late-resolution state (0L); under strict GRS, higher continuation utility maps to lower marginal utility, meaning assets paying off more in the early-resolution state are negatively correlated with the SDF and therefore carry a positive risk premium. Claims to stock market return variance serve as the test asset because expected variance is high before informative announcements (early resolution) and low before uninformative ones (late resolution).&lt;/p&gt;
&lt;h3 id="q4-how-do-the-authors-operationalize-the-roiq-period-empirically"&gt;Q4. How do the authors operationalize the ROIQ period empirically?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The ROIQ period is identified as the five weekdays before FOMC announcements, during which market attention to the Fed (measured by RavenPack Fed-related news intensity) is significantly positively correlated with the change in the inverse slope of the implied-volatility term structure (coefficient = 1.076, t = 4.09), while no such correlation exists in the ten days 6–10 before or after the announcement.&lt;/strong&gt; This correlation arises because, during those five days, investors regularly update their expectations about whether the upcoming FOMC statement will be informative; more expected informativeness raises the demand for short-dated options (driving up the 9-day VIX relative to the 90-day VIX) and simultaneously raises Fed-related news coverage. Outside the ROIQ window, the two series are uncorrelated (coefficient = −0.242, t = −1.13 unconditionally), confirming that the window is the correct testing period.&lt;/p&gt;
&lt;h3 id="q5-what-is-the-empirical-evidence-for-a-positive-roiq-premium-and-how-is-it-constructed"&gt;Q5. What is the empirical evidence for a positive ROIQ premium, and how is it constructed?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Synthetic variance claims constructed as option portfolios following Bakshi, Kapadia, and Madan (2003) earn a ROIQ premium (coefficient beta in the panel regression) of 1.085 percentage points per day (t = 2.44) above their average daily return; at-the-money straddles earn 0.428 pp/day (t = 2.25), both significantly positive.&lt;/strong&gt; The panel regression controls for maturity fixed effects (11 dummies for weeks to expiration), FOMC-day effects, and day-of-week effects. Crucially, the market itself earns approximately 8 basis points lower than average during the ROIQ period, and the market loading on variance claims does not increase during the ROIQ window (Table 5), ruling out an interpretation in which the premium simply reflects a higher market beta at announcement times.&lt;/p&gt;
&lt;h3 id="q6-how-does-the-paper-rule-out-alternative-explanations-for-the-roiq-premium"&gt;Q6. How does the paper rule out alternative explanations for the ROIQ premium?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;A placebo test using VIX futures — which pay the forward-looking VIX level (expected volatility over the next 30 days after expiry) rather than realized variance over the announcement — shows no significant ROIQ premium, confirming that the effect operates specifically through exposure to volatility during the announcement itself rather than through general volatility-level exposure.&lt;/strong&gt; The paper also shows that controlling for the Fama-French three factors does not appreciably change the ROIQ coefficient. An additional test using individual stock options (5 weekdays before earnings announcements) also yields positive ROIQ premiums, extending the result beyond FOMC to firm-level announcements.&lt;/p&gt;
&lt;h3 id="q7-what-does-the-finding-imply-for-macroeconomic-preference-modeling-and-policy"&gt;Q7. What does the finding imply for macroeconomic preference modeling and policy?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The empirical finding that investors have a positive ROIQ premium — i.e., PER — without assuming any particular utility functional form confirms the central calibration assumption of Bansal-Yaron long-run risk models (risk aversion &amp;gt; 1/IES) and provides the market-based evidence that Epstein, Farhi, and Strzalecki (2014) stated was unavailable.&lt;/strong&gt; The paper&amp;rsquo;s approach is significant for macro modeling because it establishes PER from minimal assumptions (GRS and monotonicity of preferences), meaning that the result holds across expected utility deviations including robust control, smooth ambiguity, and disappointment aversion preferences — as long as they satisfy GRS — making it a broadly applicable empirical anchor for calibrating non-expected utility models.&lt;/p&gt;
&lt;h3 id="q8-what-are-the-identification-limitations-and-scope-conditions"&gt;Q8. What are the identification limitations and scope conditions?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The identification relies on three maintained conditions: (i) GRS holds for the representative agent, (ii) FOMC announcements genuinely resolve macro uncertainty (so that the ROIQ window is correctly specified), and (iii) the pre-announcement period does not contain price-relevant news (so that market return premia during the ROIQ are not confounded with the news content of the announcement itself).&lt;/strong&gt; The empirical support for condition (iii) comes from the fact that the market does not earn abnormal returns during the ROIQ (negative, not positive, as expected from the announcement drift literature), and from the lack of a ROIQ premium for VIX futures that expire after but not over the announcement. The framework abstracts from heterogeneous agents and assumes a representative-agent economy, which is standard but may not fully capture distributional effects.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;preference for early resolution of uncertainty (PER)&lt;/strong&gt; : the property of a dynamic preference that the agent strictly prefers to learn the realization of a future uncertain outcome earlier rather than later, holding the distribution unchanged; equivalent in Epstein-Zin recursive utility to risk aversion exceeding the reciprocal of the IES.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;generalized risk sensitivity (GRS)&lt;/strong&gt; : the condition that the certainty-equivalent functional I is strictly increasing in second-order stochastic dominance; equivalent to the existence of strictly positive announcement premia for all assets comonotone with continuation utility; the paper&amp;rsquo;s key maintained assumption connecting utility levels to asset prices.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;resolution of information quality (ROIQ) period&lt;/strong&gt; : the period during which investors learn whether the upcoming macroeconomic announcement will be informative; empirically identified as the five weekdays before FOMC meetings, during which Fed-related news intensity co-moves with the inverse slope of the VIX term structure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;ROIQ premium&lt;/strong&gt; : the excess return earned by a claim to market volatility (synthetic variance claim or straddle) during the ROIQ period over its average daily return on non-ROIQ days; the paper&amp;rsquo;s operational test for PER; estimated at 1.085 percentage points per day for variance claims.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;inverse slope of the implied-volatility term structure&lt;/strong&gt; : the ratio IV9/IV90 (9-day CBOE VIX divided by 90-day CBOE VIX); the paper&amp;rsquo;s market-based predictor of FOMC announcement informativeness; a higher ratio reflects investor anticipation of large announcement-day volatility relative to long-run baseline uncertainty.&lt;/p&gt;</description></item><item><title>Inequality and asset prices during Sudden Stops</title><link>https://macropaperwarehouse.com/papers/inequality-and-asset-prices-during-sudden-stops/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/inequality-and-asset-prices-during-sudden-stops/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This paper studies the cross-sectional dimension of Fisher&amp;rsquo;s (1933) debt-deflation mechanism as it operates during Sudden Stop crises — episodes characterized by large, abrupt reversals in the current account. The central question is how the distribution of wealth and leverage across households shapes the macroeconomic dynamics of financial crises, and whether greater inequality makes Sudden Stops more or less severe.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Methodology&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The empirical analysis uses panel microdata from the Mexican Family Life Survey (MxFLS) across three waves (2002, 2005, 2009), covering a representative sample of approximately 8,400 households in 150 localities. The 2009 wave captures a Sudden Stop in which Mexico&amp;rsquo;s current account reversed by 1.5 percentage points of GDP, per capita consumption fell 7 percent, and housing prices fell 4 percent below pre-crisis trend by 2010. Households are sorted by net wealth and leverage ratio — defined as total debt divided by total assets — to identify how balance sheet heterogeneity drove differentiated asset-holding dynamics during the crisis.&lt;/p&gt;
&lt;p&gt;The theoretical framework is a Bewley small open economy model with heterogeneous agents, incomplete markets, aggregate risk (simultaneous shocks to the international interest rate and total factor productivity), and an occasionally-binding loan-to-value (LtV) collateral constraint. Households hold two assets: a one-period risk-free international bond and a risky domestic collateralizable asset (land). Households face persistent non-insurable idiosyncratic risk in both labor income and dividend returns; the latter creates an endogenous risk-wealth tradeoff, since larger asset holdings raise future income volatility while simultaneously expanding debt capacity. The model is calibrated to Mexican data — matching the leverage ratio distribution in 2005 (10 percent of households financially constrained) and a net foreign asset position of −35 percent of GDP — and solved using the FiPIt algorithm combined with the Krusell-Smith stochastic-simulation approach.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings with Quantitative Magnitudes&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The empirical evidence from Mexico&amp;rsquo;s 2009 crisis reveals sharply divergent asset dynamics across the household balance sheet distribution. Wealthy households (top net-wealth decile) with low leverage increased their real estate holdings by 61.4 percent (annualized, relative to the average) between 2005 and 2009, consistent with a crisis-dampening effect whereby unconstrained agents absorb fire-sales. Wealthy households in the top decile of both net wealth and leverage ratio — financially constrained — reduced their real estate holdings by 36.6 percent, consistent with a crisis-amplifying effect. Cross-country descriptive evidence shows that Sudden Stop episodes are associated with significantly larger contractions in consumption and GDP in more unequal economies (Gini index, World Bank data, 58 Sudden Stop episodes identified by Bianchi and Mendoza 2020).&lt;/p&gt;
&lt;p&gt;In the calibrated model, the crisis-dampening effect dominates relative to the representative agent baseline: the heterogeneous-agents economy produces a smaller decline in asset prices (−0.99 percent vs. −2.57 percent in the representative agent model during crisis episodes), but a larger and more persistent consumption decline (−2.97 percent vs. −1.17 percent) and current account reversals (1.56 percentage points vs. 0.09 percentage points). The wealth Gini index generated by the calibrated model is 0.61, close to the untargeted 2005 Mexican estimate of 0.73. The aggregate equity premium generated is 5.1 percent, close to the data estimate of 6.5 percent; of this, 55.3 percent is attributable to the risk component, 35.9 percent to the persistence effect, and 8.6 percent to the constraint effect.&lt;/p&gt;
&lt;p&gt;When comparing the baseline emerging economy (wealth Gini 0.61) to an advanced economy calibration in which idiosyncratic dividend risk is set to zero (wealth Gini 0.29), crises are milder and less frequent in the more equal economy: consumption drops 1.0 percentage point less, asset prices drop 0.2 percentage points less, and the net foreign debt position is 6.2 percentage points larger relative to GDP. The implied slope coefficient from the model relating consumption declines during Sudden Stops to the income Gini (−11.1) closely matches the cross-country empirical estimate (−11.5). An economy with an income Gini index 0.10 points lower experiences a decline in consumption 1.1 percentage points smaller during a crisis.&lt;/p&gt;
&lt;p&gt;An impulse response to a two-standard-deviation aggregate shock confirms that, conditional on starting from a perfectly equal (symmetric) initial distribution via complete redistribution, declines in consumption and asset prices are approximately 0.5 percentage points smaller than in the baseline economy with the stationary ergodic distribution as initial condition.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Redistributive Dividend Tax&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A flat 30 percent dividend income tax, redistributed as lump-sum transfers, reduces Sudden Stop severity by lowering average asset prices by 9.6 percent relative to the benchmark, which shrinks effective debt capacity and limits bond adjustment during crises. The average current account reversal during a crisis falls by 0.54 percentage points, and aggregate consumption falls by 0.63 percentage points less than in the benchmark. Crisis probability under the benchmark threshold falls from 4.3 to 1.83 percent (less than half). Average welfare improves by a gain equivalent to 2.8 percent of consumption. However, 26.7 percent of households — those more leveraged and three times wealthier than the beneficiaries — experience welfare losses averaging 6.8 percent of consumption, due to asset price declines and tighter financial conditions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Overall Conclusion&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Both the empirical evidence and the model suggest that economies with lower inequality, whether due to reduced idiosyncratic risk (as in advanced versus emerging economy calibrations) or wealth redistribution across agents with identical idiosyncratic risk processes, experience less severe Sudden Stop crises.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-are-the-two-cross-sectional-channels-through-which-household-heterogeneity-affects-the-debt-deflation-mechanism-and-in-which-direction-do-they-move-asset-prices"&gt;Q1. What are the two cross-sectional channels through which household heterogeneity affects the debt-deflation mechanism, and in which direction do they move asset prices?&lt;/h3&gt;
&lt;p&gt;A1: The dampening effect operates when unconstrained wealthy households — who hold diversified portfolios and have precautionary savings in bonds — purchase fire-sold assets from constrained households, relieving downward pressure on asset prices. The amplifying effect operates when highly leveraged households, once pushed into binding credit constraints by declining asset prices, must further liquidate asset positions, deepening the price decline and tightening the collateral constraint for additional households via the pecuniary externality. These two effects move in opposite directions, so the net effect of inequality on crisis severity is theoretically ambiguous and depends on calibration.&lt;/p&gt;
&lt;h3 id="q2-what-specific-empirical-evidence-from-mexicos-2009-sudden-stop-supports-both-cross-sectional-effects"&gt;Q2. What specific empirical evidence from Mexico&amp;rsquo;s 2009 Sudden Stop supports both cross-sectional effects?&lt;/h3&gt;
&lt;p&gt;A2: Using MxFLS microdata, Table 1 in the paper shows that wealthy households (top net-wealth decile) with low leverage (deciles I–VII of leverage) increased their real estate holdings by 61.4 percent between 2005 and 2009 — evidence for the dampening effect. Wealthy households in the top decile of both net wealth and leverage reduced their real estate holdings by 36.6 percent — evidence for the amplifying effect. Between 2005 and 2009, the share of financially constrained households (leverage ratio above 0.168, the 90th percentile) increased by 1.7 percentage points, while the share of financial savers dropped by 5.0 percentage points. The pre-crisis period (2002–2005) shows no comparable divergence, ruling out a mechanical mean-reversion explanation.&lt;/p&gt;
&lt;h3 id="q3-what-is-the-risk-wealth-tradeoff-and-why-is-it-central-to-generating-a-realistic-wealth-and-leverage-distribution-in-the-model"&gt;Q3. What is the risk-wealth tradeoff, and why is it central to generating a realistic wealth and leverage distribution in the model?&lt;/h3&gt;
&lt;p&gt;A3: The risk-wealth tradeoff arises because idiosyncratic dividend risk is endogenous to asset holdings: holding more risky domestic assets increases debt capacity (relaxing borrowing constraints) but also raises future income volatility, since the variance of household flow income is convex in asset holdings. For households earning high dividend realizations, there exists a threshold beyond which precautionary savings motives — driven by rising income risk — dominate the benefit from expanded debt capacity, causing these households to begin accumulating bonds and eventually become net savers. This mechanism generates an empirically plausible distribution in which some households are financially constrained at the LtV limit, others are unconstrained borrowers, and a fraction are net savers holding both domestic assets and positive international bonds.&lt;/p&gt;
&lt;h3 id="q4-how-does-the-model-calibration-match-the-stationary-distribution-of-mexican-households"&gt;Q4. How does the model calibration match the stationary distribution of Mexican households?&lt;/h3&gt;
&lt;p&gt;A4: Three parameters governing the dividend income risk process (average dividend yield, autocorrelation, and standard deviation) are jointly calibrated to match three statistics from the MxFLS 2005 distribution of households: 14.1 percent financial savers (data: 14.2 percent), 75.9 percent unconstrained indebted (data: 75.8 percent), and 10.0 percent financially constrained (data: 10.0 percent). The collateral fraction κ = 0.168 is set equal to the 90th percentile of the leverage ratio distribution in 2005, reflecting that the average delinquency rate for commercial bank household credit was 10.3 percent between 2004 and 2008. The discount factor β = 0.90 matches the average net foreign asset position relative to GDP of −35 percent for Mexico.&lt;/p&gt;
&lt;h3 id="q5-how-does-the-heterogeneous-agents-model-compare-to-the-representative-agent-model-in-terms-of-crisis-dynamics"&gt;Q5. How does the heterogeneous-agents model compare to the representative agent model in terms of crisis dynamics?&lt;/h3&gt;
&lt;p&gt;A5: In the heterogeneous-agents benchmark, the average current account reversal during a Sudden Stop is 1.56 percentage points, consumption falls 2.97 percent, and asset prices fall 0.99 percent below the steady state. In the representative agent model with the same average leverage ratio (κ = 0.12), the current account reversal is only 0.09 percentage points, consumption falls 1.17 percent, and asset prices fall 2.57 percent. The crisis-dampening effect in the heterogeneous economy produces a smaller asset price drop but a larger consumption decline, because leveraged households must make larger consumption adjustments when hit by negative idiosyncratic shocks in addition to the aggregate shock. Impulse response analysis shows the heterogeneous-agents economy generates current account reversals 1.9 percentage points larger than the representative agent, and consumption responses approximately four times larger.&lt;/p&gt;
&lt;h3 id="q6-what-is-the-mechanism-by-which-comparing-emerging-and-advanced-economy-calibrations-shows-that-lower-inequality-leads-to-less-severe-crises"&gt;Q6. What is the mechanism by which comparing emerging and advanced economy calibrations shows that lower inequality leads to less severe crises?&lt;/h3&gt;
&lt;p&gt;A6: The advanced economy calibration sets idiosyncratic dividend risk to zero, eliminating the risk-wealth tradeoff and resulting in a wealth Gini of 0.29 (compared to 0.61 in the baseline). Without dividend risk, households have weaker incentives to accumulate assets as a precautionary buffer against income volatility, so they hold less debt on average and the long-run net foreign debt relative to GDP is 6.2 percentage points larger (i.e., less debt). During a Sudden Stop under this calibration, consumption drops 1.0 percentage point less, asset prices drop 0.2 percentage points less, and the economy is less frequently in crisis. The model-implied slope of consumption decline on income Gini is −11.1, matching the cross-country empirical estimate of −11.5.&lt;/p&gt;
&lt;h3 id="q7-what-does-the-impulse-response-analysis-reveal-about-the-effect-of-wealth-redistribution-on-crisis-severity-holding-idiosyncratic-risk-constant"&gt;Q7. What does the impulse response analysis reveal about the effect of wealth redistribution on crisis severity, holding idiosyncratic risk constant?&lt;/h3&gt;
&lt;p&gt;A7: The impulse response analysis compares the baseline heterogeneous-agents economy (with the stationary ergodic distribution as the initial condition) against a version in which all households are given a perfectly symmetric initial distribution — identical bond and asset holdings equal to long-run averages — while retaining the same idiosyncratic risk processes. The symmetric initial condition corresponds to a complete redistribution of wealth without changing fundamentals. In the first three periods after a two-standard-deviation aggregate shock, the symmetric economy shows declines in consumption and asset prices approximately 0.5 percentage points smaller than the baseline. This demonstrates that even holding the risk environment constant, reducing wealth dispersion mitigates crisis severity.&lt;/p&gt;
&lt;h3 id="q8-how-does-the-equity-premium-decomposition-work-in-the-heterogeneous-agents-model-and-which-components-are-quantitatively-most-important"&gt;Q8. How does the equity premium decomposition work in the heterogeneous-agents model, and which components are quantitatively most important?&lt;/h3&gt;
&lt;p&gt;A8: The aggregate equity premium is decomposed into five components (Equation 7 in the paper): a constraint effect (positive, increasing in the measure and intensity of constrained households), a risk effect (positive, from the negative covariance between the individual stochastic discount factor and individual equity return, weighted more heavily on constrained households), a persistence effect (positive, from the covariance between idiosyncratic dividend return and asset holdings, since high-dividend households accumulate more assets), a trading cost effect (approximately zero in aggregate), and a no-short-sales effect (negative, since households at the short-sales constraint add to asset demand without increasing the marginal benefit of saving). In the calibrated model, the equity premium is 5.1 percent; the risk effect accounts for 55.3 percent, the persistence effect for 35.9 percent, and the constraint effect for 8.6 percent.&lt;/p&gt;
&lt;h3 id="q9-what-is-the-mechanism-by-which-the-dividend-income-tax-reduces-crisis-severity"&gt;Q9. What is the mechanism by which the dividend income tax reduces crisis severity?&lt;/h3&gt;
&lt;p&gt;A9: A flat 30 percent dividend income tax lowers average after-tax dividend returns, reducing households&amp;rsquo; incentive for precautionary accumulation of domestic assets and weakening the risk-wealth tradeoff. As a result, households demand fewer domestic assets and fewer international bonds in normal times. The reduced demand for the domestic asset lowers the equilibrium asset price by 9.6 percent on average relative to the benchmark, which — through the pecuniary externality embedded in the LtV constraint — tightens borrowing constraints, raising the share of financially constrained households from 5.6 to 7.8 percent. Nevertheless, the reduction in equilibrium debt positions means that during a crisis, bond adjustments and consumption drops are more limited: the average current account reversal during crises falls by 0.54 percentage points, and aggregate consumption falls by 0.63 percentage points less than in the benchmark. Crisis probability under the benchmark threshold falls from 4.3 to 1.83 percent.&lt;/p&gt;
&lt;h3 id="q10-who-benefits-and-who-loses-from-the-dividend-income-tax-and-by-how-much"&gt;Q10. Who benefits and who loses from the dividend income tax, and by how much?&lt;/h3&gt;
&lt;p&gt;A10: Among the simulated population, 73.3 percent of households experience welfare gains averaging 6.2 percent of consumption in consumption-equivalent terms, while 26.7 percent experience welfare losses averaging 6.8 percent of consumption. The average welfare gain across all households is equivalent to 2.8 percent of consumption. The households experiencing losses are more leveraged and three times wealthier on average than those that benefit; the policy reduces their net worth through lower asset prices and tightens their financial constraints. The welfare analysis accounts for the transition to the new tax policy.&lt;/p&gt;
&lt;h3 id="q11-why-does-the-representative-agent-model-miss-the-cross-sectional-effects-that-are-central-to-the-papers-mechanism"&gt;Q11. Why does the representative agent model miss the cross-sectional effects that are central to the paper&amp;rsquo;s mechanism?&lt;/h3&gt;
&lt;p&gt;A11: In the representative agent model, all households behave identically and either collectively want to buy or sell assets, but since there is no one to trade with domestically, actual asset holdings remain unchanged by cross-sectional forces. Additionally, the average debt constraint multiplier in the representative agent equals the single household&amp;rsquo;s multiplier, whereas in the heterogeneous model a small fraction of highly constrained households can have much larger individual multipliers, amplifying the aggregate debt-deflation effect. In the calibrated stationary model, 10 percent of constrained households own 7.7 percent of assets and have a consumption share of 9.0 percent, while 75.9 percent of unconstrained indebted households hold 88.1 percent of assets with a consumption share of 78.1 percent — distributional features invisible to a representative agent.&lt;/p&gt;
&lt;h3 id="q12-what-robustness-does-the-model-validation-provide-for-the-quantitative-results"&gt;Q12. What robustness does the model validation provide for the quantitative results?&lt;/h3&gt;
&lt;p&gt;A12: The model reproduces the untargeted net wealth and asset distributions across deciles from MxFLS 2005 closely, with slight underestimation at the top deciles; the exception is the bottom decile of debt (where the model cannot generate households with negative net wealth since default is not modeled). The aggregate law of motion for the Krusell-Smith algorithm fits with R² = 0.99 for bond position and R² = 0.93 for asset price, and Den Haan (2010) accuracy checks show maximum forecast errors of 2.8 (current account) and 1.1 (asset price). The model replicates the untargeted magnitude of current account reversals observed in Mexican Sudden Stops. The wealth Gini of 0.61 is close to the untargeted 2005 Mexican estimate of 0.73, and the equity premium of 5.1 percent is close to the data estimate of 6.5 percent.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Sudden Stop&lt;/strong&gt;: An episode characterized by a large, abrupt reversal in the current account, typically triggered by a sudden halt in foreign capital inflows. In this paper, Sudden Stops are modeled as endogenous crises that arise from the interaction of a negative aggregate shock (simultaneous rise in the international interest rate and decline in total factor productivity) with an occasionally-binding LtV collateral constraint. The paper follows Bianchi and Mendoza (2020) in identifying 58 such episodes over the past four decades.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Debt-deflation mechanism (cross-sectional dimension)&lt;/strong&gt;: The paper studies Fisher&amp;rsquo;s (1933) debt-deflation spiral — in which declining asset prices tighten credit constraints, forcing further asset sales, further depressing prices — through the lens of household heterogeneity. The cross-sectional dimension refers to the fact that different households (wealthy unconstrained vs. highly leveraged constrained) respond differently to price declines, generating two opposing effects: dampening (wealthy buyers absorb fire-sales) and amplifying (constrained households fire-sell additional assets).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Risk-wealth tradeoff&lt;/strong&gt;: A novel feature of the model in which holding more risky domestic assets simultaneously (a) expands debt capacity by relaxing the LtV constraint and (b) increases future income volatility through higher exposure to idiosyncratic dividend risk, since the variance of household flow income is convex in asset holdings. This tradeoff generates the endogenous transition of households from indebted to net-saver status and gives rise to the empirically plausible distribution of savers, unconstrained borrowers, and constrained households.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Loan-to-value (LtV) collateral constraint&lt;/strong&gt;: A borrowing limit requiring that households&amp;rsquo; international debt (negative bond holdings) cannot exceed a fixed fraction κ of the market value of their domestic asset holdings. In the paper, κ = 0.168 (the 90th percentile of the Mexican leverage ratio distribution in 2005). The constraint is occasionally binding and generates a pecuniary externality: households fail to internalize that their individual portfolio choices affect the aggregate asset price, which in turn determines the borrowing limits of all other households.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Pecuniary externality&lt;/strong&gt;: The externality arising from the LtV constraint in which each household&amp;rsquo;s choice of asset holdings affects the equilibrium asset price, thereby changing the borrowing limits of all households simultaneously. This externality drives the debt-deflation spiral and is the source of Sudden Stop crises in the model: no single household internalizes the aggregate impact of its fire-sales on credit conditions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Fire-sale&lt;/strong&gt;: In the context of this paper, the forced liquidation of domestic asset holdings by financially constrained households during a crisis. Fire-sales are triggered when the LtV constraint becomes binding, forcing households to sell assets to reduce debt; the resulting price decline tightens the constraint further, producing additional fire-sales. The paper documents that, during Mexico&amp;rsquo;s 2009 Sudden Stop, wealthy constrained households (top decile of both net wealth and leverage) reduced real estate holdings by 36.6 percent, while wealthy unconstrained households increased holdings by 61.4 percent.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Dampening and amplifying effects&lt;/strong&gt;: Two opposing cross-sectional effects on asset prices during a crisis. The dampening effect: unconstrained wealthy households purchase depressed assets fire-sold by constrained households, relieving downward pressure on prices and weakening the debt-deflation spiral. The amplifying effect: highly leveraged households that are pushed into binding constraints by falling prices must also fire-sell assets, further depressing prices and tightening financial conditions. The net impact on crisis severity depends on which effect dominates, which the paper establishes empirically and quantitatively is inequality-dependent.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Equity premium decomposition&lt;/strong&gt;: A decomposition derived in the paper (Equation 7) that expresses the aggregate excess return on the risky domestic asset as the sum of five components: a constraint effect (positive, from the measure and intensity of binding LtV constraints), a risk effect (positive, from the covariance of individual stochastic discount factors with individual equity returns), a persistence effect (positive, from the covariance of idiosyncratic dividend returns with asset holdings due to return persistence), a trading cost effect (approximately zero in aggregate), and a no-short-sales effect (negative). In the calibrated model, the risk and persistence effects account for 91 percent of the 5.1 percent equity premium.&lt;/p&gt;</description></item><item><title>Insuring Peace: Index-Based Livestock Insurance, Droughts, and Conflict</title><link>https://macropaperwarehouse.com/papers/insuring-peace-index-based-livestock-insurance-droughts-and-conflict/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/insuring-peace-index-based-livestock-insurance-droughts-and-conflict/</guid><description>&lt;p&gt;This paper provides quasi-experimental evidence that Index-Based Livestock Insurance (IBLI) — a remote-sensing-triggered, automated payout scheme for pastoralists — substantially reduces drought-induced conflict in Kenya over the 2001–2020 period.&lt;/p&gt;
&lt;p&gt;The research question is whether a market-based financial instrument can mitigate the causal chain running from drought shocks to violent conflict between nomadic pastoralists and sedentary farmers and other land users. The authors motivate the study by documenting that droughts force pastoralists out of their traditional grazing grounds and into mixed-land-use areas (farms, ranches, urban settlements, nature reserves), where miscoordination with other land users escalates into violence. A case study of the Samburu-Laikipia-Isiolo-Meru region in central Kenya — drawing on georeferenced survey data from Lengoiboni et al. (2010) and ACLED conflict events — validates this spatial mechanism: during droughts, roughly 60–90% of non-pastoral land users report encounters with pastoralists, and conflicts accumulate precisely where drought migration routes cross into non-pastoral land.&lt;/p&gt;
&lt;p&gt;The empirical design combines two sources of variation: (1) plausibly exogenous changes in rainfall deficits at the 0.1 × 0.1-degree grid-cell level (roughly 10 × 10 km), derived from NASA GPM satellite data; and (2) the staggered, five-wave rollout of IBLI across 146 insurance districts in Kenya from 2010 onward, which the authors argue was driven primarily by technical challenges rather than pre-existing conflict or drought patterns. The unit of observation is 94,300 cell-periods. Because conflicts due to pastoralist drought migration occur in the neighborhood of affected areas rather than within them, both drought and IBLI coverage are measured as inverse-distance-weighted averages over surrounding cells. The estimating equation is a linear probability model with cell and period fixed effects, interacting neighborhood rainfall deficit with neighborhood IBLI coverage; the coefficient on this interaction term (delta3) is the parameter of interest.&lt;/p&gt;
&lt;p&gt;The main finding is that a one-standard-deviation increase in neighborhood IBLI coverage reduces the semi-elasticity of neighborhood rainfall deficit on conflict probability by approximately 23%. In absolute terms, a one-percentage-point increase in the rainfall deficit raises the probability of conflict by 6.92 percentage points at average IBLI coverage; with one additional standard deviation of neighborhood IBLI, that same deficit raises conflict probability by only 5.34 percentage points — a reduction of 1.58 percentage points against a baseline conflict probability of roughly 2.5%.&lt;/p&gt;
&lt;p&gt;Scope conditions: the effect is estimated for Kenya specifically, over a pastoralist-heavy population of approximately 8.8 million out of 53 million Kenyans, during 2001–2020. The conflict-mitigating effect is approximately four times larger in mixed-land-use areas (nine times when rollout-cluster-times-period fixed effects are included), consistent with the theoretical expectation that IBLI matters most where pastoralists are most likely to encounter other land users during drought migration.&lt;/p&gt;
&lt;p&gt;Two mechanisms are identified. First, IBLI reduces migratory pressure: when pastoral homelands have IBLI coverage, the distance between the ethnic homeland centroid and conflict events involving that group decreases, indicating reduced drought migration. Second, IBLI smooths incomes — corroborated with Afrobarometer geo-coded data — raising the opportunity cost of fighting. An instrumental-variable specification finds that actual IBLI payouts in the neighborhood reduce conflict probability by approximately 150% relative to the baseline risk.&lt;/p&gt;
&lt;p&gt;A cost-effectiveness analysis finds that even using conservative World Health Organization or World Bank estimates of the value of statistical life, IBLI delivers fatality savings of between 10 and 22 cents per dollar spent on government subsidies for the program, making it a cost-effective complement to political and institutional conflict-mitigation approaches.&lt;/p&gt;
&lt;p&gt;Q: What is the core causal mechanism linking droughts to conflict that IBLI interrupts?&lt;/p&gt;
&lt;p&gt;A: Droughts deplete forage in pastoralists&amp;rsquo; traditional grazing grounds, forcing them to migrate into mixed-land-use areas — farms, ranches, urban settlements, and nature reserves — where encounters with other land users are more likely to escalate into violence. Without insurance, pastoralists hold excess livestock as precautionary savings, amplifying the extent of necessary migration during dry periods. IBLI payouts allow pastoralists to purchase forage locally, reducing migration distance and intensity, and also smooth income, raising the opportunity cost of engaging in violence.&lt;/p&gt;
&lt;p&gt;Q: How does IBLI work technically, and why does it overcome problems of traditional livestock insurance?&lt;/p&gt;
&lt;p&gt;A: IBLI uses satellite remote sensing to calculate whether a district-specific drought threshold has been crossed; if so, automated payments are triggered immediately without requiring direct loss assessment or field inspections. This design eliminates moral hazard and adverse selection problems inherent in traditional indemnity insurance, reduces monitoring costs, and enables fast delivery via mobile payment platforms such as MPESA even to remote households. The Kenyan government rebranded the program as the Kenyan Livestock Insurance Program (KLIP) in 2015 and fully subsidizes coverage for up to five tropical livestock units per household.&lt;/p&gt;
&lt;p&gt;Q: What is the magnitude of the main conflict-mitigation result?&lt;/p&gt;
&lt;p&gt;A: A one-standard-deviation increase in neighborhood IBLI coverage reduces the semi-elasticity of the neighborhood rainfall deficit on conflict probability by approximately 23% (delta3/delta1 = -0.0158/0.0692). In absolute terms, this translates to a reduction from a 6.92 percentage-point increase in conflict probability per one-percentage-point rainfall deficit to a 5.34 percentage-point increase — a decline of 1.58 percentage points against a mean conflict probability of roughly 2.5%.&lt;/p&gt;
&lt;p&gt;Q: Why do the authors use a neighborhood rather than cell-level treatment measure?&lt;/p&gt;
&lt;p&gt;A: Drought-induced pastoralist conflicts occur primarily not in the pastoral home areas themselves but in neighboring regions where drought migration routes cross into non-pastoral land. The case study documents this pattern directly: ACLED conflict events accumulate where migration routes from Namelok, Lodungokwe, and Ngaremara communities intersect urban or agricultural areas, not within the pastoral zones. The neighborhood approach, using inverse-distance-weighted averages, captures both the probability of migration from surrounding cells and the declining probability of migration with distance.&lt;/p&gt;
&lt;p&gt;Q: What is the main identification concern and how do the authors address it?&lt;/p&gt;
&lt;p&gt;A: The main concern is that the timing of the IBLI rollout is endogenously determined — areas with a higher latent drought-conflict elasticity might receive coverage earlier or later, biasing the interaction coefficient. The authors show that the pre-treatment drought-conflict elasticity has no systematic correlation with either IBLI eligibility or the timing of coverage receipt. Placebo tests interacting the neighborhood rainfall deficit with pre-treatment eligibility or eventual coverage indicators yield positive, statistically insignificant coefficients, suggesting any bias would run in the direction of underestimating the mitigation effect. A permutation test randomly reassigning IBLI coverage across the six rollout clusters finds the actual point estimate is in the bottom 2.2% of the simulated distribution, indicating it is unlikely to arise from cluster-level confounders.&lt;/p&gt;
&lt;p&gt;Q: How do the authors rule out that other programs — cash transfers or development aid — explain the result?&lt;/p&gt;
&lt;p&gt;A: The authors control for cell-level and neighborhood-level coverage of Kenya&amp;rsquo;s Hunger Safety Net Programme (HSNP), which provides unconditional cash transfers to vulnerable households and covers most IBLI-eligible areas, as well as for World Bank agricultural aid projects. Across these specifications, the estimated conflict mitigation ranges from -19.16% to -42.24%, with the baseline estimate of -22.79% remaining robust, indicating neither HSNP nor development aid is a plausible alternative explanation.&lt;/p&gt;
&lt;p&gt;Q: What is the alternative identification strategy using within-rollout-cluster variation?&lt;/p&gt;
&lt;p&gt;A: The authors exploit pre-determined (1984 government land-use map) variation in mixed-land-use status across cells within the same IBLI rollout cluster-period, including rollout-cluster-times-period fixed effects that absorb any omitted variable related to the potentially endogenous rollout steps. The conflict-mitigating effect of IBLI is approximately four times larger in mixed-land-use cells, and approximately nine times larger in the most restrictive specification with rollout-cluster-times-period fixed effects, consistent with the prediction that IBLI matters most where pastoralists encounter other land users.&lt;/p&gt;
&lt;p&gt;Q: How do the authors establish the migratory pressure mechanism?&lt;/p&gt;
&lt;p&gt;A: Following Eberle et al. (2023), the authors match conflict actors to ethnic homelands using Murdock (1967) boundaries and test whether IBLI coverage in a homeland reduces the distance between the homeland centroid and conflict events involving that group. They find that it does, indicating that IBLI coverage reduces the spatial range of pastoralist drought migration and thus the probability of conflict-generating encounters with other land users.&lt;/p&gt;
&lt;p&gt;Q: How do the authors establish the income-smoothing mechanism?&lt;/p&gt;
&lt;p&gt;A: Using geo-coded Afrobarometer survey data, the authors show that IBLI coverage is associated with higher reported incomes among pastoralist households, consistent with Jensen et al. (2017). Higher incomes raise the opportunity cost of fighting (following Grossman, 1991), contributing to the overall conflict-mitigating effect alongside reduced migratory pressure.&lt;/p&gt;
&lt;p&gt;Q: What does the instrumental variable specification find?&lt;/p&gt;
&lt;p&gt;A: The authors instrument inverse-distance-weighted IBLI payouts in the neighborhood with the interaction of neighborhood rainfall deficit and neighborhood IBLI coverage. The first stage confirms that rainfall deficits trigger payouts conditional on coverage. The second stage finds that the occurrence of payouts in the neighborhood reduces the probability of conflict by approximately 150% relative to the baseline risk, corroborating the reduced-form results.&lt;/p&gt;
&lt;p&gt;Q: How do the authors assess cost-effectiveness?&lt;/p&gt;
&lt;p&gt;A: The authors predict plausible drought-induced conflict fatalities in Kenya over the pre-treatment period and calculate yearly lives saved from the main estimates, then compare the monetary value of saved lives to government subsidy expenditures on IBLI. Using conservative VSL estimates from the WHO and World Bank, IBLI delivers between 10 and 22 cents of pure fatality savings per dollar of public subsidy expenditure.&lt;/p&gt;
&lt;p&gt;Q: How robust are the results to alternative drought and conflict measures?&lt;/p&gt;
&lt;p&gt;A: Results are qualitatively similar using an Aridity Index or Dry Matter Productivity (DMP) as drought proxies instead of rainfall deficit. The estimated interaction effect maintains a t-statistic above two for spatial decay functions ranging from distance^-0.5 to distance^-1.5 and for Conley standard error cutoffs from 200 km up to 400 km. Results also hold when restricting to conflict events not involving the government, or to battles, riots, and violence against civilians only, and when excluding the pre-IBLI period (2000–2009) entirely.&lt;/p&gt;
&lt;p&gt;Q: What are the policy implications regarding scalability?&lt;/p&gt;
&lt;p&gt;A: Pastoralism covers 43% of the African landmass across 36 countries, supporting approximately 268 million people (FAO, 2018). The World Bank and private equity were planning to invest close to 900 million dollars in East African pastoralist programs over 2023–2027. The authors argue that IBLI&amp;rsquo;s cost structure — high fixed costs of technology and setup but low marginal costs of expansion — gives it a scalability advantage over cash transfer programs or public works schemes that require sustained state capacity. Market-based IBLI complements rather than substitutes for political and institutional reforms.&lt;/p&gt;
&lt;p&gt;Index-Based Livestock Insurance (IBLI): A financial instrument that uses satellite remote sensing to automatically trigger preemptive cash payouts to pastoralists when a pre-determined district-specific drought threshold is crossed, bypassing direct loss assessment and thereby eliminating moral hazard and adverse selection problems inherent in traditional indemnity insurance.&lt;/p&gt;
&lt;p&gt;Drought-conflict semi-elasticity: The percentage-point change in the probability of conflict associated with a one-percentage-point increase in the rainfall deficit; the paper&amp;rsquo;s main outcome quantity, estimated at 6.92 percentage points at mean IBLI coverage, reduced by 23% for a one-standard-deviation increase in neighborhood IBLI coverage.&lt;/p&gt;
&lt;p&gt;Neighborhood approach: An empirical strategy that measures both drought severity and IBLI coverage as inverse-distance-weighted averages over all surrounding grid cells, reflecting the authors&amp;rsquo; finding that pastoralist drought-migration generates conflicts not in the pastoral home area but in neighboring mixed-land-use zones where migration routes intersect other land users.&lt;/p&gt;
&lt;p&gt;Migratory pressure: The mechanism by which drought forces pastoralists — who hold excess livestock as precautionary savings in the absence of insurance — to migrate farther from traditional grazing grounds into mixed-land-use areas, increasing the probability of encounters and violent miscoordination with farmers, urban dwellers, and protected-area managers.&lt;/p&gt;
&lt;p&gt;Mixed land use: Areas, designated using a 1984 Kenyan government land-use map, where pastoral grazing zones are proximate to farms, ranches, urban settlements, or nature reserves; the paper identifies these as the locations with the highest expected treatment intensity, where IBLI coverage reduces drought-induced conflict approximately four to nine times more than elsewhere.&lt;/p&gt;
&lt;p&gt;Tropical Livestock Unit (TLU): The standard unit of account for IBLI contracts in Kenya; one TLU corresponds to one head of cattle or ten goats or sheep; the Kenyan government fully subsidizes IBLI for up to five TLUs per household.&lt;/p&gt;
&lt;p&gt;Rollout-cluster-times-period fixed effects: A restrictive set of fixed effects included in the alternative identification strategy that absorbs all omitted variables varying at the level of the six IBLI spatial rollout clusters over time, allowing the authors to identify the conflict-mitigating effect purely from within-cluster variation in mixed-land-use exposure.&lt;/p&gt;</description></item><item><title>International trade and macroeconomic dynamics with sanctions</title><link>https://macropaperwarehouse.com/papers/international-trade-and-macroeconomic-dynamics-with-sanctions/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/international-trade-and-macroeconomic-dynamics-with-sanctions/</guid><description>&lt;p&gt;Sanctions are increasingly used as an instrument of economic statecraft, yet their macroeconomic consequences—especially the transitional dynamics and their effects on business cycles—are poorly understood. This paper develops a micro-founded framework combining the intertemporal general-equilibrium structure of standard open-economy macro models with the rich trade-theoretic microfoundations of modern trade theory to study sanctions systematically. In a two-country, two-sector model where Home specializes in differentiated consumption goods (heterogeneous firms, endogenous entry, Melitz-style) and Foreign specializes in homogeneous intermediate goods (Cournot oligopoly in extraction), sanctions—modeled as trade bans and financial restrictions excluding particular Foreign agents—reallocate resources across and within countries, affect production, exchange rates, and welfare, and are shown to inflict larger welfare losses when they target sectors of comparative disadvantage. A central finding is that focusing only on long-run outcomes and overlooking initial transitional dynamics substantially misdirects welfare assessments; sanctions weaken international comovement and fragment markets but, contrary to some claims, leave the structure of business cycles largely intact.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-model-structure-and-what-types-of-sanctions-does-it-capture"&gt;Q1. What is the model structure and what types of sanctions does it capture?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The model is a two-country, two-sector economy: Home has a comparative advantage in differentiated consumption goods (produced by heterogeneous firms with endogenous entry under monopolistic competition, as in Melitz 2003), while Foreign specializes in homogeneous intermediate goods (produced via Cournot competition among a fixed number of upstream firms and processed by a representative distributor).&lt;/strong&gt; This structure is motivated by the pattern in which Western economies specialize in high-value, firm-entry-intensive industries while sanctioned countries often specialize in commodity production (energy, natural gas). Sanctions are modeled as two distinct instruments: trade bans (restrictions on commerce in goods) and financial restrictions (excluding particular Foreign agents from capital markets). The model accommodates both Ricardian and Melitz-type comparative advantage.&lt;/p&gt;
&lt;h3 id="q2-what-are-the-key-results-on-welfare-effects-and-the-role-of-transitional-dynamics"&gt;Q2. What are the key results on welfare effects and the role of transitional dynamics?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Sanctions reallocate resources across and within countries, with welfare losses larger when sanctions target sectors of comparative disadvantage rather than sectors of comparative advantage; and focusing only on long-run welfare ignores significant transitional costs that substantially change the total welfare assessment.&lt;/strong&gt; The model implies that initial transitional dynamics—disruptions to trade flows, entry and exit of firms, exchange rate movements, and adjustment of resource allocation—can be quantitatively important and even dominate long-run effects in welfare calculations. Assessments based solely on steady-state comparisons may therefore produce seriously misleading conclusions about whether and how severely sanctions harm the imposing or target economy.&lt;/p&gt;
&lt;h3 id="q3-how-do-sanctions-affect-international-comovement-and-business-cycles"&gt;Q3. How do sanctions affect international comovement and business cycles?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Sanctions weaken international business cycle comovement and fragment markets—reducing the degree to which shocks in one country transmit to the other—but leave the structural properties of business cycles within each country largely intact, in the sense that the cyclical dynamics of output, consumption, and investment retain their qualitative features.&lt;/strong&gt; This result has policy implications: sanctions can reduce the interdependence of the sanctioned country&amp;rsquo;s business cycle from the rest of the world (which could be either beneficial or harmful depending on the source of shocks), but cannot fundamentally restructure the domestic cycle.&lt;/p&gt;
&lt;h3 id="q4-what-is-the-baseline-application-and-what-general-lessons-emerge"&gt;Q4. What is the baseline application and what general lessons emerge?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;While the model&amp;rsquo;s comparative advantage structure is explicitly motivated by the 2022 Western sanctions against Russia—with intermediate goods interpretable as energy—the framework is designed to be more broadly applicable to other geopolitical conflicts involving sanctioned commodity producers facing differentiated-goods exporters, such as US-China trade tensions.&lt;/strong&gt; The general lessons are: (1) the sector targeted by sanctions matters greatly for their welfare costs; (2) transitional dynamics are not second-order; (3) financial sanctions operate through different channels than trade bans and should be modeled separately.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;comparative disadvantage in sanctions&lt;/strong&gt; : the paper&amp;rsquo;s finding that sanctions are more costly when they target goods in which the targeted country has a comparative disadvantage—sectors the country cannot efficiently produce domestically—because those are the sectors where trade provides the highest value and substitution is hardest.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;financial restrictions&lt;/strong&gt; : one of the two types of sanctions modeled, in which particular Foreign agents (firms or sovereign entities) are excluded from international capital markets, distinct from trade bans that restrict commerce in goods.&lt;/p&gt;</description></item><item><title>International Trade Responses to Labor Market Regulations</title><link>https://macropaperwarehouse.com/papers/international-trade-responses-to-labor-market-regulations/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/international-trade-responses-to-labor-market-regulations/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; This paper asks whether differences in labor market regulations — specifically payroll taxes and minimum wages — shape countries&amp;rsquo; comparative advantage in the cross-border provision of labor-intensive services. The question has broad policy relevance: if lower labor standards confer a systematic trade advantage, countries may face pressure to race to the bottom in labor protections, and political support for economic integration may erode.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Setting and Identification.&lt;/strong&gt; The paper exploits the EU &amp;ldquo;posting policy,&amp;rdquo; a large trade program established in 1959 that allows firms in one EU member state to temporarily send their employees to perform service contracts in another member state. In 2017, posting accounted for roughly one-third of all within-EU trade in services (approximately 2% of EU GDP), involving about 2 million workers (in full-time equivalents) in 2019. The setting is analytically attractive because competing foreign and domestic firms serve the same customers at the same physical location using shared capital, holding most determinants of comparative advantage constant while labor market regulations vary by the firm&amp;rsquo;s country of origin.&lt;/p&gt;
&lt;p&gt;Under posting rules, payroll taxes are generally origin-based (exporting firms pay their home country&amp;rsquo;s tax rate) but become destination-based when contracts exceed a regulatory duration threshold (12 months pre-2010, 24 months from 2010–2020, 18 months from 2020 onward). Minimum wages are destination-based: foreign firms must match the importing country&amp;rsquo;s statutory minimum wage floor when it exceeds the workers&amp;rsquo; home-country wage level. This generates the paper&amp;rsquo;s key identifying variation — payroll taxes and minimum wages vary across countries, over time, and within countries across sectors.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data.&lt;/strong&gt; The author uses administrative A1 social security forms filed for every EU posting contract from 2007–2018, collected from 25 EU member states, supplemented by micro-level national posting registries in Belgium (LIMOSA), France (SIPSI), and Luxembourg (matched employer-employee data). Labor cost data (wages, payroll tax rates, minimum wages) come from Eurostat and the OECD Taxing Wages Dataset.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Methodology.&lt;/strong&gt; The paper proceeds in three steps. First, it documents steady-state cross-sectional correlations between bilateral posting flows and labor cost differentials. Second, it estimates difference-in-differences (DiD) elasticities from four quasi-natural experiments. Third, it estimates a theory-consistent gravity model using all sources of variation across 25 EU countries from 2009–2018.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings.&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Steady-state correlation:&lt;/em&gt; A strong negative relationship exists between bilateral posting flows and labor cost differentials, with a cross-sectional elasticity of approximately –0.58 (SE 0.08). In sharp contrast, the relationship between bilateral goods trade and labor cost differentials is weak and if anything marginally positive (point estimate +0.13), confirming that labor cost differences are a distinctive driver of trade specifically in labor-intensive services rather than goods.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Belgian tax shift (2016–2018):&lt;/em&gt; When Belgium cut employers&amp;rsquo; social security contributions from 33% to 25%, imports of posting services into Belgium slowed relative to France (a neighboring control country on parallel pre-reform trends). The reduced-form elasticity of posting imports with respect to the payroll tax rate is 1.45 (SE 0.3).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Luxembourg EU regulation reform (2010):&lt;/em&gt; A new EU regulation required temporary employment agencies in border regions to pay destination-based payroll taxes, raising statutory rates faced by Luxembourgish exporters from 15% to 44%. Posting exports from Luxembourg&amp;rsquo;s temporary employment sector fell by 40% relative to the pre-reform level and relative to the domestic (control) sector, while the sheltered road transportation sector showed no response. The reduced-form elasticity with respect to the statutory payroll tax rate is –1.55 (SE 0.24), and the triple-difference estimate is –1.37 (SE 0.08).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Bunching at duration thresholds:&lt;/em&gt; The distribution of posting contract lengths in France (which has the EU&amp;rsquo;s highest payroll taxes) shows a sharp spike just below the 24-month payroll tax threshold. When the threshold was moved to 18 months in 2020, excess mass migrated to the new threshold, confirming that bunching reflects behavioral responses to the tax notch rather than reference-point effects. This documents that payroll tax differentials shape not only the quantity (extensive margin) but also the length (intensive margin) of posting contracts.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;German minimum wage reform (2015):&lt;/em&gt; Germany&amp;rsquo;s introduction of a national minimum wage of €8.50 per hour — which was already binding on construction workers through a sectoral minimum, but not on foreign firms providing non-construction services — caused postings to Germany in manufacturing to fall by approximately 60% relative to the construction (control) sector. The reduced-form elasticity is –1.34 (SE 0.43). Heterogeneity analysis shows that export declines were monotonically larger for low-wage origin countries where the new minimum wage was binding, and placebo estimates using Germany&amp;rsquo;s high-wage neighboring countries (where minimum wage requirements did not change) are statistically indistinguishable from zero.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Gravity estimates:&lt;/em&gt; The preferred specification (PPML with origin-year, destination-year, and pair fixed effects, exploiting bilateral variation in minimum wage bindingness across origin countries) yields a model-implied trade elasticity θ of –1.2 (SE 0.2). The range across specifications is –1.2 to –2.4. These estimates are smaller than the goods trade elasticity (typically estimated around 5) and below the medium-run reduced-form elasticities from the DiD case studies, consistent with short-run gravity estimates capturing only partial adjustment while DiD designs measure longer-run equilibrium responses.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;Policy Counterfactual.&lt;/strong&gt; The paper&amp;rsquo;s estimates imply that the Bolkestein Directive — which proposed exempting foreign firms from all destination-country labor regulations — would have doubled exports of physical services from Eastern European countries (upper bound), as their cost advantage would have been dramatically amplified by removal of minimum wage requirements. Counterpart to this export boom, average posted workers&amp;rsquo; wages would have fallen by approximately 16%, since workers would lose their entitlement to destination-country minimum wages. The paper documents that the Bolkestein controversy — sparked by the &amp;ldquo;Polish plumber&amp;rdquo; debate in early 2005 — coincided with a sharp and persistent drop in French voter support for the EU constitutional treaty, which was subsequently rejected.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions.&lt;/strong&gt; Results apply specifically to trade in physical (labor-intensive) services traded via temporary worker posting within the EU, where productivity differences across countries for these tasks are plausibly small (Balassa-Samuelson), making institutional factors a primary driver of wage differences. The paper estimates intent-to-treat effects, assuming perfect compliance by exporting firms. The paper does not perform a comprehensive welfare analysis covering consumer price effects or general equilibrium wage and trade-balance responses.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-eu-posting-policy-and-why-does-it-provide-an-unusually-clean-setting-for-identifying-the-causal-effect-of-labor-regulations-on-trade"&gt;Q1. What is the EU posting policy and why does it provide an unusually clean setting for identifying the causal effect of labor regulations on trade?&lt;/h3&gt;
&lt;p&gt;The EU posting policy, established in 1959, allows firms in one EU member state to temporarily send employees to perform service contracts in another member state. The policy keeps most determinants of comparative advantage constant — competing foreign and domestic firms serve the same customers at the same physical location using shared capital — while labor market regulations vary by the firm&amp;rsquo;s country of origin. Productivity differences for physical services across countries are also plausibly limited (Balassa-Samuelson), making institutional wage differences the primary cost driver. Enforcement is facilitated by the on-site nature of the service, and administrative A1 forms create a direct measure of the number of workers involved in cross-border transactions without a minimum reporting threshold.&lt;/p&gt;
&lt;h3 id="q2-what-are-the-three-sources-of-labor-cost-differences-the-paper-identifies-and-quantifies"&gt;Q2. What are the three sources of labor cost differences the paper identifies and quantifies?&lt;/h3&gt;
&lt;p&gt;Foreign firms competing for posting contracts face different costs through three channels: (i) equilibrium gross wages differ across origin countries, reflecting both productivity differences and institutional/information frictions that allow wage discrimination between posted and domestic workers; (ii) payroll tax rates are origin-based and differ substantially across countries (for example, France&amp;rsquo;s employer payroll tax is approximately 40% versus approximately 15% for Luxembourg before the 2010 reform); and (iii) destination-specific minimum wages impose a &amp;ldquo;posting allowance&amp;rdquo; on firms from countries with lower wages, equal to the shortfall between the firm&amp;rsquo;s home-country wage and the importing country&amp;rsquo;s minimum wage floor. Micro-level wage data from France confirm that most posted workers from low-wage countries are paid exactly at the French minimum wage, demonstrating the bindingness of the third channel, while French workers performing the same tasks receive wages near the French average (approximately €21.1 per hour versus a minimum wage of approximately €10 per hour in 2018).&lt;/p&gt;
&lt;h3 id="q3-what-does-the-cross-sectional-evidence-show-about-the-relationship-between-labor-cost-differentials-and-posting-flows-and-how-does-this-compare-to-goods-trade"&gt;Q3. What does the cross-sectional evidence show about the relationship between labor cost differentials and posting flows, and how does this compare to goods trade?&lt;/h3&gt;
&lt;p&gt;Bilateral posting flows and bilateral labor cost differentials have a tight negative cross-sectional relationship with an estimated elasticity of –0.58 (SE 0.08), indicating that countries export more posting services when their labor costs are substantially below those of the destination country. The same exercise applied to bilateral goods trade yields a coefficient of +0.13 (SE 0.07) — weak and marginally positive — consistent with goods trade being driven by capital, technology, and scale rather than labor cost differentials. The gap confirms that labor cost differences are a distinctive comparative advantage mechanism for labor-intensive services but not for less labor-intensive goods.&lt;/p&gt;
&lt;h3 id="q4-what-does-the-belgian-tax-shift-reform-demonstrate-and-how-is-identification-established"&gt;Q4. What does the Belgian tax shift reform demonstrate, and how is identification established?&lt;/h3&gt;
&lt;p&gt;Belgium cut employer social security contributions from 33% to 25% between 2016 and 2018 in a revenue-neutral reform (financed by VAT, excise duties, and dividend taxes). The DiD compares posting imports into Belgium with those into France (a neighboring, similarly sized importer on parallel pre-reform trends). Belgium and France imported posting services at similar rates before 2015; Belgian imports slowed immediately after the reform while French imports continued growing. The reduced-form elasticity of posting flows with respect to the destination payroll tax rate is 1.45 (SE 0.3). The elasticity with respect to total labor cost is 3.7 (SE 0.7). No discernible response is detected for trade in manufacturing goods, providing a within-reform placebo. A synthetic control using all available importing countries yields a smaller elasticity of 0.6 (SE 0.22).&lt;/p&gt;
&lt;h3 id="q5-how-does-the-luxembourg-eu-regulation-reform-2010-improve-on-the-belgian-case-for-identification"&gt;Q5. How does the Luxembourg EU regulation reform (2010) improve on the Belgian case for identification?&lt;/h3&gt;
&lt;p&gt;The 2010 EU regulation required temporary employment agencies in border regions to pay destination-based (rather than origin-based) payroll taxes, raising statutory rates for Luxembourgish exporters from 15% to 44%. Unlike the Belgian reform, this created within-country variation: the same Luxembourgish firms were exposed in the temporary employment sector but not in road transportation (which received a 10-year exemption). This within-exporter, cross-sector design controls for all Luxembourg-wide demand or supply shocks. Posting exports by the temporary employment sector fell 40% relative to pre-reform levels and relative to the domestic (control) sector, while road transportation posting showed zero response. The monthly data confirm the drop occurred in the exact month following the regulation with no anticipation. The triple-difference elasticity (with respect to the payroll tax rate) is –1.37 (SE 0.08).&lt;/p&gt;
&lt;h3 id="q6-what-does-the-bunching-evidence-at-payroll-tax-duration-thresholds-add-to-the-did-findings"&gt;Q6. What does the bunching evidence at payroll tax duration thresholds add to the DiD findings?&lt;/h3&gt;
&lt;p&gt;When posting contracts exceed a regulatory duration threshold (24 months during 2010–2020, then 18 months from July 2020), payroll taxes become destination-based. Because France has the highest payroll tax in the EU, all exporting firms face strong incentives to avoid crossing the threshold. The distribution of posting contract lengths in France shows sharp excess mass just below 24 months in 2017. When the threshold moved to 18 months in 2020, the excess mass migrated to the new threshold while diminishing at the old one, confirming that bunching is tax-motivated rather than driven by a reference-point at 24 months. This establishes that labor tax differentials shape not only the quantity of posting contracts (extensive margin) but also their length (intensive margin).&lt;/p&gt;
&lt;h3 id="q7-what-are-the-main-findings-from-the-german-minimum-wage-reform-and-how-do-the-heterogeneity-tests-strengthen-identification"&gt;Q7. What are the main findings from the German minimum wage reform, and how do the heterogeneity tests strengthen identification?&lt;/h3&gt;
&lt;p&gt;Germany&amp;rsquo;s January 2015 introduction of a national minimum wage of €8.50 per hour (preceded by a sectoral minimum in meat processing in August 2014) raised wage costs for foreign firms providing non-construction services, but not for construction firms already covered by a higher sectoral minimum. Postings to Germany in manufacturing fell by approximately 60% relative to the construction (control) sector, implying a reduced-form elasticity of –1.34 (SE 0.43). Two heterogeneity tests reinforce identification: (i) within the treated German sector, posting declines are monotonically increasing in the degree to which the new minimum wage is binding in the origin country, with Luxembourg (where the minimum is non-binding) showing no statistically significant effect; (ii) the same industry-by-country comparison in Germany&amp;rsquo;s high-wage neighboring countries (which did not change minimum wage rules) yields placebo estimates statistically indistinguishable from zero. The reform raised wages for German workers by an average of 6% (and up to 10% for most affected workers) but automatically raised wages for posted workers by an average of 40%, doubling them for workers from the poorest sending countries.&lt;/p&gt;
&lt;h3 id="q8-how-do-the-gravity-model-estimates-compare-to-the-reduced-form-did-estimates-and-what-explains-the-difference"&gt;Q8. How do the gravity model estimates compare to the reduced-form DiD estimates, and what explains the difference?&lt;/h3&gt;
&lt;p&gt;Across gravity specifications, model-implied elasticities range from –0.75 to –2.4. The preferred specification — PPML with pair fixed effects, destination-year fixed effects, and origin-year fixed effects — yields θ = –1.2 (SE 0.2). These estimates are systematically below the medium-run reduced-form DiD estimates because: (a) the gravity model uses nationwide average tax and minimum wage measures that introduce measurement error relative to the sector-specific reforms in the case studies; and (b) the gravity model captures year-to-year (short-run) adjustments, while the DiD designs compare outcomes several years before and after the reform, picking up longer-run equilibrium reallocation. The finding that responses grow over time mirrors evidence on dynamic adjustment in goods trade (Boehm, Levchenko and Pandalai-Nayar, 2023), and contradicts the conventional belief that fiscal devaluations boost exports only in the short run.&lt;/p&gt;
&lt;h3 id="q9-what-does-the-gravity-model-reveal-about-trade-in-goods-as-a-function-of-posting-specific-wage-costs"&gt;Q9. What does the gravity model reveal about trade in goods as a function of posting-specific wage costs?&lt;/h3&gt;
&lt;p&gt;When the same gravity specification is applied to bilateral goods trade rather than posting flows, posting-specific wage costs have a positive — not negative — coefficient on goods trade. This is inconsistent with a model where unobserved shocks affect all exports symmetrically, and instead suggests a small substitution effect: as the cost to import labor services rises (due to tighter posting regulations), countries substitute toward importing goods. For some activities (such as meat processing), importing finished goods is a partial substitute for importing labor services to produce on-site.&lt;/p&gt;
&lt;h3 id="q10-what-are-the-bolkestein-directive-counterfactual-implications-and-how-do-they-connect-to-the-political-economy-evidence"&gt;Q10. What are the Bolkestein Directive counterfactual implications, and how do they connect to the political economy evidence?&lt;/h3&gt;
&lt;p&gt;The Bolkestein Directive (proposed 2005) would have enforced a &amp;ldquo;country of origin principle,&amp;rdquo; exempting foreign posting firms from destination-country minimum wages. Using the preferred lower-bound elasticity from the gravity model (column 5, θ = –1.2) and an upper bound averaging gravity and DiD estimates, the paper predicts this would have at least doubled exports of labor services from Eastern European countries. Tax revenues collected on posted workers in origin countries would also double. However, average posted workers&amp;rsquo; wages would fall by approximately 16%, as workers would lose their entitlement to destination-country minimum wages. The paper documents that the Bolkestein controversy — introduced to the EU Parliament in March 2005 and popularized via the &amp;ldquo;Polish plumber&amp;rdquo; trope — coincided with a sharp and permanent drop in French voter support for the EU constitutional treaty, which was subsequently rejected in referendum. This is consistent with Rodrik&amp;rsquo;s (1998) hypothesis that voters withdraw support for economic integration when comparative advantage appears to be based on institutional choices that conflict with importing countries&amp;rsquo; social norms.&lt;/p&gt;
&lt;h3 id="q11-how-does-the-paper-handle-the-incidence-of-payroll-taxes--does-the-canonical-result-that-payroll-taxes-are-fully-passed-through-to-workers-hold-in-this-context"&gt;Q11. How does the paper handle the incidence of payroll taxes — does the canonical result that payroll taxes are fully passed through to workers hold in this context?&lt;/h3&gt;
&lt;p&gt;The canonical competitive labor market model predicts full pass-through of payroll taxes to workers&amp;rsquo; net wages, leaving firms&amp;rsquo; labor costs unchanged. The paper finds substantial trade responses to payroll tax reforms, inconsistent with full pass-through. Nominal rigidities — including binding minimum wages that constrain downward wage adjustment — help rationalize incomplete pass-through in the EU context. The paper estimates elasticities both with respect to statutory tax rates (the reduced-form, making no incidence assumption) and with respect to total wage costs (instrumented with the reform, allowing for gross wage responses). Wage data from Belgium show no distinguishable wage response to the Belgian tax cut, suggesting the incidence fell largely on firms&amp;rsquo; costs rather than workers&amp;rsquo; wages in that episode.&lt;/p&gt;
&lt;h3 id="q12-what-do-the-destination-based-taxation-counterfactual-tax-cooperation-proposal-calculations-show"&gt;Q12. What do the destination-based taxation counterfactual (tax cooperation proposal) calculations show?&lt;/h3&gt;
&lt;p&gt;A proposal to shift all posting payroll taxation to destination-based rates would decrease posting exports from Eastern European countries by between 10% and 25%. Despite the volume reduction, total taxes collected on posted workers would still increase under this reform even when the upper-bound elasticity (approximately –3.7 with respect to total wage cost) is used, because a 1% increase in the payroll tax rate translates to a much smaller proportional increase in total wage cost.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Posted workers / posting policy:&lt;/strong&gt; Employees temporarily sent by their employer (the &amp;ldquo;exporting firm&amp;rdquo;) to perform a service contract in another EU member state. Posted workers maintain their employment contract with the firm in the origin country but physically work in the destination country. This creates a setting where competing domestic and foreign firms serve the same customers at the same location under different labor regulations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Posting allowance:&lt;/strong&gt; The additional wage component that exporting firms must pay to posted workers to satisfy the destination country&amp;rsquo;s minimum legal wage when that minimum exceeds the firm&amp;rsquo;s home-country wage level. The posting allowance is zero when the exporting country&amp;rsquo;s average wage already exceeds the destination minimum wage; it can be large for low-wage origin countries. The allowance enters directly into firms&amp;rsquo; labor costs and is the minimum-wage channel of the paper&amp;rsquo;s labor cost formula.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Origin-based vs. destination-based payroll taxation:&lt;/strong&gt; Under posting, payroll taxes are normally assessed in the country where the exporting firm is registered (origin-based), creating tax rate differentials between competing firms in the same job site. EU regulations convert payroll taxes to destination-based when posting contracts exceed a duration threshold, eliminating the tax advantage of lower-tax origin countries for those contracts. The 2010 EU regulation additionally imposed destination-based taxation on border-region temporary employment agencies.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Trade elasticity for physical services (θ):&lt;/strong&gt; The structural parameter from the Eaton-Kortum (2002) gravity model that governs the elasticity of bilateral posting flows with respect to changes in firms&amp;rsquo; total wage costs when exporting services from country i to country j. The paper&amp;rsquo;s preferred estimate is –1.2 (from gravity estimation) to approximately –1.3 to –1.5 (from reduced-form DiD designs), substantially smaller in absolute value than the goods trade elasticity (typically estimated around 5).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Social standards as comparative advantage:&lt;/strong&gt; The paper uses &amp;ldquo;standards&amp;rdquo; to refer to countries&amp;rsquo; domestic policy choices about payroll taxes (which finance social insurance programs) and minimum wages (which set worker protection floors). The paper demonstrates that these regulatory choices — distinct from productivity differences, factor abundance, or technology — create measurable cost advantages that shape specialization in labor-intensive service sectors. This is in contrast to &amp;ldquo;benign&amp;rdquo; sources of comparative advantage.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bolkestein Directive / country of origin principle:&lt;/strong&gt; A 2005 EU legislative proposal that would have required posting firms to operate under the laws of their home country when supplying services in other EU member states, eliminating the hard core of destination-country regulations (including minimum wages) that the 1996 Posted Workers Directive had imposed on foreign firms. The proposal was withdrawn after a wave of protests and its association with a sharp fall in French support for the EU constitutional treaty.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bunching / notch at duration threshold:&lt;/strong&gt; A behavioral response in which exporting firms strategically keep posting contract lengths below the duration threshold that triggers destination-based payroll taxation, generating an excess mass in the distribution of contract lengths just below the threshold. The paper uses this bunching, together with the movement of the threshold from 24 to 18 months in 2020, as additional evidence that payroll tax differentials affect the intensive margin of posting.&lt;/p&gt;</description></item><item><title>Is it AI or data that drives firm market power?</title><link>https://macropaperwarehouse.com/papers/is-it-ai-or-data-that-drives-firm-market-power/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/is-it-ai-or-data-that-drives-firm-market-power/</guid><description>&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;Summary blocked:&lt;/strong&gt; No freely available working paper or full text was found. Metadata record only; paper suppressed from site until a working paper is located.&lt;/p&gt;
&lt;/blockquote&gt;</description></item><item><title>Labor Market Competition and the Assimilation of Immigrants</title><link>https://macropaperwarehouse.com/papers/labor-market-competition-and-the-assimilation-of-immigrants/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/labor-market-competition-and-the-assimilation-of-immigrants/</guid><description>&lt;h2 id="labor-market-competition-and-the-assimilation-of-immigrants"&gt;Labor Market Competition and the Assimilation of Immigrants&lt;/h2&gt;
&lt;h3 id="research-question"&gt;Research Question&lt;/h3&gt;
&lt;p&gt;Why have immigrant-native wage gaps widened substantially across arrival cohorts in the United States since the 1960s, and why has the speed of wage convergence slowed? The paper argues that the existing literature, which attributes these trends entirely to declining immigrant cohort quality, omits a critical general-equilibrium channel: labor market competition arising from imperfect substitutability between immigrants and natives. The paper quantifies how much of the observed deterioration in wage assimilation profiles can be attributed to (i) increasing immigrant cohort sizes raising labor market competition, (ii) secular shifts in relative skill demand, and (iii) genuine changes in immigrant cohort quality.&lt;/p&gt;
&lt;h3 id="data-and-methodology"&gt;Data and Methodology&lt;/h3&gt;
&lt;p&gt;The analysis uses U.S. Census microdata for 1970, 1980, 1990, and 2000, combined with American Community Survey (ACS) data pooled for 2009–2011 (labeled 2010) and 2018–2019 (labeled 2020), all drawn from IPUMS-USA. The sample covers individuals aged 25–64 who are employed in the civilian sector, not self-employed, not in group quarters, and report positive earnings. Immigrant cohort sizes grew from approximately 800,000 individuals in the 1960s cohort to 2.3 million in the 1980s cohort and 4.6 million in the 2000s cohort.&lt;/p&gt;
&lt;p&gt;The theoretical framework is a constant elasticity of substitution (CES) production function in which workers supply two types of skills: &amp;ldquo;general&amp;rdquo; skills portable across countries and &amp;ldquo;specific&amp;rdquo; skills particular to the host country (including language proficiency and knowledge of cultural and institutional environment). Immigrants arrive with the same general skills as observationally equivalent natives but only a fraction of their specific skills; they accumulate specific skills over time. Because immigrants disproportionately supply general skills upon arrival, increasing immigrant inflows raise the relative supply of general skills, depress the relative price of general skills, and thereby widen the immigrant-native wage gap. This mechanism operates only when immigrants and natives are imperfect substitutes (elasticity of substitution σ &amp;lt; ∞).&lt;/p&gt;
&lt;p&gt;The model is estimated in two steps using nonlinear least squares (NLS). First, productivity factor parameters are estimated from native wages year by year, with state dummies identifying state-level skill prices. Second, specific skill accumulation parameters and the elasticity of substitution σ are jointly identified from immigrant wage differences across labor markets (defined as U.S. states) and over time. The demand shift parameter δ_t, which captures changes in the relative demand for specific skills (e.g., technology that favors communication over manual tasks), enters as a linear time trend in the baseline specification.&lt;/p&gt;
&lt;h3 id="main-findings-with-quantitative-magnitudes"&gt;Main Findings with Quantitative Magnitudes&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Competition effect:&lt;/strong&gt; Immigration-induced increases in labor market competition explain 14.2, 43.9, and 40.8 percent of the increase in the initial wage gap of the 1970s, 1980s, and 1990s cohorts relative to the 1960s cohort, respectively. Averaged across all years spent in the United States, the competition effect alone accounts for 14.1, 22.4, and 20.4 percent — approximately one fifth overall.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Competition plus demand effect:&lt;/strong&gt; Adding secular shifts in relative skill demand raises these figures to 24.8, 68.3, and 109.5 percent at arrival and 21.2, 33.6, and 36.4 percent averaged across years — approximately one third overall.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Elasticity of substitution:&lt;/strong&gt; The baseline estimate of σ (elasticity of substitution between general and specific skills) is 0.020 (s.e. 0.002), implying an inverse elasticity of approximately 50.5. The relative supply of general skills increased by 1.67 log points between 1970 and 2020, producing a predicted increase in the relative price of specific skills of approximately 59.6 log points. The demand shift trend is estimated at 1.3 log points per year.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Cohort quality:&lt;/strong&gt; Once competition and demand effects are netted out, the remaining deterioration in assimilation profiles is entirely attributable to observable changes in immigrants&amp;rsquo; educational attainment and country-of-origin composition. Conditional on these two observable characteristics, unobservable skill quality improved across cohorts (consistent with English language proficiency trends), reversing the conventional narrative of declining cohort quality.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Specific skills gap at arrival:&lt;/strong&gt; The 1960s cohort faced a specific skills gap of approximately 52.4 percent relative to native equivalents; this narrowed to 41.8 percent for the 1970s cohort, 35.6 percent for the 1980s cohort, and 17.6 percent for the 1990s cohort, conditional on origin and education. After 20–30 years, all cohorts reach 83.7–92.0 percent of their native counterparts&amp;rsquo; specific skill levels.&lt;/p&gt;
&lt;h3 id="scope-conditions"&gt;Scope Conditions&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;The analysis focuses on employed men in the main text (women are analyzed in an Online Appendix, showing qualitatively similar but quantitatively smaller patterns).&lt;/li&gt;
&lt;li&gt;Labor markets are defined at the U.S. state level in the baseline; robustness checks use state-education and state-gender cells.&lt;/li&gt;
&lt;li&gt;The decomposition covers the period from the 1960s to the 1990s arrival cohorts.&lt;/li&gt;
&lt;li&gt;Results are robust to corrections for selective outmigration, undercounting of undocumented immigrants, immigrant network effects, alternative demand shift specifications, alternative labor market definitions, and endogenous immigrant location choice (using shift-share instruments in the spirit of Card, 2001).&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-core-theoretical-mechanism-by-which-increasing-immigrant-inflows-widen-the-immigrant-native-wage-gap"&gt;Q1. What is the core theoretical mechanism by which increasing immigrant inflows widen the immigrant-native wage gap?&lt;/h3&gt;
&lt;p&gt;A: Because immigrants disproportionately supply general (country-portable) skills upon arrival, while natives disproportionately supply specific (host-country) skills, an increase in immigrant inflows raises the ratio of general to specific skills in the economy. Under imperfect substitutability (σ &amp;lt; ∞), this lowers the relative price of general skills and raises the relative price of specific skills, thereby widening the wage gap between immigrants (who earn predominantly from general skills) and natives (who earn more from specific skills). The effect is larger in the early years after arrival when immigrants&amp;rsquo; specific skill endowment s is small, and diminishes as immigrants accumulate specific skills over time.&lt;/p&gt;
&lt;h3 id="q2-how-does-the-paper-model-immigrants-skill-accumulation-and-how-do-accumulation-profiles-differ-across-groups"&gt;Q2. How does the paper model immigrants&amp;rsquo; skill accumulation, and how do accumulation profiles differ across groups?&lt;/h3&gt;
&lt;p&gt;A: Immigrants&amp;rsquo; specific skill endowment s(·) upon arrival and over time is modeled as a flexible polynomial in years since migration, interacted with dummies for region of origin, education, cohort of entry, and potential experience abroad. Mexican high school dropouts (the reference group) are estimated to arrive with approximately 80 percent of the specific skills of equivalent natives. Immigrants from Latin America, Asia, and other regions arrive with lower specific skills than Western immigrants, who arrive near native parity. Higher-educated immigrants arrive relatively less similar to equivalently educated natives than low-educated immigrants, reflecting the greater importance of language-intensive skills in high-skill occupations. Conditional on origin and education, more recent cohorts arrive with narrower specific skill deficits: the 1990s cohort faces a gap of 17.6 percent at arrival compared to 52.4 percent for the 1960s cohort.&lt;/p&gt;
&lt;h3 id="q3-what-are-the-estimated-technology-parameters-and-how-are-they-interpreted"&gt;Q3. What are the estimated technology parameters, and how are they interpreted?&lt;/h3&gt;
&lt;p&gt;A: The elasticity of substitution between general and specific skills is estimated at σ = 0.020 (s.e. 0.002), with a confidence interval of [0.017, 0.024]. This implies an inverse elasticity of approximately 50.5, meaning a one percent increase in the relative supply of general skills raises the relative price of specific skills by about 50.5 percent. The implied elasticity of substitution between natives and immigrants (evaluated at market-level averages) is approximately 0.013 in 1990, 0.020 in 2000, and 0.025 in 2010 — in the same range as the Ottaviano and Peri (2012) benchmark of 0.034 (s.e. 0.008). The demand shift trend is estimated at δ̃ = 0.013 (s.e. 0.001) log points per year, reflecting secular increases in the relative demand for specific (host-country) skills.&lt;/p&gt;
&lt;h3 id="q4-how-does-the-paper-identify-the-elasticity-of-substitution-σ-and-the-skill-accumulation-parameters-separately"&gt;Q4. How does the paper identify the elasticity of substitution σ and the skill accumulation parameters separately?&lt;/h3&gt;
&lt;p&gt;A: The estimation proceeds in two steps. First, productivity factor parameters (returns to education and experience) are estimated from native wage regressions, with state-year dummies absorbing state-specific skill prices. Second, skill accumulation parameters θ are identified from wage differences between immigrants with different characteristics working in the same labor market, while σ and the demand shift δ̃ are identified from variation in immigrant wage gaps across states (which have different immigrant population shares) and over time. Specifically, states with higher immigrant shares display lower relative prices of general skills, providing the identifying variation for σ.&lt;/p&gt;
&lt;h3 id="q5-what-are-the-quantitative-magnitudes-of-the-competition-effect-for-specific-cohorts-at-different-time-horizons"&gt;Q5. What are the quantitative magnitudes of the competition effect for specific cohorts at different time horizons?&lt;/h3&gt;
&lt;p&gt;A: At the time of arrival, the competition effect explains 14.2 percent (1970s cohort), 43.9 percent (1980s cohort), and 40.8 percent (1990s cohort) of the increase in initial wage gaps relative to the 1960s cohort. After 10 years, these figures are 17.1, 22.7, and 22.2 percent respectively. After 20 years, they are 12.2, 16.9, and 16.2 percent. After 30 years, 10.9, 15.3, and 13.7 percent. The declining share across years reflects the fact that as immigrants accumulate specific skills, their wages become less sensitive to equilibrium skill prices. Averaged across all years since migration, the competition effect accounts for 14.1, 22.4, and 20.4 percent for the three cohorts.&lt;/p&gt;
&lt;h3 id="q6-how-does-labor-market-competition-affect-the-speed-of-wage-assimilation-and-does-it-prevent-full-convergence"&gt;Q6. How does labor market competition affect the speed of wage assimilation, and does it prevent full convergence?&lt;/h3&gt;
&lt;p&gt;A: The effect on assimilation speed is theoretically ambiguous and depends on whether future cohorts are larger or smaller than the reference cohort, and whether immigrants fully converge to native skill levels. In the stylized examples, a one-time permanent increase in competition raises both the initial wage gap and the speed of subsequent convergence (since the gap between immigrant and native skill levels is larger and therefore more responsive to changes in skill prices). However, continuous inflows of increasingly large cohorts counteract this speedup by continuously shifting the wage profile downward — the &amp;ldquo;dynamic competition effect.&amp;rdquo; For immigrants who fully converge (s → 1), competition delays but does not prevent convergence; for those who only partially converge (s → &amp;lt; 1), competition permanently widens the long-run wage gap. Quantitatively, the paper finds the effect on assimilation speed to be small in the full-sample decomposition.&lt;/p&gt;
&lt;h3 id="q7-what-do-the-illustrative-examples-for-specific-immigrant-groups-reveal-about-heterogeneous-competition-effects"&gt;Q7. What do the illustrative examples for specific immigrant groups reveal about heterogeneous competition effects?&lt;/h3&gt;
&lt;p&gt;A: For a Mexican male high school dropout (1960s cohort skills), facing the same competition level as the 1990s cohort would widen the initial wage gap by 10.2 log points; facing 2010 competition levels would widen it by 21.1 log points. However, because this group fully converges (s → 1), the effect dissipates entirely after approximately 25 years, and long-run wage assimilation is not prevented. For a Latin American male high school graduate who only partially converges (s → &amp;lt; 1), facing 1990s competition would widen the initial gap by 17.4 log points and leave a 3.8 log-point larger long-run wage gap. For a Western college graduate who arrives near native skill parity, competition effects are negligible throughout.&lt;/p&gt;
&lt;h3 id="q8-what-are-the-changes-in-absolute-wage-gaps-documented-in-the-baseline-data"&gt;Q8. What are the changes in absolute wage gaps documented in the baseline data?&lt;/h3&gt;
&lt;p&gt;A: The 1960s cohort arrived with an initial wage gap of approximately 17.2 log points relative to natives. The 1970s cohort arrived with a gap of 30.1 log points, the 1980s cohort 29.2 log points, and the 1990s cohort 20.8 log points. Under the no-competition counterfactual, these initial gaps narrow to 13.6, 24.7, 20.3, and 15.7 log points respectively. Removing both competition and demand effects further narrows them to 13.7, 23.4, 17.5, and 13.3 log points.&lt;/p&gt;
&lt;h3 id="q9-what-does-the-paper-find-about-the-role-of-observable-versus-unobservable-immigrant-quality"&gt;Q9. What does the paper find about the role of observable versus unobservable immigrant quality?&lt;/h3&gt;
&lt;p&gt;A: Once competition and demand effects are accounted for, all remaining cohort differences in assimilation profiles are attributable to observable changes in immigrants&amp;rsquo; educational attainment and country-of-origin composition. Conditional on these two observable characteristics, immigrants in more recent cohorts display higher levels of unobservable skills (smaller specific skill deficits conditional on origin and education), consistent with rising English language proficiency across cohorts. This reverses the standard interpretation that unobservable immigrant quality has declined.&lt;/p&gt;
&lt;h3 id="q10-how-do-aggregate-skill-supplies-and-relative-skill-prices-evolve-over-the-sample-period"&gt;Q10. How do aggregate skill supplies and relative skill prices evolve over the sample period?&lt;/h3&gt;
&lt;p&gt;A: Between 1970 and 2020, the total supply of general skills from immigrants grew by a factor of 16.3, while the supply of specific skills grew by a factor of 15.0. The resulting increase in the relative supply of general skills caused the relative price of general skills to fall from 0.89 to 0.38. Accounting for growing relative demand for specific skills (the δ_t trend), the ratio of relative skill prices fell further to 0.20 by 2020. At the state level, relative prices of general skills are well below 0.3 in high-immigration states like California, Florida, and New York, and approach 1.0 in states with low immigrant shares.&lt;/p&gt;
&lt;h3 id="q11-are-the-results-robust-to-selective-outmigration-undocumented-immigrants-and-alternative-specifications"&gt;Q11. Are the results robust to selective outmigration, undocumented immigrants, and alternative specifications?&lt;/h3&gt;
&lt;p&gt;A: Yes. Across twelve robustness checks covering selective outmigration corrections (using Borjas and Bratsberg 1996 or Rho and Sanders 2021 outmigration rates, and synthetic cohort reweighting), undocumented immigrant undercounting corrections, immigrant network controls (share and stock of compatriots in the same state), alternative demand shift specifications (quadratic and time dummies), alternative labor market definitions (state-education and state-gender cells), and endogenous immigrant location choice (GMM with shift-share instruments), the estimated elasticity of substitution σ ranges from 0.017 to 0.033 and the average competition effects remain stable. Averaged across all robustness checks, competition effects are 1.3 log points (1960s cohort), 3.0 log points (1970s), 5.2 log points (1980s), and 4.3 log points (1990s), compared to baseline values of 1.4, 3.1, 5.5, and 4.6 log points.&lt;/p&gt;
&lt;h3 id="q12-what-are-the-policy-implications-highlighted-by-the-authors"&gt;Q12. What are the policy implications highlighted by the authors?&lt;/h3&gt;
&lt;p&gt;A: First, since assimilation and competition effects are intertwined, the wage impact of immigration on natives is intrinsically dynamic: newly arrived immigrants initially compete relatively little with natives but increasingly substitute for them as their specific skills grow. Second, labor market competition may reduce immigrants&amp;rsquo; incentives to invest in host-country-specific skills, a channel not modeled in most existing structural models. Third, dispersal policies (such as those used during refugee crises) that reallocate immigrants across regions will affect local skill price ratios and therefore alter wage assimilation trajectories — a potentially unintended consequence of geographic allocation policies.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;General skills:&lt;/strong&gt; Skills that are portable across countries and can be used productively in any labor market. In the paper&amp;rsquo;s framework, general skills are those required for tasks (such as manual or physical labor) that are similar across national contexts. Upon arrival, immigrants are assumed to supply the same amount of general skills as observationally equivalent natives, making immigrants&amp;rsquo; relative supply of general skills high at arrival.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Specific skills (host-country-specific skills):&lt;/strong&gt; Skills particular to the host country, including language proficiency (English in the U.S. context) as well as familiarity with the institutional and cultural environment. Immigrants arrive with only a fraction s of the specific skills of comparable natives; this fraction evolves over time as immigrants spend time in the host country. The level of specific skills governs how substitutable a given immigrant worker is with native workers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Labor market competition effect:&lt;/strong&gt; The mechanism by which increasing immigrant inflows affect relative wages through equilibrium changes in skill prices rather than through individual skill accumulation. When immigrants and natives are imperfect substitutes, rising immigrant inflows raise the relative supply of general skills, depress the relative price of general skills, and widen the immigrant-native wage gap. This effect is larger for recently arrived immigrants (small s) and diminishes as immigrants assimilate.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Dynamic competition effect:&lt;/strong&gt; The combined effect on a given cohort&amp;rsquo;s observed assimilation profile of continuous, growing immigrant inflows over its time in the country. Unlike a one-time permanent increase in competition (which would raise both the initial gap and assimilation speed), continuously growing inflows both widen the initial gap and exert a continuous downward shift on the cohort&amp;rsquo;s wage profile, with an ambiguous net effect on the speed of convergence.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Demand shift (δ_t):&lt;/strong&gt; A time-varying parameter in the CES production function capturing secular changes in the relative demand for specific versus general skills beyond what is explained by standard skill-biased technological change. A positive trend in δ_t (estimated at 1.3 log points per year in the baseline) reflects technological change that favors communication-intensive (specific-skill-intensive) tasks over manual (general-skill-intensive) tasks, and amplifies the competition effect.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Elasticity of substitution between general and specific skills (σ):&lt;/strong&gt; The key technology parameter governing the degree of imperfect substitutability between natives and immigrants in equilibrium. Estimated at σ = 0.020 in the baseline. When σ = ∞, immigrants and natives are perfect substitutes and labor market competition has no effect on relative wages. As σ decreases, the competition effect on relative wages becomes stronger for a given change in relative skill supplies.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Specific skill accumulation function s(·):&lt;/strong&gt; A flexible parametric function of years since migration, interacted with region of origin, education level, cohort of entry, and potential experience at arrival, that governs the rate at which immigrants acquire host-country-specific skills over time. The intercept of s(·) at arrival (relative to a native s = 1) measures the initial specific skill deficit; the polynomial in years since migration captures how quickly this deficit closes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Wage assimilation profile:&lt;/strong&gt; The trajectory of the immigrant-native log wage gap as a function of years spent in the host country, conditional on a cohort of arrival. The paper distinguishes between changes in the level of the profile (the initial wage gap) and changes in its slope (the speed of convergence), and decomposes both dimensions into competition effects, demand effects, and cohort quality effects.&lt;/p&gt;</description></item><item><title>Lender concentration of external debts and sudden stops</title><link>https://macropaperwarehouse.com/papers/lender-concentration-of-external-debts-and-sudden-stops/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/lender-concentration-of-external-debts-and-sudden-stops/</guid><description>&lt;h1 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h1&gt;
&lt;h2 id="research-question"&gt;Research Question&lt;/h2&gt;
&lt;p&gt;This paper studies how the lender structure of external debt — specifically, the degree to which a borrowing country&amp;rsquo;s external debt is concentrated among a small number of large lenders — affects open economies&amp;rsquo; credit conditions, borrowing behavior, and the severity of sudden stops.&lt;/p&gt;
&lt;h2 id="core-mechanism"&gt;Core Mechanism&lt;/h2&gt;
&lt;p&gt;The paper argues that the pecuniary externality arising from collateral foreclosure can be internalized not only by borrowers (as in the standard Bianchi 2011 framework) but also by lenders. When a large lender holds a substantial share of total loans, it has an incentive to foreclose only partially on seized collateral. Selling foreclosed collateral injects asset supply and depresses the collateral price; a sufficiently large lender internalizes this price impact and therefore restrains foreclosure. Atomistic lenders, by contrast, take the collateral price as given and sell all seized collateral (foreclosure rate = 1). Consequently, concentrating external debt in fewer, larger lenders supports a higher collateral price during financial downturns. This higher collateral price raises borrowing capacity, weakens borrowers&amp;rsquo; precautionary saving motive, and causes them to overborrow relative to the social optimum.&lt;/p&gt;
&lt;h2 id="empirical-evidence"&gt;Empirical Evidence&lt;/h2&gt;
&lt;p&gt;Using FFIEC 009a data — quarterly exposure of individual U.S. banks to the external debts of other countries, covering 2003Q1–2022Q2 — the paper documents two new empirical facts. First, lender concentration of emerging countries&amp;rsquo; external debt has been considerably higher than that of advanced countries since the Global Financial Crisis. The average difference in the mean top-3 lender concentration (LTop3) between emerging and advanced economies is 0.11 (= 0.93 − 0.82), with a t-statistic of 13.87. Second, higher lender concentration alleviates sudden stop events in terms of both current account reversal and the decline in asset price proxies. In a difference-in-differences specification interacting sudden stop indicators with lagged lender concentration, the coefficient on the interaction term is negative and statistically significant across all concentration measures. A one-standard-deviation increase in LTop3 (7.2 percentage points) results in a 2.6 percentage point reduction in current account-to-GDP reversal during sudden stops, constituting 7.5% of the overall sudden stop increase. Lender concentration also mitigates real effective exchange rate depreciation during sudden stops, consistent with the mechanism operating through the collateral price channel. Results hold when controlling for rollover risk motives.&lt;/p&gt;
&lt;h2 id="model"&gt;Model&lt;/h2&gt;
&lt;p&gt;The model extends a standard small open economy DSGE framework (Bianchi 2011) by introducing one large lender who holds share eta of total loans and internalizes the pecuniary externality of collateral foreclosure, alongside atomistic lenders who hold share (1 − eta) and take the collateral price as given. When tradable endowment falls short of debt obligations (foreclosure state), lenders optimally choose their foreclosure rate: atomistic lenders set foreclosure rate = 1 (sell all seized collateral), while the large lender sets foreclosure rate &amp;lt; 1 (partial foreclosure to maintain the collateral price). Higher lender concentration (larger eta) leads to lower aggregate foreclosure, less collateral sold, a higher nontradable goods price, a higher borrowing capacity, more tradable consumption, and a weaker precautionary saving motive — generating overborrowing relative to the social planner&amp;rsquo;s allocation.&lt;/p&gt;
&lt;p&gt;Two channels through which concentration affects overborrowing are identified: (1) a debt capacity channel, whereby concentration raises the nontradable price in foreclosure states and thereby increases borrowing capacity; and (2) an amplification channel, whereby concentration steepens the decline in nontradable price per unit fall in tradable consumption, amplifying the pecuniary externality that the social planner internalizes.&lt;/p&gt;
&lt;h2 id="quantitative-results-calibrated-to-argentina"&gt;Quantitative Results (Calibrated to Argentina)&lt;/h2&gt;
&lt;p&gt;In the competitive equilibrium, agents encounter foreclosure with probability 2%, and the large lender sells two-thirds of seized collateral. The social planner&amp;rsquo;s allocation eliminates foreclosure entirely. The social planner&amp;rsquo;s allocation can be implemented via a state-dependent debt tax; the implied consumption-equivalent welfare gain is 0.78%. The pecuniary externality internalized by lenders is estimated to equal two-thirds of the externality internalized by borrowers. Overborrowing is increasing in lender concentration.&lt;/p&gt;
&lt;h2 id="optimal-lender-structure"&gt;Optimal Lender Structure&lt;/h2&gt;
&lt;p&gt;When lender countries optimally choose their lender structure, they select further concentration relative to the baseline in order to gain higher foreclosure repayment. Under optimal lender structure, domestic agents consume and borrow more and encounter sudden stops with higher probability, but completely avoid foreclosure events. Borrower welfare improves by 0.1% in consumption-equivalent terms relative to the baseline competitive equilibrium. The paper concludes that managing lender structure benefits both sides of the international credit market, and notes that policies targeting creditor coordination — such as collective action clauses — may be insufficient to fully correct the efficiency implications of lender structure.&lt;/p&gt;
&lt;h2 id="key-implication"&gt;Key Implication&lt;/h2&gt;
&lt;p&gt;Because lender concentration alleviates crisis severity, emerging economies (which are documented to have substantially more concentrated lender structures than advanced economies) face a reduced precautionary saving motive and therefore tend to overborrow more than advanced economies, compounding their vulnerability to sudden stops.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-papers-central-departure-from-the-bianchi-2011-sudden-stops-framework"&gt;Q1. What is the paper&amp;rsquo;s central departure from the Bianchi (2011) sudden stops framework?&lt;/h3&gt;
&lt;p&gt;The standard Bianchi (2011) model features atomistic lenders who take the collateral price as given, so the pecuniary externality of collateral fire-sales is internalized only by the borrower&amp;rsquo;s social planner. This paper introduces a large lender who holds a non-trivial share eta of total loans and therefore internalizes the price impact of selling foreclosed collateral. This creates a second source of pecuniary externality internalization — on the lender side — that is absent from the canonical framework.&lt;/p&gt;
&lt;h3 id="q2-why-do-atomistic-lenders-sell-all-seized-collateral-while-the-large-lender-does-not"&gt;Q2. Why do atomistic lenders sell all seized collateral, while the large lender does not?&lt;/h3&gt;
&lt;p&gt;Atomistic lenders take the collateral price as given and therefore face no downside from selling their entire share of seized collateral — they cannot individually affect the price. The large lender, holding share eta of total loans, recognizes that selling a large quantity of collateral depresses the nontradable goods price, which reduces the value of any remaining collateral claims. It therefore optimally sets foreclosure rate &amp;lt; 1, retaining some seized collateral to support the equilibrium price.&lt;/p&gt;
&lt;h3 id="q3-what-are-the-two-channels-through-which-lender-concentration-amplifies-overborrowing-and-how-do-they-differ"&gt;Q3. What are the two channels through which lender concentration amplifies overborrowing, and how do they differ?&lt;/h3&gt;
&lt;p&gt;The debt capacity channel operates in foreclosure states: higher concentration reduces foreclosure, raises the nontradable price, and increases the collateral value that backs borrowing. This directly expands the borrowing capacity available to agents and weakens their precautionary saving motive. The amplification channel operates through the slope of the nontradable price response: greater concentration steepens the decline in the nontradable price per unit fall in tradable consumption, which amplifies the pecuniary externality that the social planner internalizes. The two channels reinforce each other in driving overborrowing.&lt;/p&gt;
&lt;h3 id="q4-what-empirical-dataset-is-used-and-what-does-it-measure"&gt;Q4. What empirical dataset is used, and what does it measure?&lt;/h3&gt;
&lt;p&gt;The paper uses FFIEC 009a data, which records the quarterly exposure of individual U.S. banks to the external debts of other countries, covering 2003Q1–2022Q2. From these data, the paper constructs lender concentration measures — including LTop3, the combined share of the top three lenders — at the borrowing-country level for each quarter.&lt;/p&gt;
&lt;h3 id="q5-what-is-the-quantitative-magnitude-of-the-lender-concentration-gap-between-emerging-and-advanced-economies"&gt;Q5. What is the quantitative magnitude of the lender concentration gap between emerging and advanced economies?&lt;/h3&gt;
&lt;p&gt;The average difference in mean top-3 lender concentration (LTop3) between emerging countries and advanced countries is 0.11 (= 0.93 − 0.82), and this difference is highly statistically significant, with a t-statistic of 13.87. This gap emerged and persisted notably since the Global Financial Crisis.&lt;/p&gt;
&lt;h3 id="q6-how-does-lender-concentration-affect-sudden-stop-severity-in-the-empirical-specification-and-how-large-is-the-effect"&gt;Q6. How does lender concentration affect sudden stop severity in the empirical specification, and how large is the effect?&lt;/h3&gt;
&lt;p&gt;The paper estimates a difference-in-differences specification in which current account reversal (and other sudden stop outcome variables) is regressed on a sudden stop indicator, lagged lender concentration, and their interaction, with country and time fixed effects. The coefficient on the interaction term is negative and statistically significant across all concentration measures. A one-standard-deviation increase in LTop3 (7.2 percentage points) reduces current account-to-GDP reversal by 2.6 percentage points, which corresponds to 7.5% of the overall increase in the current account during a sudden stop episode.&lt;/p&gt;
&lt;h3 id="q7-does-higher-lender-concentration-also-mitigate-exchange-rate-and-asset-price-pressures-during-sudden-stops"&gt;Q7. Does higher lender concentration also mitigate exchange rate and asset price pressures during sudden stops?&lt;/h3&gt;
&lt;p&gt;Yes. Lender concentration is also found to mitigate real effective exchange rate depreciation during sudden stops, which is consistent with the model&amp;rsquo;s proposed mechanism: higher concentration supports the collateral (nontradable goods) price, which in turn limits the depreciation of the real exchange rate. The paper reports results on asset price proxy declines as well.&lt;/p&gt;
&lt;h3 id="q8-what-is-the-welfare-cost-of-overborrowing-under-the-baseline-calibration-to-argentina"&gt;Q8. What is the welfare cost of overborrowing under the baseline calibration to Argentina?&lt;/h3&gt;
&lt;p&gt;The social planner&amp;rsquo;s allocation, implemented by a state-dependent debt tax, delivers a consumption-equivalent welfare gain of 0.78% relative to the competitive equilibrium. This measures the efficiency cost of overborrowing under the calibrated model in which the large lender sells two-thirds of seized collateral and competitive equilibrium agents encounter foreclosure with probability 2%.&lt;/p&gt;
&lt;h3 id="q9-how-large-is-the-lender-side-pecuniary-externality-relative-to-the-borrower-side-externality"&gt;Q9. How large is the lender-side pecuniary externality relative to the borrower-side externality?&lt;/h3&gt;
&lt;p&gt;Under the baseline calibration, the pecuniary externality internalized by lenders is estimated to be two-thirds of the externality internalized by borrowers. This is described as a &amp;ldquo;plausible parameterization,&amp;rdquo; meaning that lender-side internalization of the externality is quantitatively substantial relative to the classic borrower-side effect.&lt;/p&gt;
&lt;h3 id="q10-what-does-the-optimal-lender-structure-exercise-find-and-what-does-it-imply-for-welfare"&gt;Q10. What does the optimal lender structure exercise find, and what does it imply for welfare?&lt;/h3&gt;
&lt;p&gt;When lender countries are allowed to optimally choose lender structure, they select a more concentrated structure than the baseline in order to maximize foreclosure repayment. Under this optimal structure, domestic (borrowing-country) agents consume and borrow more, face sudden stops with higher probability, but completely avoid foreclosure events. Borrower welfare improves by 0.1% in consumption-equivalent terms relative to the baseline competitive equilibrium. This implies that concentrating lender structure can be mutually beneficial for both sides of the international credit market.&lt;/p&gt;
&lt;h3 id="q11-why-might-collective-action-clauses-be-insufficient-to-correct-the-efficiency-implications-of-lender-structure"&gt;Q11. Why might collective action clauses be insufficient to correct the efficiency implications of lender structure?&lt;/h3&gt;
&lt;p&gt;Collective action clauses are policies designed to improve creditor coordination in sovereign debt restructuring. The paper argues that the efficiency distortions arising from lender structure go beyond pure coordination failures: because a concentrated lender structure generates welfare-relevant pecuniary externalities through the collateral price channel — affecting overborrowing and crisis severity — addressing creditor coordination alone is insufficient to fully resolve these inefficiencies.&lt;/p&gt;
&lt;h1 id="key-concepts"&gt;Key Concepts&lt;/h1&gt;
&lt;p&gt;&lt;strong&gt;Lender concentration (LTop3):&lt;/strong&gt; The combined loan share held by the top three lenders in a borrowing country&amp;rsquo;s external debt. Measured using FFIEC 009a data. Used as the primary empirical proxy for the degree to which external debt is concentrated in a few large creditors rather than dispersed among many atomistic lenders.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Pecuniary externality (lender-side):&lt;/strong&gt; The price impact that a large lender imposes on the collateral market when selling foreclosed assets. Unlike in the standard Bianchi (2011) framework where only borrowers (via the social planner) internalize this externality, a sufficiently large lender also internalizes it by restraining collateral sales to support the collateral price.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Foreclosure rate (zeta):&lt;/strong&gt; The fraction of seized collateral that a lender sells after foreclosure. Atomistic lenders set zeta = 1 (sell everything); the large lender sets zeta &amp;lt; 1 (partial foreclosure) to prevent collateral price depression. The aggregate foreclosure rate is a weighted average across lender types.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Overborrowing:&lt;/strong&gt; Borrowing in excess of the social planner&amp;rsquo;s optimal level, arising because competitive equilibrium agents do not internalize the pecuniary externality of their borrowing on the collateral price. In this model, overborrowing is increasing in lender concentration because a more concentrated lender structure supports a higher collateral price, reducing precautionary saving.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sudden stop:&lt;/strong&gt; An abrupt reversal of capital inflows to an emerging economy, typically associated with a sharp current account reversal, real exchange rate depreciation, and a decline in asset prices. In the model, sudden stops are associated with foreclosure states in which tradable endowment falls short of debt obligations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Debt capacity channel:&lt;/strong&gt; The mechanism by which higher lender concentration raises the nontradable goods price in foreclosure states, thereby increasing the collateral value and expanding agents&amp;rsquo; borrowing capacity, which weakens the precautionary saving motive.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Amplification channel:&lt;/strong&gt; The mechanism by which higher lender concentration steepens the slope of the nontradable price response to a fall in tradable consumption, amplifying the magnitude of the pecuniary externality that the social planner internalizes and thus increasing the social planner&amp;rsquo;s incentive to restrict borrowing.&lt;/p&gt;</description></item><item><title>Leveraging Virtual Contact and Social Networks to Foster Interethnic Harmony</title><link>https://macropaperwarehouse.com/papers/leveraging-virtual-contact-and-social-networks-to-foster-interethnic-harmony/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/leveraging-virtual-contact-and-social-networks-to-foster-interethnic-harmony/</guid><description>&lt;p&gt;This paper investigates whether virtual contact — exposure to an outgroup through a documentary film — can promote interethnic harmony, and whether targeting network-central individuals amplifies effects on untreated community members. The study addresses a context of deep, historically rooted discrimination: the Santal ethnic minority in northwestern Bangladesh have faced colonial-era land dispossession, ongoing violence, labor market discrimination, and structural exclusion by the Bengali ethnic majority. The Santals are the second-largest ethnic-minority group in Bangladesh; in the study villages, their share ranges from 13% to 83% of the population.&lt;/p&gt;
&lt;p&gt;The authors conducted a cluster-randomized field experiment across 121 multiethnic villages in the Rajshahi and Naogaon districts of Bangladesh, involving over 3,300 households. Villages were randomly assigned to three arms: a random treatment arm (RR, 40 villages, N=562 Bengalis) in which approximately 14 randomly selected ethnic-majority households per village watched a 45-minute documentary film (&amp;ldquo;Ami Santal&amp;rdquo; / &amp;ldquo;I Am Santal&amp;rdquo;) portraying Santal culture, economic hardships, and aspirations; a central treatment arm (41 villages) in which approximately 7 randomly selected Bengalis (RC) and 7 network-central Bengalis identified via a diffusion-centrality nomination exercise (CC) watched the same film; and a control arm (40 villages) in which households watched a placebo documentary on flower farming. The documentary, costing approximately $13 per participant, was screened individually at participants&amp;rsquo; homes on tablets. Data were collected at baseline (September–October 2022), first end line approximately 3 months post-screening (February–March 2023), and a casual-work field experiment second end line approximately 4.5–5 months post-screening (April–May 2023). Outcomes were measured via lab-in-the-field experiments (dictator game, solidarity game), an experimentally validated interethnic trust survey item (Falk et al. 2018), self-reported behaviors, administrative police complaint data, and facial emotion detection during screening.&lt;/p&gt;
&lt;p&gt;The main findings are as follows. First, treated Bengalis in the central arm (RC) gave 14.7% more in the dictator game (p &amp;lt; .01) and exhibited 21.7% greater trust toward Santals (p &amp;lt; .01) compared to controls; RR participants showed a 7.1% increase in solidarity game giving (p &amp;lt; .10) and 11.8% greater trust (p &amp;lt; .01). Effects on reducing negative stereotypes and discriminatory opinions were not statistically significant, suggesting that affective components of prejudice are more responsive to the intervention than cognitive components. About 82% of treated Bengalis reported acquiring new information about Santals, primarily regarding occupational struggles, educational aspirations, and economic potential. Facial expression analysis using emotion-detection software found sadness to be significantly more prevalent among viewers (p &amp;lt; .05), particularly among network-central participants, consistent with an empathetic response.&lt;/p&gt;
&lt;p&gt;Second, untreated Bengalis in the central arm — who never watched the documentary — showed 20.9% higher altruism (p &amp;lt; .10), 27.3% higher solidarity (p &amp;lt; .05), and 8.1% higher trust (p &amp;lt; .05) toward Santals relative to controls. No significant effects on untreated Bengalis were found in the random arm. Untreated Santals in both arms exhibited greater trust toward Bengalis (11% increase in random arm, p &amp;lt; .05; 21.7% increase in central arm, p &amp;lt; .01) and higher subjective well-being (p &amp;lt; .01 in both arms). Village-level administrative data show a significant reduction in Bengali police complaints against Santals post-intervention (p &amp;lt; .05), but only in the central arm.&lt;/p&gt;
&lt;p&gt;Third, in the casual-work field experiment, multiethnic pairs jointly produced paper bags under piece-rate compensation. Overall productivity increased approximately 5% (p &amp;lt; .05) in the central arm only. Both Bengali and Santal workers increased productivity specifically in the finisher role — the most critical role for determining earnings — in the central arm. The authors interpret Bengali productivity gains as reflecting increased prosociality toward Santal co-workers, and Santal productivity gains as reflecting conformism or peer pressure in response to Bengali effort. The scope of all effects is limited to multiethnic villages in northwestern Bangladesh, a context of historically severe and ongoing majority-minority inequality; the intervention deliberately did not challenge the socioeconomic hierarchy of the villages.&lt;/p&gt;
&lt;p&gt;Q: What was the documentary film&amp;rsquo;s content and design rationale?
A: The 45-minute film &amp;ldquo;Ami Santal&amp;rdquo; featured three narrative layers: Santal culture (rituals, cuisine, the Baha festival), economic hardships (housing, water access, low incomes, labor market struggles, educational barriers), and aspirational stories of Santals who achieved success. All stories were narrated by non-actor local Santals, filmed outside the study region, and deliberately avoided attributing blame to Bengalis. The film was designed under the supervision of anthropologists at the University of Rajshahi to maintain ethnographic authenticity and a non-moralistic, observational tone (moral judgment language was much lower than in comparison Bangladeshi documentaries and general films, per LIWC-22 analysis).&lt;/p&gt;
&lt;p&gt;Q: How were network-central individuals identified and why might targeting them matter?
A: In central-arm villages, enumerators surveyed approximately 18–20 randomly selected passers-by at village markets and asked them to nominate the 15 people most effective at disseminating information. The seven most consistently and highly ranked individuals per village were selected as network-central (CC). These individuals were expected to have high diffusion centrality — meaning information they receive spreads widely — so targeting them with the documentary could shift attitudes and behavior among untreated community members through persuasion, visibility, credibility, or diffusion (the paper cannot separately identify which mechanism operates).&lt;/p&gt;
&lt;p&gt;Q: What were the primary behavioral effects on treated Bengalis (the ethnic majority who watched the film)?
A: Randomly selected participants in the central arm (RC) gave 14.7% more in the dictator game (p &amp;lt; .01) and 8% more in the solidarity game (not statistically significant), and exhibited 21.7% greater trust toward Santals (p &amp;lt; .01), all relative to controls. In the random arm (RR), participants showed a 6.4% increase in dictator game giving (not statistically significant), a 7.1% increase in solidarity game giving (p &amp;lt; .10), and 11.8% greater trust toward Santals (p &amp;lt; .01). Effects on self-reported behaviors — interethnic friendships, social interactions, amount charged to minorities for water — were not statistically significant.&lt;/p&gt;
&lt;p&gt;Q: Did the intervention change Bengali stereotypes or discriminatory opinions toward Santals?
A: No. Despite treated Bengalis acquiring substantial new information (approximately 82% reported learning new things, primarily about Santal occupational struggles and educational aspirations), the authors find no significant effects on the stereotypes index or the discriminatory-opinions index among treated Bengalis. They propose two explanations: cognitive components of prejudice (stereotypes) are harder to change through indirect contact than affective components (emotions, prosocial behavior), consistent with Tropp and Pettigrew (2005) and Turner, Crisp, and Lambert (2007); and a single documentary may be insufficient to counter deeply ingrained generational biases due to resistance to change.&lt;/p&gt;
&lt;p&gt;Q: What emotional responses did the documentary elicit, and how was this measured?
A: Field assistants took candid photographs of participants&amp;rsquo; faces at a random point during the screening; these were analyzed using Emotimeter software (machine learning-based emotion detection) that assigns scores across seven emotion categories summing to 100%. Sadness was significantly more prevalent among documentary viewers compared to placebo viewers (p &amp;lt; .05), particularly among network-central participants (CC). The authors interpret this as consistent with an empathetic response to the film&amp;rsquo;s content about Santal hardships, and connect it to increased prosocial behavior via emotion-regulation mechanisms (alleviating sadness through prosocial action).&lt;/p&gt;
&lt;p&gt;Q: What were the spillover effects on untreated Bengalis in the central arm?
A: Untreated Bengalis in central-arm villages — who never watched the documentary — showed 20.9% higher altruism (p &amp;lt; .10), 27.3% higher solidarity (p &amp;lt; .05), and 8.1% higher trust toward Santals (p &amp;lt; .05) relative to controls. By contrast, untreated Bengalis in random-arm villages showed no statistically significant effects on any of these outcomes. The authors attribute the central-arm spillovers to the presence of network-central individuals being treated in those villages, though whether these patterns reflect persuasion, visibility, credibility, or information diffusion cannot be separately identified.&lt;/p&gt;
&lt;p&gt;Q: How did the intervention affect the Santal ethnic minority (who never watched the documentary)?
A: Untreated Santals in both arms exhibited greater trust toward Bengalis: an 11% increase in the random arm (p &amp;lt; .05) and a 21.7% increase in the central arm (p &amp;lt; .01) compared to controls. Santals in both arms also reported higher subjective well-being (p &amp;lt; .01). A weakly significant increase in food security was observed among Santals in the central arm (p &amp;lt; .10), possibly reflecting increased material support from Bengalis. No statistically significant effects were found on Santal altruism or solidarity.&lt;/p&gt;
&lt;p&gt;Q: What did the village-level administrative complaint data show?
A: Using data collected from two police stations covering all 121 villages, the authors find a significant reduction in Bengali complaints against Santals post-intervention in the central arm (p &amp;lt; .05). No significant reduction was found in Santals&amp;rsquo; complaints against Bengalis (p &amp;gt; .10) in any arm. Data from village counselors&amp;rsquo; offices (shalish arbitration complaints) showed no significant change in any arm. The distinction matters because police complaints involve more serious, violent matters, while village-counselor complaints involve routine arbitration.&lt;/p&gt;
&lt;p&gt;Q: How was the casual-work field experiment designed, and what did it find?
A: Approximately 4.5 months after the documentary screenings, 720 participants (360 Bengalis, 360 Santals) drawn equally from the three study arms were paired into multiethnic dyads to jointly produce paper bags for a local supplier under piece-rate compensation, with earnings split equally. One worker was randomly assigned the preparer role and the other the finisher role; roles were switched halfway through the three-hour session. The paper finds an approximately 5% overall productivity increase (p &amp;lt; .05) in the central arm only, concentrated in the finisher role (the role most critical for final output). Bengalis and Santals both increased productivity specifically as finishers in the central arm.&lt;/p&gt;
&lt;p&gt;Q: What mechanisms explain the productivity effects in the casual-work experiment?
A: For Bengali finishers, the productivity gain is interpreted as prosocial behavior: treated Bengalis who showed greater altruism toward Santals worked harder to increase the earnings of their Santal co-workers. For Santal finishers, the productivity gain is interpreted as conformism or peer pressure: Santals increased effort more when they worked as finisher after swapping roles (i.e., after observing Bengalis&amp;rsquo; higher effort as finisher first), suggesting responsiveness to the higher productivity of Bengalis rather than an independent prosocial motivation. The authors present a simple theoretical model to formalize these interpretations, citing Rotemberg (1994) on prosocial effort and Kandel and Lazear (1992) and Mas and Moretti (2009) on peer pressure mechanisms.&lt;/p&gt;
&lt;p&gt;Q: Why was virtual rather than direct contact used in this intervention?
A: The authors argue that encouraging direct contact between Bengalis and Santals in this setting carries specific risks: the unequal status of the groups may generate anxiety during interactions, potentially limiting engagement or provoking backlash. By contrast, the documentary provides an indirect, low-cost ($13 per participant) form of contact that presents Santal lives without disrupting the socioeconomic hierarchy of the villages and without attributing blame to Bengalis. The film&amp;rsquo;s entertaining veneer and emotional storytelling make it more scalable and logistically feasible in contexts where direct contact is socially difficult or impractical.&lt;/p&gt;
&lt;p&gt;Q: What are the primary limitations acknowledged by the authors?
A: The authors acknowledge that the study&amp;rsquo;s sampling protocol relied on a door-to-door skip procedure without systematic records of approached households, raising the possibility of convenience or snowball-type recruitment and potential deviations from random sampling — this is reflected in some imbalances in baseline characteristics across arms. CC-control comparisons are explicitly descriptive (not causal) because network-central individuals were selected on centrality. Differential attrition was found among untreated Santals (both treatment arms had significantly lower attrition than control, p &amp;lt; .05), which could bias estimates for that subgroup. The authors cannot separately identify the mechanisms (persuasion, visibility, credibility, diffusion) underlying spillover effects in central villages.&lt;/p&gt;
&lt;p&gt;Q: What are the policy implications of this study?
A: The findings suggest that media-based virtual contact interventions are a low-cost, scalable tool for improving interethnic prosociality even in contexts of deep-rooted discrimination where direct contact may be socially impractical. Targeting network-central individuals — identified via a simple nomination exercise requiring no pre-existing network data — amplifies village-wide effects, including among untreated community members and the minority group itself. The productivity gains in multiethnic work teams imply that improved interethnic relations can have tangible economic consequences beyond attitudinal change. However, the null effects on stereotypes and discriminatory opinions suggest that single documentary interventions may not be sufficient to alter deep-seated cognitive biases, and more intensive or repeated interventions may be needed to achieve durable attitude change.&lt;/p&gt;
&lt;p&gt;Virtual contact: Indirect exposure to an ethnic outgroup through a documentary film, as distinct from direct intergroup contact; posited to influence majority-group attitudes and behavior by increasing empathy and identification with the outgroup without requiring face-to-face interaction.&lt;/p&gt;
&lt;p&gt;Diffusion centrality: A network measure of how effectively an individual can spread information through a community, operationalized via a nomination exercise in which community members identify those best positioned to disseminate information; used to select the seven highest-ranked individuals per village for targeted treatment.&lt;/p&gt;
&lt;p&gt;Prosociality (altruism and solidarity): Measured using incentivized lab-in-the-field games — the dictator game (unilateral allocation of an endowment to a passive outgroup recipient) and the solidarity game (precommitted transfers to an outgroup member who may incur a random loss) — capturing willingness to benefit non-coethnic others at personal cost.&lt;/p&gt;
&lt;p&gt;Affective versus cognitive components of prejudice: A distinction between emotional aspects of prejudice (feelings, empathy) — which the authors find to be more responsive to the documentary intervention — and cognitive aspects (negative stereotypes, discriminatory opinions) — which show no significant change despite new information acquisition.&lt;/p&gt;
&lt;p&gt;Spillover effects (untreated individuals): Changes in behavior or attitudes among community members who did not directly receive the intervention (did not watch the documentary), attributed to the influence of treated individuals in their village, particularly network-central individuals in the central arm.&lt;/p&gt;
&lt;p&gt;Piece-rate casual-work field experiment: A second end line in which multiethnic pairs of Bengali and Santal workers jointly produced paper bags for a local supplier, with individual earnings determined by joint piece-rate output; designed to measure whether improved interethnic attitudes translated into higher workplace productivity in ethnically mixed teams.&lt;/p&gt;
&lt;p&gt;Source text origin: The provenance classification of the text used to generate a paper summary (full PDF, open-access HTML, or abstract only); the paper&amp;rsquo;s pipeline rules impose a hard block on abstract-only summarization.&lt;/p&gt;</description></item><item><title>Linking Social and Personal Preferences: Theory and Experiment</title><link>https://macropaperwarehouse.com/papers/linking-social-and-personal-preferences-theory-and-experiment/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/linking-social-and-personal-preferences-theory-and-experiment/</guid><description>&lt;p&gt;This paper asks whether an individual&amp;rsquo;s attitude toward risk in the personal domain (choices affecting only oneself) can be linked to that same individual&amp;rsquo;s attitude toward risk in the social domain (choices affecting both oneself and others). The authors provide a theoretical answer in the form of necessary and sufficient conditions, and then test those conditions experimentally.&lt;/p&gt;
&lt;p&gt;The formal model posits a decision maker (DM) with a preference relation over lotteries on a set of social states, where a distinguished subset of states are personal (consequences for the DM alone). The authors assume preferences satisfy Completeness, Transitivity, Continuity, and State Monotonicity — the last being equivalent to respect for First-Order Stochastic Dominance (FOSD), a condition weaker than the Expected Utility Independence Axiom and satisfied by virtually all extant decision theories including Weighted Expected Utility, Rank-Dependent Utility, and Prospect Theory. The key theoretical result (Theorem 1) establishes that the full preference relation over all social lotteries can be uniquely deduced from the partial observations of (i) riskless social choices and (ii) risky personal choices if and only if the DM finds every social state indifferent to some personal state. When this condition fails, there exist social lotteries whose ranking cannot be recovered from the partial data.&lt;/p&gt;
&lt;p&gt;For two empirically relevant preference types, this condition generates directly testable predictions: for selfish subjects (who allocate nothing to others in deterministic social choices), risky personal preferences must coincide with risky social preferences; for impartial subjects (who treat self and other symmetrically in deterministic social choices), riskless social preferences must coincide with risky social preferences.&lt;/p&gt;
&lt;p&gt;The experiment was conducted at the University of Bergen and NHH Norwegian School of Economics with 276 undergraduate subjects. Each subject faced 50 budget-line choice problems in each of three domains: Personal Risk (equiprobable binary lotteries over own payoffs only), Social Choice (deterministic splits between self and an anonymous other), and Social Risk (equiprobable binary lotteries over symmetric payout pairs for self and other). The graphical interface of Choi et al. (2007b) was used throughout. One randomly selected decision per domain was paid out; each token was worth 1.2 NOK (approximately 0.2 USD), with average earnings of approximately 270 NOK.&lt;/p&gt;
&lt;p&gt;Within-domain consistency, measured by the Critical Cost Efficiency Index (CCEI), is high: mean CCEIs are 0.959, 0.952, and 0.902 in the Personal Risk, Social Choice, and Social Risk domains respectively. At the CCEI &amp;gt; 0.90 threshold, 89.9%, 85.9%, and 69.9% of subjects pass in the three domains. Using a 0.95 share-to-self threshold, 103 subjects (37.3%) are classified as selfish; using revealed-preference criteria at the 5% significance level, 33 subjects (12.0%) are classified as impartial.&lt;/p&gt;
&lt;p&gt;Testing is done via an individual-level nonparametric permutation test that draws 10,000 random data sets per subject and compares simulated CCEI distributions to actual cross-domain CCEIs, with Bonferroni correction. At the 1% significance level, the null that Personal Risk and Social Risk preferences coincide is rejected for only 5.9%–9.3% of selfish subjects (varying by classification threshold), compared with 14.7%–16.3% rejection rates for non-selfish subjects. For impartial subjects at the 1% level, the null that Social Choice and Social Risk preferences coincide is rejected for 0.0%–11.1%, compared with 19.8%–26.8% for non-impartial subjects. The theory&amp;rsquo;s predictions are thus supported for a large majority of both selfish and impartial subjects.&lt;/p&gt;
&lt;p&gt;A theoretical extension (Theorem 2) shows that if one additionally observes comparisons between social states and personal lotteries, unique deduction of the full preference relation requires that preferences in both personal and social domains satisfy Expected Utility (Independence Axiom) and that every social state is indifferent to some personal lottery — a strictly stronger set of conditions.&lt;/p&gt;
&lt;p&gt;Q: What is the central theoretical question and why does it matter?
A: The paper asks whether preferences over risky social choices (lotteries over outcomes for self and others) can be deduced from observing only riskless social choices and risky personal choices. This matters because people frequently observe or predict the risky social choices of leaders and representatives, but may have access only to those leaders&amp;rsquo; personal risk-taking behavior and their expressed social preferences under certainty.&lt;/p&gt;
&lt;p&gt;Q: What is the main theoretical result (Theorem 1)?
A: Under Completeness, Transitivity, Continuity, and State Monotonicity, the unique extension of the partial preference relation (over social states and personal lotteries) to the full domain of social lotteries exists if and only if every social state is indifferent to some personal state. When this condition is not met, multiple distinct preference relations can extend the partial observations, making deduction impossible.&lt;/p&gt;
&lt;p&gt;Q: What is State Monotonicity and how does it relate to standard axioms?
A: State Monotonicity requires that if each social state in one lottery dominates the corresponding state in another lottery, then the first lottery is weakly preferred. The paper shows this is equivalent to respect for First-Order Stochastic Dominance (FOSD) given the other axioms, and is strictly weaker than the von Neumann–Morgenstern Independence Axiom. It is satisfied by Weighted Expected Utility, Rank-Dependent Utility, and Prospect Theory, making it a broadly applicable assumption.&lt;/p&gt;
&lt;p&gt;Q: What are the testable predictions for selfish subjects?
A: Proposition 2 establishes that if a subject&amp;rsquo;s Social Choice preferences are selfish — meaning any bundle (x, y) is indifferent to (0, y), so the subject is indifferent between keeping x for self and giving it to other — then preferences in the Personal Risk domain must coincide with preferences in the Social Risk domain. In the experiment, selfish subjects are those allocating more than 95% of tokens to themselves in the Social Choice domain (103 of 276 subjects, or 37.3%).&lt;/p&gt;
&lt;p&gt;Q: What are the testable predictions for impartial subjects?
A: Proposition 3 establishes that if a subject&amp;rsquo;s Social Choice preferences are symmetric — meaning (x, y) is indifferent to (y, x) for all pairs — then preferences in the Social Choice domain must coincide with preferences in the Social Risk domain, implying risk neutrality toward social lotteries. The intuition is that such a subject treats self and other identically, so risky splits are evaluated by expected value alone. In the experiment, 33 subjects (12.0%) are classified as impartial by the revealed-preference criterion at the 5% significance level.&lt;/p&gt;
&lt;p&gt;Q: How does the experiment measure within-domain rationality?
A: Choices within each domain are evaluated using the Critical Cost Efficiency Index (CCEI, following Afriat 1967), which measures how much a budget constraint must be relaxed to remove all GARP violations. Mean CCEIs are 0.959 (Personal Risk), 0.952 (Social Choice), and 0.902 (Social Risk). At the CCEI &amp;gt; 0.90 threshold, 248 subjects (89.9%), 237 (85.9%), and 193 (69.9%) pass in the three domains respectively, compared to a simulated mean CCEI of only 0.585 for subjects randomizing uniformly.&lt;/p&gt;
&lt;p&gt;Q: How does the cross-domain test work and why is it nonparametric?
A: The test uses individual-level permutation inference: under the null that preferences in domains I and J are identical, any 50-element subset drawn from the pooled 100 choices should satisfy GARP as well as the actual domain-specific choices. For each subject, 10,000 such random draws are generated, their CCEI scores are computed, and the distribution is compared to the actual cross-domain CCEI with Bonferroni correction. The test makes no functional form assumptions about utility and accommodates the observed within-domain errors without parametric error modeling.&lt;/p&gt;
&lt;p&gt;Q: What are the rejection rates for the selfish-subject prediction?
A: At the 1% significance level, the null that Personal Risk and Social Risk preferences coincide is rejected for only 5.9%–9.3% of selfish subjects (range across four classification thresholds from 0.99 to 0.90 share-to-self), compared to 14.7%–16.3% for non-selfish subjects. At the 5% level, rejection rates rise to 20.4%–25.6% for selfish and 22.4%–31.8% for non-selfish subjects.&lt;/p&gt;
&lt;p&gt;Q: What are the rejection rates for the impartial-subject prediction?
A: At the 1% significance level, the null that Social Choice and Social Risk preferences coincide is rejected for 0.0%–11.1% of impartial subjects (range depending on threshold and classification method), compared to 19.8%–26.8% for non-impartial subjects. At the 5% and 10% levels, rejection rates for impartial subjects range from 0.0% to 22.2%.&lt;/p&gt;
&lt;p&gt;Q: Does the theory predict how risk aversion should map across domains for non-selfish, non-impartial subjects?
A: The theory does not directly produce testable cross-domain predictions for subjects who are neither selfish nor impartial without additional parametric assumptions, because the specific personal-state equivalent of each social state depends on the form of preferences. The paper restricts its nonparametric tests to the two polar cases where the equivalence mapping is determinate from social choice behavior alone.&lt;/p&gt;
&lt;p&gt;Q: What is the extended result (Theorem 2) and what stronger conditions does it require?
A: When one additionally observes comparisons between social states and personal lotteries (not just within each domain separately), unique deduction of the full preference relation is possible if and only if preferences in both the personal and social domains are consistent with an Expected Utility representation and every social state is indifferent to some personal lottery. This requires the Independence Axiom — a strictly stronger condition than State Monotonicity — highlighting that the main Theorem 1 result exploits the weaker observational structure.&lt;/p&gt;
&lt;p&gt;Q: What is the distribution of social preferences in the sample?
A: Of 276 subjects, 103 (37.3%) are classified as selfish at the 0.95 share-to-self threshold. Only 6 subjects (2.2%) kept fewer than 0.45 of tokens on average, making purely altruistic subjects rare. In the Personal Risk domain, 41 subjects (14.9%) allocated more than 95% to the cheaper account (consistent with risk neutrality), while 9 (3.3%) allocated fewer than 55% (consistent with infinite risk aversion). In the Social Risk domain, 30 subjects (10.9%) are consistent with utilitarianism in money and 9 (3.3%) with Rawlsianism in money.&lt;/p&gt;
&lt;p&gt;Q: How does the Social Risk domain compare to the Personal Risk and Social Choice domains in terms of rationality scores?
A: The Social Risk domain shows lower consistency than the other two: mean CCEI is 0.902 versus 0.959 and 0.952, and only 69.9% of subjects exceed the 0.90 threshold versus 89.9% and 85.9%. The CCEI distribution is shifted left for Social Risk, suggesting the novel combined dimension of social and risky choice introduces more decision complexity or error.&lt;/p&gt;
&lt;p&gt;Q: What is the relationship to the prior experimental literature on social and risk preferences?
A: The Personal Risk domain replicates the symmetric risk experiment of Choi et al. (2007a), and the Social Choice domain replicates the linear two-person dictator experiment of Fisman et al. (2007). The Social Risk domain is new to this paper. The theoretical framework connects to Saito (2013) on social preferences under risk, and to the preference extension literature of Grant et al. (1992) and Nishimura et al. (2017).&lt;/p&gt;
&lt;p&gt;State Monotonicity: The axiom requiring that if each social state in one lottery weakly dominates the corresponding social state in another lottery, the first lottery is weakly preferred. The paper proves this is equivalent to respect for First-Order Stochastic Dominance given Completeness, Transitivity, and Continuity, and distinguishes it from the stronger Independence Axiom by noting that Independence compares lotteries over lotteries while State Monotonicity only compares lotteries over states.&lt;/p&gt;
&lt;p&gt;Selfish preferences (in the paper&amp;rsquo;s sense): Preferences in the Social Choice domain such that (x, y) is indifferent to (0, y) for all bundles — the subject is indifferent between receiving x themselves versus giving x to the other person. Operationally measured as allocating more than a threshold share (e.g., 95%) of tokens to self across Social Choice decisions.&lt;/p&gt;
&lt;p&gt;Impartial preferences (in the paper&amp;rsquo;s sense): Preferences in the Social Choice domain such that (x, y) is indifferent to (y, x) for all bundles — the subject treats self and other symmetrically. Operationally identified by the revealed preference criterion that choices in the Social Choice domain satisfy GARP and are consistent with symmetric treatment.&lt;/p&gt;
&lt;p&gt;Unique extension (deducibility): The property that there exists exactly one complete preference relation over all social lotteries that is consistent with the axioms and agrees with the observed partial relation over social states and personal lotteries. Theorem 1 identifies the necessary and sufficient condition for unique extension under State Monotonicity.&lt;/p&gt;
&lt;p&gt;Personal state indifference condition: The condition that for every social state omega in Omega minus P, there exists some personal state in P to which the DM is indifferent. This is the necessary and sufficient condition in Theorem 1 for deducibility of the full preference relation. Interpreted as: for every proposed social allocation, there exists a &amp;ldquo;bribe&amp;rdquo; — a personal allocation with nothing for others — that the DM finds equally desirable.&lt;/p&gt;
&lt;p&gt;Critical Cost Efficiency Index (CCEI): A measure of how much budget constraints must be scaled down to eliminate all GARP violations in a dataset of choices from budget lines (following Afriat 1967). A CCEI of 1 indicates perfect rationality; the paper uses 0.90 as a practical threshold. Mean values are 0.959, 0.952, and 0.902 in the Personal Risk, Social Choice, and Social Risk domains respectively.&lt;/p&gt;
&lt;p&gt;Nonparametric permutation test: The individual-level test used to assess consistency across choice domains. Under the null that preferences are identical in domains I and J, any random 50-element draw from the pooled 100 choices should achieve CCEI scores no worse than the actual domain scores. The test draws 10,000 permuted datasets per subject and uses the Bonferroni correction for multiple comparisons, making no assumptions about the functional form of utility.&lt;/p&gt;</description></item><item><title>Lives Versus Livelihoods: The Impact of the Great Recession on Mortality and Welfare</title><link>https://macropaperwarehouse.com/papers/lives-versus-livelihoods-the-impact-of-the-great-recession-on-mortality-and-welfare/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/lives-versus-livelihoods-the-impact-of-the-great-recession-on-mortality-and-welfare/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; Does the Great Recession reduce or increase mortality, and what are the welfare implications of incorporating recession-induced mortality changes into standard macroeconomic welfare frameworks?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Setting and Identification.&lt;/strong&gt; The authors exploit spatial variation in the severity of the 2007–2009 Great Recession across 741 U.S. Commuting Zones (CZs), following the empirical design of Yagan (2019). The primary shock variable is the percentage-point change in the CZ unemployment rate between 2007 and 2009. The key identifying assumption is that no concurrent shocks to mortality coincide with the timing and geographic pattern of the Great Recession shock. Pre-trend evidence supports this: CZs subsequently harder hit experienced a slight relative &lt;em&gt;increase&lt;/em&gt; in mortality before 2007, which is the opposite sign from the main effect, supporting the validity of the design.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data.&lt;/strong&gt; Mortality data come from CDC restricted-use death certificate microdata (2003–2016) covering the universe of U.S. deaths, combined with SEER population denominators. A 20 percent random sample of Medicare enrollees aged 65–99 provides an individual-level panel that directly addresses concerns about endogenous migration. The main outcome is the log age-adjusted CZ mortality rate; economic indicators come from BLS, BEA, and FHFA; air pollution data from the EPA AQS monitor network (PM2.5); morbidity from the BRFSS; nursing home characteristics from federal certification inspections.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Mortality Finding.&lt;/strong&gt; A one-percentage-point increase in the local unemployment rate between 2007 and 2009 is associated with a 0.50 percent decline (SE = 0.15) in the annual age-adjusted mortality rate in 2007–2009, and a 0.58 percent decline (SE = 0.34) in 2010–2016; the two periods are statistically indistinguishable (p = 0.78). Because the national average unemployment rate rose by 4.6 percentage points, the Great Recession on average reduced the annual age-adjusted mortality rate by approximately 2.3 percent, with effects persisting for at least 10 years. The authors note this is equivalent to approximately two years of secular mortality improvement at the pre-recession trend pace of 1.1 percent per year. For a 55-year-old, the estimates imply that 1 in 25 gained an extra year of life from a shock of this magnitude.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Heterogeneity by Cause of Death.&lt;/strong&gt; Mortality declines appear across most major causes. Cardiovascular disease (34 percent of 2006 deaths) declines by 0.65 percent per percentage-point unemployment increase (SE = 0.21) and accounts for approximately 48 percent of the total estimated mortality reduction. Motor vehicle mortality falls by 1.7 percent (SE = 0.56) and liver disease by 1.1 percent (SE = 0.43). Suicides show a statistically significant 1.7 percent decline (SE = 0.5) in the 2010–2016 period. The notable exception is cancer (the second-largest cause of death), for which the estimated effect is a precise null of 0.02 percent (SE = 0.11). The null cancer result is interpreted as a specification check: if mortality declines were spurious (e.g., driven by population mismeasurement), cancer mortality should also decline.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Heterogeneity by Demographics.&lt;/strong&gt; Recession-induced mortality declines are similar in percentage terms across gender and race/ethnicity, and statistically equi-proportional across age groups (p-value for equality across 25–64 versus 65+: 0.76). Because mortality is heavily concentrated in the elderly, those aged 65 and over account for approximately 74.3 percent of averted deaths, roughly proportional to their 72.5 percent share of 2006 mortality. The most striking heterogeneity is by education: the entire mortality decline is concentrated among the approximately 52 percent of the population with a high school degree or less. The estimated 2007-2016 effect is −1.3 percent per percentage-point unemployment increase (SE = 0.56) for those with high school or less, compared to +0.34 percent (SE = 0.68) for those with more than high school (statistically distinguishable at p &amp;lt; 0.01).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mechanisms.&lt;/strong&gt; The authors distinguish internal effects (own reduced employment or consumption improving health) from external effects (externalities from reduced aggregate economic activity, holding own employment/consumption fixed). Evidence strongly favors external effects as the primary driver. Three-quarters of averted deaths accrue to the elderly, who experienced no direct income effects from the labor market shock. Moreover, the timing pattern—an immediate mortality drop that does not grow over time—is inconsistent with health-behavior channels (e.g., smoking cessation, improved diet) that would build up gradually. Direct tests find no statistically significant impact on self-reported health behaviors (smoking, drinking, exercise) and no impact on healthcare use among Medicare enrollees.&lt;/p&gt;
&lt;p&gt;Among external channels, neither reduced spread of infectious disease nor improved nursing home staffing receives empirical support. Reduced air pollution (PM2.5) is identified as a quantitatively important channel. A one-percentage-point increase in CZ unemployment is associated with a 0.16 µg/m³ decline in PM2.5 (SE = 0.04), a 1.3 percent decline relative to the 2006 national average of 12 µg/m³. A mediation analysis (controlling for the PM2.5 shock) attenuates the estimated mortality effect by 37 percent, from −0.52 percent to −0.33 percent per percentage-point unemployment increase. Back-of-the-envelope calculations combining the PM2.5 decline with external estimates of PM2.5-mortality elasticities suggest pollution can explain 17 to 35 percent of total recession-induced mortality declines.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Lag Structure.&lt;/strong&gt; Exploiting variation in the speed of post-recession labor market recovery (measured by 2010–2016 EPOP ratio changes) conditional on the initial shock, the authors find that mortality reductions persist in areas that have fully recovered economically by 2016, suggesting lagged mortality effects of the initial economic downturn beyond what contemporaneous economic conditions alone explain.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Welfare Analysis.&lt;/strong&gt; The authors extend the Krebs (2007) consumption-based welfare cost-of-recessions model to incorporate endogenous mortality. For a 45-year-old with γ = 2 and a value of a statistical life-year (VSLY) of $250k (five times annual consumption), accounting for endogenous mortality reduces the willingness to pay to avoid all future recessions from 2.00 percent of average annual consumption to 0.91 percent—a reduction of approximately 55 percent. Starting around age 55, recessions become welfare-improving on net. For the Great Recession specifically, at age 55 endogenous mortality reduces the welfare cost by approximately 25 percent (from 2.39 to 1.80 percent of average annual consumption). Because mortality declines are concentrated among those with high school or less, accounting for endogenous mortality also substantially mitigates—and at older ages reverses—the finding that the Great Recession was more costly for the less educated.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions and Caveats.&lt;/strong&gt; (i) The design captures only differential local effects, not nationwide impacts (e.g., stock market collapse, nationwide malaise). (ii) Mortality impacts may not generalize to milder recessions, though the relationship appears approximately linear in shock size. (iii) The analysis excludes morbidity, though limited evidence suggests morbidity is also pro-cyclical and roughly equi-proportional across ages. (iv) The welfare analysis begins at age 35 and does not account for longer-run mortality costs of recession entry for younger cohorts.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-baseline-empirical-specification-and-why-does-the-design-exploit-cross-sectional-variation-rather-than-time-series-panel-regressions"&gt;Q1. What is the baseline empirical specification, and why does the design exploit cross-sectional variation rather than time-series panel regressions?&lt;/h3&gt;
&lt;p&gt;The estimating equation regresses the log age-adjusted CZ mortality rate on an interaction of the CZ-level Great Recession shock (2007–2009 unemployment change) with year indicators, plus CZ and year fixed effects, weighted by 2006 CZ population. The authors prefer this to the standard two-way fixed effects panel approach (area and year FE with contemporaneous unemployment rate) for three reasons: (1) it directly identifies the full dynamic lag structure of the shock rather than imposing contemporaneity; (2) exploiting a single spatially differentiated shock reduces risk of confounding from other concurrent area-level shocks; (3) the panel can be linked to individual-level Medicare data, allowing explicit control for endogenous migration, which the existing literature cannot do.&lt;/p&gt;
&lt;h3 id="q2-how-does-the-paper-address-the-concern-that-mortality-rate-declines-might-simply-reflect-unmeasured-population-outflows-from-hard-hit-areas-rather-than-genuine-reductions-in-deaths"&gt;Q2. How does the paper address the concern that mortality rate declines might simply reflect unmeasured population outflows from hard-hit areas rather than genuine reductions in deaths?&lt;/h3&gt;
&lt;p&gt;The authors offer two main responses. First, cancer mortality shows a precise null effect despite being the second-leading cause of death; if unmeasured population losses were driving the results, cancer deaths should decline proportionally. Second, using the Medicare individual-level panel, they fix each enrollee&amp;rsquo;s location at their 2003 CZ and find a statistically significant mortality decline of 0.35 percent per percentage-point unemployment increase in the reduced-form (2007–2009 period). A control function approach that instruments current-year location with 2003 location yields an estimate of −0.37 percent (SE = 0.17), similar to the baseline −0.50 percent from the aggregate specification, confirming that migration bias is not the primary driver.&lt;/p&gt;
&lt;h3 id="q3-how-long-do-the-mortality-reductions-from-the-great-recession-persist-and-does-the-paper-identify-whether-these-are-contemporaneous-or-lagged-effects"&gt;Q3. How long do the mortality reductions from the Great Recession persist, and does the paper identify whether these are contemporaneous or lagged effects?&lt;/h3&gt;
&lt;p&gt;The 2007–2009 period estimate is −0.50 percent per percentage-point unemployment increase and the 2010–2016 period estimate is −0.58 percent, and these are statistically indistinguishable (p = 0.78). To identify whether persistence reflects ongoing economic effects or true lagged mortality effects, the authors compare CZs with above- vs. below-median 2010–2016 EPOP recovery (conditional on initial shock decile). Both groups show similar 2010–2016 mortality declines despite the above-median recovery CZs having returned to pre-recession employment levels by 2016. This finding is consistent with lagged mortality effects of the initial economic downturn that persist independently of current economic conditions.&lt;/p&gt;
&lt;h3 id="q4-are-mortality-reductions-concentrated-among-individuals-already-near-death-harvesting-or-do-they-represent-meaningful-longevity-gains"&gt;Q4. Are mortality reductions concentrated among individuals already near death (&amp;ldquo;harvesting&amp;rdquo;), or do they represent meaningful longevity gains?&lt;/h3&gt;
&lt;p&gt;The authors use a Medicare auxiliary model to predict counterfactual remaining life expectancy for each enrollee based on age, demographics, and chronic conditions. The marginal life saved has only about 6 percent lower counterfactual remaining life expectancy than a typical decedent of the same age, and this difference is statistically insignificant. Because effects persist over 10 years (not just days or weeks), short-run mortality displacement (harvesting) is not the operative concern. The 6 percent difference is also small enough that the authors do not adjust their welfare analysis for it.&lt;/p&gt;
&lt;h3 id="q5-what-is-the-educational-gradient-in-mortality-impacts-and-is-it-explained-by-age-composition-or-other-confounders"&gt;Q5. What is the educational gradient in mortality impacts, and is it explained by age composition or other confounders?&lt;/h3&gt;
&lt;p&gt;Mortality declines are entirely concentrated among those with a high school degree or less: the 2007–2016 estimate is −1.3 percent per percentage-point unemployment increase (SE = 0.56) for this group versus +0.34 percent (SE = 0.68) for those with more than high school, distinguishable at p &amp;lt; 0.01. This gradient holds within age groups (confirmed in Appendix analysis), and further disaggregation shows no mortality declines for those with some college or college-or-more separately. In Medicare data, the elderly mortality effect is concentrated among the approximately 12 percent enrolled in Medicaid (a proxy for low income), reinforcing the socioeconomic concentration.&lt;/p&gt;
&lt;h3 id="q6-what-evidence-rules-out-improved-health-behaviors-increased-exercise-reduced-smoking-reduced-alcohol-as-the-main-mechanism"&gt;Q6. What evidence rules out improved health behaviors (increased exercise, reduced smoking, reduced alcohol) as the main mechanism?&lt;/h3&gt;
&lt;p&gt;Two types of evidence argue against this channel. First, three-quarters of averted deaths are among the elderly, who experienced no direct income or employment effects from the local labor market shock and would not plausibly change their health behaviors in response to someone else losing employment. Second, the mortality decline is immediate in 2007 and flat through 2016 rather than growing over time; smoking cessation, for example, takes 10–15 years to accumulate mortality effects. Direct tests of behavioral outcomes from BRFSS find no statistically significant impact on smoking, drinking, exercise, or flu vaccination rates, individually or pooled. The pooled average treatment effect on six morbidity measures is statistically significant and negative (suggesting morbidity improvements), but behavioral covariates show no movement.&lt;/p&gt;
&lt;h3 id="q7-what-is-the-evidence-for-and-against-improved-nursing-home-care-as-a-mechanism"&gt;Q7. What is the evidence for and against improved nursing home care as a mechanism?&lt;/h3&gt;
&lt;p&gt;Prior literature (Stevens et al. 2015; Konetzka et al. 2018; Antwi and Bowblis 2018) documents that recessions increase nursing home staffing and reduce nursing home deaths in earlier decades. However, the authors find no evidence for this channel in the Great Recession context. Estimated mortality impacts are virtually identical (approximately 0.5 percent per percentage-point unemployment increase) for the 7 percent of the elderly in nursing home care and the 93 percent not in nursing home care. Direct measures of nursing home staffing (direct-care staff hours per resident-day, highly skilled nurses ratio) show no statistically significant change in harder-hit areas: the point estimate for direct-care hours is −0.11 percent (SE = 0.22) in 2007–2009. Nursing home occupancy rates and resident characteristics also show no significant changes.&lt;/p&gt;
&lt;h3 id="q8-how-is-the-quantitative-importance-of-the-air-pollution-channel-estimated-and-what-are-the-two-complementary-approaches-used"&gt;Q8. How is the quantitative importance of the air pollution channel estimated, and what are the two complementary approaches used?&lt;/h3&gt;
&lt;p&gt;Approach 1 (back-of-the-envelope): The authors combine their estimate that a one-percentage-point unemployment increase reduces PM2.5 by 0.16 µg/m³ with external estimates from Deryugina et al. (2019) of PM2.5&amp;rsquo;s effect on elderly daily mortality, rescaled to annual exposure. This calculation implies pollution explains 17–35 percent of total recession-induced mortality declines, depending on which Deryugina et al. mortality estimates are used. Approach 2 (mediation analysis): Adding the county-level PM2.5 shock as an additional control in the mortality regression attenuates the Great Recession mortality coefficient from −0.52 percent to −0.33 percent per percentage-point unemployment increase—a 37 percent attenuation. Both approaches are suggestive rather than definitive, as the mediation analysis requires the strong assumption that the recession shock and PM2.5 shock are conditionally independent of other unmeasured mediators.&lt;/p&gt;
&lt;h3 id="q9-what-are-the-specific-calibration-parameters-in-the-welfare-model-and-how-does-the-paper-set-the-mortality-decline-parameter"&gt;Q9. What are the specific calibration parameters in the welfare model and how does the paper set the mortality decline parameter?&lt;/h3&gt;
&lt;p&gt;The authors extend Krebs (2007)&amp;rsquo;s income process calibration (pH = 0.03, pL = 0.05, dH = 0.09, dL = 0.21, g = 0.02, σ = 0.01, πH = 0.5) and use 2007 SSA life tables for age-specific mortality rates in normal times. The recession mortality parameter is set to dm = −0.015 for all ages, derived from a 3.1 percentage-point unemployment increase in a typical recession multiplied by the estimated 0.5 percent mortality decline per percentage-point. VSLY values are parameterized at two, five, or eight times annual consumption ($100k, $250k, or $400k at $50k annual consumption). Risk aversion γ takes values 1.5, 2, and 2.5. For the Great Recession-specific exercise, dmA = −0.023 (4.6 × 0.5 percent), dmHS = −0.037, and dmC = 0.0006.&lt;/p&gt;
&lt;h3 id="q10-how-does-accounting-for-endogenous-mortality-change-the-distributional-welfare-analysis-of-the-great-recession-by-education-group"&gt;Q10. How does accounting for endogenous mortality change the distributional welfare analysis of the Great Recession by education group?&lt;/h3&gt;
&lt;p&gt;Under exogenous mortality, the welfare cost of the Great Recession at age 35 is 2.89 percent of average annual consumption for those with high school or less versus 1.23 percent for those with more than high school—the less educated bear roughly twice the burden. Under endogenous mortality, the mortality declines are concentrated entirely among the less educated (dmHS = −0.037 vs. dmC ≈ 0), so accounting for mortality disproportionately offsets welfare losses for that group. By around age 65, the welfare costs of the Great Recession converge across education groups, and after age 65, the less educated bear &lt;em&gt;lower&lt;/em&gt; welfare costs than the more educated, reversing the exogenous-mortality ranking. This result depends on the same education differential in mortality impacts that drives the main empirical finding.&lt;/p&gt;
&lt;h3 id="q11-what-robustness-checks-demonstrate-that-the-baseline-mortality-estimates-are-not-driven-by-geographic-or-functional-form-choices"&gt;Q11. What robustness checks demonstrate that the baseline mortality estimates are not driven by geographic or functional-form choices?&lt;/h3&gt;
&lt;p&gt;The baseline CZ-level estimate of −0.50 percent (SE = 0.15) is replicated almost exactly at the state level (−0.62, SE = 0.25) and county level (−0.49, SE = 0.10). A Poisson regression yields −0.45 percent (SE = 0.14). Dropping the top/bottom decile of CZs by shock size yields −0.46 percent (SE = 0.16). Adding Census-division-by-year fixed effects attenuates the estimate slightly to −0.38 percent (SE = 0.14) but retains statistical significance. Dropping CZs with high fracking activity and dropping the ten most populous CZs both produce estimates similar to baseline. Quartile regressions show monotone mortality reductions across quartiles of the unemployment shock, consistent with approximate linearity.&lt;/p&gt;
&lt;h3 id="q12-what-does-the-expert-survey-reveal-about-prior-beliefs-and-how-does-the-papers-finding-compare"&gt;Q12. What does the expert survey reveal about prior beliefs, and how does the paper&amp;rsquo;s finding compare?&lt;/h3&gt;
&lt;p&gt;In a spring 2023 survey of over 300 experts, 50 percent predicted the Great Recession would &lt;em&gt;increase&lt;/em&gt; mortality and only 27 percent predicted a decrease. Of those predicting a decrease, 93 percent gave a magnitude larger (in absolute value) than the paper&amp;rsquo;s negative point estimate of 0.50 percent per percentage-point unemployment increase, and 82 percent gave a prediction larger than the upper bound of the 95 percent confidence interval. This illustrates that the paper&amp;rsquo;s finding—mortality is meaningfully pro-cyclical during the Great Recession—was highly surprising to the empirical and policy economics community.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Pro-cyclical mortality&lt;/strong&gt;: The phenomenon whereby mortality rates fall during economic downturns and rise during expansions. The paper documents this for the Great Recession using a spatial identification strategy, in contrast to the time-series correlation that had weakened in the two decades before the Great Recession. The term &amp;ldquo;pro-cyclical&amp;rdquo; means mortality moves in the same direction as the business cycle (up in booms, down in recessions), implying recessions are associated with fewer deaths.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Internal vs. external effects (of recessions on mortality)&lt;/strong&gt;: The paper distinguishes internal effects—whereby an individual&amp;rsquo;s own reduced employment or consumption affects her own mortality—from external effects, which are changes in mortality from reduced aggregate economic activity that hold constant one&amp;rsquo;s own employment and consumption. This distinction has direct welfare implications: external effects (e.g., less pollution from lower industrial output) are genuine welfare improvements for people who did not lose income, while internal effects of behavioral change are mitigated by the envelope theorem if behavior is privately optimal.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Commuting Zone (CZ) shock&lt;/strong&gt;: The paper&amp;rsquo;s primary treatment variable, defined as the percentage-point change in the CZ unemployment rate between 2007 and 2009. CZs are aggregations of counties (741 total) designed to approximate local labor markets. The median CZ experienced a 4.6-percentage-point increase, with substantial variation ranging from roughly 2.9 points (bottom quartile) to 6.7 points (top quartile).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Value of a Statistical Life-Year (VSLY)&lt;/strong&gt;: The dollar value placed on one additional year of life in expectation, used in the welfare calibration. In the paper&amp;rsquo;s framework it equals VSLY = bcγ − c/(γ−1), where b is a preference parameter governing the marginal utility of life-years. Results are reported for VSLYs of $100k, $250k, and $400k corresponding to two, five, and eight times average annual consumption of $50k, following Hall and Jones (2007).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Endogenous mortality in welfare analysis&lt;/strong&gt;: The paper&amp;rsquo;s central theoretical contribution is augmenting the Krebs (2007) welfare cost-of-recessions framework to allow mortality to vary with the aggregate state of the economy. When mortality is endogenously lower in recessions, the willingness to pay to eliminate recession risk falls—and at high enough VSLY or old enough ages, recessions become welfare-improving because the mortality benefit outweighs the consumption cost.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mortality displacement (harvesting)&lt;/strong&gt;: The possibility that short-run mortality declines merely reflect the premature death of already-frail individuals being slightly delayed, without meaningful longevity gains. The paper argues this is not the operative concern given 10-year persistence and uses auxiliary Medicare models to show marginal lives saved have only 6 percent shorter counterfactual life expectancy than average decedents of the same age.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;PM2.5 mediation analysis&lt;/strong&gt;: An empirical approach in which the county-level change in fine particulate matter (PM2.5, in µg/m³) between 2006 and 2010 is added as a covariate in the mortality regression. Under the assumption that the recession shock and the PM2.5 shock are conditionally independent of other unmeasured mediators, the attenuation in the recession-mortality coefficient when controlling for PM2.5 identifies the share of the mortality effect operating through the pollution channel. A 37 percent attenuation is found in the 2007–2009 period.&lt;/p&gt;</description></item><item><title>Marginal Returns to Public Universities</title><link>https://macropaperwarehouse.com/papers/marginal-returns-to-public-universities/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/marginal-returns-to-public-universities/</guid><description>&lt;p&gt;This paper asks whether enrolling in an American public university generates positive net returns for marginal students — those who barely qualify for admission — and whether those returns justify public expenditures. The question is policy-relevant because marginal students have weak academic preparation, face high dropout risk, and the net returns to expanding admission margins are theoretically ambiguous.&lt;/p&gt;
&lt;p&gt;The author assembles administrative records spanning all 35 public universities in Texas, covering the universe of Texas public high school graduates from 2004–2014 (approximately 2.7 million students). Texas public universities collectively enroll over 10 percent of all American public university students. The data link high school records (test scores, demographics, coursework, attendance, disciplinary infractions) to college application and admission records, postsecondary enrollment and degree completion records, financial aid packages, institutional expenditure data from IPEDS, and quarterly earnings records from the Texas Workforce Commission unemployment insurance system.&lt;/p&gt;
&lt;p&gt;The identification strategy exploits hundreds of decentralized SAT/ACT score cutoffs in university admissions — varying across schools and application years — that generate sharp discontinuities in admission probability. A fuzzy regression discontinuity design compares applicants just above versus just below each cutoff. On average, crossing a cutoff raises the probability of admission by 27 percentage points and the probability of enrolling at the target university by 15 percentage points. Density tests and pre-college covariate balance validate the smoothness assumptions. The typical cutoff complier is more disadvantaged than the average college applicant but comparable to the average Texas high school graduate.&lt;/p&gt;
&lt;p&gt;Roughly half of cutoff compliers would fall back to another, typically less selective, four-year institution if rejected; 43 percent would fall back to a two-year community college; and only about 6 percent would forgo higher education entirely. The pooled estimates therefore blend intensive-margin effects (more selective versus less selective four-year college) with extensive-margin effects (four-year college versus community college or no college).&lt;/p&gt;
&lt;p&gt;Main causal findings for enrollment compliers: the typical marginally admitted student completes approximately one additional year of credits in the four-year sector and becomes 12 percentage points more likely to ever earn a bachelor&amp;rsquo;s degree from any institution. About half of the additional four-year credits are offset by 15 fewer credits in the two-year sector, and associate degree or certificate completion falls by 7 percentage points. All bachelor&amp;rsquo;s degree gains are in non-STEM fields; STEM degree completion shows no detectable increase. Compliers become about 3 percentage points more likely to hold a graduate degree by 10 years out.&lt;/p&gt;
&lt;p&gt;On earnings, admitted compliers earn less than rejected counterparts in the first five years due to continued enrollment. Year six is the crossover point; by years 8–12, compliers earn a stable 8.6 percent earnings premium in log terms (8.2 percent in dollar ratio terms, representing a LATE of $3,339 against an untreated complier mean of $40,829), with earnings ranks rising approximately 4 percentiles from a base near the 50th percentile.&lt;/p&gt;
&lt;p&gt;Marginally admitted students pay no additional net tuition on average: $4,600 in additional gross tuition is nearly fully offset by grant aid, though they take on $5,300 more in student loans. Society incurs approximately $10,000 in additional educational expenditures per complier. Internal rates of return are 26 percent for students, 16 percent for society, and 7 percent for the government budget. At a 3 percent discount rate, the lifetime net present value of enrolling the typical marginal applicant is approximately $80,000 — $70,000 accruing to the student and $10,000 to taxpayers.&lt;/p&gt;
&lt;p&gt;Earnings gains are similar across institutions of varying selectivity, but significantly smaller for low-income compliers, who spend more time enrolled, complete fewer degrees, and major in less lucrative fields. A bounding method shows that extensive-margin compliers (those who would otherwise not attend any four-year college) experience larger effects than intensive-margin compliers.&lt;/p&gt;
&lt;p&gt;Q: What is the core research question and why is credible evidence scarce?
A: The paper asks whether enrolling marginal students in American public universities generates positive net returns — private, social, and fiscal — and what drives heterogeneity in those returns. Credible evidence is scarce because most existing work is correlational and fails to account for selection bias: individuals with more college education may have had pre-existing advantages, confounding college&amp;rsquo;s causal effect with systematic sorting into it. Even if average returns are positive, the policy-relevant question is whether the marginal student — who has weak preparation and high dropout risk — represents a good investment.&lt;/p&gt;
&lt;p&gt;Q: What is the regression discontinuity design, and what does the first stage look like?
A: The author infers hundreds of decentralized SAT/ACT score cutoffs across approximately 700 application cells (combinations of university, year, GPA quartile, and test type) by searching for the score value with the largest discontinuity in admission and enrollment within each cell. This procedure delivers a superconsistent estimator of each cell&amp;rsquo;s true cutoff. Pooled across all cells, crossing a cutoff raises the probability of admission by 27 percentage points and the probability of enrollment at the target university by a precisely estimated 15 percentage points. The density of applicants and a rich set of pre-college characteristics run smoothly through the cutoffs, supporting the exclusion restriction.&lt;/p&gt;
&lt;p&gt;Q: Who are the cutoff compliers, and are they representative of any broader population?
A: Compliers — applicants who enroll in the target university if and only if they barely cross its cutoff — comprise approximately 15 percent of marginal applicants. In observable characteristics, compliers are roughly representative of the broader population of marginal applicants at the cutoff. They are significantly more disadvantaged than the average public university applicant, but broadly comparable to the average Texas public high school graduate in terms of academic preparation and family income.&lt;/p&gt;
&lt;p&gt;Q: What are the next-best alternatives for marginal applicants who are rejected?
A: Approximately 47 percent of compliers would fall back to another Texas four-year college (mostly public), 43 percent to a two-year community college, and approximately 9 percent would not enroll in any Texas institution. National Student Clearinghouse data for the 2008–2014 cohorts confirm that only 4 percent of untreated compliers attend a college outside the THECB universe, meaning approximately 6 percent of all compliers truly forgo higher education altogether if rejected. The empirically relevant extensive margin is therefore between the four-year sector and the two-year sector, not between college and no college.&lt;/p&gt;
&lt;p&gt;Q: How does cutoff crossing change the institutional characteristics a complier experiences?
A: Compliers are propelled into substantially better-resourced environments: the average math test score of college peers rises by half a standard deviation; peers are 12 percentage points less likely to have been low-income; gross tuition rises by $2,400 (a 42 percent increase over the untreated complier mean of $5,700); educational spending per student rises by $3,200 (43 percent over the untreated mean); peers&amp;rsquo; 10-year BA completion rate rises by 28 percentage points; and peer mean earnings 8–12 years after college entry are $6,700 higher.&lt;/p&gt;
&lt;p&gt;Q: What are the educational attainment effects?
A: Cutoff crossing causes compliers to complete approximately 28 additional credits at any four-year institution (roughly one full year of a four-year program) and increases the probability of ever earning a bachelor&amp;rsquo;s degree by 12 percentage points, raising the completion rate from approximately 40 percent to just above 50 percent. About 15 fewer two-year sector credits are offset against the four-year gains, and associate degree or certificate completion falls by 7 percentage points. All bachelor&amp;rsquo;s degree gains are in non-STEM fields; there is no detectable increase in STEM degrees. Graduate degree completion rises by approximately 3 percentage points by 10 years out.&lt;/p&gt;
&lt;p&gt;Q: What is the earnings trajectory, and when does the premium materialize?
A: Admitted compliers earn less than rejected counterparts in the first five years after application because they remain enrolled longer. Year six is the crossover point. By years 8–12, the earnings premium stabilizes at approximately 8.6 percent in log terms and 8.2 percent in dollar ratio terms (a LATE of $3,339 against an untreated complier mean of $40,829). Earnings rank rises by approximately 4 percentiles from a base near the 50th percentile. These results are robust across sandwich earnings, all-quarters-with-earnings, and zero-imputed specifications.&lt;/p&gt;
&lt;p&gt;Q: What does the cost-benefit analysis show?
A: Marginally admitted students pay no additional net tuition on average: $4,600 in additional gross tuition is nearly fully offset by additional grant aid. They do borrow $5,300 more in student loans, likely financing higher room, board, and consumption costs at four-year colleges. From society&amp;rsquo;s perspective, compliers generate approximately $10,000 in additional educational expenditures. Cumulative undiscounted earnings benefits surpass costs after 8 years for students, 11 years for society, and 19 years for taxpayers. At a 3 percent discount rate, the lifetime net present value is approximately $80,000 total — $70,000 accruing to the student and $10,000 to taxpayers — with internal rates of return of 26 percent for students, 16 percent for society, and 7 percent for the government budget.&lt;/p&gt;
&lt;p&gt;Q: Does selectivity of the admitting institution predict larger earnings returns?
A: No. Compliers at more selective institutions experience substantially larger increases in peer quality than those at less selective institutions, but they are also less likely to be on the extensive margin of four-year enrollment and experience smaller BA attainment gains. These factors roughly offset, producing no systematic difference in earnings gains across institutions of varying selectivity. More selective institutions also impose no additional cumulative cost on society, while compliers actually pay slightly less in additional net tuition at more selective schools.&lt;/p&gt;
&lt;p&gt;Q: How does the commonly used measure of college value-added (mean peer earnings) compare to actual complier returns?
A: Mean peer earnings overpredicts actual value-added for marginal students by a factor of two: compliers attend an institution with $6,700 higher average peer earnings as a result of admission but gain only $3,300 themselves. The measure also overpredicts the earnings return to selectivity by a factor of three: a 100-SAT-point increase in target school selectivity predicts $3,000 higher peer earnings but only a statistically insignificant $900 higher gain in the complier&amp;rsquo;s own earnings.&lt;/p&gt;
&lt;p&gt;Q: How do earnings returns differ by family income?
A: Compliers from low-income families experience significantly smaller earnings gains compared to higher-income compliers. The gap is not explained by differential changes in college quality induced by admission. Instead, low-income compliers gain fewer degrees despite spending more time in college and major in less lucrative fields, consistent with related findings in the literature on family income gaps in degree completion and major choice.&lt;/p&gt;
&lt;p&gt;Q: How do earnings returns differ by gender and by race?
A: Female and male compliers eventually earn similar log earnings and earnings rank gains, but women reach their gains more quickly — likely because men take longer to finish college. White and Asian compliers experience similar earnings gains and BA completion improvements as Black and Hispanic compliers, despite white and Asian students experiencing larger increases in college selectivity and spending per student as a result of admission.&lt;/p&gt;
&lt;p&gt;Q: What is the method for separating intensive- and extensive-margin effects?
A: The two complier types are not directly distinguishable in the data. The author first uses an endogenous but strong stratification variable — having at least one other Texas public university admission offer — to identify some mean potential outcomes for each type. He then imposes an empirically-informed rank assumption to bound the remaining unknown mean potential outcomes, delivering tightly informative upper and lower bounds on each margin&amp;rsquo;s effects without requiring full nonparametric identification. The results show that pooled effects are driven by larger returns for extensive-margin compliers who would not have attended any four-year college, with smaller contributions from intensive-margin compliers shifting between four-year institutions.&lt;/p&gt;
&lt;p&gt;Q: How do this paper&amp;rsquo;s earnings estimates compare to prior studies, and what explains the differences?
A: This paper&amp;rsquo;s 8 percent earnings gain is smaller than the 17–26 percent reported in prior studies (Zimmerman 2014: 22%; Kozakowski 2023: 26%; Smith, Goodman, and Hurwitz 2025: 17%; Bleemer 2024: 21%; Hoekstra 2009: 20%). The differences are likely explained by the much larger educational attainment and institutional quality gains induced by those studies&amp;rsquo; natural experiments: in Zimmerman (2014), enrollment compliers gain roughly three additional years of four-year education versus one year in this paper; in Bleemer (2024), compliers experience roughly $30,000 more in institutional spending per student versus approximately $3,000 in this paper.&lt;/p&gt;
&lt;p&gt;Q: What are the scope conditions for these results?
A: The results pertain to marginal applicants to Texas public universities (excluding UT-Austin, which uses holistic admission with no detectable SAT/ACT cutoffs) from the 2004–2014 high school graduation cohorts. The identified effects are local average treatment effects for compliers — applicants who would enroll in the target university if and only if they barely crossed its admission cutoff — and do not represent effects for always-takers or infra-marginal students. Earnings are measured only for Texas-based workers covered by the state unemployment insurance system, which captures an estimated 90 percent of the civilian labor force.&lt;/p&gt;
&lt;p&gt;Cutoff complier: An applicant who enrolls in their target university if and only if their SAT/ACT score barely exceeds that university&amp;rsquo;s admission cutoff. Compliers are the population whose behavior — and thus whose treatment effects — are identified by the fuzzy RD design. They comprise approximately 15 percent of marginal applicants and are more disadvantaged than the average public university applicant but broadly comparable to the average high school graduate.&lt;/p&gt;
&lt;p&gt;Extensive versus intensive margin: The extensive margin refers to the contrast between attending any four-year college versus falling back to a two-year community college or no college. The intensive margin refers to the contrast between attending a more selective versus a less selective four-year institution. Approximately half of cutoff compliers are on each margin; the paper treats them as economically distinct parameters requiring separate identification.&lt;/p&gt;
&lt;p&gt;Fuzzy regression discontinuity (RD) design: An identification strategy that uses the discontinuous jump in admission probability at a test score cutoff as an instrument for enrollment, recovering the LATE for compliers via the ratio of the reduced-form discontinuity in outcomes to the first-stage discontinuity in enrollment. &amp;ldquo;Fuzzy&amp;rdquo; refers to the fact that crossing the cutoff changes admission and enrollment probabilities with a discrete jump rather than with certainty.&lt;/p&gt;
&lt;p&gt;Internal rate of return (IRR): The discount rate at which the net present value of an investment equals zero — here, the discount rate equating the discounted stream of earnings benefits to the discounted stream of costs. The paper estimates IRRs separately for students (26 percent), society (16 percent), and the government budget (7 percent), reflecting different cost and benefit definitions from each perspective.&lt;/p&gt;
&lt;p&gt;Rank assumption (bounding method): An empirically-informed assumption about the ordering of mean potential outcomes across latent complier types (extensive vs. intensive margin) that, combined with partial identification from a strong endogenous stratification variable, yields tight upper and lower bounds on each margin&amp;rsquo;s causal effects without requiring full nonparametric identification.&lt;/p&gt;
&lt;p&gt;Net tuition: Gross tuition charges minus grant aid. For the typical marginal complier, gross tuition rises by $4,600 but is nearly fully offset by additional grant aid, yielding approximately zero additional net tuition cost — meaning the private financial cost of attending a public university for marginal students is effectively zero on net, though they take on $5,300 more in student loans to finance room, board, and consumption.&lt;/p&gt;
&lt;p&gt;Sandwich earnings measure: A procedure applied to quarterly state earnings data that retains only quarters with positive earnings sandwiched between other quarters with positive earnings, discarding high-variance transition quarters between employment spells. Annualized by multiplying the quarterly average by four; used to reduce noise from entry and exit transitions in administrative earnings records.&lt;/p&gt;</description></item><item><title>Micro MPCs and Macro Counterfactuals: The Case of the 2008 Rebates</title><link>https://macropaperwarehouse.com/papers/micro-mpcs-and-macro-counterfactuals-the-case-of-the-2008-rebates/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/micro-mpcs-and-macro-counterfactuals-the-case-of-the-2008-rebates/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research question.&lt;/strong&gt; Do the high marginal propensities to consume (MPCs) estimated in the leading household studies of the 2008 U.S. tax rebates—particularly Parker et al. (2013), which found MPCs of 50–90 percent within three months—imply plausible macroeconomic counterfactuals? And if not, what combination of micro-level bias corrections and general equilibrium forces reconciles the micro evidence with aggregate data?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Setting.&lt;/strong&gt; The 2008 Economic Stimulus Act distributed approximately $100 billion in tax rebates, totaling eleven percent of January 2008 monthly disposable income. Among the 85 percent of households receiving a check, the average amount was $1,000. Rebates were distributed primarily from April through July 2008, with nearly half delivered in May alone. The timing of receipt was determined by the last two digits of Social Security numbers, providing quasi-random variation exploited by the household-level literature.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Methodology.&lt;/strong&gt; The paper proceeds in two halves. In the first, the authors construct macro counterfactuals by calibrating a standard medium-scale two-good, two-agent New Keynesian (TANK) model with the micro MPCs from the literature and simulating what aggregate consumption would have been absent the rebate. The model contains life-cycle permanent income households and hand-to-mouth households whose dynamic spending propensities are calibrated directly to match the household-level estimates. General equilibrium effects—including Keynesian income multipliers, real interest rate movements, and changes in the relative price of durable goods—are incorporated. Counterfactual consumption paths are constructed by subtracting model-simulated deviations from steady state from actual NIPA consumption data.&lt;/p&gt;
&lt;p&gt;In the second half, the authors revisit both the micro estimates and the macro model. On the micro side, they identify three upward biases in standard two-way fixed effects (TWFE) estimates applied to CEX data: (1) omitted variable bias from excluding the lagged rebate indicator; (2) &amp;ldquo;forbidden comparisons&amp;rdquo; bias arising from comparing cohorts with heterogeneous treatment effects, following Borusyak et al. (2022) and Sun and Abraham (2020); and (3) a rebate reporting bias in which households are systematically more likely to report receiving the rebate in the month that coincides with large expenditure increases, causing spurious positive correlation between reported receipt and contemporaneous spending. On the macro side, the baseline model is modified to incorporate an upward-sloping supply curve for durable goods (calibrated to a supply elasticity of 5, midway between House and Shapiro (2008) and Goolsbee (1998)), replacing the baseline assumption of frictionless conversion between nondurable and durable intermediates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main findings with quantitative magnitudes.&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Implausibility of baseline counterfactuals.&lt;/em&gt; When calibrated to Parker et al.&amp;rsquo;s (2013) micro MPC of 0.9, the baseline model implies that real PCE absent the rebate would have collapsed by 6.0 percent from April through July 2008—a decline exceeded historically only by the Covid-19 lockdowns. Even the more modest micro MPC of 0.5 implies a 2.7 percent three-month PCE decline, comparable only to the 1980 Volcker disinflation with credit controls. For motor vehicle expenditures, the counterfactual drops range from 38 percent (micro MPC = 0.3) to 67 percent (micro MPC = 0.9)—larger than any historical experience, including the 30 percent Covid decline. Contemporaneous professional forecasters (Federal Reserve Greenbooks, Survey of Professional Forecasters, Goldman Sachs) predicted at most small consumption declines in summer 2008. Even the authors&amp;rsquo; own pessimistic forecast model—incorporating actual oil price paths and a Lehman Brothers bankruptcy dummy—implies that the cumulative difference between actual and forecast consumption attributable to the rebate was at most $20 billion out of $100 billion in rebates, for an implied GE-MPC of at most 0.2.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Bias correction in micro MPC estimates.&lt;/em&gt; Applying all three bias corrections to CEX data (the preferred specification with lagged rebate indicator, cohort-level treatment effects, and lagged expenditure controls), the estimated three-month MPC falls from 0.50 to 0.28 in the full sample and from 0.82 to 0.34 in the rebate-recipients-only sample, with both rounding to approximately 0.3. The Borusyak-Jaravel-Spiess (BJS) imputation method yields an MPC of 0.20 in the full sample and 0.37 in the rebate-only sample, consistent with the OLS corrections.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Composition of spending.&lt;/em&gt; In the preferred corrected specification, essentially all of the total expenditure MPC of 0.3 is accounted for by motor vehicle spending: the MPC on motor vehicles is 0.30 in the full sample and 0.26 in the rebate-only sample, while the MPC on all other expenditures is −0.02 (full sample) and 0.08 (rebate-only sample).&lt;/p&gt;
&lt;p&gt;&lt;em&gt;General equilibrium dampening via inelastic durable supply.&lt;/em&gt; In the model with a calibrated durable supply elasticity of 5, rebate-induced demand for motor vehicles raises the relative vehicle price by approximately 1.1 percent in July 2008. This price increase crowds out durable expenditure by optimizing households through intertemporal substitution. At the preferred micro MPC of 0.3, the general equilibrium MPC (GE-MPC) for total PCE is only 0.07, well below the 0.3 micro estimate. At a micro MPC of 0.5, the GE-MPC is 0.22. The combination of the bias-corrected micro MPC and dampening general equilibrium forces implies a general equilibrium consumption multiplier below 0.2 for the 2008 rebates.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Importance of durable goods composition for HANK models.&lt;/em&gt; A model that abstracts from durable goods and calibrates the full expenditure micro MPC to nondurable spending predicts a GE-MPC of 0.36 when the micro MPC is 0.30—five times larger than the 0.07 implied by the model with durable goods. This contrast illustrates that the distribution of spending across nondurable and durable goods is a key determinant of the aggregate fiscal multiplier, in addition to heterogeneity in wealth and income emphasized by the existing HANK literature.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-central-empirical-puzzle-the-paper-addresses"&gt;Q1. What is the central empirical puzzle the paper addresses?&lt;/h3&gt;
&lt;p&gt;A. The leading household studies of the 2008 rebates estimate very high three-month MPCs (50–90 percent). When these estimates are plugged into a standard New Keynesian model to construct counterfactual consumption paths absent the rebate, the model implies that PCE would have collapsed by 2.7–6.0 percent from April through July 2008 and then sharply recovered just as Lehman Brothers failed in September. No contemporaneous forecaster or narrative evidence suggests such extreme, short-lived macroeconomic stress was present. The Lehman collapse itself caused only a 1.1 percent three-month PCE decline—smaller than all three counterfactual declines implied by micro MPCs of 0.3, 0.5, or 0.9.&lt;/p&gt;
&lt;h3 id="q2-what-are-the-features-of-the-tank-model-used-to-construct-the-counterfactuals"&gt;Q2. What are the features of the TANK model used to construct the counterfactuals?&lt;/h3&gt;
&lt;p&gt;A. The model is a two-good (nondurable and durable), two-agent (optimizing life-cycle and hand-to-mouth) New Keynesian model calibrated at monthly frequency, building on Ramey (2021) and Galí et al. (2007). Intermediate goods can, in the baseline, be frictionlessly converted into either nondurable or durable goods (implying a fixed relative price of one). Durable goods (interpreted as motor vehicles) enter household utility, with optimizing households facing a Calvo-type adjustment friction motivated by Evans and Ramey (1992) calculation costs. The fraction of hand-to-mouth consumers and their dynamic propensities to spend are calibrated directly to match the micro MPC estimates from the household literature. The model incorporates a Calvo-style price-adjustment structure for nondurables, sticky wages set by unions, capital with adjustment costs and variable utilization, and an inertial monetary policy rule.&lt;/p&gt;
&lt;h3 id="q3-how-does-the-model-translate-micro-mpcs-into-macro-counterfactuals-and-why-does-it-amplify-rather-than-dampen-the-micro-estimates-in-the-baseline"&gt;Q3. How does the model translate micro MPCs into macro counterfactuals, and why does it amplify rather than dampen the micro estimates in the baseline?&lt;/h3&gt;
&lt;p&gt;A. The model&amp;rsquo;s GE-MPC equals the micro MPC&amp;rsquo;s direct demand effect plus Keynesian income multiplier effects. Because the rebate is highly transitory, there is little movement in the real interest rate (the Phillips curve is flat and monetary policy is inertial), so the dominant general equilibrium force is the income multiplier. This amplifies, rather than dampens, the micro MPCs. As a result, the GE counterfactuals exhibit even sharper V-shapes than the pure micro counterfactuals.&lt;/p&gt;
&lt;h3 id="q4-what-narrative-and-forecast-evidence-do-the-authors-use-to-argue-the-baseline-counterfactuals-are-implausible"&gt;Q4. What narrative and forecast evidence do the authors use to argue the baseline counterfactuals are implausible?&lt;/h3&gt;
&lt;p&gt;A. Contemporary forecasts from the Federal Reserve Greenbooks, the Survey of Professional Forecasters, and Goldman Sachs all predicted at most small consumption declines in summer 2008—Goldman Sachs forecast only −0.125 percent (not annualized) per quarter in Q2–Q3 2008. The authors also construct their own &amp;ldquo;pessimistic&amp;rdquo; time-series forecast that incorporates actual oil price paths (which rose from $98 to $140 per barrel by July 2008) and a Lehman Brothers bankruptcy dummy; even this forecast lies above all three model counterfactuals in summer 2008 and displays no V-shape. Furthermore, the cumulative difference between actual PCE and the pessimistic forecast over April–October 2008 totals only $20 billion—implying a GE-MPC of at most 0.2 even if the entire gap were attributed to the rebate.&lt;/p&gt;
&lt;h3 id="q5-what-is-the-first-bias-in-standard-twfe-estimates-of-the-mpc-and-how-large-is-its-effect"&gt;Q5. What is the first bias in standard TWFE estimates of the MPC, and how large is its effect?&lt;/h3&gt;
&lt;p&gt;A. The first bias is omitted variable bias from excluding the lagged rebate indicator. In a first-differenced panel regression, lagged treatment enters the error term. Because current treatment reduces the probability of past treatment, current and lagged treatment are negatively correlated, and omitting the lag inflates the OLS estimate of the contemporaneous effect. Including a lagged rebate indicator reduces the contemporaneous spending response by $40 in the full CEX sample (from $470 to $434) and by approximately $237 in the rebate-only sample (from $764 to $527).&lt;/p&gt;
&lt;h3 id="q6-what-is-the-forbidden-comparisons-bias-and-how-is-it-corrected"&gt;Q6. What is the &amp;ldquo;forbidden comparisons&amp;rdquo; bias and how is it corrected?&lt;/h3&gt;
&lt;p&gt;A. When treatment effects are heterogeneous across cohorts (e.g., the June rebate cohort has a larger MPC than the September cohort), standard homogeneous TWFE estimates use later-treated cohorts as control groups for earlier-treated cohorts even after accounting for average mean-reversion. Because the mean-reversion of the earlier (larger-effect) cohort is larger than that of the later cohort, this comparison is contaminated, inflating the estimate. The authors correct for this by allowing cohort-specific treatment effects, following Sun and Abraham (2020). This reduces the contemporaneous effect by a further $90 in the full sample; in the rebate-only sample the correction raises the estimate slightly (by $70) because later treatment effects are larger in that sample.&lt;/p&gt;
&lt;h3 id="q7-what-is-the-rebate-reporting-bias-and-what-mechanism-underlies-it"&gt;Q7. What is the rebate reporting bias and what mechanism underlies it?&lt;/h3&gt;
&lt;p&gt;A. The rebate reporting bias arises because households in the CEX are systematically more likely to report receiving the rebate in the interview month that coincides with high expenditure. Although the true timing of rebate checks is determined by Social Security number last-digits (and is thus random), the reported timing may reflect recall issues: households more readily remember and report receiving the rebate when it was accompanied by a large purchase. The empirical signature is a statistically significant negative effect of future rebate receipt on current expenditure (−$863 in the full sample, −$575 in the rebate-only sample at the 10% level), indicating that rebate reporters had unusually low spending in the period prior to reporting receipt. Controlling for lagged expenditure and income decile fixed effects corrects for this bias, reducing the three-month MPC in the full sample from 0.37 to 0.28.&lt;/p&gt;
&lt;h3 id="q8-what-are-the-authors-preferred-bias-corrected-mpc-estimates-and-how-do-they-compare-across-specifications-and-estimators"&gt;Q8. What are the authors&amp;rsquo; preferred bias-corrected MPC estimates, and how do they compare across specifications and estimators?&lt;/h3&gt;
&lt;p&gt;A. After correcting for all three biases (preferred specification, column 4 of Table 3), the implied three-month MPC is 0.28 in the full sample and 0.34 in the rebate-only sample, both approximately 0.3. The Borusyak-Jaravel-Spiess imputation method, which imposes weaker assumptions and overcomes the first two biases by construction, yields an MPC of 0.20 (full sample) and 0.37 (rebate-only sample), with an average consistent with the OLS-corrected estimates. Both methods point to an MPC around 0.3, substantially below the 0.5–0.9 range from the baseline Parker et al. (2013) approach.&lt;/p&gt;
&lt;h3 id="q9-how-is-almost-all-of-the-total-expenditure-mpc-concentrated-in-motor-vehicles"&gt;Q9. How is almost all of the total expenditure MPC concentrated in motor vehicles?&lt;/h3&gt;
&lt;p&gt;A. After bias correction, the MPC on motor vehicles is 0.30 in the full sample and 0.26 in the rebate-only sample. The MPC on all other PCE is −0.02 (full sample) and 0.08 (rebate-only sample), neither statistically significant. This concentration in durables is consistent with Adams et al. (2009) and Aaronson et al. (2012), and is corroborated by CEX vehicle-expenditure data showing a car-purchase response concentrated in the three months surrounding receipt of the rebate.&lt;/p&gt;
&lt;h3 id="q10-how-does-introducing-an-upward-sloping-supply-curve-for-durable-goods-change-the-models-general-equilibrium-predictions"&gt;Q10. How does introducing an upward-sloping supply curve for durable goods change the model&amp;rsquo;s general equilibrium predictions?&lt;/h3&gt;
&lt;p&gt;A. In the modified model, durable goods producers face a production externality (or fixed factor) that makes the short-run supply of motor vehicles upward-sloping, with supply elasticity calibrated to 5. When rebate recipients increase demand for motor vehicles, the relative price of motor vehicles rises by approximately 1.1 percent in July 2008 (consistent with the observed 1.5 percent spike in the BLS new vehicle price index relative to core CPI around the rebate distribution). This price increase induces optimizing households to intertemporally substitute away from durable goods. Because durable demand is highly price-elastic (long-run elasticity of −1 to −15 depending on the study), even a modest relative price increase generates substantial crowding out of durable expenditure by non-recipients.&lt;/p&gt;
&lt;h3 id="q11-what-are-the-ge-mpc-estimates-in-the-modified-model-with-less-elastic-durable-supply-and-how-do-they-decompose"&gt;Q11. What are the GE-MPC estimates in the modified model with less elastic durable supply, and how do they decompose?&lt;/h3&gt;
&lt;p&gt;A. At the preferred micro MPC of 0.3, the GE-MPC for total PCE is 0.07—general equilibrium forces dampen the micro effect. At micro MPC of 0.5, GE-MPC is 0.22 (modest dampening). At micro MPC of 0.9, the GE-MPC rises to 1.42 (amplification). Decomposing by good type at micro MPC of 0.3: the GE-MPC on motor vehicles is 0.09 and the GE-MPC on nondurables is −0.03. The dampening is concentrated almost entirely in durable expenditure.&lt;/p&gt;
&lt;h3 id="q12-how-sensitive-are-the-ge-mpc-results-to-the-calibration-of-durable-demand-elasticity"&gt;Q12. How sensitive are the GE-MPC results to the calibration of durable demand elasticity?&lt;/h3&gt;
&lt;p&gt;A. The baseline calibration uses a long-run vehicle demand elasticity of −15, based on household-level evidence from Bachmann et al. (2021). When the authors instead use the lower-bound estimate of −6.4 from Baker et al. (2019), the GE-MPC at micro MPC of 0.3 rises from 0.07 to 0.12. Even at this lower demand elasticity there is substantial crowding out in general equilibrium, so the qualitative conclusion is robust.&lt;/p&gt;
&lt;h3 id="q13-why-does-a-nondurables-only-model-with-the-same-overall-mpc-substantially-overstate-the-fiscal-multiplier"&gt;Q13. Why does a nondurables-only model with the same overall MPC substantially overstate the fiscal multiplier?&lt;/h3&gt;
&lt;p&gt;A. When abstracting from durable goods and calibrating a nondurable MPC of 0.30 (to match the overall expenditure MPC), the model predicts a GE-MPC of 0.36—five times larger than the 0.07 from the two-good model. This occurs because nondurable demand is far less price-elastic than durable demand, and the nearly-flat Phillips curve makes nondurable supply very elastic, so there is no relative-price-driven crowding out channel. The comparison illustrates that the distribution of spending across nondurable and durable goods is a quantitatively important determinant of the fiscal multiplier, independent of the level of the MPC.&lt;/p&gt;
&lt;h3 id="q14-what-evidence-is-provided-that-the-control-group-in-the-household-regressions-is-itself-affected-by-the-rebate-in-general-equilibrium"&gt;Q14. What evidence is provided that the control group in the household regressions is itself affected by the rebate in general equilibrium?&lt;/h3&gt;
&lt;p&gt;A. Figure 9 in the paper plots motor vehicle spending per household by rebate-receipt status using CEX data. When rebate recipients begin reporting receipt in June 2008, motor vehicle expenditure in the rebate group rises while simultaneously falling in the never-rebate group. This pattern is consistent with the model&amp;rsquo;s prediction that the rebate-induced rise in relative motor vehicle prices crowds out purchases by non-recipient households. This general equilibrium spillover means the difference-in-differences micro MPC estimate remains valid as a micro estimate (the symmetric crowding out does not affect the treated-versus-control difference), but the aggregate GE-MPC is less than the micro MPC.&lt;/p&gt;
&lt;h3 id="q15-how-do-the-authors-verify-that-their-preferred-corrected-specification-recovers-true-mpcs"&gt;Q15. How do the authors verify that their preferred corrected specification recovers true MPCs?&lt;/h3&gt;
&lt;p&gt;A. In Appendix C.6 the authors simulate household-level data from the modified Section 5 model and apply both the original Parker et al. (2013) specification (Equation 1) and their preferred corrected specification (Equation 5). The Parker et al. specification produces upward-biased MPC estimates in the simulated data, consistent with Kaplan and Violante&amp;rsquo;s (2014) theoretical argument. The preferred corrected specification recovers the true MPCs from the model, validating the correction methodology.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;GE-MPC (General Equilibrium Marginal Propensity to Consume).&lt;/strong&gt; The paper&amp;rsquo;s term for the aggregate increase in total consumer spending per dollar of tax rebate, incorporating both the direct micro-level demand effect of the rebate on hand-to-mouth households&amp;rsquo; consumption and the induced macroeconomic income effects from Keynesian multipliers and relative price changes. Distinct from the micro MPC, which captures only the household-level spending response before any general equilibrium feedbacks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Micro MPC.&lt;/strong&gt; The causal effect of receiving a temporary lump-sum transfer on a household&amp;rsquo;s own consumer expenditure, expressed as a fraction of the transfer amount, estimated from household panel data via difference-in-differences event studies. In the paper&amp;rsquo;s usage, this is a partial equilibrium concept that excludes any impact of the policy on prices, wages, or other households&amp;rsquo; incomes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Forbidden comparisons bias.&lt;/strong&gt; A form of bias in two-way fixed effects event study estimates that arises when treatment effects are heterogeneous across cohorts and later-treated units are used as control groups for earlier-treated units whose outcomes are still reverting after treatment. Named and formalized in Borusyak and Jaravel (2017) and Borusyak et al. (2022); in this paper it manifests because cohorts receiving rebates in June have systematically larger spending responses than those receiving in September, so using September recipients as a &amp;ldquo;clean&amp;rdquo; control for June reversal yields contaminated estimates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Rebate reporting bias.&lt;/strong&gt; A bias specific to the CEX survey data in which the timing of a household&amp;rsquo;s self-reported rebate receipt is correlated with unusually high contemporaneous expenditure (and correspondingly low prior-period expenditure), likely due to recall effects. Because the true rebate timing is random but the reported timing is not, this correlation inflates the difference-in-differences estimate of the spending effect.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Two-good, two-agent New Keynesian (TANK) model.&lt;/strong&gt; A medium-scale New Keynesian model containing two types of households (optimizing life-cycle consumers and hand-to-mouth consumers who exhaust current income) and two goods (nondurables and durable goods interpreted as motor vehicles). The model is used in this paper as a framework to translate micro MPC estimates into aggregate general equilibrium counterfactuals, calibrated at monthly frequency.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Durable supply elasticity.&lt;/strong&gt; The elasticity of real durable goods production with respect to the relative price of durable goods, calibrated in the paper to 5. In the baseline model, this elasticity is infinite (the relative price is fixed at one because intermediates convert frictionlessly). With a finite supply elasticity of 5, rebate-induced durable demand causes the relative vehicle price to rise, generating crowding out of optimizing households&amp;rsquo; durable expenditure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Calvo durable adjustment friction.&lt;/strong&gt; An adjustment friction imposed on optimizing households&amp;rsquo; durable goods purchases, motivated by Evans and Ramey&amp;rsquo;s (1992) calculation cost model. Only a fraction 1−θd of households reoptimize their durable stock each period (with probability drawn randomly), producing a Calvo-type reduced form. This friction limits both the extensive and intensive margins of durable adjustment and prevents unrealistically large intertemporal substitution of durable purchases in response to price changes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Macro counterfactual.&lt;/strong&gt; In this paper&amp;rsquo;s usage, the simulated path of aggregate consumption that would have occurred in the absence of the 2008 tax rebate, constructed by subtracting the model-implied impulse response to the rebate from the actual observed NIPA consumption series. Plausibility of the counterfactual is assessed by comparison to contemporaneous forecasts and to historical episodes of large consumption declines.&lt;/p&gt;</description></item><item><title>Minimum Wages, Efficiency, and Welfare</title><link>https://macropaperwarehouse.com/papers/minimum-wages-efficiency-and-welfare/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/minimum-wages-efficiency-and-welfare/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research question.&lt;/strong&gt; Can minimum wages improve welfare through efficiency — by correcting monopsony-driven under-employment — and, if so, by how much? What is the optimal minimum wage, and how much of the welfare gain from a higher minimum wage comes from efficiency versus redistribution?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model and methodology.&lt;/strong&gt; The paper develops a tractable general equilibrium oligopsony model with heterogeneous workers (four types: non-high-school, high-school, college workers, and capital owners) and heterogeneous firms (varying in total factor productivity), embedded in a continuum of local labor markets where firms compete strategically in Cournot fashion. Firms face downward-sloping labor supply curves; their market power generates wages below the marginal revenue product of labor (markdowns). The model is calibrated to US data using the Census Longitudinal Business Database (LBD, 2014), the Bureau of Labor Statistics Current Population Survey (CPS, 2019), and the Survey of Consumer Finances (SCF). Key calibration targets include: average firm size of 22.83 workers (LBD), 29 percent of workers earning below $15/hr (CPS), labor and capital income shares, and household-level earnings and capital income ratios. The model is validated by quantitatively replicating four strands of empirical evidence: (i) reallocation effects of the German minimum wage introduction (Dustmann et al., 2021); (ii) employer spillover responses to Amazon&amp;rsquo;s voluntary $15 minimum wage (Derenoncourt et al., 2021); (iii) wage distribution compression evidence from Brazil (Engbom and Moser, 2021); and (iv) heterogeneous employment effects by market concentration (Azar et al., 2019).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Three channels for efficiency gains.&lt;/strong&gt; The model identifies three mechanisms through which a minimum wage can improve efficiency under oligopsony: (1) a &lt;em&gt;direct effect&lt;/em&gt; in which constrained firms with monopsony markdowns increase wages and expand employment toward the competitive level (Region II firms); (2) a &lt;em&gt;spillover effect&lt;/em&gt; in which unconstrained competitor firms narrow their own markdowns in response to constrained firms&amp;rsquo; increased wages and market shares; (3) a &lt;em&gt;reallocation effect&lt;/em&gt; in which employment is shifted away from low-productivity firms (which enter Region III — constrained on labor demand) toward more productive firms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main findings on efficiency versus redistribution.&lt;/strong&gt; Under the $15.12/hr minimum wage that maximizes social welfare under utilitarian weights (population-share weights), less than 5 percent of the welfare gains come from improved efficiency, while more than 95 percent come from redistribution. When the government is additionally given access to budget-neutral lump-sum transfers that fully address redistribution goals, the efficiency-maximizing minimum wage narrows to a range of approximately $7.50–$10.00 per hour, which is robust across social welfare weight specifications. The welfare gains attributable to efficiency alone are approximately 0.16–0.20 percent in consumption-equivalent terms, representing only about 1–2 percent of the welfare gains achievable in an economy with no labor market power at all (which would be 15.26 percent in consumption-equivalent terms under the same conditions with optimal transfers).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why efficiency gains are small.&lt;/strong&gt; Three structural reasons limit efficiency gains: (i) low-productivity firms — which are the firms most affected by a binding minimum wage in Region II — have endogenously narrow markdowns even absent a minimum wage, because they face more elastic labor supply and command small market shares; (ii) the calibrated production function has relatively flat marginal revenue product of labor schedules (decreasing returns parameter α = 0.940), so once firms enter Region III, employment rationing occurs rapidly; (iii) the large, high-productivity firms with the widest markdowns are not materially affected by the minimum wages of their small, low-wage competitors because those competitors have small market shares — making spillovers quantitatively negligible even though the model matches empirical cross-employer wage elasticities.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Optimal minimum wages under alternative frameworks.&lt;/strong&gt; Without transfers and under utilitarian weights, the optimal minimum wage is $15.12. Without transfers but under Negishi weights (which rationalize the observed competitive equilibrium and load approximately 62 percent of weight on college workers and owners versus their 35 percent population share), the optimal is $6.97. Under a 97 percent weight on high-school graduates, the optimal rises to $18.32. With optimal lump-sum transfers, the optimal collapses to $7.76–$10.11 regardless of social welfare weights — a range robust across Frisch elasticity variants (ϕ ∈ {0.30, 0.62, 0.86}), regional decompositions (low, medium, and high income US states), short-run capital-fixed scenarios (where the optimum declines by approximately $1 under utilitarian weights), and the removal of household heterogeneity entirely (which yields an optimum of $7.74).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Distributional proxies versus welfare.&lt;/strong&gt; Wage inequality (college–non-college log wage premium, cross-sectional variance of log wages) and the labor income share are monotonically improving as the minimum wage rises, even as welfare is hump-shaped and eventually declining. A rise in the minimum wage from $7.50 to $15 reduces the college–non-college log wage premium from 0.53 to 0.43 (roughly one-fifth), reduces the cross-sectional variance of log wages by nearly half, and raises the aggregate labor income share by approximately 3 percentage points — all while welfare (under utilitarian weights with no transfers) reaches its maximum at $15.12 and then declines. These standard proxies therefore do not reliably indicate welfare.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope conditions.&lt;/strong&gt; All results are long-run steady-state comparisons unless otherwise noted. Results assume no price passthrough and a unit elasticity of substitution between capital and labor. The paper abstracts from capital–labor substitution responses and occupational choice. The redistribution channel quantified here is specific to the utilitarian welfare criterion and to the existing distribution of capital and profit income, in which owners (6 percent of households) earn 92 percent of dividends.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-are-the-three-regions-of-firm-behavior-in-response-to-a-binding-minimum-wage-and-what-are-their-efficiency-implications"&gt;Q1. What are the three regions of firm behavior in response to a binding minimum wage, and what are their efficiency implications?&lt;/h3&gt;
&lt;p&gt;A: A firm can be in one of three regions. In Region I the minimum wage is not binding: the firm pays its optimal monopsony wage and employment is inelastically below the competitive level. In Region II the minimum wage binds and exceeds the firm&amp;rsquo;s optimal monopsony wage, but labor supply at the minimum wage still falls short of labor demand: employment and efficiency improve as the shadow markdown narrows. In Region III the minimum wage exceeds the competitive wage, so unconstrained labor supply would exceed demand: the firm rations employment and the rationing constraint binds, reducing efficiency. At the boundary of Region II and Region III, the shadow markdown equals one and the firm is at its efficient employment level. Only a firm-specific minimum wage targeting each firm&amp;rsquo;s competitive wage could deliver economy-wide efficiency.&lt;/p&gt;
&lt;h3 id="q2-how-does-the-paper-define-and-use-shadow-wages-to-characterize-equilibrium"&gt;Q2. How does the paper define and use &amp;ldquo;shadow wages&amp;rdquo; to characterize equilibrium?&lt;/h3&gt;
&lt;p&gt;A: The shadow wage for a firm is the effective wage that rationalizes equilibrium employment given rationing constraints. Formally, when a firm rations employment (Region III), households act as if facing a shadow wage equal to the actual minimum wage multiplied by a rationing factor p &amp;lt; 1 (the Lagrange multiplier on the rationing constraint, normalized as a fraction). Shadow wages aggregate across firms into market- and type-level shadow wages via CES aggregation. The key insight is that shadow wages, not observed wages, are allocative: aggregate labor supply for each worker type is determined by the type-level shadow wage, not by the minimum wage that firms actually pay. This allows the paper to express aggregate efficiency via two wedges — the aggregate shadow markdown (capturing average market power) and a misallocation term — without tracking all firm-specific constraints individually.&lt;/p&gt;
&lt;h3 id="q3-what-are-the-two-aggregate-efficiency-wedges-and-how-do-they-behave-as-the-minimum-wage-rises"&gt;Q3. What are the two aggregate efficiency wedges and how do they behave as the minimum wage rises?&lt;/h3&gt;
&lt;p&gt;A: The two wedges are: (i) the aggregate shadow markdown µ̃, which is a productivity-weighted average of firm-level shadow markdowns and measures the extent to which aggregate wages fall short of marginal revenue products; and (ii) the misallocation term ω, which measures whether employment is allocated toward more productive firms and equals one when all shadow markdowns are identical. As the minimum wage rises from zero, µ̃ initially narrows (improving efficiency) because firms in Region II expand toward their competitive employment level and constrained firms&amp;rsquo; market shares rise, tightening the residual labor supply of unconstrained competitors and narrowing their markdowns. But as the minimum wage rises further, Region III rationing causes shadow markdowns to widen rapidly — first for low-productivity firms and then progressively for more productive ones — so µ̃ turns back downward. The misallocation term ω first improves as low-productivity firms are pushed out, but then worsens because rationing at intermediate-productivity firms redirects employment from high- to medium-productivity firms.&lt;/p&gt;
&lt;h3 id="q4-what-does-the-model-validation-exercise-on-the-german-minimum-wage-dlsub-2021-show"&gt;Q4. What does the model validation exercise on the German minimum wage (DLSUB 2021) show?&lt;/h3&gt;
&lt;p&gt;A: The paper calibrates the model to the German context by setting a minimum wage of $8.95/hr equivalent to 48 percent of the pre-reform median wage — matching Germany&amp;rsquo;s 8.50 euro introduction in 2015, where 15 percent of workers earned below the threshold. The model produces employment effects that are slightly positive (consistent with empirical findings of no disemployment), average wage increases consistent with both constrained and unconstrained firms raising wages, a negative elasticity of the number of operating firms with respect to minimum wage exposure (correctly signed, moderately smaller than data), and a positive elasticity of average firm size with respect to exposure (slightly larger than the data). The reallocation direction — small unproductive firms shrinking and workers moving to larger, more productive firms — matches the data qualitatively and within the range of data estimates across specifications.&lt;/p&gt;
&lt;h3 id="q5-what-does-the-amazon-spillover-replication-dnwt-2021-show-and-what-does-it-imply-about-the-minimum-wage-spillover-channel"&gt;Q5. What does the Amazon spillover replication (DNWT 2021) show, and what does it imply about the minimum wage spillover channel?&lt;/h3&gt;
&lt;p&gt;A: Derenoncourt et al. (2021) estimate a cross-employer wage elasticity of 0.26: when Amazon raised wages by approximately 18.1 percent, competitors raised wages by 4.7 percent on average. The model replicates this by treating Amazon as the largest (or second-largest) firm in each market, exogenously narrowing its markdown by a fraction ζ calibrated to deliver an 18.1 percent wage increase. Competitors in the model raise wages through the strategic interaction mechanism: Amazon&amp;rsquo;s higher wage and market share tightens competitors&amp;rsquo; residual supply curves, inducing them to narrow their own markdowns. The model matches the 0.26 cross-employer elasticity when Amazon is the largest firm in markets with at least 36 competitors, or the second-largest in markets with at least 12. Critically, the authors note that this empirical evidence concerns responses to a &lt;em&gt;large&lt;/em&gt; firm raising wages; for minimum wages the question is whether &lt;em&gt;large&lt;/em&gt; firms respond to their small wage competitors, which the model shows they do not substantially, because small firms have negligible market shares.&lt;/p&gt;
&lt;h3 id="q6-how-does-the-paper-separate-efficiency-from-redistribution-and-what-is-the-key-methodological-innovation"&gt;Q6. How does the paper separate efficiency from redistribution, and what is the key methodological innovation?&lt;/h3&gt;
&lt;p&gt;A: The paper gives the government access to budget-neutral, unrestricted lump-sum transfers across households in addition to the minimum wage. With transfers available, the government can use them to meet any redistributive objective encoded in arbitrary social welfare weights. Whatever is left for the minimum wage to do must be purely efficiency-improving. The paper shows (via aggregation theorems) that optimal lump-sum transfers can be computed in closed form for any social welfare weights, and that the social welfare maximizing allocation subject to transfers can be decentralized by transfers that sum to zero across households. Under this framework, the efficiency-maximizing minimum wage lies between $7.50 and $10.00 per hour regardless of whether utilitarian, Negishi, or 97 percent high-school-weighted social welfare functions are used — collapsing the original $0–$31 range to a tight interval.&lt;/p&gt;
&lt;h3 id="q7-how-are-negishi-weights-computed-and-why-are-they-important-for-interpreting-the-results"&gt;Q7. How are Negishi weights computed, and why are they important for interpreting the results?&lt;/h3&gt;
&lt;p&gt;A: The Negishi weights are the social welfare weights under which a planner would choose the observed competitive equilibrium with zero lump-sum transfers. They are computed by inverting the planner&amp;rsquo;s first-order conditions: for the competitive equilibrium to be optimal under some set of weights, the implied consumption ratios must match observed data. The calibrated Negishi weights assign a combined weight of approximately 62 percent to college workers and owners, who constitute only 35 percent of the population. This means the competitive equilibrium is disproportionately aligned with higher-income households. A utilitarian planner, which weights households by population shares, therefore sees large scope for redistribution toward non-college workers — which is exactly why the utilitarian-optimal minimum wage is $15.12 and why 94 percent of its welfare gains come from redistribution rather than efficiency.&lt;/p&gt;
&lt;h3 id="q8-what-are-the-quantitative-welfare-gains-from-the-efficiency-maximizing-minimum-wage-and-how-small-are-they-relative-to-the-potential-gains-from-eliminating-monopsony"&gt;Q8. What are the quantitative welfare gains from the efficiency-maximizing minimum wage, and how small are they relative to the potential gains from eliminating monopsony?&lt;/h3&gt;
&lt;p&gt;A: With optimal lump-sum transfers, the welfare gains from the efficiency-maximizing minimum wage are approximately 0.16–0.20 percent in consumption-equivalent terms, robust across social welfare weight specifications, Frisch elasticity variations, and regional decompositions. The welfare gains associated with an economy in which all firms&amp;rsquo; markdowns are set to one (no labor market power at all), also evaluated with optimal transfers, are 15.26 percent in consumption-equivalent terms. The efficiency-maximizing minimum wage therefore recovers approximately 1–2 percent of the potential welfare gains from eliminating monopsony. Equivalently, the efficiency gains correspond to roughly a 0.1 percent increase in TFP. These gains are small despite the model matching all empirical evidence on the channels through which efficiency gains could occur.&lt;/p&gt;
&lt;h3 id="q9-how-do-employment-effects-of-minimum-wages-vary-by-market-concentration-and-why"&gt;Q9. How do employment effects of minimum wages vary by market concentration, and why?&lt;/h3&gt;
&lt;p&gt;A: In concentrated markets (upper tercile of HHI), firms have larger monopsony markdowns, so a binding minimum wage pushes them into Region II — where employment expands — over a wider range of minimum wage values before entering Region III. This produces large, positive employment effects in concentrated markets. In less concentrated markets, firms already have narrow markdowns (they are closer to competitive), so even small minimum wage increases push them into Region III, where employment contracts. The model replicates the statistically significant positive effects in high-concentration markets and negative effects in low-concentration markets documented by Azar et al. (2019), for initial minimum wages below approximately $8/hr. At higher initial minimum wages, however, even high-concentration markets exhibit negative employment effects as more firms enter Region III.&lt;/p&gt;
&lt;h3 id="q10-what-does-the-robustness-exercise-for-mississippi-reveal"&gt;Q10. What does the robustness exercise for Mississippi reveal?&lt;/h3&gt;
&lt;p&gt;A: Mississippi has the lowest per capita income in the US, and a $15 minimum wage would bind for 41.3 percent of its workers (versus 29.4 percent nationally). Despite this, the model finds that Mississippi would benefit from a $15 federal minimum wage under utilitarian weights, and the Mississippi-specific optimal minimum wage is $14.89 — nearly identical to the national optimum. The reason is an offsetting compositional effect: while Mississippi has lower average wages (pushing toward a lower optimal), it has a larger share of high-school graduates (63 percent versus 52.8 percent nationally) who prefer higher minimum wages (around $17 in the model). These two forces wash out, producing a stable optimal close to the national figure.&lt;/p&gt;
&lt;h3 id="q11-what-happens-to-common-empirical-proxies-for-inequality-and-worker-power-as-the-minimum-wage-rises"&gt;Q11. What happens to common empirical proxies for inequality and worker power as the minimum wage rises?&lt;/h3&gt;
&lt;p&gt;A: The college–non-college log wage premium declines from 0.53 to 0.43 (a fall of roughly one-fifth) as the minimum wage rises from $7.50 to $15. The cross-sectional variance of log wages falls by nearly half over this range, driven equally by declining within- and between-type inequality. The aggregate labor income share rises by approximately 3 percentage points, and the share of output created in non-high-school jobs paid to non-high-school workers rises by 7 percentage points. All of these proxies are monotonically improving in the minimum wage throughout, even as aggregate welfare under the model&amp;rsquo;s social welfare function is hump-shaped and declining past the optimum. The paper concludes that observations of declining inequality or a rising labor share are consistent with falling welfare, so these proxies cannot serve as reliable welfare indicators.&lt;/p&gt;
&lt;h3 id="q12-how-does-the-short-run-fixed-capital-analysis-differ-from-the-long-run-baseline"&gt;Q12. How does the short-run (fixed-capital) analysis differ from the long-run baseline?&lt;/h3&gt;
&lt;p&gt;A: In the short run, capital at each firm is fixed at the type-specific level chosen under a zero minimum wage. This creates sharper decreasing returns in labor (parameter γα rather than α̃), overhead costs that can make operation unprofitable, and a narrower range of minimum wages over which firms remain in Region II. The result is that firms in the short run enter Region III at lower minimum wages than in the long run, limiting the range of efficiency gains. Quantitatively, the efficiency-maximizing optimal minimum wage declines by approximately $1 under utilitarian weights (from about $10 to about $9 in the short-run exercise) and by only about $0.20 under Negishi weights. The robustness conclusion is that the difference between short- and long-run optimal minimum wages is modest, and the main finding that efficiency gains are small is preserved.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Shadow wage (w̃ᵢⱼ):&lt;/strong&gt; The effective wage that rationalizes a firm&amp;rsquo;s equilibrium employment in the presence of a minimum wage. When labor is rationed at firm ij (Region III), the shadow wage equals the actual minimum wage multiplied by a rationing factor pᵢⱼ &amp;lt; 1, where pᵢⱼ is derived from the Lagrange multiplier on the household&amp;rsquo;s rationing constraint. The shadow wage is allocative — it determines labor supply decisions — while the observed minimum wage wage is not. When the rationing constraint is slack (Regions I and II), the shadow wage coincides with the observed wage.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Shadow markdown (µ̃ᵢⱼ):&lt;/strong&gt; The ratio of a firm&amp;rsquo;s shadow wage to its marginal revenue product of labor. In Region I (unconstrained), this equals the standard monopsony markdown. In Region II (constrained, on the labor supply curve), the shadow markdown narrows as the minimum wage increases, moving the firm toward its efficient employment level. In Region III (constrained, on the labor demand curve), the shadow markdown equals the rationing multiplier pᵢⱼ and widens, reflecting efficiency losses from rationing. An aggregate shadow markdown µ̃ is computed as a productivity-weighted average of firm-level shadow markdowns across all firms in the economy.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Misallocation wedge (ω):&lt;/strong&gt; A productivity-weighted measure of how well employment is allocated across firms. In an efficient allocation with identical shadow markdowns, ω = 1. When high-productivity firms have wider markdowns than low-productivity firms (the baseline oligopsony outcome), ω &amp;lt; 1 because employment is directed away from productive firms. A minimum wage can improve ω by shrinking low-productivity firms but worsens it when high-productivity firms enter Region III and are over-rationed relative to medium-productivity firms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Oligopsony with Cournot competition:&lt;/strong&gt; The specific form of labor market power in this model. In each local labor market (defined as a NAICS 3-digit industry × commuting zone cell), a finite number of firms compete strategically in employment quantities, taking their competitors&amp;rsquo; employment levels as given (Cournot assumption). Each firm has an upward-sloping labor supply curve derived from nested CES household preferences, and exercises a markdown on the marginal revenue product of labor. This differs from monopsony (one firm) or perfect competition (infinitely many firms), and generates both direct effects and spillover effects of minimum wages.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Negishi weights:&lt;/strong&gt; The vector of social welfare weights under which the observed competitive equilibrium allocation would be the solution to a social planner&amp;rsquo;s problem with zero lump-sum transfers. In this model, the calibrated Negishi weights assign roughly 62 percent combined weight to college workers and owners (who constitute only 35 percent of the population), reflecting the fact that the market equilibrium allocates a disproportionate share of consumption to high-income households. The Negishi weights are used both to identify the gap between market outcomes and utilitarian objectives (motivating redistribution) and as one alternative normative benchmark.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Efficiency-maximizing minimum wage:&lt;/strong&gt; The minimum wage that maximizes social welfare when the government additionally has access to budget-neutral lump-sum transfers across households. Because transfers can be optimized to handle any redistributive objective encoded in any arbitrary social welfare weights, the minimum wage under this framework serves solely to improve productive efficiency. In the calibrated model, the efficiency-maximizing minimum wage is approximately $7.50–$10.00 per hour, robust to social welfare weight specifications, Frisch elasticity variations (ϕ ∈ {0.30, 0.86}), and regional income differences.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Rationing constraint (n̄ᵢⱼₖ):&lt;/strong&gt; A firm-specific, type-specific upper bound on the labor a household may supply to a firm in equilibrium. These constraints are taken as given by households and determined in equilibrium by firms&amp;rsquo; labor demand decisions. When the minimum wage is above the firm&amp;rsquo;s competitive wage (Region III), the firm&amp;rsquo;s labor demand is less than what households would want to supply at that wage, so the rationing constraint binds. The binding rationing constraint generates the shadow wage discount (pᵢⱼ &amp;lt; 1) and is the mechanism by which high minimum wages reduce efficiency in the model.&lt;/p&gt;</description></item><item><title>Monetary Cooperation during Global Inflation Surges</title><link>https://macropaperwarehouse.com/papers/monetary-cooperation-during-global-inflation-surges/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/monetary-cooperation-during-global-inflation-surges/</guid><description>&lt;p&gt;In a multicountry model with nominal wage rigidities, two sectors (tradable with convex supply, nontradable with flat supply), and free capital mobility, the paper studies optimal monetary policy during a global demand reallocation shock — a shift in preferences toward tradables (ω₀ &amp;gt; ω). Under cooperation (Proposition 1), the optimal response allows inflation to rise: higher tradable goods prices reduce real wages (restoring labor demand), generate expenditure switching back toward nontradables, and boost nontradable employment through an income effect. Cooperation achieves full employment as long as the inflation cost is below the full-employment threshold; otherwise it strikes the optimal inflation-unemployment balance. Under noncooperation (Proposition 3), each national central bank perceives it can attract capital inflows by raising its policy rate — inflows sustain nontradable demand and reduce the domestic sacrifice ratio of disinflation. But in a symmetric Nash equilibrium, synchronized rate hikes cancel each other through global credit market clearing; only the global monetary contraction remains. The result is lower inflation than under cooperation but higher unemployment — a &lt;strong&gt;competitive appreciation&lt;/strong&gt; trap that mirrors the competitive depreciation failures of the Great Depression and the 2008 crisis, but in the opposite direction (global scarcity rather than deficiency of tradables). In a numerical example calibrated to α = 0.64 (convex tradable supply, implying 0.57 price-output elasticity, from Boehm and Pandalai-Nayar 2022) and ω = 0.3 (US pre-COVID tradables share), a 3 percentage point demand reallocation (matching the US COVID episode) requires approximately 1.5 percentage points of inflation to maintain full employment under cooperation; without any inflation, unemployment rises by approximately 8 percentage points. At ω₀ = 0.35, the uncooperative equilibrium reduces inflation by approximately 1 percentage point relative to cooperation but pushes unemployment to approximately 7 percent. For the COVID-19 episode, the authors conclude gains from cooperation were likely small (full employment maintained globally); for the 1980s synchronized tightening — when central banks explicitly sacrificed employment to fight inflation — the model implies substantially positive gains, consistent with the heated cooperation debates and the 1985 Plaza Accord.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-how-does-a-demand-reallocation-shock-generate-an-inflation-unemployment-tradeoff"&gt;Q1. How does a demand reallocation shock generate an inflation-unemployment tradeoff?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;A shift in preferences toward tradable goods (ω₀ &amp;gt; ω) reduces demand for nontradable goods, causing nontradable firms to fire workers; since nominal wages are rigid, the only way to sustain full employment is through a rise in the price of tradables (P^T), which operates through three distinct channels.&lt;/strong&gt; First, higher P^T raises tradable sector firms&amp;rsquo; real revenue per worker (nominal wages fixed), inducing them to hire more workers and expand output — the direct labor demand channel. Second, higher P^T generates income effects: as tradable output and income rise, households increase consumption of both tradable and nontradable goods, boosting nontradable employment through the income channel. Third, higher P^T generates expenditure switching away from tradables and toward nontradables (since nontradable goods become relatively cheaper), which also sustains nontradable employment. All three channels require letting P^T rise, which means tolerating inflation. In this sense, the demand reallocation shock acts as a cost-push shock — it shifts the Phillips curve upward, so that higher inflation is required to achieve any given level of employment. If the inflation cost is sufficiently low, the optimal response allows full employment; otherwise an interior solution trades off inflation against economic slack.&lt;/p&gt;
&lt;h3 id="q2-what-is-the-optimal-cooperative-monetary-policy-and-how-large-are-the-quantitative-tradeoffs"&gt;Q2. What is the optimal cooperative monetary policy, and how large are the quantitative tradeoffs?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Proposition 1: under international cooperation, the optimal response to ω₀ &amp;gt; ω entails a rise in inflation; if the full-employment inflation level P^fe satisfies χ&amp;rsquo;(P^fe) ≤ (1/ω₀)(α/(1−α) + 1 − ω₀), the cooperative optimum achieves full employment; otherwise the interior optimum sets χ&amp;rsquo;(P̄) equal to that expression, balancing marginal inflation cost against marginal employment benefit.&lt;/strong&gt; The cooperative optimum is strictly superior to strict inflation targeting (P = 1) because the latter allows large unemployment without achieving any structural rebalancing. The global central bank internalizes the income effect from tradable expansion: as Y^T rises, households immediately spend the income on consumption of both goods, further boosting nontradable employment — an amplification mechanism that self-oriented national banks will not fully internalize. In the calibrated numerical example (α = 0.64, ω = 0.3, χ(P) = χ̄(P−1)²/2 with χ̄ = 299.25), a reallocation shock matching the US COVID-19 episode (ω₀ − ω ≈ 0.03) requires approximately 1.5 percentage points of inflation to maintain full employment; under strict inflation targeting (P = 1), unemployment rises by approximately 8 percentage points. These magnitudes are consistent with the observation that during the pandemic inflation cycle, central banks were willing to allow inflation rather than trigger a labor market collapse.&lt;/p&gt;
&lt;h3 id="q3-how-does-capital-mobility-change-the-inflation-unemployment-tradeoff-faced-by-individual-countries"&gt;Q3. How does capital mobility change the inflation-unemployment tradeoff faced by individual countries?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Capital mobility reduces the domestic sacrifice ratio — the employment cost of disinflation — through two channels: trade deficits directly sustain nontradable demand (offsetting the fall in tradable sector employment), and they buffer tradable consumption from drops in domestic tradable output.&lt;/strong&gt; When a single country contracts its monetary policy and P^T falls, domestic tradable output falls; but households react by borrowing internationally, so domestic consumption of tradables falls by less than one-for-one with output (formally: ∂C^T/∂Y^T = ω_{i,0}(1−β)/(ω_{i,0}(1−β)+β) &amp;lt; 1). Capital inflows thus sustain nontradable demand and nontradable employment, partially offsetting the contractionary effect on employment. From each country&amp;rsquo;s perspective, containing inflation &amp;ldquo;exports&amp;rdquo; part of the output loss abroad, making disinflation individually less costly than in a closed economy. This is precisely what creates the coordination failure in the global case: each country perceives a lower sacrifice ratio for disinflation because it does not internalize that this lower sacrifice ratio exists only if the rest of the world continues to produce and lend tradable goods.&lt;/p&gt;
&lt;h3 id="q4-how-does-the-coordination-failure-arise-in-a-global-reallocation-shock-and-what-is-the-precise-mechanism-of-competitive-appreciations"&gt;Q4. How does the coordination failure arise in a global reallocation shock, and what is the precise mechanism of competitive appreciations?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Proposition 3: in a Nash equilibrium with a global symmetric shock, the full-employment inflation level P^fe coincides with the cooperative benchmark (since C^T_i = Y^T_i in symmetric equilibrium and capital flows net to zero), but if the inflation cost is high enough, self-oriented central banks impose a lower inflation ceiling (MP^u &amp;lt; MP^c) — resulting in lower inflation and higher unemployment than cooperation.&lt;/strong&gt; Each national central bank individually seeks to reduce domestic inflation by hiking its policy rate to attract capital inflows (which ease the nontradable sector employment constraint through the open economy Phillips curve). But the individual strategy of hiking to attract inflows — which amounts to trying to appreciate the exchange rate (S_i = P^T_{i,t}/P^T_t) — is frustrated in a symmetric Nash equilibrium: when all countries hike simultaneously, capital flows net to zero globally, exchange rates remain unchanged, and only the synchronized monetary contraction remains. This is the mechanism of &lt;strong&gt;competitive appreciations&lt;/strong&gt;: countries try to fight domestic inflation by appreciating their currencies, but appreciate against each other, leaving only a global slump. In the numerical example at ω₀ = 0.35, the uncooperative equilibrium reduces inflation by approximately 1 percentage point relative to cooperation but pushes unemployment to approximately 7 percent (vs. full employment under cooperation at that shock size).&lt;/p&gt;
&lt;h3 id="q5-how-do-competitive-appreciations-differ-from-competitive-depreciations-and-what-are-the-scope-conditions"&gt;Q5. How do competitive appreciations differ from competitive depreciations, and what are the scope conditions?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Competitive appreciations are the mirror image of competitive depreciations (which characterized the Great Depression and the aftermath of the 2008 GFC): in both cases each country uses its monetary policy to shift costs abroad, but the direction differs — depreciations arise during periods of weak global demand when countries try to steal demand from neighbors; appreciations arise during periods of global tradable goods scarcity and high inflation when countries try to export inflation.&lt;/strong&gt; The structural difference is the initial state: competitive depreciations occur when global aggregate demand is deficient and the zero lower bound binds — each country wants to depreciate to boost exports; competitive appreciations occur when global demand for tradables is strong relative to supply (ω₀ &amp;gt; ω) and inflation is high — each country wants to appreciate to attract capital inflows that buffer domestic employment from disinflation. The key asymmetry is the direction of the international spillover: in the depreciation case, countries export demand; in the appreciation case, countries export inflation costs. The gains from cooperation in both cases arise for the same reason — the Nash equilibrium involves globally excessive monetary tightening or loosening relative to the cooperative benchmark — but the policy recommendation is opposite in sign.&lt;/p&gt;
&lt;h3 id="q6-what-do-the-models-predictions-imply-for-the-covid-19-episode-and-the-1980s-disinflation-and-when-do-gains-from-cooperation-materialize"&gt;Q6. What do the model&amp;rsquo;s predictions imply for the COVID-19 episode and the 1980s disinflation, and when do gains from cooperation materialize?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Gains from monetary cooperation arise only when condition (28) is violated — when central banks are willing to sacrifice full employment to fight inflation; for the COVID-19 episode, gains were likely small (labor markets remained strong throughout); for the 1980s synchronized tightening, the model implies positive gains that would have been achievable through international cooperation.&lt;/strong&gt; For the COVID-19 episode: throughout the 2021–2023 inflation cycle, unemployment rates in advanced economies remained low and fiscal support maintained aggregate demand, suggesting monetary policy did not sacrifice employment — the model implies condition (28) did not bind and the cooperative optimum was approximately achieved. The world &amp;ldquo;escaped competitive appreciations this time.&amp;rdquo; For the 1980s disinflation: the synchronized monetary tightening under Volcker (US), Bundesbank (Germany), and others was accompanied by a deep global recession and explicitly prioritized inflation reduction over employment — precisely the conditions under which condition (28) binds and competitive appreciations generate a suboptimal outcome. These dynamics motivated the heated international cooperation debates of the period, culminating in the Plaza Accord of 1985 (Sachs 1985; Frankel 2015). The model also applies to negative tradable supply shocks (supply chain disruptions, tariffs) in Supplemental Appendix E, so its predictions about cooperation gains extend to protectionist-driven scarcity.&lt;/p&gt;
&lt;hr&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;demand reallocation shock&lt;/strong&gt; : a shift in the preference weight on tradable goods (ω₀ &amp;gt; ω) that reduces nontradable demand relative to tradable demand; in the model it corresponds to a structural demand shift toward durables and goods (as observed during the COVID-19 recovery), creating simultaneous inflationary pressure in tradables and deflationary pressure in nontradables, and generating an inflation-unemployment tradeoff absent in standard cost-push formulations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;convex tradable supply&lt;/strong&gt; : the feature of the tradable sector (parameterized by α &amp;gt; 0) whereby supply is upward-sloping due to capacity constraints — a 1% rise in the tradable goods price P^T raises tradable output by (1−α)/α percent; calibrated to α = 0.64 (implying a 0.57 price-output elasticity) following Boehm and Pandalai-Nayar (2022) for sectors at high capacity utilization; without this feature, tradable supply would be perfectly elastic and the inflation-unemployment tradeoff would disappear.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;competitive appreciations&lt;/strong&gt; : the Nash equilibrium coordination failure in which each national central bank hikes its policy rate to attract capital inflows (reducing domestic disinflation costs), generating nominal exchange rate appreciation; since all countries do this simultaneously, appreciations cancel out in equilibrium, leaving only a globally excessive monetary contraction with lower-than-cooperative inflation and higher-than-cooperative unemployment; mirror image of competitive depreciations but arising from global scarcity (not deficiency) of tradables.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;sacrifice ratio&lt;/strong&gt; : the employment cost per unit of disinflation; reduced in open economies relative to closed economies because capital inflows buffer domestic tradable consumption from drops in domestic tradable output, and sustain nontradable demand; self-oriented central banks perceive a lower sacrifice ratio than a global central bank, which is the source of the competitive appreciation externality.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;nominal wage rigidity&lt;/strong&gt; : the short-run friction that makes demand reallocation shocks costly: with flexible wages, reallocation from nontradable to tradable employment would occur through real wage adjustment alone; with rigid nominal wages, real wages fall only if tradable goods prices rise (inflation), so monetary accommodation is required for structural reallocation without unemployment.&lt;/p&gt;</description></item><item><title>Monetary Policy and Sovereign Risk in Emerging Economies (NK-Default)</title><link>https://macropaperwarehouse.com/papers/monetary-policy-and-sovereign-risk-in-emerging-economies-nk-default/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/monetary-policy-and-sovereign-risk-in-emerging-economies-nk-default/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper develops a New Keynesian small open economy model with endogenous sovereign default — the NK-Default framework — and uses it to study the interplay between monetary policy and sovereign risk in emerging markets. The core finding is that sovereign default risk amplifies inflation volatility through an expectations channel: when default risk rises, forward-looking firms increase prices in expectation of high future inflation and depressed consumption during a potential default, so that current inflation rises even before any default occurs. Conversely, tight monetary policy disciplines government overborrowing by raising the cost of domestic monetary distortions, which the government internalizes by reducing its borrowing. Calibrated to eight emerging-market inflation targeters (Brazil, Chile, Colombia, Mexico, Peru, Philippines, Poland, South Africa) over 2004–2019, the model quantitatively matches the positive comovement of spreads with inflation and nominal rates, and the temporary nature of inflation events (approximately 4.5% inflation spike, 2.3% spread increase, resolved within roughly a year). Counterfactual experiments find that default risk accounts for approximately 50% of both inflation business-cycle volatility and the inflation increase during these events, and that a 1% tighter monetary policy would reduce spreads by about 0.3% during inflation events. An interest rate rule augmented to respond to default risk dominates strict inflation targeting in welfare and reduces mean spreads by 2.2 percentage points; strict inflation targeting is not the optimal monetary regime when sovereign risk is present.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a published paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-structural-architecture-of-the-nk-default-model"&gt;Q1. What is the structural architecture of the NK-Default model?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The NK-Default framework combines the standard New Keynesian small open economy model of Gali and Monacelli (2005) with the Eaton-Gersovitz (1981) sovereign default structure extended to long-term foreign-currency debt, producing a model in which households, firms, a monetary authority, and a fiscal government all interact.&lt;/strong&gt; Households consume domestic and foreign goods and supply labor; intermediate goods producers are monopolistically competitive and set prices subject to Rotemberg (1982) quadratic adjustment costs, generating a forward-looking New Keynesian Phillips Curve (NKPC); the monetary authority follows a nominal interest rate rule targeting domestic goods inflation; and the government borrows internationally in long-term foreign-currency perpetuity bonds, choosing each period whether to repay or default, with default leading to temporary exclusion from international financial markets and a transitory productivity reduction. The bond price schedule compensates risk-neutral international lenders for expected losses from default and falls with the government&amp;rsquo;s indebtedness. A key methodological choice is the use of global solution methods rather than local approximations, because the nonlinear dynamics around default are central to the mechanisms.&lt;/p&gt;
&lt;h3 id="q2-what-is-the-default-amplification-mechanism-and-how-does-it-transmit-to-inflation"&gt;Q2. What is the default amplification mechanism, and how does it transmit to inflation?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Default amplification operates through an expectations channel encoded in the forward-looking NKPC: when default risk rises, firms&amp;rsquo; expectations of higher future inflation (during the inflation that would accompany a default event) and lower future consumption (because default depresses productivity and restricts borrowing) both increase, causing firms to raise current prices, generating current inflation without any contemporaneous policy change.&lt;/strong&gt; Formally, the NKPC relates current inflation π to a unit-cost term and to the expectation term E[Y&amp;rsquo;u&amp;rsquo;_C(π&amp;rsquo;-π)π&amp;rsquo;], which increases with default risk because default states feature high inflation and high marginal utility. The resulting current inflation increase then triggers the monetary authority&amp;rsquo;s interest rate rule to tighten, which in turn depresses consumption through the Euler equation, amplifying the monetary distortion (wedge). In the simplified quasi-linear preferences setting, higher borrowing B&amp;rsquo; increases functions F and M — the expectation terms in the NKPC and Euler equation — and Proposition 1 establishes formally that higher borrowing raises default risk, inflation, the nominal domestic rate, and the monetary wedge under Assumption 1.&lt;/p&gt;
&lt;h3 id="q3-how-does-monetary-policy-discipline-sovereign-borrowing"&gt;Q3. How does monetary policy discipline sovereign borrowing?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Tight monetary policy disciplines government overborrowing because the government internalizes the additional costs that monetary distortions impose on the economy: when the monetary authority raises interest rates, the resulting monetary wedge — the gap between the marginal product of labor and households&amp;rsquo; marginal rate of substitution — acts as an additional cost on borrowing from the government&amp;rsquo;s perspective, discouraging excessive debt accumulation.&lt;/strong&gt; Proposition 2 establishes this formally: under Assumption 2 (one-time deviation from constrained efficiency), a policy rate i &amp;gt; i_ST (above the strict-inflation-targeting rate) generates a positive monetary wedge that modifies the government&amp;rsquo;s optimal borrowing condition with an additional term reflecting the cost to the sovereign of the higher wedge its borrowing induces. Contractionary monetary policy thus reduces the incentive to borrow and lowers equilibrium default risk. The paper also derives Proposition 3: a default-risk monetary rule of the form i = ī·Φ^αD can achieve the constrained-efficient default risk and an arbitrarily small monetary wedge simultaneously, by choosing αD appropriately — meaning that targeting default risk can address both the pricing friction and the overborrowing incentive.&lt;/p&gt;
&lt;h3 id="q4-what-are-the-quantitative-findings-on-default-amplification-and-the-disciplining-mechanism"&gt;Q4. What are the quantitative findings on default amplification and the disciplining mechanism?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Quantitatively, default risk accounts for approximately 50% of inflation business-cycle volatility and approximately 50% of the inflation increase during the temporary inflation events (4.5 p.p. inflation spike, 2.3 p.p. spread increase, nominal rate rise from baseline 5–6% to 8–9%), based on comparison with a reference model without default.&lt;/strong&gt; For the disciplining mechanism, panel-data regressions using monetary policy shocks recovered from estimated Taylor rules across the eight countries find that a 1% contractionary monetary shock reduces sovereign spreads, consistent with model predictions. During the inflation events, a 1% tighter monetary policy would have reduced spreads by approximately 0.3 percentage points. Comparing alternative monetary policy regimes against strict inflation targeting (which implements flexible-price allocation): the baseline interest rate rule (responding only to inflation) reduces mean spreads by 0.5 percentage points relative to strict inflation targeting; an augmented rule that also responds to default risk reduces mean spreads by 2.2 percentage points. Welfare under the baseline rule exceeds that under strict inflation targeting, and welfare under the default-risk rule exceeds both, with the ranking holding across all robustness extensions.&lt;/p&gt;
&lt;h3 id="q5-how-does-the-model-fit-the-data-across-targeted-and-untargeted-moments"&gt;Q5. How does the model fit the data across targeted and untargeted moments?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The model is calibrated to match key business-cycle statistics of the eight emerging-market inflation targeters and successfully replicates several untargeted moments, including the positive correlations of spreads with inflation (mean 0.5 across countries in data) and nominal rates (mean 0.3), the relative volatility of inflation to output (mean 0.8), and the mean spread level of approximately 2%.&lt;/strong&gt; The temporary inflation events — constructed as windows around periods of elevated inflation — are matched with a combination of low productivity shocks and expansionary monetary shocks, and the model&amp;rsquo;s impulse response functions for inflation, output, nominal rates, and spreads during these events align with the empirical paths. The model also fits the positive elasticity of inflation expectations to default risk and the negative elasticity of spreads to monetary policy shocks, both of which are estimated from data and used as untargeted validation moments. Structurally, the model is parameterized to match the mean and volatility of inflation, spreads, and the correlation of spreads with output (mean -0.5 across countries), among other moments.&lt;/p&gt;
&lt;h3 id="q6-how-do-the-models-results-hold-up-across-extensions-especially-local-currency-debt-and-discretionary-monetary-policy"&gt;Q6. How do the model&amp;rsquo;s results hold up across extensions, especially local currency debt and discretionary monetary policy?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The main results — default amplifies inflation, tight monetary policy disciplines borrowing, and the default-risk rule dominates strict inflation targeting — are robust across all extension economies, including the case of local currency sovereign debt, alternative default costs (no productivity loss, endogenous domestic financial frictions), and loose monetary policy during defaults.&lt;/strong&gt; In the local currency debt extension, which introduces the classic incentive to erode debt via inflation, the paper shows that monetary discretion delivers substantially worse outcomes: average inflation doubles relative to the commitment case and — crucially — sovereign spreads also double under discretion, because market participants anticipate the inflationary incentive. This result shows that the disciplining benefits of commitment in monetary policy rules extend to the sovereign debt dimension: the country&amp;rsquo;s ability to commit to a rule lowers spreads by reducing the expected future inflation that lenders must be compensated for. The endogenous financial frictions extension — in which banking sector health depends on nominal rates and spreads — generates similar monetary-fiscal interactions, confirming that the mechanisms are not specific to the productivity-cost assumption.&lt;/p&gt;
&lt;h3 id="q7-what-is-the-papers-relationship-to-the-literature-on-nominal-rigidities-and-sovereign-default"&gt;Q7. What is the paper&amp;rsquo;s relationship to the literature on nominal rigidities and sovereign default?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The NK-Default framework differs critically from related papers that introduce downward nominal wage rigidity (e.g., Na, Schmitt-Grohe, Uribe, Yue 2018; Bianchi, Ottonello, Presno 2023) in that price-setting frictions arise from optimal forward-looking pricing by monopolistically competitive firms under Rotemberg costs, not from a mechanical wage floor, so that inflation expectations matter for current inflation and output in a standard NKPC.&lt;/strong&gt; This means that expected future default events — through their effects on expected inflation and expected marginal utility — transmit to current equilibrium in a way that downward-rigid-wage models cannot replicate. The paper also differs from the literature studying the inflation incentive for local-currency debt dilution (e.g., Calvo 1988; Du, Pflueger, Schreger 2020): the baseline model assumes foreign-currency debt and a rule-based monetary authority that has no incentive to inflate away debt, so the mechanisms operate through expectations and discipline rather than through the debt-erosion channel. The paper connects these strands in the local-currency extension.&lt;/p&gt;
&lt;h3 id="q8-what-are-the-welfare-and-policy-implications-for-central-bank-mandates-in-emerging-markets"&gt;Q8. What are the welfare and policy implications for central bank mandates in emerging markets?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper provides formal support for monetary policy rules that respond to financial or sovereign-risk conditions — beyond standard inflation targeting — in emerging economies: the welfare ranking is default-risk rule &amp;gt; baseline rule &amp;gt; strict inflation targeting, with the gap between the default-risk rule and strict inflation targeting driven by lower mean and volatility of spreads, which reduce the frequency and severity of default amplification events.&lt;/strong&gt; Strict inflation targeting, which delivers the flexible-price allocation, is not optimal because it leaves the overborrowing incentive of the fiscal government unchecked, generating excessive default risk that feeds back into inflation volatility through the expectations channel. A monetary rule with sufficient responsiveness to inflation or to default risk disciplines fiscal behavior and reduces welfare costs from both pricing frictions and default risk, suggesting that emerging-market central bank mandates that focus exclusively on inflation targeting at the expense of financial stability considerations may be suboptimal relative to rules that jointly address monetary and fiscal distortions.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;dl&gt;
&lt;dt&gt;&lt;strong&gt;NK-Default framework&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;the paper&amp;rsquo;s model combining a New Keynesian small open economy (Gali-Monacelli structure with Rotemberg price-setting frictions and a Taylor-type interest rate rule) with the Eaton-Gersovitz endogenous sovereign default structure extended to long-term foreign-currency perpetuity bonds; the joint treatment of monetary policy and sovereign risk for emerging economies.&lt;/dd&gt;
&lt;dt&gt;&lt;strong&gt;default amplification&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;the mechanism by which elevated sovereign default risk increases current inflation and depresses output through the forward-looking NKPC expectations channel: firms raise prices in anticipation of high future inflation and low consumption during a potential default, so current inflation rises even without any contemporaneous fiscal action; established as Proposition 1 in the simplified model and confirmed quantitatively.&lt;/dd&gt;
&lt;dt&gt;&lt;strong&gt;monetary discipline&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;the mechanism by which contractionary monetary policy raises the cost of government borrowing through monetary distortions (the monetary wedge), inducing the fiscal government to reduce its indebtedness and thereby lowering equilibrium default risk; established as Proposition 2 and confirmed empirically using panel-data regressions of spreads on monetary policy shocks.&lt;/dd&gt;
&lt;dt&gt;&lt;strong&gt;monetary wedge&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;the deviation of the marginal product of labor from households&amp;rsquo; marginal rate of substitution between labor and consumption, arising from price-setting frictions; serves as the quantitative measure of monetary distortions and is the channel through which monetary policy affects government borrowing incentives.&lt;/dd&gt;
&lt;dt&gt;&lt;strong&gt;default-risk monetary rule&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;an interest rate rule of the form i = ī·Φ^αD that responds directly to the one-period-ahead default probability Φ; shown in Proposition 3 to achieve both the constrained-efficient level of government debt and an arbitrarily small monetary wedge simultaneously, by incorporating an additional cost of borrowing for the fiscal government through the rule&amp;rsquo;s response.&lt;/dd&gt;
&lt;dt&gt;&lt;strong&gt;temporary inflation events&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;empirical regularities in eight emerging-market inflation targeters in which inflation, spreads, and nominal policy rates temporarily spike together (inflation rises approximately 4.5%, spreads by 2.3%, within roughly one year) before reverting to lower levels; the model replicates these patterns using a combination of low productivity shocks and expansionary monetary shocks.&lt;/dd&gt;
&lt;/dl&gt;</description></item><item><title>Monetary policy in open economies with production networks</title><link>https://macropaperwarehouse.com/papers/monetary-policy-in-open-economies-with-production-networks/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/monetary-policy-in-open-economies-with-production-networks/</guid><description>&lt;p&gt;This paper studies the design of monetary policy in a multi-sector small open economy with domestic input-output linkages and cross-border production networks, under nominal price rigidities in domestic sectors. The main result is that the monetary policy that closes the domestic output gap is nearly optimal, and it is implemented by stabilizing an aggregate inflation index that weights each sector&amp;rsquo;s inflation by its role as a supplier of inputs and a net exporter within the international production network. Sectors with small direct or indirect import shares receive large weight in the index; ignoring cross-border linkages leads monetary policy to overemphasize inflation in sectors that are intensive exporters directly or indirectly through downstream sectors. Three channels link sectoral markup wedges to the aggregate output gap: the CPI channel (present in closed economies too) and the net export income and net profit income channels (unique to open economies with cross-border linkages). Using the World Input-Output Database, the output-gap-closing policy is shown to outperform alternatives that abstract from economic openness or input-output linkages.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-output-gap-monetary-policy-and-how-is-it-implemented"&gt;Q1. What is the output gap monetary policy and how is it implemented?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The output gap (OG) monetary policy stabilizes the aggregate inflation index proportional to the aggregate output gap—defined as the difference between output in the sticky-price equilibrium and in the efficient flexible-price equilibrium—by weighting each sector&amp;rsquo;s inflation according to the product of its price rigidity and its OG weight.&lt;/strong&gt; The price rigidity component maps positive sectoral inflation into a negative sectoral markup wedge under nominal rigidities; the OG weight measures the sector&amp;rsquo;s contribution to the aggregate output gap through domestic and cross-border network linkages. This policy eliminates first-order aggregate distortions and is shown to be nearly optimal.&lt;/p&gt;
&lt;h3 id="q2-what-are-the-cpi-net-export-income-and-net-profit-income-channels"&gt;Q2. What are the CPI, net export income, and net profit income channels?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Three channels link a negative sectoral markup wedge to a positive aggregate output gap: the CPI channel (lower domestic prices raise real factor prices and stimulate supply), the net export income channel (lower domestic prices increase net exports and domestic labor income), and the net profit income channel (two opposing effects: lower prices increase net export profits but also raise the cost of imported inputs).&lt;/strong&gt; The CPI channel operates in closed economies as well, while the net export income and net profit income channels are unique to open economies with cross-border input-output linkages.&lt;/p&gt;
&lt;h3 id="q3-why-does-ignoring-cross-border-linkages-lead-monetary-policy-to-overweight-intensive-exporters"&gt;Q3. Why does ignoring cross-border linkages lead monetary policy to overweight intensive exporters?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Failing to account for cross-border production networks causes monetary policy to overemphasize inflation in sectors that export intensively directly and indirectly—because the net profit income channel, which reduces the OG contribution of intensive exporters by raising the cost of their imported inputs, is omitted when the economy is treated as closed.&lt;/strong&gt; Without the cross-border linkages, intensive direct or indirect exporting sectors appear to have larger aggregate output gap contributions through domestic channels alone, causing the aggregate inflation index to over-weight those sectors.&lt;/p&gt;
&lt;h3 id="q4-how-do-the-results-relate-to-existing-monetary-policy-frameworks"&gt;Q4. How do the results relate to existing monetary policy frameworks?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper bridges the multi-sector closed-economy result that optimal policy targets a Domar-weighted aggregate inflation index and the one-sector open-economy result that optimal policy trades off domestic inflation against terms-of-trade distortions, showing that cross-border input-output linkages modify the Domar weights through the net export income and net profit income channels.&lt;/strong&gt; In the limit where all sectors have no cross-border linkages, the OG weights reduce to Domar weights; the one-sector open economy policy prescription is a special case of the general framework.&lt;/p&gt;
&lt;h3 id="q5-what-is-the-empirical-validation"&gt;Q5. What is the empirical validation?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Using the World Input-Output Database, the paper computes the theoretical sectoral OG weights for actual economies and shows that the OG monetary policy outperforms alternative policies that ignore either economic openness or input-output linkages.&lt;/strong&gt; The database provides cross-country cross-sector data on intermediate input flows that allow computation of the model&amp;rsquo;s OG weight formulas for real economies.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;output gap (OG) monetary policy&lt;/strong&gt; : the monetary policy that closes the aggregate output gap (difference between sticky-price and efficient flexible-price output), implemented by stabilizing the network-weighted aggregate inflation index; shown to be nearly optimal in the open-economy production network framework.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;sectoral OG weight&lt;/strong&gt; : the weight assigned to a sector&amp;rsquo;s inflation in the aggregate inflation index under OG monetary policy; measures the sector&amp;rsquo;s contribution to the aggregate output gap through the CPI, net export income, and net profit income channels; differs from the Domar weight in open economies due to cross-border linkages.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Domar weight&lt;/strong&gt; : the ratio of a sector&amp;rsquo;s gross output to GDP; the weight used in the closed-economy multi-sector optimal inflation index literature; coincides with the OG weight when there are no cross-border production linkages.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;labor wedge&lt;/strong&gt; : a weighted average of sectoral markup wedges proportional to the aggregate output gap; the monetary policy target in the OG framework.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;efficiency wedge&lt;/strong&gt; : a weighted average of exogenous sectoral shocks; determines the efficient flexible-price equilibrium; independent of sectoral markup wedges at first order, so the OG policy can separately close the aggregate distortions caused by markup wedges.&lt;/p&gt;</description></item><item><title>Monetary policy trade-offs amid global supply chain disruptions</title><link>https://macropaperwarehouse.com/papers/monetary-policy-trade-offs-amid-global-supply-chain-disruptions/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/monetary-policy-trade-offs-amid-global-supply-chain-disruptions/</guid><description>&lt;p&gt;This paper employs a proxy structural VAR model to examine the effects of global supply chain (GSC) shocks on U.S. macroeconomic variables and the Federal Reserve&amp;rsquo;s historical response, and evaluates two counterfactual monetary policy rules using the COVID-19 episode. Large fiscal stimulus amplifies inflation while cushioning the output downturn from GSC shocks. Historically, the Fed adopted a loose stance, looking through price surges from supply chain disruptions. The first counterfactual—which stabilizes inflation—entails less accommodation and yields a more favorable inflation-output trade-off, reflecting greater price flexibility and limited output losses. The second counterfactual—which minimizes a dual-mandate loss function—calls for greater initial easing; under inflation targeting (IT) this involves moderate accommodation, while under average inflation targeting (AIT) the looser initial policy generates more persistent inflation and ultimately requires a contractionary response, worsening the trade-off.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary based on a working paper version, AI-assisted and human-reviewed. See the linked published article for the authoritative version.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-empirical-strategy"&gt;Q1. What is the empirical strategy?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper estimates a proxy structural VAR model that identifies GSC shocks using the news-based Supply Bottleneck Index (SBI) of Burriel et al. (2024) as a proxy, then evaluates the Fed&amp;rsquo;s historical response to those shocks and two counterfactual policy rules that substitute for the historical stance.&lt;/strong&gt; The proxy SVAR approach identifies the GSC shock&amp;rsquo;s impulse response function and then traces the macroeconomic dynamics that would have obtained under alternative policy rules, holding the non-policy shocks at their historical values. The SBI captures sudden decreases in supply chain functioning from natural disasters, geopolitical events, strikes, and pandemics.&lt;/p&gt;
&lt;h3 id="q2-what-is-the-role-of-fiscal-stimulus-in-amplifying-gsc-shock-effects"&gt;Q2. What is the role of fiscal stimulus in amplifying GSC shock effects?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Large fiscal stimulus—such as the U.S. transfers and spending during COVID-19—amplifies the inflationary impact of GSC shocks while cushioning the output downturn; the interaction between supply disruptions and fiscal expansion is thus an important determinant of the inflation-output dynamics.&lt;/strong&gt; Without the large fiscal stimulus, GSC shocks would generate the standard supply-shock trade-off with less amplified inflation. With stimulus, the combination of higher aggregate demand (from fiscal transfers) and reduced aggregate supply (from GSC disruptions) creates a strongly inflationary environment.&lt;/p&gt;
&lt;h3 id="q3-what-does-the-first-counterfactual-inflation-stabilizing-policy-show"&gt;Q3. What does the first counterfactual (inflation-stabilizing policy) show?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The counterfactual that stabilizes inflation requires less monetary accommodation than the historical stance and yields a more favorable inflation-output trade-off, suggesting that the Fed&amp;rsquo;s historical &amp;rsquo;look-through&amp;rsquo; approach was suboptimal given the interaction with fiscal stimulus.&lt;/strong&gt; The intuition is that earlier and firmer monetary tightening in response to GSC-driven inflation would have reduced inflation expectations pass-through and prevented a larger buildup of price pressures, while the output cost of that tighter stance was limited by the greater price flexibility the model identifies in this environment.&lt;/p&gt;
&lt;h3 id="q4-what-is-the-comparison-between-it-and-ait-in-the-second-counterfactual"&gt;Q4. What is the comparison between IT and AIT in the second counterfactual?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The second counterfactual calls for greater initial easing than the historical stance; under inflation targeting (IT) this involves moderate accommodation, while average inflation targeting (AIT) implies an even looser initial policy that generates more persistent inflation and ultimately requires a contractionary response, worsening the inflation-output trade-off relative to IT.&lt;/strong&gt; The AIT result reflects the design of that framework: making up for periods of below-target inflation with above-target periods creates a commitment to easy policy even when supply-side inflationary pressures are elevated, producing a worse outcome when supply shocks drive inflation above target.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;proxy structural VAR&lt;/strong&gt; : a structural VAR identified using an external instrument (the proxy variable) that is correlated with the structural shock of interest but uncorrelated with other shocks; used here to identify GSC shocks using the Supply Bottleneck Index.
&lt;strong&gt;global supply chain (GSC) shock&lt;/strong&gt; : a sudden decrease in the supply provision or functioning of supply chains stemming from adverse events (natural disasters, pandemics, geopolitical events); identified in this paper as acting like supply shocks, lowering output and raising prices.
&lt;strong&gt;average inflation targeting (AIT)&lt;/strong&gt; : a monetary policy framework in which the central bank targets the average rate of inflation over time, implying accommodation of below-target periods with above-target periods; shown here to imply looser initial policy and more persistent inflation in response to supply shocks, worsening the trade-off relative to standard IT.&lt;/p&gt;</description></item><item><title>Motivating banks to lend? Credit spillover effects of the Main Street Lending Program</title><link>https://macropaperwarehouse.com/papers/motivating-banks-to-lend-credit-spillover-effects-of-the-main-street-lending-program/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/motivating-banks-to-lend-credit-spillover-effects-of-the-main-street-lending-program/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; Minoiu, Zarutskie, and Zlate ask whether participation in the Main Street Lending Program (MSLP)—a Federal Reserve emergency facility launched in mid-2020 to channel credit to small and mid-sized firms during the COVID-19 pandemic—caused banks to lend more &lt;em&gt;outside&lt;/em&gt; the program. The authors focus on credit spillover effects: did MSLP-participating banks ease standards and expand volumes on their general commercial and industrial (C&amp;amp;I) loan books, beyond the direct loans originated under the program itself?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Institutional Context.&lt;/strong&gt; The MSLP opened for lender registration on June 15, 2020 and began accepting loan submissions on July 6, 2020, expiring December 31, 2020. Of $600 billion in available SPV capacity, only $16.05 billion was actually deployed, making overall take-up approximately 2.7% of capacity. Despite this, the program required participating banks to retain 5% of each loan&amp;rsquo;s credit risk while offloading 95% to the SPV, and charged borrowers LIBOR plus 300 bps. Registration rate among all Call Report banks was 11.7% (614 out of 5,242 banks), with participation rising steeply with bank size: from 6.5% of banks in the below-$1-billion asset group to 63.8% of banks with assets above $50 billion.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Methodology.&lt;/strong&gt; The analysis draws on multiple data sources: (a) supervisory Y-14Q H1 loan-level data covering C&amp;amp;I loans above $1 million commitments, reported by 32 bank holding companies (BHCs) that account for roughly three-quarters of total U.S. C&amp;amp;I loans; (b) Y-14Q A9 loan portfolio segment data for small business C&amp;amp;I loans (below $1 million commitments) from 22 BHCs; (c) quarterly Senior Loan Officer Opinion Survey (SLOOS) microdata for April, July, and October 2020, providing bank-level assessments of lending standard changes, loan terms, demand shifts, and stated reasons for tightening; (d) Dealscan syndicated loan originations for 262 banks (51 MSLP participants); and (e) bank balance sheet data from Call Reports, including the Ellul-Yerramilli risk management index (RMI) for 16 BHCs. The core empirical design is a difference-in-differences (DiD) comparing MSLP-participating vs. non-participating banks before (2020:Q1–Q2) and after (2020:Q3) program implementation. To address nonrandom selection, the authors instrument MSLP participation with three variables: (i) a dummy for banks that cited registration as &amp;ldquo;too burdensome&amp;rdquo; in the September 2020 supplementary SLOOS; (ii) a dummy for banks with prior experience pledging loan collateral at the Fed&amp;rsquo;s discount window; and (iii) a dummy for banks with prior experience pledging securities collateral at the discount window. Firm×quarter fixed effects absorb time-varying credit demand at the borrower level (Khwaja-Mian design), and bank×borrower fixed effects further control for relationship-specific lending patterns.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings — Extensive Margin (Large Business Loans).&lt;/strong&gt; In the Y-14Q H1 data, MSLP banks were 30–32% more likely to renew existing loans than non-MSLP banks in 2020:Q3, with the probability of renewal 1.6–1.7 percentage points higher (against a sample average renewal rate of 5.3%). New loan originations were 22–27% more likely at MSLP banks, or 1.1–1.4 percentage points higher (against a sample average origination rate of 5.1%). 2SLS estimates are similar in magnitude to OLS, indicating selection bias is modest.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings — Extensive Margin (Small Business Loans and Survey Data).&lt;/strong&gt; In the A9 small business segment data, MSLP lenders had 17.3% more small business loan accounts outstanding in 2020:Q3 than non-MSLP banks. In SLOOS microdata, MSLP banks were approximately 15 percentage points less likely to report tightening C&amp;amp;I lending standards in 2020:Q3 (conditional on demand controls), compared to an actual tightening rate of 37.5%. This effect is larger for small (more financially constrained) firms (16–17 percentage points) than for large firms (13–14 percentage points).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings — Intensive Margin.&lt;/strong&gt; On loan terms, MSLP banks charged spreads that were approximately 9 basis points lower on renewed/originated C&amp;amp;I loans in the Y-14Q data, and 13.5 basis points lower in the Dealscan syndicated loan sample, compared to non-MSLP banks in 2020:Q3. 2SLS estimates are somewhat larger (19–30 bps). In the Dealscan sample, MSLP banks also extended syndicated loans that were 11.2% larger (about $2.4 million more given a $22 million average loan size). Survey data confirm MSLP banks were less likely to tighten most individual loan terms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Aggregate Magnitude.&lt;/strong&gt; The authors estimate that, in the absence of the MSLP, total loan renewals and originations at Y-14Q reporting banks in 2020:Q3 would have been approximately 10% lower. Scaling to the broader banking sector, the estimated credit spillover effect is approximately $44.8 billion in C&amp;amp;I lending—nearly three times the $16.05 billion in direct MSLP loan purchases.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mechanism.&lt;/strong&gt; Survey and objective evidence both point to reduced risk aversion as the primary channel, rather than immediate balance sheet constraint relief. MSLP banks were significantly less likely to cite &amp;ldquo;reduced tolerance for risk&amp;rdquo; as a reason for tightening lending standards after the program&amp;rsquo;s introduction, while showing no differential propensity to cite capital or liquidity deterioration. Banks with higher risk management index scores (more risk-averse institutions) exhibited larger spillover effects on two of three lending margins. Indicators of immediate balance sheet tightness (excess capital cushions, cost of capital, core deposit reliance) do not predict larger spillovers, with a partial exception for lower excess capital and higher loan loss reserves — suggesting future rather than current balance sheet constraints may have played some role.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions and Robustness.&lt;/strong&gt; The backstop mechanism is explicitly tied to the program&amp;rsquo;s credibility period: the spillover effects are smaller in 2020:Q4, consistent with the Treasury&amp;rsquo;s November 19, 2020 announcement that the program would not be extended, which diminished its backstop role. Placebo regressions using 2018 and 2019 data find no differential lending behavior between MSLP and non-MSLP banks before the program, supporting parallel trends. Results are robust to controls for PPP participation, credit line drawdown exposure, loan loss provisioning, and bank-level loan portfolio cyclicality.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-precisely-is-the-spillover-effect-that-the-paper-measures-and-how-does-it-differ-from-the-direct-effect-of-the-mslp"&gt;Q1. What precisely is the &amp;ldquo;spillover effect&amp;rdquo; that the paper measures, and how does it differ from the direct effect of the MSLP?&lt;/h3&gt;
&lt;p&gt;A: The direct effect is the $16.05 billion in MSLP loans purchased by the SPV — credit extended specifically through the program. The spillover effect refers to changes in banks&amp;rsquo; general C&amp;amp;I lending behavior outside the program: renewals and originations of non-MSLP loans, changes in lending standards and terms for all business borrowers, and changes in small business loan volumes. The sample in the Y-14Q regression explicitly excludes MSLP loans themselves, so the estimates reflect only the indirect, broader credit effects.&lt;/p&gt;
&lt;h3 id="q2-what-instruments-does-the-paper-use-for-mslp-participation-and-why-are-they-plausibly-exogenous"&gt;Q2. What instruments does the paper use for MSLP participation, and why are they plausibly exogenous?&lt;/h3&gt;
&lt;p&gt;A: Three IVs are employed: (1) a dummy for banks that cited program registration as &amp;ldquo;too burdensome&amp;rdquo; as a very important reason for not joining (from the September 2020 supplementary SLOOS); (2) a dummy for banks that pledged loan collateral at the Fed&amp;rsquo;s discount window in December 2019; and (3) a dummy for banks that pledged securities collateral at the discount window in the same period. The exclusion restriction argument is that (1) reflects banks&amp;rsquo; administrative capacity and prior Fed engagement rather than underlying balance sheet strength or lending appetite, and that (2) and (3) reflect familiarity with Fed collateral processes in ways that made a loan-based program easier to understand and join — without independently affecting lending standards or volumes in 2020:Q3.&lt;/p&gt;
&lt;h3 id="q3-how-large-are-the-spillover-effects-on-the-extensive-margin-of-large-corporate-lending"&gt;Q3. How large are the spillover effects on the extensive margin of large corporate lending?&lt;/h3&gt;
&lt;p&gt;A: In the Y-14Q H1 data across 32 BHCs, MSLP banks renewed loans 1.6–1.7 percentage points more frequently and originated new loans 1.1–1.4 percentage points more frequently in 2020:Q3, relative to non-MSLP banks. Against sample averages of 5.3% renewal rate and 5.1% origination rate, these translate to MSLP banks being 30–32% more likely to renew and 22–27% more likely to originate loans. The 2SLS estimates are broadly similar in magnitude, suggesting that self-selection bias in OLS is limited.&lt;/p&gt;
&lt;h3 id="q4-what-are-the-estimated-aggregate-dollar-spillovers-from-the-mslp"&gt;Q4. What are the estimated aggregate dollar spillovers from the MSLP?&lt;/h3&gt;
&lt;p&gt;A: The paper calculates that, in the absence of the program, total loan renewals and originations at Y-14Q H1 MSLP banks in 2020:Q3 would have been lower by approximately $33.6 billion (derived from 44,274 bank-borrower pairs × 1.38 existing loans per pair × 3.06 percentage points of extra loan activity × $17.98 million average loan size). Scaling to all Y-14Q banks (MSLP and non-MSLP alike), the shortfall would represent roughly a 10% reduction in total 2020:Q3 loan renewals and originations. Extrapolating to the full banking sector (since Y-14Q banks cover about 75% of total C&amp;amp;I lending), and assuming similar spillover magnitudes for banks outside the sample, total MSLP spillovers amount to roughly $44.8 billion — approximately three times the $16.05 billion in direct MSLP loan purchases.&lt;/p&gt;
&lt;h3 id="q5-what-is-the-estimated-effect-on-ci-lending-standards-using-survey-data"&gt;Q5. What is the estimated effect on C&amp;amp;I lending standards using survey data?&lt;/h3&gt;
&lt;p&gt;A: Using SLOOS microdata, the paper estimates that MSLP banks were approximately 15 percentage points less likely to tighten C&amp;amp;I lending standards in 2020:Q3 compared to non-MSLP banks, after controlling for demand conditions. The actual tightening rate in 2020:Q3 was 37.5%, meaning the counterfactual tightening rate absent the program would have been approximately 5 percentage points higher. In a further hypothetical where all SLOOS sample banks had participated, the counterfactual tightening rate would have been nearly 10 percentage points higher than actual.&lt;/p&gt;
&lt;h3 id="q6-are-spillover-effects-larger-for-small-or-large-borrowers-and-what-does-this-imply"&gt;Q6. Are spillover effects larger for small or large borrowers, and what does this imply?&lt;/h3&gt;
&lt;p&gt;A: The SLOOS-based estimates show that MSLP banks were 16–17 percentage points less likely to tighten lending standards for small firms (annual sales below $50 million), compared to 13–14 percentage points less likely for large and middle-market firms — a statistically significant difference. The authors interpret this as consistent with the MSLP reducing risk aversion broadly, with the largest effect on borrowers facing greater credit constraints where uncertainty about creditworthiness was highest.&lt;/p&gt;
&lt;h3 id="q7-what-evidence-supports-the-risk-aversion-psychological-backstop-mechanism-over-the-balance-sheet-constraint-mechanism"&gt;Q7. What evidence supports the risk aversion (psychological backstop) mechanism over the balance sheet constraint mechanism?&lt;/h3&gt;
&lt;p&gt;A: From SLOOS data, MSLP banks were significantly less likely (at the 1% level) to cite &amp;ldquo;reduced tolerance for risk&amp;rdquo; as a reason for tightening lending standards after the program&amp;rsquo;s introduction, while showing no differential likelihood of citing deteriorating capital or liquidity positions as reasons. Furthermore, splitting banks by the risk management index (RMI), the spillover effects are stronger for high-RMI (more risk-averse) banks on two of three lending outcomes. Conversely, proxies for immediate balance sheet constraints — excess capital cushions, core deposit ratios, equity issuance, and cost of capital — do not yield consistently stronger spillover effects for more constrained banks. The only partial exception is lower excess capital and higher loan loss reserves, which are associated with more loan renewals, suggesting future rather than current balance sheet constraints may have contributed.&lt;/p&gt;
&lt;h3 id="q8-what-is-the-risk-management-index-rmi-and-how-is-it-used-here"&gt;Q8. What is the risk management index (RMI), and how is it used here?&lt;/h3&gt;
&lt;p&gt;A: The RMI is an index developed by Ellul and Yerramilli (2013) that captures the strength of a bank&amp;rsquo;s internal risk management function, constructed from variables including whether the bank has a chief risk officer (CRO), the CRO&amp;rsquo;s executive status and relative compensation, risk committee member experience, and meeting frequency. Available for 61 BHCs over 2011–2013, it is matched to 16 BHCs in the Y-14Q H1 sample and used as a pre-COVID proxy for institutional risk aversion. Banks above the median RMI show larger MSLP spillover effects on loan renewals and tightening standards, consistent with the interpretation that the MSLP reduced effective risk aversion more for banks that had higher baseline risk-consciousness.&lt;/p&gt;
&lt;h3 id="q9-how-do-the-authors-address-the-concern-that-ppp-participation--not-mslp-participation--might-drive-the-results"&gt;Q9. How do the authors address the concern that PPP participation — not MSLP participation — might drive the results?&lt;/h3&gt;
&lt;p&gt;A: First, they test directly that MSLP participation does not predict outstanding PPP/federally-guaranteed loan balances (in Q2 or Q3 2020) in the A9 loan segment data, finding no correlation. Second, they add an interaction of PPP loan balances (divided by total assets) × Post to the baseline regression in Table A10 and find that while PPP lending is positively associated with loan renewals and originations, the MSLP bank × Post coefficient remains statistically significant and similar in magnitude to the baseline, ruling out PPP participation as the driver of the baseline results.&lt;/p&gt;
&lt;h3 id="q10-what-explains-the-low-take-up-of-the-mslp-despite-its-large-designed-capacity"&gt;Q10. What explains the low take-up of the MSLP despite its large designed capacity?&lt;/h3&gt;
&lt;p&gt;A: Survey responses from the September 2020 supplementary SLOOS indicate several demand- and supply-side constraints: banks reported they could generally meet credit demand outside the program; borrower leverage limits (capped at 4–6× EBITDA depending on facility) were seen as too restrictive; the LIBOR plus 300 bps interest rate was high relative to historical pricing for eligible firms; and registration and loss-sharing arrangements were viewed as burdensome and uncertain. The paper interprets these findings as consistent with banks treating the MSLP primarily as a backstop — a facility they would activate only if economic conditions deteriorated significantly — rather than a primary lending channel.&lt;/p&gt;
&lt;h3 id="q11-how-does-the-paper-address-the-threat-that-mslp-participation-reflects-bank-level-cyclicality-in-loan-portfolios"&gt;Q11. How does the paper address the threat that MSLP participation reflects bank-level cyclicality in loan portfolios?&lt;/h3&gt;
&lt;p&gt;A: Table 10 controls for bank-specific C&amp;amp;I loan portfolio cyclicality, measured as the correlation between each bank&amp;rsquo;s C&amp;amp;I loan growth and aggregate banking-sector C&amp;amp;I loan growth estimated over 1985:Q1–2021:Q2 using two functional forms. The MSLP bank × Post coefficient estimates remain very similar to the baseline after including these controls, ruling out the concern that MSLP participants were simply banks with naturally more procyclical or countercyclical lending patterns.&lt;/p&gt;
&lt;h3 id="q12-what-happens-to-the-estimated-spillover-effects-in-2020q4-and-what-does-this-reveal"&gt;Q12. What happens to the estimated spillover effects in 2020:Q4, and what does this reveal?&lt;/h3&gt;
&lt;p&gt;A: The paper shows (Table A6) that extending the sample to include 2020:Q4 yields somewhat smaller estimated spillover effects than in the baseline 2020:Q3 period. The authors attribute this to the November 19, 2020 announcement by Treasury Secretary Mnuchin that the MSLP would not be extended beyond year-end, which effectively ended the program&amp;rsquo;s backstop role and — consistent with the psychological backstop mechanism — reduced banks&amp;rsquo; confidence in the program&amp;rsquo;s future availability and thus the spillover motivation.&lt;/p&gt;
&lt;h3 id="q13-does-the-paper-find-spillover-effects-on-intensive-margin-loan-terms-and-how-large-are-they"&gt;Q13. Does the paper find spillover effects on intensive margin loan terms, and how large are they?&lt;/h3&gt;
&lt;p&gt;A: On loan spreads, MSLP banks charged approximately 9 basis points lower spreads on floating-rate C&amp;amp;I loans renewed or originated in 2020:Q3 in the Y-14Q data (2SLS: 19 bps), and approximately 13.5 bps lower spreads in the Dealscan syndicated loan sample (2SLS: 30 bps). The 9 bps OLS estimate implies the average spread across all LIBOR-indexed C&amp;amp;I loans in 2020:Q3 would have been approximately 4 bps higher absent the program (i.e., 0.43 × 9 bps), relative to an actual average spread of 235 bps — an effect the authors characterize as economically small. On loan size, the Dealscan evidence indicates MSLP banks extended syndicated loans that were 11.2% larger (2SLS: 25% larger).&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Credit Spillover Effects:&lt;/strong&gt; As used in this paper, spillover effects refer to the impact of MSLP participation on participating banks&amp;rsquo; lending behavior &lt;em&gt;outside and beyond&lt;/em&gt; the program itself — specifically, changes in loan renewal rates, new loan origination rates, lending standards, and loan terms for non-MSLP C&amp;amp;I loans. This is distinct from the direct effect (i.e., loans originated through the MSLP proper).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Psychological Backstop:&lt;/strong&gt; The paper&amp;rsquo;s term for the mechanism by which the MSLP reduced participating banks&amp;rsquo; effective risk aversion without necessarily easing their immediate balance sheet constraints. By committing to provide lending support if conditions deteriorated, the program built banks&amp;rsquo; confidence to lend ex ante, functioning as &amp;ldquo;insurance&amp;rdquo; against bad outcomes rather than a direct funding facility. The mechanism is distinguished from balance sheet easing by the fact that constrained and unconstrained banks exhibited similar spillover effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Extensive Margin of Lending:&lt;/strong&gt; The binary dimension of lending activity — specifically, whether a bank renews an existing loan or originates a new loan within a bank-borrower pair. In this paper, measured as the share of existing loan commitments within each bank-borrower pair that are renewed or newly originated each quarter. Contrasted with the intensive margin.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Intensive Margin of Lending:&lt;/strong&gt; The quantitative dimension of existing lending relationships — specifically, the average loan size and average spread on loans renewed or originated in a given period, conditional on a loan being extended.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Senior Loan Officer Opinion Survey (SLOOS):&lt;/strong&gt; A quarterly Federal Reserve survey of senior lending officers at large U.S. banks covering self-reported changes in C&amp;amp;I lending standards, terms (including spreads, maximum loan size, maturity, covenants, collateral requirements), demand conditions, and — in supplementary editions — reasons for changing standards. Used in this paper both as an outcome variable (tightening standards) and as a control variable (changes in loan demand) and as a source of IV variation (burden of MSLP registration).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Risk Management Index (RMI):&lt;/strong&gt; An index developed by Ellul and Yerramilli (2013) measuring the strength of a bank&amp;rsquo;s internal risk management function, combining information on the presence and compensation of a chief risk officer, risk committee composition, and meeting frequency. Used in this paper as a pre-pandemic proxy for institutional risk aversion to test whether the MSLP disproportionately reduced risk aversion in banks with stronger risk controls.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Difference-in-Differences with Granular Fixed Effects:&lt;/strong&gt; The primary identification strategy, comparing changes in lending outcomes between MSLP-participating and non-participating banks before (2020:Q1–Q2) and after (2020:Q3) program implementation. The paper uses firm×quarter fixed effects following Khwaja and Mian (2008) to absorb borrower-level credit demand, and bank×borrower fixed effects following Chodorow-Reich (2013) to absorb relationship-specific supply factors — isolating the bank credit supply effect attributable to MSLP participation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Originate-and-Distribute Feature (of MSLP):&lt;/strong&gt; The MSLP&amp;rsquo;s design in which banks originate MSLP loans but sell 95% of the credit exposure to the SPV, retaining only 5%. This feature was intended to free up balance sheet capacity for further lending. The paper tests whether this channel (easing current balance sheet constraints) explains the observed spillovers, finding limited support relative to the risk aversion reduction channel.&lt;/p&gt;</description></item><item><title>Multinational production and global shock propagation during the great recession</title><link>https://macropaperwarehouse.com/papers/multinational-production-and-global-shock-propagation-during-the-great-recession/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/multinational-production-and-global-shock-propagation-during-the-great-recession/</guid><description>&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;Summary blocked:&lt;/strong&gt; No freely available working paper or full text was found. Metadata record only; paper suppressed from site until a working paper is located.&lt;/p&gt;
&lt;/blockquote&gt;</description></item><item><title>Mussa Puzzle Redux</title><link>https://macropaperwarehouse.com/papers/mussa-puzzle-redux/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/mussa-puzzle-redux/</guid><description>&lt;p&gt;The Mussa (1986) puzzle is the empirical observation of a sharp, simultaneous increase in the volatility of both nominal and real exchange rates following the end of the Bretton Woods fixed exchange rate system in 1973 — a fact commonly interpreted as evidence for monetary non-neutrality. This paper resolves the puzzle by developing a model in which the dominant driver of nominal exchange rate fluctuations is a &amp;ldquo;financial shock&amp;rdquo; — a shock to the international demand for a country&amp;rsquo;s assets that is orthogonal to goods market fundamentals. Under a fixed rate, the central bank offsets financial shocks through reserve intervention, preventing them from moving the exchange rate; under a float, financial shocks freely move the nominal and real exchange rate simultaneously. The same framework also reconciles the Meese-Rogoff disconnect (exchange rates are unpredictable from macro fundamentals), the Backus-Smith puzzle, and the forward premium puzzle within a single unified model, with the financial shock accounting for the dominant share of exchange rate variance in each case.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-financial-shock-and-how-does-it-differ-from-standard-macro-shocks"&gt;Q1. What is the financial shock and how does it differ from standard macro shocks?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The financial shock is an orthogonal disturbance to international portfolio demand — the preference of foreign investors for holding domestic versus foreign assets — that is disconnected from productivity, monetary policy, and goods-market conditions.&lt;/strong&gt; Because it is uncorrelated with macro fundamentals, it generates exchange rate movements without corresponding movements in output, prices, or interest rate differentials, producing the observed disconnect between exchange rates and macro variables.&lt;/p&gt;
&lt;h3 id="q2-why-does-the-mussa-pattern-arise-from-regime-switching"&gt;Q2. Why does the Mussa pattern arise from regime switching?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Under a fixed rate, the central bank absorbs financial shocks via reserve intervention, sterilizing their exchange rate effects; the real exchange rate is equally insulated because the nominal rate is fixed and prices adjust slowly. Under a float, the same financial shocks freely move the nominal exchange rate, and with sticky prices this passes through to the real exchange rate.&lt;/strong&gt; The variance of the real exchange rate therefore jumps discontinuously at the regime switch, matching the sharp Mussa empirical finding without requiring any change in the shock process.&lt;/p&gt;
&lt;h3 id="q3-how-unified-is-the-resolution-across-exchange-rate-puzzles"&gt;Q3. How unified is the resolution across exchange rate puzzles?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;A single model with the financial shock, sticky prices, and a standard asset pricing kernel simultaneously matches the Mussa pattern (regime-switching real volatility), the Meese-Rogoff disconnect (exchange rates unpredictable from fundamentals), the Backus-Smith puzzle (exchange rates and relative consumption uncorrelated), and the forward premium puzzle (high-interest-rate currencies appreciate).&lt;/strong&gt; The financial shock accounts for the majority of exchange rate variance in each application.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Mussa puzzle&lt;/strong&gt; : the discrete jump in real exchange rate volatility at the Bretton Woods breakdown (1973); resolved in this paper as the change in the central bank&amp;rsquo;s absorption of financial shocks between fixed and floating regimes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;financial shock&lt;/strong&gt; : a disturbance to international portfolio demand orthogonal to goods-market fundamentals; the paper&amp;rsquo;s key mechanism for exchange rate disconnect, the Mussa pattern, and several other exchange rate puzzles.&lt;/p&gt;</description></item><item><title>Oil price fluctuations, US banks, and macroprudential policy</title><link>https://macropaperwarehouse.com/papers/oil-price-fluctuations-us-banks-and-macroprudential-policy/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/oil-price-fluctuations-us-banks-and-macroprudential-policy/</guid><description>&lt;p&gt;This paper estimates the effect of oil price fluctuations on US banking variables using a Bayesian SVAR with sign restrictions following Baumeister and Hamilton (2019). Oil market shocks that lead to a contraction in world economic activity are found to unambiguously lower the amount of bank credit to the US economy, tend to decrease US banks&amp;rsquo; net worth, and tend to increase the US credit spread. The effects can be strong and long-lasting or more modest and short-lived, depending on the source of the oil price fluctuation. The effects are found to be stronger for smaller and lower-leveraged banks.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-empirical-strategy"&gt;Q1. What is the empirical strategy?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper extends the state-of-the-art oil market SVAR of Baumeister and Hamilton (2019) to incorporate three US banking variables—banks&amp;rsquo; net worth, the US credit spread, and the amount of bank credit extended—estimated with monthly data over January 1974 through December 2019.&lt;/strong&gt; An agnostic approach is taken on sign restrictions for the US banking block: no restrictions are imposed on banking variables beyond those already imposed by Baumeister and Hamilton (2019) on the oil block, so the results for banking variables are driven primarily by data rather than prior restrictions. This extends earlier work that studied oil prices and credit spreads (Abbritti et al., 2020) or oil prices and stock markets (Kilian and Park, 2009) in isolation.&lt;/p&gt;
&lt;h3 id="q2-what-is-the-main-finding-regarding-the-effect-of-oil-shocks-on-banks"&gt;Q2. What is the main finding regarding the effect of oil shocks on banks?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Oil market shocks that lead to a contraction in world economic activity are found to unambiguously lower the amount of bank credit to the US economy, tend to decrease US banks&amp;rsquo; net worth, and tend to increase the US credit spread.&lt;/strong&gt; &amp;ldquo;Unambiguously&amp;rdquo; reflects that the sign restrictions impose no prior on the direction of credit&amp;rsquo;s response, so the finding that credit falls is driven entirely by data. The paper is the first to characterize the effect of oil market shocks on banks&amp;rsquo; net worth and to estimate the credit effect within the SVAR framework.&lt;/p&gt;
&lt;h3 id="q3-how-do-the-effects-differ-by-the-source-of-oil-price-fluctuations"&gt;Q3. How do the effects differ by the source of oil price fluctuations?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The effects on banking variables can be strong and long-lasting or more modest and short-lived, depending on the underlying source of the oil price change—reflecting the SVAR framework&amp;rsquo;s decomposition of oil price movements into distinct structural shocks.&lt;/strong&gt; The distinction between oil supply shocks, demand shocks driven by global activity, and demand shocks driven by speculative factors implies that shocks of the same sign in the oil price may have different magnitudes and durations of effects on banks, consistent with Kilian (2009)&amp;rsquo;s decomposition.&lt;/p&gt;
&lt;h3 id="q4-which-banks-are-most-affected"&gt;Q4. Which banks are most affected?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The effects of oil market shocks on banking variables are found to be stronger for smaller and lower-leveraged banks.&lt;/strong&gt; Smaller banks may be more exposed to oil-related regional economic downturns through concentrated loan portfolios, while lower-leveraged banks may face different collateral and risk dynamics relative to more highly leveraged peers.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;oil market Bayesian SVAR&lt;/strong&gt; : a structural vector autoregression that uses a Bayesian prior over sign restrictions to identify oil supply shocks, oil demand shocks related to global real activity, and oil-specific demand shocks, following Baumeister and Hamilton (2019); extended here to include US banking variables.
&lt;strong&gt;credit spread&lt;/strong&gt; : the difference between yields on corporate bonds or loans and a risk-free reference rate; used as a measure of the credit risk premium and financial conditions in US credit markets.&lt;/p&gt;</description></item><item><title>On measuring the welfare cost of inflation</title><link>https://macropaperwarehouse.com/papers/on-measuring-the-welfare-cost-of-inflation/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/on-measuring-the-welfare-cost-of-inflation/</guid><description>&lt;p&gt;Measuring the welfare cost of inflation requires specifying a money demand function, a definition of money, and an approach to consumer surplus; existing estimates vary widely because these choices are not standardized. This paper advances the literature by applying neoclassical monetary demand theory that integrates the demand for money with the demands for consumption and leisure, using the Normalized Quadratic (NQ) flexible functional form that avoids imposing specific elasticity assumptions. The main contribution is to extend the Serletis and Xu (2021, 2023) framework to derive Hicksian (compensating variation) money demand functions from the NQ model and compare welfare cost estimates based on these against estimates from the Marshallian (consumer surplus) approach—a comparison not previously made within this integrated demand-system framework. The paper uses U.S. CFS Divisia monetary aggregates across multiple levels of monetary aggregation and finds that the two approaches yield internally consistent but quantitatively different welfare cost estimates, with the Hicksian compensating variation approach providing theoretically preferred measures that are robust across specifications.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-neoclassical-demand-system-approach-and-how-does-it-differ-from-earlier-methods"&gt;Q1. What is the neoclassical demand system approach and how does it differ from earlier methods?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The Serletis-Xu framework integrates the demand for money with the demands for consumption goods and leisure in a joint utility maximization problem, estimating a flexible NQ functional form in a systems context rather than fitting a single-equation money demand specification.&lt;/strong&gt; Earlier approaches—such as the log-log specification (Lucas 2000) or semi-log specification (Ireland 2009)—estimate a single money demand equation under a maintained functional form assumption and a fixed interest elasticity (often −0.5 as in the Baumol-Tobin model). The NQ approach, derived from the dual demand system of Diewert (1974), makes no assumption about the functional form of money demand and allows demand interactions among consumption goods, leisure, and money (as recommended by Abbott and Ashenfelter 1976 and Barnett 1979), which is necessary for correct welfare measurement when money is consumed jointly with other goods.&lt;/p&gt;
&lt;h3 id="q2-what-is-the-distinction-between-the-marshallian-and-hicksian-approaches-to-measuring-welfare-cost"&gt;Q2. What is the distinction between the Marshallian and Hicksian approaches to measuring welfare cost?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The Marshallian (Bailey 1956) approach measures the area under the inverse money demand curve between the zero-inflation and positive-inflation nominal interest rates, which corresponds to consumer surplus but does not hold utility constant.&lt;/strong&gt; The Hicksian (compensating variation) approach measures the income that must be given to the consumer to restore the same utility after the inflation increase as before—holding utility constant rather than income. The Hicksian approach is theoretically preferred because it measures the true welfare loss from inflation under standard consumer theory; the Marshallian approach can under- or over-estimate the true cost depending on income effects. The paper&amp;rsquo;s main contribution is to derive the Hicksian demands from the NQ model and compute the compensating variation, previously not done within this flexible-functional-form demand system framework.&lt;/p&gt;
&lt;h3 id="q3-what-role-do-divisia-monetary-aggregates-play"&gt;Q3. What role do Divisia monetary aggregates play?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper uses CFS (Center for Financial Stability) Divisia monetary aggregates—which aggregate monetary assets using economic quantity indices that weight components by their monetary service flows—rather than simple-sum aggregates such as M1 or M2.&lt;/strong&gt; Simple-sum aggregates treat all monetary assets as perfect substitutes regardless of yield differentials, introducing a substitution bias that misrepresents the quantity of monetary services; Divisia aggregates are theoretically consistent with the neoclassical demand system approach used here. The paper reports welfare cost estimates across multiple levels of monetary aggregation to assess sensitivity to the definition of money.&lt;/p&gt;
&lt;h3 id="q4-how-do-the-results-compare-with-the-prior-literature"&gt;Q4. How do the results compare with the prior literature?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper&amp;rsquo;s estimates, while internally consistent with the NQ flexible form and Divisia aggregates, are in the range of prior estimates in the literature; the Hicksian compensating variation estimates differ from Marshallian consumer surplus estimates in ways consistent with theory, providing a more theoretically grounded benchmark.&lt;/strong&gt; The wide range of estimates in the existing literature (discussed in the paper&amp;rsquo;s Table 1)—from the Lucas (2000) log-log model to the Ireland (2009) semi-log model—reflects sensitivity to functional form, money definition, data frequency, and methodology; the paper&amp;rsquo;s NQ framework addresses functional-form sensitivity while comparing the two surplus measures.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;compensating variation (Hicksian welfare cost of inflation)&lt;/strong&gt; : the income required to restore a consumer&amp;rsquo;s utility to its pre-inflation level after an inflation increase, holding utility constant; the paper&amp;rsquo;s main new estimate, derived from Hicksian money demand functions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Normalized Quadratic (NQ) flexible functional form&lt;/strong&gt; : a globally flexible functional form (Diewert and Wales 1988) used to approximate the consumer&amp;rsquo;s cost function without imposing restrictions on substitution elasticities; allows derivation of both Marshallian and Hicksian demand functions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Divisia monetary aggregates&lt;/strong&gt; : theoretically consistent monetary aggregates that weight monetary assets by their monetary service flows (user costs) rather than summing them with equal weights; CFS Divisia aggregates are used here as the measure of money.&lt;/p&gt;</description></item><item><title>On the Nature of Entrepreneurship</title><link>https://macropaperwarehouse.com/papers/on-the-nature-of-entrepreneurship/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/on-the-nature-of-entrepreneurship/</guid><description>&lt;p&gt;This paper uses a novel longitudinal administrative dataset drawn from U.S. Internal Revenue Service (IRS) and Social Security Administration (SSA) records to characterize income dynamics and the determinants of entrepreneurial entry for pass-through business owners — sole proprietors, partners, and S corporation owners — who collectively account for over 50 percent of all U.S. business net income. The sample covers 2000–2015 and includes up to 1.3 billion person-year observations for individuals aged 25–65. The authors construct balanced panels using birth cohorts 1950–1975, impute education (college attainment) and skill (cognitive, interpersonal, manual) via machine-learning classifiers trained on CPS and O*NET data, and estimate life-cycle income profiles using a three-component model that separates individual fixed effects, group-specific time effects, and group-cohort-specific age effects.&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s central departure from prior work is coverage of the full income distribution, including the high-earning right tail that household surveys such as the CPS misrepresent due to top-coding and small samples. When the IRS and CPS samples are compared on a consistent classification basis, median self-employment income is lower in the IRS data at all ages, consistent with the survey literature&amp;rsquo;s emphasis on the &amp;ldquo;typical&amp;rdquo; self-employed individual. However, mean incomes diverge sharply: the IRS shows mean self-employment income rising from $23 thousand at age 25 to $93 thousand at age 55, whereas the CPS (with incorporated owners reclassified) shows a rise from only $41 thousand to $73 thousand. Roughly 80 percent of self-employment income in the IRS data accrues to individuals above the $100 thousand threshold, compared to 42–53 percent in the CPS. The IRS-CPS gap is dominated by the right tail and concentrated in professional services and health care. For paid-employed individuals, the IRS and CPS medians and means are close at all ages, confirming the discrepancy is specific to self-employment.&lt;/p&gt;
&lt;p&gt;The life-cycle estimation finds that individuals who have &amp;ldquo;tried self-employment&amp;rdquo; — a group earning virtually all self-employment income — start at similar average incomes to primarily paid-employed peers at age 25 but reach $134 thousand by age 55, compared with $79 thousand for paid-employed peers with the same observable characteristics. Age effects for the self-employed are 63 percent higher than for the paid-employed at age 26 and remain elevated until age 55. Time effects show dramatically greater cyclical volatility for the self-employed: income growth declined by $9,655 (2008) and $8,785 (2009) for the self-employed versus $373 and $1,583 for paid-employed in the same years, concentrated in real estate and construction.&lt;/p&gt;
&lt;p&gt;On the determinants of entry, the paper finds: (i) no evidence that house-price appreciation raises entry rates, contra collateral-constraint hypotheses; (ii) most entrants have lower asset incomes than future entrants with the same characteristics, arguing against a liquid-wealth precondition; (iii) most entrants have higher prior labor income than future entrants, consistent with entry being driven by on-the-job experience rather than fallback from low-paid work; (iv) almost all founders report positive individual tax income in their first year of operation despite negative business net income and no external debt financing. Self-employed income growth exhibits greater dispersion — a 10th-to-90th percentile range roughly 2.5 times wider than for the paid-employed — and a Kelly skewness about 0.1 higher. A standard consumption-risk model calibrated with household-finance estimates of risk aversion rationalizes the patterns if individuals are insured against the most adverse downside shocks. Entry and exit rates are stable across the sample period, including the Great Recession, and the entrepreneurship share does not decline.&lt;/p&gt;
&lt;p&gt;The subgroup congruent with non-pecuniary motivation — primarily self-employed individuals earning less than paid-employed peers with matching characteristics — comprises roughly 57 percent of primarily self-employed by count but earns only 16 percent of total self-employment income.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-why-do-irs-and-cps-data-give-such-different-pictures-of-self-employment-income"&gt;Q1. Why do IRS and CPS data give such different pictures of self-employment income?&lt;/h3&gt;
&lt;p&gt;The CPS suffers from top-coding of high incomes and small samples that underrepresent high earners in key industries. The IRS-CPS mean income gap for the self-employed is dominated by the right tail: in the main IRS sample, individuals above the $100 thousand threshold earn roughly 80 percent of all self-employment income, versus 42 percent in the comparable CPS sample. The average income of top earners above $100 thousand is $355 thousand in the IRS versus $218 thousand in the CPS. The gap is concentrated in professional services and health care and persists across all income thresholds and sample definitions tested. No analogous discrepancy exists for paid-employed individuals, where IRS and CPS medians and means are close at all ages.&lt;/p&gt;
&lt;h3 id="q2-what-does-the-comparison-look-like-at-the-median-versus-the-mean"&gt;Q2. What does the comparison look like at the median versus the mean?&lt;/h3&gt;
&lt;p&gt;At the median, IRS self-employment income is lower than both CPS samples at all ages, with the gap largest for younger owners and those with incorporated businesses — a pattern consistent with the survey-based &amp;ldquo;self-employment discount&amp;rdquo; narrative. At the mean, the IRS shows much higher income at older ages: by age 55, IRS mean self-employment income is $93 thousand versus $73 thousand in the CPS sample that includes reclassified incorporated-owner wages. The divergence arises because the mean is sensitive to the right tail, which the CPS systematically underrepresents.&lt;/p&gt;
&lt;h3 id="q3-how-does-the-paper-estimate-life-cycle-income-profiles-while-separating-age-time-and-cohort-effects"&gt;Q3. How does the paper estimate life-cycle income profiles while separating age, time, and cohort effects?&lt;/h3&gt;
&lt;p&gt;Individual income is decomposed into an individual fixed effect (permanent latent ability and preferences), a group-specific time effect (business-cycle fluctuations common to a group), and a group-cohort-specific age effect (life-cycle income growth). Identification exploits the overlapping cohort structure of the 16-year panel: age effects are assumed equal across cohort bins of size at least two, allowing time and age effects to be separately identified. The model is estimated in levels rather than logs to accommodate business losses. Groups are defined as a Cartesian product of 32,256 subgroups based on education, three skill dimensions, industry (21 two-digit NAICS codes), demographics (gender, cohort, marital status, children), and employment-status history.&lt;/p&gt;
&lt;h3 id="q4-what-are-the-headline-life-cycle-income-profile-findings-for-self--versus-paid-employed"&gt;Q4. What are the headline life-cycle income profile findings for self- versus paid-employed?&lt;/h3&gt;
&lt;p&gt;Among the &amp;ldquo;primarily employed&amp;rdquo; group, those who have tried self-employment and those who are primarily paid-employed have similar average incomes at age 25. By age 55 the self-employed reach an estimated $134 thousand (2012 dollars) versus $79 thousand for paid-employed peers with identical observable characteristics. The estimated age effect for the self-employed is 63 percent higher than for the paid-employed at age 26 and remains higher through age 55. These gaps would widen further if incomes were adjusted upward for the BEA-estimated net misreporting rates of 46 percent for unincorporated owners and 14 percent for S corporation owners.&lt;/p&gt;
&lt;h3 id="q5-how-large-is-the-group-consistent-with-non-pecuniary-motivation-and-how-much-income-does-it-earn"&gt;Q5. How large is the group consistent with non-pecuniary motivation, and how much income does it earn?&lt;/h3&gt;
&lt;p&gt;The non-pecuniary subgroup — primarily self-employed individuals (at least 12 years in self-employment) who earn less on average than primarily paid-employed peers matched on gender, education, skills, and other characteristics — is numerically larger, comprising approximately 57 percent of primarily self-employed by count. However, this group earns only 16 percent of total self-employment income. Adjusting for paid-employed fringe benefits and self-employed income misreporting can change the group&amp;rsquo;s size but does not alter the finding that it accounts for a small income share. The paper concludes that non-pecuniary motives may guide occupational choice for many individuals but are not the driver of the typical dollar earned in self-employment.&lt;/p&gt;
&lt;h3 id="q6-how-does-idiosyncratic-income-risk-compare-between-self--and-paid-employed"&gt;Q6. How does idiosyncratic income risk compare between self- and paid-employed?&lt;/h3&gt;
&lt;p&gt;Self-employed income changes are substantially more dispersed: the 10th-to-90th percentile range of income growth is roughly 2.5 times wider for the self-employed than for the paid-employed. Income changes for the self-employed are also more right-skewed, with a Kelly skewness difference of approximately 0.1. When a standard consumption-risk model — augmented with a lower bound on consumption growth to allow for external insurance — is parameterized with risk-aversion estimates from the household finance literature, the observed patterns are rationalized if individuals are insured against the most adverse downside shocks, i.e., the attractive aspect of self-employment is large potential upside with insured downside.&lt;/p&gt;
&lt;h3 id="q7-what-happened-to-self-employed-income-and-exit-rates-during-the-great-recession"&gt;Q7. What happened to self-employed income and exit rates during the Great Recession?&lt;/h3&gt;
&lt;p&gt;Time effects show steep income growth declines for the self-employed of -$9,655 in 2008 and -$8,785 in 2009, compared with much more modest declines of -$373 and -$1,583 for paid-employed peers. The aggregate income declines are concentrated in cyclically sensitive self-employed subgroups in real estate and construction, with their paid-employed counterparts experiencing only modest declines. Despite these large income shocks, exit rates from self-employment showed little change during the Great Recession, either in aggregate or in the cyclically sensitive sectors. Entry rates were likewise stable, and the share of entrepreneurs in the population did not decline over the full sample period.&lt;/p&gt;
&lt;h3 id="q8-does-the-evidence-support-collateral-constraints-as-a-binding-barrier-to-entrepreneurial-entry"&gt;Q8. Does the evidence support collateral constraints as a binding barrier to entrepreneurial entry?&lt;/h3&gt;
&lt;p&gt;No. The paper tests the hypothesis, standard in the liquidity-constraints literature, that entry rates should be higher for homeowners experiencing house-price appreciation (which raises collateral value). The IRS data do not support this prediction. Separately, comparing asset incomes (interest, dividends, capital gains) of current entrants and future entrants with the same characteristics, the paper finds that most current entrants have lower asset incomes and less liquid wealth than those who switch later, which also argues against a liquid-wealth precondition for entry.&lt;/p&gt;
&lt;h3 id="q9-what-does-prior-labor-income-reveal-about-why-people-enter-self-employment"&gt;Q9. What does prior labor income reveal about why people enter self-employment?&lt;/h3&gt;
&lt;p&gt;Current entrants have higher prior labor income than matched future entrants with the same characteristics, indicating they enter with accumulated on-the-job experience rather than being pushed into self-employment as a fallback after failure in paid work. This is consistent with self-employment being a deliberate, experience-driven career transition for most entrants rather than a last resort for low earners. The paper interprets this as positive evidence for the role of experience-based human capital in driving entrepreneurial choice.&lt;/p&gt;
&lt;h3 id="q10-how-do-founders-finance-startup-costs-if-most-have-negative-business-net-income-in-early-years"&gt;Q10. How do founders finance startup costs if most have negative business net income in early years?&lt;/h3&gt;
&lt;p&gt;Almost all founders in the sample report positive income on their personal (individual) tax form in the first year of operation, even though most report negative business net income and carry no external debt financing. This pattern suggests founders rely on personal income sources — prior savings, part-time paid employment, or spousal income — to cover startup costs rather than external debt, implying that formal credit-market financing constraints are not the primary barrier to entry for most entrants in the sample.&lt;/p&gt;
&lt;h3 id="q11-what-are-the-scope-conditions-and-key-limitations"&gt;Q11. What are the scope conditions and key limitations?&lt;/h3&gt;
&lt;p&gt;The sample covers pass-through owners (sole proprietors, partners, S corporation owners) and excludes C corporation shareholders, whose entrepreneurial income does not flow to individual returns until distributed. Income measures exclude most employer fringe benefits; capital gains are excluded from self-employment income, and the authors note their inclusion would strengthen the main findings. The analysis covers 2000–2015 for cohorts born 1950–1975, and income is reported before taxes and transfers. Baseline estimates are not adjusted for misreporting, though BEA-implied adjustments of 46 percent for unincorporated owners and 14 percent for S corporation owners would widen the income gaps further.&lt;/p&gt;
&lt;p&gt;Pass-through business owner: An individual who owns a sole proprietorship, partnership, or S corporation, such that business net income flows directly onto the owner&amp;rsquo;s personal tax return; excludes C corporation shareholders whose income appears only upon dividend or capital-gains distributions.&lt;/p&gt;
&lt;p&gt;Tried self-employment: The paper&amp;rsquo;s primary self-employed comparison group within the &amp;ldquo;primarily employed&amp;rdquo; category — individuals with any years in self-employment (including frequent switchers and those with most years in self-employment) — who collectively earn virtually all self-employment income.&lt;/p&gt;
&lt;p&gt;Group-specific age effect: The paper&amp;rsquo;s estimate of how individual income changes with age within a defined subgroup (determined by education, skill, industry, demographics, and employment history), identified by exploiting overlapping birth cohorts in the 16-year panel and separated from individual fixed effects and business-cycle time effects.&lt;/p&gt;
&lt;p&gt;Primarily employed: Individuals with at least 12 of 16 sample years in either self- or paid-employment, with at most one intermediate year of non-employment; the paper&amp;rsquo;s main analytical focus for life-cycle income comparisons.&lt;/p&gt;
&lt;p&gt;SOI Databank: The Statistics of Income Databank, a de-identified balanced panel combining SSA demographic records with IRS tax filing data for all living U.S. individuals with a Social Security number over 1996–2015; the paper&amp;rsquo;s primary data source providing Schedule C, K-1, W-2, and related filing information.&lt;/p&gt;
&lt;p&gt;Kelly skewness: A robust measure of distributional asymmetry used by the paper to characterize income growth; the paper reports that Kelly skewness of self-employed income changes exceeds that of paid-employed by approximately 0.1, indicating greater right-skewness in self-employment income dynamics.&lt;/p&gt;
&lt;p&gt;Non-pecuniary motivation subgroup: Primarily self-employed individuals who earn less on average than primarily paid-employed peers matched on observable characteristics, taken by the paper as consistent with non-wage job amenities (autonomy, flexibility) driving occupational choice; found to be 57 percent of primarily self-employed by count but earning only 16 percent of total self-employment income.&lt;/p&gt;</description></item><item><title>Optimal Fiscal Policy with Heterogeneous Agents and Capital: Overturning Chamley-Judd</title><link>https://macropaperwarehouse.com/papers/optimal-fiscal-policy-with-heterogeneous-agents-and-capital-overturning-chamley-judd/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/optimal-fiscal-policy-with-heterogeneous-agents-and-capital-overturning-chamley-judd/</guid><description>&lt;p&gt;The Chamley-Judd result (1986) states that the optimal long-run capital income tax rate is zero in representative-agent models. This paper shows that introducing heterogeneous agents — specifically, agents with uninsurable idiosyncratic income risk who use precautionary saving — overturns this result. When agents differ in their wealth and income realizations, a capital income tax serves as a form of insurance that representative-agent models cannot provide. The paper derives a tractable analytical characterization of the optimal capital tax in an Aiyagari-type heterogeneous-agent model and finds that the optimal rate lies in the range of 10–30 percent at the steady state — strictly positive, in direct contradiction to Chamley-Judd. The magnitude of the optimal tax depends on the degree of idiosyncratic risk and the availability of alternative redistribution instruments: when other redistributive tools are limited, the optimal capital tax is higher.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-why-does-heterogeneity-overturn-chamley-judd"&gt;Q1. Why does heterogeneity overturn Chamley-Judd?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;In representative-agent models, all agents hold the same capital stock, so a capital tax distorts intertemporal decisions identically and the Ramsey planner finds it optimal to zero out the distortion in the long run. With heterogeneous agents and uninsurable risk, the capital tax has an additional insurance role: taxing capital income and redistributing it reduces consumption variance across agents, generating welfare gains that outweigh the intertemporal distortion costs.&lt;/strong&gt; The insurance benefit makes the optimal tax positive at the steady state because the tax-and-redistribute mechanism provides risk-sharing that incomplete markets cannot.&lt;/p&gt;
&lt;h3 id="q2-how-tractable-is-the-analytical-result"&gt;Q2. How tractable is the analytical result?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper derives closed-form expressions for the optimal tax rate as a function of the degree of idiosyncratic risk, the wealth distribution&amp;rsquo;s spread, and the available redistribution instruments, enabling comparative statics that go beyond what purely computational approaches provide.&lt;/strong&gt; This tractability distinguishes the result from earlier numerical work that demonstrated positive optimal capital taxes without clear analytical structure.&lt;/p&gt;
&lt;h3 id="q3-what-is-the-quantitative-magnitude"&gt;Q3. What is the quantitative magnitude?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The optimal steady-state capital income tax is in the range of 10–30 percent, substantially above zero but well below confiscatory rates, in the paper&amp;rsquo;s benchmark calibration matched to U.S. income and wealth inequality.&lt;/strong&gt; The range reflects the sensitivity to available redistribution instruments: the lower bound applies when the government has a rich set of redistribution tools, the upper bound when capital taxation is the only available instrument.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Chamley-Judd result&lt;/strong&gt; : the proposition that the optimal long-run capital income tax is zero in representative-agent Ramsey taxation models; overturned in this paper once heterogeneous agents with uninsurable risk are introduced.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;insurance role of capital taxation&lt;/strong&gt; : the mechanism by which a capital income tax reduces consumption inequality in a heterogeneous-agent economy, generating welfare gains that outweigh the intertemporal distortion costs and making the optimal capital tax positive.&lt;/p&gt;</description></item><item><title>Optimal monetary policy with uncertain private sector foresight</title><link>https://macropaperwarehouse.com/papers/optimal-monetary-policy-with-uncertain-private-sector-foresight/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/optimal-monetary-policy-with-uncertain-private-sector-foresight/</guid><description>&lt;p&gt;Central banks must set policy under uncertainty about how private-sector expectations form, which changes how monetary policy transmits to output and inflation. This paper studies optimal time-consistent monetary policy using a New Keynesian finite-horizon planning (NK-FHP) model in which households and firms have limited foresight: they solve structural problems only up to a finite horizon, and update their beliefs about longer-run inflation by averaging over past data. In this setting—unlike in standard New Keynesian models—an &amp;ldquo;inflation scares&amp;rdquo; problem can arise: agents&amp;rsquo; longer-run inflation expectations can deviate persistently from the central bank&amp;rsquo;s target, generating costly and prolonged disinflations. The authors formally characterize optimal policy when the planning horizons of private-sector agents are uncertain and a risk of inflation scares is present, showing that risk-management considerations modify the standard &amp;ldquo;leaning against the wind&amp;rdquo; principle with a novel preemptive motive: the optimal policy responds more aggressively to the risk of unanchoring to prevent inflation scares from materializing. An estimated version of the model is used to quantify how much this preemptive motive mattered during the post-pandemic inflation surge.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-nk-fhp-model-and-how-does-it-differ-from-standard-new-keynesian-models"&gt;Q1. What is the NK-FHP model and how does it differ from standard New Keynesian models?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;In the NK-FHP model, households and firms are boundedly rational because they evaluate state-contingent paths only up to a finite horizon; beyond that horizon they extrapolate longer-run beliefs by averaging over past data, making those beliefs adaptive rather than anchored at the target.&lt;/strong&gt; When agents have long planning horizons, the model approximates rational expectations: inflation expectations are well anchored, disinflations are relatively costless, and policy transmits quickly. When planning horizons are short, longer-run inflation expectations can become unanchored, disinflations are costly, and policy transmission lags lengthen. The NK-FHP model thus nests both extremes and provides micro-foundations for the &amp;ldquo;inflation scares&amp;rdquo; discussed by Goodfriend (1993).&lt;/p&gt;
&lt;h3 id="q2-what-is-the-inflation-scares-problem-and-why-does-it-matter-for-optimal-policy"&gt;Q2. What is the &amp;ldquo;inflation scares problem&amp;rdquo; and why does it matter for optimal policy?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;An inflation scare occurs when agents&amp;rsquo; longer-run inflation expectations deviate persistently from the central bank&amp;rsquo;s target because agents with short planning horizons update beliefs adaptively, and past high inflation feeds forward into current expectations.&lt;/strong&gt; This creates a welfare-relevant asymmetry: once expectations become unanchored, a disinflation is costly in output because the central bank must build credibility against backward-looking expectations. The standard NK model with rational expectations does not generate this problem—rational agents&amp;rsquo; inflation expectations are pinned to the target irrespective of history—so it cannot address the design of policy specifically to prevent scares from materializing.&lt;/p&gt;
&lt;h3 id="q3-what-is-the-preemptive-motive-and-how-does-it-modify-the-leaning-against-the-wind-principle"&gt;Q3. What is the &amp;ldquo;preemptive motive&amp;rdquo; and how does it modify the leaning-against-the-wind principle?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Optimal time-consistent policy under uncertain private-sector foresight adds a preemptive motive to the standard leaning-against-the-wind (LATW) principle: the central bank contracts demand not just in response to current output-gap and inflation deviations, but also to prevent expectations from becoming unanchored.&lt;/strong&gt; Under the standard NK LATW result (Clarida, Galí, and Gertler 1999), the policymaker responds to the means of output and inflation. Under uncertain and potentially short-horizon foresight, optimal policy also depends on the distribution of output and inflation, as well as agents&amp;rsquo; beliefs about future inflation—specifically, whether those beliefs risk drifting away from target. The preemptive motive implies a more aggressive policy response to the risk of an inflation scare even before the scare has fully materialized.&lt;/p&gt;
&lt;h3 id="q4-how-does-the-paper-relate-to-the-post-pandemic-inflation-experience"&gt;Q4. How does the paper relate to the post-pandemic inflation experience?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Using parameter estimates from an estimated version of the NK-FHP model, the paper applies the optimal policy framework to quantify how much the preemptive motive matters during the recent post-pandemic inflation surge.&lt;/strong&gt; The model—which has been shown in related work (Gust, Herbst, and López-Salido 2022, 2024) to fit macroeconomic time series substantially better than hybrid NK models and to account for initial underreaction and subsequent overreaction of inflation forecasts—is well suited to analyze an episode where longer-run inflation expectations initially remained anchored but later showed signs of drift. The paper&amp;rsquo;s quantification indicates that risk-management considerations, including the preemptive motive, significantly affect the optimal policy path.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;finite-horizon planning (NK-FHP)&lt;/strong&gt; : a bounded-rationality framework (Woodford 2018) in which agents evaluate only those state-contingent paths within a finite planning horizon, updating beliefs about events beyond the horizon adaptively from past data.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;inflation scares&lt;/strong&gt; : episodes (Goodfriend 1993) in which agents&amp;rsquo; longer-run inflation expectations deviate persistently from the central bank&amp;rsquo;s target, making disinflation costly; the NK-FHP model provides micro-foundations for such scares.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;preemptive motive&lt;/strong&gt; : the additional incentive for a policymaker to tighten beyond what current output-gap and inflation deviations alone would prescribe, specifically to prevent longer-run inflation expectations from becoming unanchored.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;time-consistent policy under uncertainty&lt;/strong&gt; : optimal policy that does not rely on commitment and hence accounts for future re-optimization; in this model it must also account for the non-additive uncertainty arising from a distribution of planning horizons.&lt;/p&gt;</description></item><item><title>Optimal Resilience in Multitier Supply Chains</title><link>https://macropaperwarehouse.com/papers/optimal-resilience-in-multitier-supply-chains/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/optimal-resilience-in-multitier-supply-chains/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Grossman, Helpman, and Sabal ask what market failures arise in vertical supply chains with multiple production tiers, limited (non-anonymous) supply networks, arms-length transactions, and recurrent risks of disruption at every node. They then ask what government policies would be required to implement the socially efficient (first-best) allocation as a decentralized equilibrium, and — in a second-best environment where subsidies to firm-to-firm transactions are politically infeasible — how optimal policies to promote resilience and network formation differ.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model and Methodology&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper develops a general-equilibrium model of a closed economy with an arbitrary number S+1 of vertical production tiers (tier 0 through tier S). A finite measure of &amp;ldquo;lead&amp;rdquo; firms in tier S produce differentiated consumer goods under monopolistic competition using labor and a CES bundle of intermediate inputs from tier S-1 suppliers. Firms in each intermediate tier combine labor and inputs from the tier above using a Cobb-Douglas production function. Tier 0 firms produce from labor alone.&lt;/p&gt;
&lt;p&gt;Every firm faces an independent, non-zero probability of a catastrophic disruption (complete inability to produce). Firms may invest labor up front to moderate this risk — endogenous &amp;ldquo;resilience&amp;rdquo; — or may invest to forge relationships with a larger fraction of potential suppliers in the next upstream tier — endogenous &amp;ldquo;network thickness.&amp;rdquo; Each formed relationship costs k units of labor.&lt;/p&gt;
&lt;p&gt;After disruption shocks are realized, surviving firms negotiate quantities and payments bilaterally. Bargaining is sequential (beginning with lead firms negotiating with tier S-1, then tier S-1 with tier S-2, and so on to tier 0), and within each round is governed by Nash-in-Nash equilibrium (Horn and Wolinsky, 1988): each firm takes as given the outcomes of its negotiations with all other partners. The Nash surplus is split with exogenous bargaining weight β_s for the downstream buyer in the s-to-s−1 negotiation.&lt;/p&gt;
&lt;p&gt;The paper solves the planner&amp;rsquo;s direct-control problem and then characterizes the three sets of policy instruments needed to decentralize the first best: subsidies to input transactions between adjacent tiers, subsidies to investments in resilience (agility), and subsidies to network formation (redundancy). It then solves the second-best problem in which transaction subsidies are constrained to zero.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Transaction subsidies.&lt;/em&gt; In the competitive bargaining equilibrium, each pair of firms undervalues input transactions because the upstream firm anticipates paying a marked-up price when it bargains with its own suppliers. This cascading distortion means the private marginal cost of producing a tier-s good exceeds the social marginal cost. The optimal first-best transaction subsidy on sales by tier s firms (τ*&lt;em&gt;s) equals [γ_s + (1−γ_s)μ&lt;/em&gt;{s−1}]^{−1}, where γ_s is the labor share in tier s production and μ_{s−1} is the endogenous markup factor from bargaining at the s-to-s−1 interface. This subsidy depends only on production function parameters and bargaining weights at the immediately adjacent tier. No subsidy is needed at tier 0 (the most upstream tier), and no subsidy is applied to final-good sales. Under Assumption 1 — inputs become weakly less substitutable as goods proceed downstream — the optimal purchase subsidies rise monotonically as one moves downstream along the supply chain.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Resilience subsidies (first best).&lt;/em&gt; Two offsetting forces govern the optimal subsidy to resilience investments θ*&lt;em&gt;s at intermediate tiers: (i) firms capture only the fraction (1−β&lt;/em&gt;{s+1}) of the joint surplus that their resilience creates for downstream customers, creating underinvestment; (ii) optimal transaction subsidies inflate private profitability, creating a countervailing overinvestment incentive. The net optimal first-best subsidy for intermediate-tier firms is θ*&lt;em&gt;s = (1−β&lt;/em&gt;{s+1}) / τ*_s. This formula depends only on technological and bargaining parameters of tier s and the tier immediately adjacent; it does not depend on conditions elsewhere in the chain. When production parameters and bargaining weights are uniform across tiers, the first-best resilience subsidy is the same at every interior tier. If goods become strictly less substitutable downstream, the first-best subsidy for resilience declines monotonically as one moves downstream, and may turn into an optimal tax for middle tiers where the transaction subsidy is large enough to over-incentivize resilience investment. The first-best resilience subsidy always applies at both extreme ends of the chain.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Network formation subsidies (first best).&lt;/em&gt; Despite firms&amp;rsquo; private incentive to manipulate their number of upstream suppliers to improve bargaining position, the net strategic effect of network formation in general equilibrium exactly cancels the off-equilibrium spillovers to non-partners. As a result, the optimal first-best policy toward network formation at every tier is identical to the optimal policy toward resilience investment.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Second-best policies.&lt;/em&gt; When transaction subsidies are unavailable, uncorrected markups downstream from tier s depress demand for tier-s output, reducing profitability and incentives to invest in resilience below the first-best level. Second-best optimal subsidies for resilience and network formation therefore reflect production function parameters and bargaining weights throughout the entire downstream supply chain, not just at the immediately adjacent tier. Specifically, when buyer bargaining weights are non-increasing along the chain (β_{s+1} ≤ β_s for all s), the second-best subsidy to resilience falls monotonically as one moves downstream. This is the opposite pattern from what might be inferred from the first-best analysis when transaction subsidies are available: with non-increasing bargaining weights, second-best subsidies are larger for upstream producers than for downstream producers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Results are derived for a closed economy. Welfare is measured by the CES utility of the representative consumer over differentiated final goods. The sequential bargaining structure assumes contracts are written after disruption shocks are realized. Assumption 1 (σ_1 ≥ σ_2 ≥ … ≥ σ_S &amp;gt; ε, where σ_s is the elasticity of substitution between inputs at tier s and ε is the demand elasticity for final goods) is maintained for sharper monotonicity results on the structure of optimal subsidies across tiers.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-precise-structure-of-the-supply-chain-in-the-model-and-why-does-the-bargaining-take-place-sequentially-rather-than-simultaneously-across-all-tiers"&gt;Q1. What is the precise structure of the supply chain in the model, and why does the bargaining take place sequentially rather than simultaneously across all tiers?&lt;/h3&gt;
&lt;p&gt;A: The economy has S+1 tiers. Tier 0 firms use only labor; tier s firms (s = 1,…,S−1) use labor and a CES bundle of tier s−1 inputs with elasticity of substitution σ_s &amp;gt; 1; tier S firms produce final differentiated goods using labor and tier S−1 inputs under Cobb-Douglas technology. Sequential bargaining is imposed because the vast number of simultaneous negotiations across all tiers makes a grand coalition impractical. The timing is that lead firms (tier S) first negotiate input quantities and payments with their tier S−1 suppliers; those suppliers, now contractually obligated to their downstream customers, then negotiate with tier S−2, and so on up the chain until tier 1 firms contract with tier 0 suppliers.&lt;/p&gt;
&lt;h3 id="q2-how-is-the-markup-factor-defined-and-what-parameters-determine-it"&gt;Q2. How is the markup factor defined, and what parameters determine it?&lt;/h3&gt;
&lt;p&gt;A: The markup factor μ_s is the ratio of the payment per unit made by tier s+1 firms to the production cost of tier s firms. It equals μ_s = (1−β_{s+1}) · [σ_{s+1}/(σ_{s+1}−1)] + β_{s+1}, where β_{s+1} is the exogenous bargaining weight of the downstream (tier s+1) buyer. When the downstream firm has all bargaining power (β_{s+1} = 1), the markup equals unity (competitive outcome). When the upstream firm has all bargaining power (β_{s+1} = 0), the markup equals the standard monopoly markup σ_{s+1}/(σ_{s+1}−1). For intermediate bargaining weights, the markup is a weighted average. The markup enters the optimal transaction subsidy formula by inflating the private marginal cost of producing tier-s inputs above the social marginal cost.&lt;/p&gt;
&lt;h3 id="q3-why-are-no-subsidies-needed-for-the-most-upstream-tier-0-transactions-or-for-final-good-sales"&gt;Q3. Why are no subsidies needed for the most upstream (tier 0) transactions or for final-good sales?&lt;/h3&gt;
&lt;p&gt;A: For tier 0 transactions: when tier 0 and tier 1 firms bargain, the negotiations occur last sequentially and so do not affect any prior agreements. There are no downstream cascading markup effects — tier 0 firms produce from labor alone, so their private marginal cost equals their social marginal cost. The joint surplus maximization by the pair thus aligns with the planner&amp;rsquo;s objective, yielding τ*_0 = 1 (no intervention needed). For final-good sales: final producers do mark up above marginal cost under monopolistic competition, but all varieties are symmetric, so the markup affects all goods equally and does not distort relative consumption choices. Hence τ*_S = 1.&lt;/p&gt;
&lt;h3 id="q4-what-are-the-two-offsetting-forces-that-determine-the-optimal-first-best-subsidy-to-resilience-investments-at-an-intermediate-tier"&gt;Q4. What are the two offsetting forces that determine the optimal first-best subsidy to resilience investments at an intermediate tier?&lt;/h3&gt;
&lt;p&gt;A: First, a firm in tier s captures only the fraction (1−β_{s+1}) of the joint surplus that its survival creates for its downstream customers (the rest is appropriated through bargaining by those customers), leading to underinvestment relative to the social optimum. Second, the optimal transaction subsidy τ*_s &amp;lt; 1 raises the private profitability of firms in tier s above its social value, because public finances bear part of the cost of their input purchases. This inflated private profitability encourages resilience investment beyond what the planner desires. The net optimal policy is θ*&lt;em&gt;s = (1−β&lt;/em&gt;{s+1}) / τ*_s, which may be a subsidy (θ*_s &amp;lt; 1) or a tax (θ*_s &amp;gt; 1) depending on which force dominates.&lt;/p&gt;
&lt;h3 id="q5-why-does-the-first-best-subsidy-for-resilience-at-an-intermediate-tier-depend-only-on-local-parameters-at-tier-s-and-its-immediate-neighbors-even-though-resilience-investments-generate-spillovers-to-firms-throughout-the-network"&gt;Q5. Why does the first-best subsidy for resilience at an intermediate tier depend only on local parameters (at tier s and its immediate neighbors), even though resilience investments generate spillovers to firms throughout the network?&lt;/h3&gt;
&lt;p&gt;A: When optimal transaction subsidies are in place at all tiers, a firm&amp;rsquo;s value becomes independent of the joint surplus in sales that occur between firms in tiers other than its own. That is, the positive spillovers to all firms farther upstream and downstream in a firm&amp;rsquo;s own network are exactly offset by the negative spillovers to firms in rival networks (including rival firms in the same tier). What remains after this general-equilibrium cancellation is only the benefit to the firm&amp;rsquo;s immediate downstream customers and the wedge created by the transaction subsidy. This result implies that the formula θ*&lt;em&gt;s = (1−β&lt;/em&gt;{s+1}) / τ*_s does not involve conditions at tiers other than s and s−1.&lt;/p&gt;
&lt;h3 id="q6-why-does-the-optimal-policy-for-network-formation-supplier-link-investment-equal-the-optimal-policy-for-resilience-investment-despite-the-fact-that-network-formation-also-strategically-improves-a-firms-bargaining-position"&gt;Q6. Why does the optimal policy for network formation (supplier link investment) equal the optimal policy for resilience investment, despite the fact that network formation also strategically improves a firm&amp;rsquo;s bargaining position?&lt;/h3&gt;
&lt;p&gt;A: Firms in intermediate tiers do have a private incentive to form additional supplier links specifically to improve their bargaining position vis-à-vis their upstream suppliers (by improving their outside options) and vis-à-vis their downstream customers (by the same mechanism). However, the authors show by comparing the firm&amp;rsquo;s first-order condition for link formation with the planner&amp;rsquo;s first-order condition that this strategic motivation exactly balances the offsetting general-equilibrium effects from rival firms doing the same. After this cancellation, the residual wedge between private and social incentives for network formation is identical to that for resilience investment. Hence #&lt;em&gt;_s = θ&lt;/em&gt;_s for all tiers.&lt;/p&gt;
&lt;h3 id="q7-how-do-second-best-policies-differ-from-first-best-policies-in-terms-of-both-the-magnitude-of-subsidies-and-the-information-required-to-set-them"&gt;Q7. How do second-best policies differ from first-best policies in terms of both the magnitude of subsidies and the information required to set them?&lt;/h3&gt;
&lt;p&gt;A: In the first best, the subsidy for resilience at tier s depends only on the bargaining weight β_{s+1} and the markup factor μ_{s−1} — parameters relevant to tier s and its immediate neighbors. In the second best, when transaction subsidies are unavailable, the optimal resilience subsidy at tier s is θ†&lt;em&gt;s = J^{−1} · [1 − (cumulative distortion of all downstream tiers)] · (1−β&lt;/em&gt;{s+1}), where J captures aggregate labor-market effects of all markups throughout the chain. This formula requires knowledge of production function parameters (labor shares γ_j, markups μ_j, elasticities σ_j) for every tier j downstream from s. The second-best subsidy may be larger or smaller than the first-best subsidy; it is more likely to exceed the first-best subsidy for upstream tiers, where the cumulative downstream distortions (uncorrected markups contracting demand) produce a larger shortfall in private profitability and hence a larger underinvestment in resilience.&lt;/p&gt;
&lt;h3 id="q8-under-what-condition-do-second-best-subsidies-fall-monotonically-as-one-moves-downstream-and-how-does-this-compare-to-the-first-best-pattern"&gt;Q8. Under what condition do second-best subsidies fall monotonically as one moves downstream, and how does this compare to the first-best pattern?&lt;/h3&gt;
&lt;p&gt;A: The ratio of second-best subsidies at adjacent tiers (θ†_{s−1} / θ†&lt;em&gt;s) equals [(1−β_s) / (1−β&lt;/em&gt;{s+1})] · [τ*&lt;em&gt;s]^{−1}, where τ*&lt;em&gt;s is the first-best transaction subsidy. If buyer bargaining weights are non-increasing along the chain — β&lt;/em&gt;{s+1} ≤ β_s for all s — then (1−β_s) ≤ (1−β&lt;/em&gt;{s+1}) and, combined with τ*&lt;em&gt;s ≤ 1, the second-best subsidy is larger upstream than downstream (θ†&lt;/em&gt;{s−1} ≥ θ†_s). This contrasts with the first-best policy: when parameters are uniform across tiers, first-best resilience subsidies are the same at every interior tier, while second-best subsidies are strictly larger upstream than downstream.&lt;/p&gt;
&lt;h3 id="q9-what-role-does-assumption-1-elasticities-of-substitution-non-increasing-as-goods-move-downstream-play-in-the-results"&gt;Q9. What role does Assumption 1 (elasticities of substitution non-increasing as goods move downstream) play in the results?&lt;/h3&gt;
&lt;p&gt;A: Assumption 1 (σ_1 ≥ σ_2 ≥ … ≥ σ_S &amp;gt; ε) ensures that the operating profit function ~v_s(η) is concave in a firm&amp;rsquo;s network size η, which in turn ensures interior solutions to the network formation problem. It also delivers sharper monotonicity results: under this assumption, if other production parameters and bargaining weights are similar across tiers, the optimal purchase subsidies rise monotonically downstream, and the optimal first-best resilience subsidies decline monotonically downstream (potentially turning into taxes at some interior tiers). The assumption reflects the realistic view that inputs become more differentiated and specialized as they approach the final consumer good.&lt;/p&gt;
&lt;h3 id="q10-what-are-the-limitations-the-authors-identify-regarding-their-model-and-what-extensions-do-they-suggest"&gt;Q10. What are the limitations the authors identify regarding their model, and what extensions do they suggest?&lt;/h3&gt;
&lt;p&gt;A: Three main limitations are identified. First, the model assumes bargaining occurs after disruption shocks are realized, ruling out contingent contracts. Pre-disruption bargaining with contingent payments could mitigate double-marginalization inefficiencies and help internalize resilience externalities, though complex network-wide contingent contracts would likely be needed for full efficiency even in the second-best environment. Second, the model assumes symmetric firms within each tier, so downstream firms cannot sort on upstream firms&amp;rsquo; observable resilience levels; if observable differences existed, downstream firms could seek out more reliable partners, partially internalizing the resilience externality. Third, the model covers only a closed economy with idiosyncratic (uncorrelated) shocks. Extensions to global supply chains, correlated (geographic) shocks, cross-country differences in wages and technologies, and optimal cooperative versus unilateral policy are identified as important directions for future research.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Resilience (agility):&lt;/strong&gt; In the paper&amp;rsquo;s usage, a firm&amp;rsquo;s endogenous investment in reducing the probability of a catastrophic disruption to its own operations. A firm in tier s hires r_s units of labor up front, which raises its survival probability φ_s(r_s), with φ&amp;rsquo;_s &amp;gt; 0 and φ&amp;rsquo;&amp;rsquo;_s &amp;lt; 0. Resilience is a relationship-specific investment in the sense that its payoff is realized only conditional on the firm surviving and then trading with its downstream customers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Network thickness (redundancy):&lt;/strong&gt; The fraction η_s of firms in the next upstream tier with whom a firm in tier s forms a supply relationship prior to the disruption shock. Forming k units of labor per link creates a thicker network that hedges against supplier disruption, increases input variety (and thus CES productivity), and improves bargaining positions vis-à-vis both upstream suppliers and downstream customers. Distinct from resilience: resilience reduces the firm&amp;rsquo;s own probability of disruption; network thickness provides substitutability across suppliers should some fail.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Markup factor (μ_s):&lt;/strong&gt; The ratio of the per-unit payment made by tier s+1 firms to the production cost of tier s firms, as determined by Nash bargaining. Specifically, μ_s = (1−β_{s+1}) · [σ_{s+1}/(σ_{s+1}−1)] + β_{s+1}. The markup distorts private marginal costs above social marginal costs, causing underinvestment in transactions between firms and, transitively, in resilience and network formation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Nash-in-Nash equilibrium:&lt;/strong&gt; The bargaining solution concept used in the paper (following Horn and Wolinsky, 1988). Each pair of firms negotiates as if all other bilateral negotiations involving either party proceed at their equilibrium outcomes, both on and off the equilibrium path. This is the appropriate equilibrium concept when grand coalitions across all firms and all tiers are impractical.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sequential bargaining:&lt;/strong&gt; The specific timing structure in which negotiations proceed from the most downstream tier (lead firms bargaining with tier S−1 suppliers) sequentially upstream until tier 1 firms bargain with tier 0 suppliers. Each tier of firms, at the time they bargain with their own suppliers, are already contractually obligated to deliver specified quantities to their downstream customers. This obligation anchors the downstream firm&amp;rsquo;s outside option in any given bilateral negotiation.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;&lt;em&gt;First-best transaction subsidy (τ&lt;/em&gt;_s):&lt;/em&gt;* The fraction of the cost of a tier-s input that, under the optimal policy, the downstream (tier s+1) buyer must pay. Equals [γ_s + (1−γ_s) · μ_{s−1}]^{−1} &amp;lt; 1 for all intermediate tiers, i.e., it is always a subsidy. Designed to align private marginal cost in the bilateral negotiation with the social marginal cost by offsetting the distortion introduced by anticipated markups on the upstream firm&amp;rsquo;s own inputs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Second-best subsidy:&lt;/strong&gt; The optimal policy toward resilience and network formation when subsidizing firm-to-firm transactions is infeasible (constrained to τ_s = 1 for all s). Unlike first-best subsidies — which depend only on local tier parameters — second-best subsidies depend on production function parameters and bargaining weights throughout the entire downstream supply chain due to the uncorrected cumulative markup distortions.&lt;/p&gt;</description></item><item><title>Optimal Taxation and Market Power</title><link>https://macropaperwarehouse.com/papers/optimal-taxation-and-market-power/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/optimal-taxation-and-market-power/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;This paper asks whether and how optimal income taxation should change when firms have market power. The question is motivated by the documented rise in economy-wide markups since 1980, which has compressed the labor share, widened the gap between worker and entrepreneurial income, and generated allocative inefficiency through excessive pricing.&lt;/p&gt;
&lt;p&gt;The authors develop a Mirrleesian optimal taxation framework augmented with three features absent from the canonical literature: (i) oligopolistic intermediate goods markets with endogenous, variable markups, (ii) heterogeneous firm productivities, and (iii) two occupational groups—wage-earning workers and profit-earning entrepreneurs—whose abilities are private information. Entrepreneurs strategically set prices under Cournot competition, which means that the tax system affects profits both through a firm&amp;rsquo;s own behavior and through the responses of its competitors. This strategic interaction is the critical novelty relative to prior work that assumes monopolistic competition.&lt;/p&gt;
&lt;p&gt;The main theoretical contribution is the derivation of optimal tax formulas for both labor income and profit income that decompose into four named components: (i) the Mirrleesian incentive component, which reflects the standard trade-off between redistribution and labor supply distortions; (ii) the Pigouvian component, which corrects for the externality from market power by subsidizing labor and entrepreneurial effort to offset the output shortfall from high markups; (iii) the Reallocation Effect (RE), which shifts the profit tax to redirect labor inputs from low-markup firms to high-markup firms where labor is inefficiently scarce, and which emerges only under heterogeneous markups; and (iv) the Indirect Redistribution Effect (IRE), which uses changes in competitors&amp;rsquo; product prices—a channel present only under oligopolistic (not monopolistic) competition—to redistribute income between entrepreneurs.&lt;/p&gt;
&lt;p&gt;For the labor income tax, the dominant force is the Pigouvian component. As average markups rise, the Pigouvian subsidy to labor supply grows, mechanically reducing optimal labor income tax rates. The profit tax is shaped by all four components in opposing directions; the net quantitative effect is resolved empirically.&lt;/p&gt;
&lt;p&gt;The model is calibrated to match distributions of labor income (from the Current Population Survey), profits (from Compustat-based data in De Loecker, Eeckhout, and Unger 2020), and firm-level markups (also from De Loecker, Eeckhout, and Unger 2020, using the cost-minimization approach) for the US in 1980 and 2019. The cost-weighted average markup rose from 1.25 in 1980 to 1.33 in 2019, with the increase concentrated at the top of the markup distribution.&lt;/p&gt;
&lt;p&gt;The central quantitative prescription is that the optimal labor income tax rate should decline by 7.7 percentage points between 1980 and 2019 (average optimal rate falls from 22.0 percent to 14.3 percent), while the optimal profit tax rate should rise by 2.2 percentage points on average (from 58.4 percent to 60.5 percent) and by 29.1 percentage points at the top. The decline in the labor income tax is driven primarily by the rise in average markups reducing the Pigouvian component. The increase in the profit tax, especially at the top, is driven primarily by the Mirrleesian component operating through the skill gap, which rises because higher markups reduce profit elasticity. The Pigouvian and reallocation components push in the opposite direction on the profit tax, but the Mirrleesian effect dominates.&lt;/p&gt;
&lt;p&gt;The optimal profit tax structure is regressive for large, high-markup firms—reflecting the RE, which requires lower tax rates for high-markup firms to incentivize labor reallocation toward them—but less regressive in 2019 than in 1980, reflecting the distributional tightening from rising markup inequality.&lt;/p&gt;
&lt;p&gt;Robustness checks across parameter values for the social welfare curvature k, the span of control ξ, and the elasticity of substitution σ confirm that the directional results hold: labor income tax rates decrease and profit tax rates increase from 1980 to 2019 across all parameter configurations. Extensions to nonlinear sales taxes and conditioning on markups confirm that even when the planner can observe markups directly, the first-best is not achievable because markups are endogenous to entrepreneurs&amp;rsquo; unobservable decisions.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-fundamental-difference-between-this-papers-model-and-prior-work-on-optimal-taxation-with-market-power"&gt;Q1. What is the fundamental difference between this paper&amp;rsquo;s model and prior work on optimal taxation with market power?&lt;/h3&gt;
&lt;p&gt;Prior work using monopolistic competition (e.g., Gürer 2021; Boar and Midrigan 2019) assumes each entrepreneur holds monopoly power in its own market, so no strategic interaction exists between firms. Under monopolistic competition, entrepreneurs price to maximize utility given competitors&amp;rsquo; choices, and the envelope theorem implies that tax changes have no first-order effect on prices or utility through the pricing channel—the Indirect Redistribution Effect (IRE) disappears. In this paper, entrepreneurs compete in Cournot oligopolistic markets with a finite number of firms I, so each firm&amp;rsquo;s pricing depends on competitors&amp;rsquo; output. A change in one firm&amp;rsquo;s output (induced by taxation) shifts competitors&amp;rsquo; prices, opening a redistribution channel through product markets that is entirely absent in monopolistic competition. Additionally, the Reallocation Effect (RE) emerges only when firm-level markups are heterogeneous, which requires oligopolistic (not perfectly competitive) markets.&lt;/p&gt;
&lt;h3 id="q2-what-are-the-four-components-of-the-optimal-tax-formula-and-how-does-each-relate-to-market-power"&gt;Q2. What are the four components of the optimal tax formula and how does each relate to market power?&lt;/h3&gt;
&lt;p&gt;The optimal tax wedge for both labor and profit income decomposes into four components. First, the Mirrleesian component reflects the standard trade-off between redistribution and the efficiency cost of taxation; in the presence of market power, it is modified because the skill gap for entrepreneurs depends on markups through the profit elasticity. Second, the Pigouvian component corrects the externality from market power, which causes prices to exceed marginal cost and output to be inefficiently low; it implies a subsidy to both worker and entrepreneurial effort, scaled by the reciprocal of the average markup (for the labor tax) or firm-level markup (for the profit tax). Third, the Reallocation Effect (RE) applies only to the profit tax and reflects that labor should be shifted toward high-markup firms where it is inefficiently underemployed; it reduces the tax rate for firms whose markup exceeds the average. Fourth, the Indirect Redistribution Effect (IRE) captures redistribution through competitor price changes under oligopolistic interaction; it can either raise or lower the profit tax rate depending on the distribution of social welfare weights and the cross-inverse demand elasticity.&lt;/p&gt;
&lt;h3 id="q3-what-happens-to-the-labor-income-tax-formula-as-average-markups-rise"&gt;Q3. What happens to the labor income tax formula as average markups rise?&lt;/h3&gt;
&lt;p&gt;The labor income tax formula contains a Pigouvian component equal to the reciprocal of the employment-weighted average markup. As average markups rise, this reciprocal falls, reducing the optimal labor income tax rate. Quantitatively, the optimal average labor income tax rate declines from 22.0 percent in 1980 to 14.3 percent in 2019, a decrease of 7.7 percentage points. In a purely competitive benchmark economy, the top labor income tax rate would be around 60 percent (consistent with Saez 2001); in the calibrated model with market power, it is 34.2 percent in 1980 and 28.7 percent in 2019. The Pigouvian component accounts for essentially the entire difference because the Mirrleesian component, when calibrated to the same labor income distribution, is unchanged.&lt;/p&gt;
&lt;h3 id="q4-how-does-the-mirrleesian-component-cause-the-top-profit-tax-rate-to-rise-with-market-power"&gt;Q4. How does the Mirrleesian component cause the top profit tax rate to rise with market power?&lt;/h3&gt;
&lt;p&gt;The Mirrleesian component of the profit tax is driven by the skill gap, defined as the proportional rate of change in the composite entrepreneur ability measure. The skill gap depends on markups through the profit elasticity: as markups rise, profit elasticity falls (since profit elasticity is approximately the reciprocal of markup minus the span-of-control parameter minus the inverse of the labor supply elasticity term), which increases the skill gap. A higher skill gap amplifies the income divergence across entrepreneur types, increasing the Mirrleesian incentive to redistribute at the top. Quantitatively, Figure 5 shows that the rise in the skill gap from 1980 to 2019 tracks almost exactly the change in the inverse of profit elasticity, confirming that markup changes—not changes in the ability distribution—are the primary driver of increased Mirrleesian pressure on top profit taxes.&lt;/p&gt;
&lt;h3 id="q5-how-does-the-reallocation-effect-influence-the-structure-progressivity-of-the-profit-tax"&gt;Q5. How does the Reallocation Effect influence the structure (progressivity) of the profit tax?&lt;/h3&gt;
&lt;p&gt;The RE term equals the ratio of the average markup to the firm-level markup minus one: RE(θe) = μ/μ(θe) − 1. For firms with markups above the average, RE is negative, reducing their optimal tax rate; for firms below the average, RE is positive, increasing it. This implies that the optimal profit tax should be regressive relative to markup (i.e., high-markup firms face lower marginal tax rates), even though the overall profit tax rises on average. This provides a novel rationale for why the profit tax schedule in practice is less progressive—or even regressive—for large firms. As markups rise across the distribution, the reallocation effect pushes down the top profit tax but does not offset the larger increase from the Mirrleesian component in the quantitative exercise.&lt;/p&gt;
&lt;h3 id="q6-what-is-the-indirect-redistribution-effect-and-why-does-it-disappear-under-monopolistic-competition"&gt;Q6. What is the Indirect Redistribution Effect and why does it disappear under monopolistic competition?&lt;/h3&gt;
&lt;p&gt;The IRE captures the change in entrepreneurial utility that arises because a tax reduction for one entrepreneur increases their output, which reduces the prices of substitute goods produced by competitors, thereby lowering competitors&amp;rsquo; incomes. Under oligopolistic competition with I &amp;gt; 1 firms per market, the cross-inverse demand elasticity is nonzero, so competitor prices are sensitive to any one firm&amp;rsquo;s output decision, and this redistribution channel is open. Under monopolistic competition (I = 1), each entrepreneur is the sole producer in its market; competitors&amp;rsquo; prices do not depend on the firm&amp;rsquo;s output, the cross-inverse demand elasticity is zero, and the IRE vanishes by the envelope theorem. The IRE is also absent in perfectly competitive economies. Empirical evidence for the US suggests the hazard ratio of profits is sufficiently high that the IRE generally pushes toward a lower top profit tax rate, but the Mirrleesian effect dominates in the quantitative results.&lt;/p&gt;
&lt;h3 id="q7-what-is-the-quantitative-effect-of-rising-markups-on-the-optimal-tax-rates-and-what-drives-the-net-change-in-the-profit-tax"&gt;Q7. What is the quantitative effect of rising markups on the optimal tax rates, and what drives the net change in the profit tax?&lt;/h3&gt;
&lt;p&gt;The model calibrated to 1980 and 2019 US data prescribes a decline in the optimal average labor income tax rate of 7.7 percentage points (from 22.0 to 14.3 percent) and an increase in the optimal average profit tax rate of 2.2 percentage points (from 58.4 to 60.5 percent). At the top of the profit distribution, the increase is 29.1 percentage points. The net profit tax increase results from four opposing forces: the Pigouvian component falls (pushing toward lower taxes) and the RE decreases for high-markup firms (also pushing down the top rate), while the IRE and especially the Mirrleesian component rise (pushing up top rates). The Mirrleesian effect is the dominant force, driven by rising markup inequality reducing profit elasticity and widening the skill gap for top entrepreneurs.&lt;/p&gt;
&lt;h3 id="q8-how-does-the-counterfactual-analysis-isolate-the-role-of-markups-from-productivity-changes"&gt;Q8. How does the counterfactual analysis isolate the role of markups from productivity changes?&lt;/h3&gt;
&lt;p&gt;The counterfactual fixes the markup distribution at its 1980 level while holding the 2019 productivity distribution constant, then solves for optimal taxes. The result is that high-profit entrepreneurs would face lower optimal tax rates under 1980 markups than under 2019 markups, while low-profit entrepreneurs would face higher rates. Decomposing the difference, the Pigouvian component and the RE are larger for high incomes under 1980 (lower) markups, making the profit tax more regressive, while the IRE and the Mirrleesian component are smaller under 1980 markups, producing a lower top rate. The increase in the Mirrleesian component due to the markup increase from 1980 to 2019 is identified as the primary reason top profit taxes rise. This isolates the markup channel from the productivity channel in accounting for changes in optimal taxes.&lt;/p&gt;
&lt;h3 id="q9-what-does-the-robustness-analysis-reveal-about-parameter-sensitivity"&gt;Q9. What does the robustness analysis reveal about parameter sensitivity?&lt;/h3&gt;
&lt;p&gt;The main qualitative result—labor income taxes decline and profit taxes rise from 1980 to 2019—holds across a broad parameter space. The optimal profit tax rate is largely insensitive to the social welfare curvature parameter k: across k ∈ {0.77, 1, 3}, the average optimal profit tax rate is approximately 58 percent in 1980 and 61 percent in 2019. The optimal average labor income tax rate is more sensitive to k: for k = 0.7, 1, and 3, the 1980 rates are 20.3, 26.7, and 44.6 percent, and the 2019 rates are 12.5, 19.4, and 39.1 percent, respectively. Changes in the span-of-control parameter ξ and the substitution elasticity σ do not affect the labor income tax wedge schedule directly but do influence it indirectly through the markup distribution. The directional results are confirmed for all tested parameter configurations.&lt;/p&gt;
&lt;h3 id="q10-what-is-the-role-of-the-additivity-property-from-prior-externality-literature-and-why-does-it-fail-here"&gt;Q10. What is the role of the &amp;ldquo;additivity property&amp;rdquo; from prior externality literature, and why does it fail here?&lt;/h3&gt;
&lt;p&gt;The additivity property from the Pigouvian externality literature (see Kopczuk 2003; Sandmo 1975) states that the Pigouvian correction is separable from other components of the optimal tax formula, implying that rising markups would simply decrease the optimal tax rate (since 1/μ falls). This property holds under simplifying assumptions that abstract from the general equilibrium and incentive effects of market power. In the present model, the additivity property does not hold because markups enter all four components of the optimal tax formula—not just the Pigouvian term—through the skill gap (Mirrleesian component), the RE, and the IRE. As a result, rising markups can increase the optimal profit tax rate even though the Pigouvian component falls, because the skill gap and Mirrleesian force dominate.&lt;/p&gt;
&lt;h3 id="q11-can-the-government-attain-the-first-best-by-conditioning-taxes-on-markups"&gt;Q11. Can the government attain the first-best by conditioning taxes on markups?&lt;/h3&gt;
&lt;p&gt;No. The paper demonstrates that even if the planner can observe and condition taxes on firm-level markups, the first-best is not achievable. The reason is that markups are endogenous to the entrepreneurs&amp;rsquo; unobservable decisions: an entrepreneur&amp;rsquo;s markup depends on their privately known type and chosen output. When the planner designs a mechanism that conditions on markup, the incentive constraint facing entrepreneurs remains the same as in the benchmark model, because the promise-keeping constraints are independent of the entrepreneur&amp;rsquo;s true type when markups are observable. The optimal allocation with markup-conditioned taxes is shown to be equivalent to the second-best with nonlinear sales taxes, which still falls short of the first-best.&lt;/p&gt;
&lt;h3 id="q12-what-are-the-policy-implications-for-the-design-of-the-profit-tax-schedule"&gt;Q12. What are the policy implications for the design of the profit tax schedule?&lt;/h3&gt;
&lt;p&gt;The model yields three concrete prescriptions for the joint design of labor and profit income taxes in the context of rising market power. First, labor income taxes should be reduced and top profit taxes should be increased as market power rises. Second, for large, high-productivity firms the profit tax should be designed to be appropriately regressive to enhance allocative efficiency through the Reallocation Effect—this provides a new normative justification for why profit tax schedules observed in practice are often less progressive than labor income taxes. Third, while profit taxes should be regressive for large firms, the degree of regressivity should decrease as market power rises, reflecting the trade-off between efficiency and equality: higher markups increase the Mirrleesian pressure for redistribution at the top, reducing the optimal regressivity.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Mirrleesian component (of the optimal tax formula):&lt;/strong&gt; The standard incentive component of the optimal tax, capturing the trade-off between direct redistribution and the efficiency cost of taxation. In the presence of market power, this component is modified because the skill gap for entrepreneurs depends on markups through the profit elasticity: higher markups reduce profit elasticity, widen the skill gap, and amplify the Mirrleesian force toward higher top profit taxes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Pigouvian component:&lt;/strong&gt; The correction in the optimal tax formula for the externality from market power. Because oligopolistic pricing causes output to be inefficiently low, the optimal tax subsidizes both worker and entrepreneurial labor supply. In the labor income tax formula, the Pigouvian component is the reciprocal of the employment-weighted average markup; in the profit tax formula, it is the reciprocal of the firm-level markup. As average markups rise, the Pigouvian component reduces the optimal labor income tax rate.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Reallocation Effect (RE):&lt;/strong&gt; A component of the optimal profit tax formula that captures the efficiency gain from reallocating labor inputs from low-markup firms (where labor&amp;rsquo;s marginal product is high relative to value) to high-markup firms (where labor demand is inefficiently low). It equals the ratio of the average markup to the firm-level markup minus one. It implies a lower optimal marginal tax rate for firms with markups above the average, producing a regressive structure in the profit tax for large firms. This effect is absent under monopolistic competition (uniform markups) and in competitive markets.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Indirect Redistribution Effect (IRE):&lt;/strong&gt; A component of the optimal profit tax formula specific to oligopolistic competition, capturing redistribution through competitor prices. Lowering the marginal tax rate of a high-productivity entrepreneur raises their output, which reduces the prices of substitutable goods produced by their competitors, thereby lowering competitors&amp;rsquo; incomes and redistributing toward workers who benefit from lower prices. This effect is present only when the cross-inverse demand elasticity is nonzero—i.e., only under oligopolistic (Cournot) competition with multiple firms per market—and vanishes under monopolistic competition and in the limit as the number of firms grows to infinity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Skill gap (for entrepreneurs):&lt;/strong&gt; The proportional rate of change in the composite entrepreneur ability measure with respect to entrepreneur type, analogous to the Mirrleesian skill gap for workers. Under market power, the entrepreneur skill gap depends on the markup through the profit elasticity: as firm-level markups rise, profit elasticity falls, the skill gap increases, and the income dispersion across entrepreneurs widens, which amplifies the Mirrleesian incentive to redistribute at the top and raises the optimal top profit tax rate.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Symmetric Cournot Competitive Tax Equilibrium (SCCTE):&lt;/strong&gt; The equilibrium concept used in the paper. It is a combination of a tax system, symmetric allocation, and symmetric price system such that all agents (final goods producer, entrepreneurs of each type, workers) are optimizing, strategic interaction in the intermediate goods market is a Cournot Nash equilibrium within each granular market, and all commodity and labor markets clear. Strategic interaction is restricted to within each granular market (firms in the same market compete), so decisions across markets are taken as given.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Composite ability:&lt;/strong&gt; A combined measure of entrepreneur productivity that determines equilibrium allocations and optimal taxation in the nested-CES economy. It aggregates the entrepreneur&amp;rsquo;s raw ability (affecting output capacity) and the demand parameter (affecting the market-level markup). The markup-relevant component and the quantity-relevant component are not perfect substitutes in the composite, since equilibrium prices depend on their specific composition while equilibrium quantities depend only on their combined value.&lt;/p&gt;</description></item><item><title>Outsourcing bank loan screening: The economics of third-party loan guarantees</title><link>https://macropaperwarehouse.com/papers/outsourcing-bank-loan-screening-the-economics-of-third-party-loan-guarantees/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/outsourcing-bank-loan-screening-the-economics-of-third-party-loan-guarantees/</guid><description>&lt;p&gt;Third-party loan guarantees—in which a fee-charging guarantor formally guarantees a bank loan and conducts its own due diligence on the borrower—are present in approximately 10% of Chinese bank loans and are required for most small and medium enterprise (SME) financing. This paper investigates their economic function using proprietary data from a large private loan guarantee firm and interviews with market participants. The paper systematically tests and rejects two leading alternative hypotheses: that guarantees circumvent interest rate caps via regulatory arbitrage (rejected because total loan payments never approach the cap in the data), and that guarantees primarily induce borrowers to self-select based on creditworthiness. The positive evidence points instead to a &amp;ldquo;second level of delegation of loan evaluation&amp;rdquo;: guarantors have private information about borrower quality beyond hard accounting data and collateral, screen bad loans effectively, and their pricing is consistent with the outsourced screening interpretation. This framework is analogous to Diamond&amp;rsquo;s (1984) delegated monitoring, but prior to loan origination rather than after.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-why-are-third-party-loan-guarantees-prevalent-in-chinas-sme-lending-market"&gt;Q1. Why are third-party loan guarantees prevalent in China&amp;rsquo;s SME lending market?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;During the paper&amp;rsquo;s sample period, third-party loan guarantees are essentially a prerequisite for most SME loans in China, arising because banks face high costs of evaluating small borrowers with limited collateral and opaque financials; guarantors specialize in gathering soft information through site visits and examination of borrower books.&lt;/strong&gt; The loan guarantee industry consists of a few large firms and many smaller ones; in exchange for a fee paid by the borrower and a pledge of collateral to the guarantor, the guarantor provides a formal credit guarantee to the lending bank. This is a transactional (not relationship-based) business.&lt;/p&gt;
&lt;h3 id="q2-how-do-the-data-reject-the-regulatory-arbitrage-and-self-selection-hypotheses"&gt;Q2. How do the data reject the regulatory arbitrage and self-selection hypotheses?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The regulatory arbitrage hypothesis (that guarantees allow total payments to exceed the interest rate cap) is rejected cleanly because none of the loans in the sample have a total payment—interest plus guarantee fee—at or near the interest rate cap.&lt;/strong&gt; The self-selection hypothesis (that guarantees induce only high-quality borrowers to apply, as in Thakor 1982) is rejected because: (i) only a small fraction of applications are accepted, suggesting the guarantor and bank do most of the selection rather than borrowers self-selecting; and (ii) the data show guarantors successfully screen out bad loans using private information, inconsistent with the borrower being the primary information-bearing party.&lt;/p&gt;
&lt;h3 id="q3-what-is-the-positive-evidence-for-the-outsourced-screening-interpretation"&gt;Q3. What is the positive evidence for the outsourced screening interpretation?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The guarantor has information about borrowers beyond hard accounting data and collateral—gathered from site visits and business-record examinations—and this private information is reflected in its risk assessments, which predict loan performance independently.&lt;/strong&gt; Pricing of loans and guarantees in the data is consistent with a model in which the guarantor&amp;rsquo;s fee reflects its risk assessment (its private signal about borrower quality), and the bank&amp;rsquo;s interest rate reflects the residual risk after conditioning on the guarantor&amp;rsquo;s approval. A key empirical finding is that the correlation between bank loan rates and guarantor risk assessment is negative—when rates were higher in the economy, banks lent to safer credits, because high rates correlate with scarce credit in the sample—a pattern consistent with screening rather than adverse selection.&lt;/p&gt;
&lt;h3 id="q4-what-are-the-broader-implications-for-sme-finance-and-financial-intermediation-theory"&gt;Q4. What are the broader implications for SME finance and financial intermediation theory?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper interprets third-party loan guarantees as a &amp;ldquo;second level of delegation of loan evaluation&amp;rdquo;—analogous to Diamond&amp;rsquo;s (1984) delegated monitoring (which occurs after loan origination) but occurring before—suggesting that the guarantor occupies a specialized information-gathering role that banks cannot efficiently internalize.&lt;/strong&gt; This outsourcing of screening is potentially a more efficient organizational form than either direct bank screening (if banks face higher per-borrower costs) or government guarantees (which lack the performance incentives of private guarantors). The contrast with the CDS market (where AIG-style guarantors performed no serious checking or hedging) underscores that private guarantors with proper incentives can perform meaningful screening.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;third-party loan guarantee&lt;/strong&gt; : a contractual arrangement in which a fee-charging private guarantor formally guarantees a bank loan and bears the credit risk if the borrower defaults, having conducted independent due diligence on the borrower; the paper shows this functions as outsourced screening.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;delegated screening (pre-loan)&lt;/strong&gt; : the paper&amp;rsquo;s interpretation of the guarantor&amp;rsquo;s role as a second layer of delegation of loan evaluation before origination, analogous to Diamond&amp;rsquo;s (1984) delegated monitoring after origination; the guarantor has a comparative advantage in gathering borrower-specific soft information.&lt;/p&gt;</description></item><item><title>Permanent Capital Losses after Banking Crises</title><link>https://macropaperwarehouse.com/papers/permanent-capital-losses-after-banking-crises/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/permanent-capital-losses-after-banking-crises/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This paper investigates two interrelated questions about historical banking crises: (1) whether bank losses during banking crises are primarily temporary or permanent in nature, and (2) whether policy interventions — particularly liquidity-based interventions — are effective at restoring bank capitalization after such crises. The paper positions these questions against a theoretical divide: models stressing temporary price dislocations (binding borrowing constraints, depositor fragility, information frictions) versus models in which crises reflect fundamental and permanent deterioration in the value of bank assets.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Methodology&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors construct three new historical datasets spanning 46 economies from 1870 to 2019. The first is a country-level panel of annual and monthly bank and nonfinancial equity index total returns, building on Baron, Verner, and Xiong (2021). The second is an individual-bank-level dataset covering the ten largest banks per country across 17 economies (from Jordà, Schularick, and Taylor 2017), containing equity returns, balance sheet quantities, net income decomposed into write-downs and trading income, and equity issuance within ±5-year windows around each crisis. The third is a new database of the monthly starting dates of policy interventions — extraordinary central bank liquidity support, blanket liability guarantees, and government recapitalizations — extending the databases of Laeven and Valencia (2020) and Metrick and Schmelzing (2024).&lt;/p&gt;
&lt;p&gt;Bank equity crises are identified using a real-time, data-driven indicator requiring: (1) a greater than 30% annual decline in the bank equity index and (2) the failure of a top-20 bank within the country. This definition yields 76 bank equity crises, nearly all of which overlap with prior narrative-based chronologies (Reinhart-Rogoff, JST, Laeven-Valencia), and results are robust to all alternative crisis definitions examined.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Permanent losses.&lt;/em&gt; In the year of a bank equity crisis onset, bank equity experiences average abnormal returns of -68 log-points (or -49% in arithmetic terms), while nonfinancial equity falls by -36 log-points (-30%). Over the subsequent five years, bank equity does not earn elevated returns relative to the country&amp;rsquo;s unconditional average — point estimates are consistently negative, and significantly so in years three and four after crisis onset. Bank equity does not recover to its pre-crisis level. By contrast, nonfinancial equity earns cumulative abnormal returns of roughly 30 log-points (35% arithmetic) over five years, recovering to pre-crisis trend, consistent with a discount-rate-driven decline for nonfinancial firms.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Earnings-driven, not discount-rate-driven.&lt;/em&gt; Panel regressions at both the country and individual-bank level show coefficients of roughly 1 to 2 on the relationship between the initial bank equity return in the crisis year and the subsequent five-year change in real dividends and real earnings. The initial equity decline thus predicts a roughly commensurate long-run decline in banks&amp;rsquo; dividends and earnings, inconsistent with the temporary-loss view&amp;rsquo;s prediction of discount-rate-driven declines that should subsequently reverse.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Short-run bounce-backs are modest and transient.&lt;/em&gt; At the monthly frequency, bank equity does rebound modestly from its trough — the bounce-back averages only about 30% of the initial decline, even assuming perfect market timing. This gain partially reverses after approximately twelve months, so cumulative five-year returns remain not elevated above the unconditional average.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Write-downs, not fire sales, drive losses.&lt;/em&gt; Realized book losses in the first year of crisis onset account for only about 30% of market-value losses — contrary to what fire-sale models predict. By year five, cumulative book losses reach roughly 35% of pre-crisis book equity and approximately 100% of market-value losses. Decomposing net income, write-downs track cumulative book losses closely and fully account for market-value losses by year five. Trading losses (from securities sales and asset dispositions) account for only a small share on average, though for banks in the top quartile of securities-to-assets ratios, immediate accounting losses are larger and more trading-loss-driven — consistent with fire-sale dynamics being important specifically for banks with large tradable securities portfolios.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Nonperforming loans confirm the mechanism.&lt;/em&gt; At the country level, larger bank equity declines are associated with higher peak NPL rates in the subsequent five years (adjusted R² of 0.53 excluding two outliers; 0.606 for the 2008-2010 subsample only). No analogous relationship exists for nonfinancial equity returns.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Policy interventions are insufficient.&lt;/em&gt; Liquidity-based interventions (extraordinary central bank support and blanket guarantees) implemented after bank equity crises are followed by an approximately 20% short-run rebound in bank equity, which reverses between months 12 and 36. No large or permanent increase in bank value follows. Government recapitalization programs have historically been small (averaging 24% of pre-crisis book equity and 43% of realized losses), narrow (65% classified as narrow, median of five banks recapitalized), and delayed. Banks cannot self-recapitalize through high post-crisis profitability.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Crisis type matters.&lt;/em&gt; Panic-only crises (banking panics without large bank equity declines, N=85) exhibit very different dynamics: bank equity recovers to pre-crisis levels within five years, dividends fall only temporarily, liquidity interventions produce large and permanent rebounds, and macroeconomic output losses are smaller. In 75% of bank equity crises, the bank equity decline strictly precedes the banking panic, indicating that fundamental weaknesses — not liquidity shocks escalating into solvency problems — are the primary driver. Only 19 cases (25%), labelled &amp;ldquo;mismanaged banking panics&amp;rdquo; (including the U.S. Great Depression), saw the panic precede the equity decline, mostly in the pre-1945 Gold Standard era. Early liquidity intervention is essentially a necessary condition for averting incipient crises, but it is effective only when a steep bank equity decline has not yet occurred.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-how-do-the-authors-define-a-bank-equity-crisis-and-why-does-the-definition-matter-for-their-empirical-strategy"&gt;Q1. How do the authors define a &amp;ldquo;bank equity crisis&amp;rdquo; and why does the definition matter for their empirical strategy?&lt;/h3&gt;
&lt;p&gt;A bank equity crisis is defined as the first year when (1) the bank equity index declines by more than 30% in annual excess total returns in any year within the past five years, and (2) a top-20 bank (ranked by assets) fails within the country. This purely data-driven, real-time definition avoids the look-ahead bias inherent in narrative-based chronologies. The authors identify 76 such crises. Results are robust to using Reinhart-Rogoff, JST, Laeven-Valencia, and 30%-decline-only definitions, alleviating concerns that the differential bank versus nonfinancial equity dynamics are mechanical artifacts of the crisis identification approach.&lt;/p&gt;
&lt;h3 id="q2-what-is-the-quantitative-magnitude-of-the-initial-equity-shock-to-banks-versus-nonfinancial-firms-at-crisis-onset"&gt;Q2. What is the quantitative magnitude of the initial equity shock to banks versus nonfinancial firms at crisis onset?&lt;/h3&gt;
&lt;p&gt;In the year of a bank equity crisis, the average abnormal cumulative log excess total return is -68 log-points for bank equity and -36 log-points for nonfinancial equity (corresponding to -49% and -30% in arithmetic abnormal returns, respectively). These are relative to the country&amp;rsquo;s unconditional average returns, estimated using country fixed effects in panel regressions.&lt;/p&gt;
&lt;h3 id="q3-do-bank-stocks-earn-elevated-returns-after-banking-crises-as-temporary-loss-models-predict"&gt;Q3. Do bank stocks earn elevated returns after banking crises, as temporary-loss models predict?&lt;/h3&gt;
&lt;p&gt;No. Over the five years following crisis onset, bank equity point estimates of cumulative abnormal returns are consistently negative, and significantly so at years three and four. Bank equity does not recover to its pre-crisis level at any horizon out to five years (and Figure A.9 extends to ten years with similar conclusions). This pattern holds across advanced and emerging economies, before and after 1945, excluding the Global Financial Crisis, and across a variety of methods for computing abnormal returns. Even for surviving banks — excluding those that failed or exited — the pattern holds.&lt;/p&gt;
&lt;h3 id="q4-how-do-the-earnings-and-dividend-dynamics-of-banks-versus-nonfinancial-firms-differ-after-crises"&gt;Q4. How do the earnings and dividend dynamics of banks versus nonfinancial firms differ after crises?&lt;/h3&gt;
&lt;p&gt;For banks, both real dividends per share and real earnings per share remain well below their long-term average five years after crisis onset, with no recovery visible by year five. For nonfinancial firms, dividends and earnings decline at crisis onset but rebound, though only slowly through year five. Panel regressions at both the country and individual-bank level find coefficients of approximately 1 to 2 on the relationship between the crisis-year bank equity return and the five-year-ahead change in real dividends and real earnings — indicating a roughly commensurate earnings-driven decline, not a transitory discount-rate shock.&lt;/p&gt;
&lt;h3 id="q5-what-is-the-magnitude-of-the-short-run-bounce-back-in-bank-equity-and-does-it-represent-a-profit-opportunity"&gt;Q5. What is the magnitude of the short-run bounce-back in bank equity, and does it represent a profit opportunity?&lt;/h3&gt;
&lt;p&gt;Even with perfect knowledge of the crisis trough (which is not available in real time), the rebound in bank equity from trough to peak averages only about 30% of the initial decline. This gain partially reverses within approximately twelve months, so that cumulative five-year abnormal returns remain not elevated above the unconditional average. Trading strategies that account for risk and factor returns (market, value, size, momentum, global equity) yield even lower risk-adjusted returns, strengthening the conclusion that bank equity is not cheap at crisis troughs.&lt;/p&gt;
&lt;h3 id="q6-how-do-write-downs-compare-to-trading-losses-in-explaining-the-accounting-losses-of-banks-during-crises"&gt;Q6. How do write-downs compare to trading losses in explaining the accounting losses of banks during crises?&lt;/h3&gt;
&lt;p&gt;Realized book losses in the first year of crisis onset account for only about 30% of market-value losses. By year five, cumulative book losses reach approximately 35% of pre-crisis book equity and roughly 100% of market-value losses. Decomposing net income, write-downs (revaluations of assets remaining on the balance sheet — loan loss provisions, impairments, goodwill write-downs) track cumulative book losses closely and fully account for market-value losses by year five. Trading losses (realized gains and losses from securities trading and all asset sales) account for only a small share of total losses on average.&lt;/p&gt;
&lt;h3 id="q7-under-what-conditions-do-fire-sales-rather-than-write-downs-dominate-the-accounting-losses"&gt;Q7. Under what conditions do fire sales rather than write-downs dominate the accounting losses?&lt;/h3&gt;
&lt;p&gt;For banks in the top quartile of the ratio of securities to total assets, immediate accounting losses in the first year of crisis onset are substantially larger and driven to a significant extent by trading losses rather than write-downs. The six bank equity crises with the highest securities-to-assets ratios (weighted across banks) all occurred during the 2007-2008 crisis (Belgium, France, Germany, Switzerland, the U.K., and the U.S.), when fire sales of securitized assets were significant. Banks holding mostly loans (bottom quartile of securities-to-assets) show slower-to-materialize book losses driven predominantly by write-downs.&lt;/p&gt;
&lt;h3 id="q8-how-do-nonperforming-loan-rates-relate-to-the-magnitude-of-bank-equity-declines-across-crises"&gt;Q8. How do nonperforming loan rates relate to the magnitude of bank equity declines across crises?&lt;/h3&gt;
&lt;p&gt;At the country level, more negative unlevered bank equity returns at crisis onset are statistically significantly associated with higher peak NPL rates over the subsequent five years. The adjusted R² for the full available sample is 0.233, rising to 0.533 after excluding two outliers (U.S. 1990, Sweden 1991). For the 2008-2010 crisis episodes only, the adjusted R² is 0.606. No analogous association between NPL rates and nonfinancial equity returns is found, suggesting the mechanism is specific to the banking sector&amp;rsquo;s asset-quality deterioration.&lt;/p&gt;
&lt;h3 id="q9-do-liquidity-based-interventions-central-bank-support-or-blanket-guarantees-restore-bank-capitalization-after-bank-equity-crises"&gt;Q9. Do liquidity-based interventions (central bank support or blanket guarantees) restore bank capitalization after bank equity crises?&lt;/h3&gt;
&lt;p&gt;No. Following the implementation of liquidity-based interventions during bank equity crises, bank equity prices initially continue to decline for about two months, then rise by approximately 20%, but this gain reverses between months 12 and 36. Bank equity values remain persistently low thereafter. This is inconsistent with models in which forceful lender-of-last-resort interventions accomplish the same result as direct recapitalizations. The authors caution that interventions are not randomly assigned — deeper crises may receive stronger interventions — so the analysis cannot identify counterfactual outcomes.&lt;/p&gt;
&lt;h3 id="q10-what-are-the-historical-characteristics-of-government-recapitalization-programs"&gt;Q10. What are the historical characteristics of government recapitalization programs?&lt;/h3&gt;
&lt;p&gt;Based on a new database covering all government recapitalization programs across 17 economies since 1870, recapitalizations have historically been small (averaging 24% of pre-crisis book equity and 43% of realized market-value losses), narrow (65% classified as narrow, with a median of five banks recapitalized), and delayed. Total equity issuance (government and private combined) is only a small fraction of realized losses. Government-funded issuance accounts for about one-fourth of total bank equity issuance. The U.S. TARP after 2008 was unusual in being both broad (over 700 banks) and timely (about one month after the Lehman collapse). Japan&amp;rsquo;s crisis of the 1990s is a prominent example of extreme delay, with the first recapitalization program implemented in March 1999, nearly a decade after the real estate collapse began.&lt;/p&gt;
&lt;h3 id="q11-how-do-panic-only-crises-differ-from-bank-equity-crises-in-terms-of-equity-dynamics-and-policy-effectiveness"&gt;Q11. How do &amp;ldquo;panic-only crises&amp;rdquo; differ from bank equity crises in terms of equity dynamics and policy effectiveness?&lt;/h3&gt;
&lt;p&gt;Panic-only crises (N=85) are banking panics without a 30% bank equity decline. They feature significant initial negative returns followed by elevated bank equity returns that bring valuations back to pre-crisis levels within five years. Dividends fall only temporarily. Liquidity interventions during panic-only crises produce a full rebound in bank equity in the month of intervention, contrasting sharply with the modest and transient response observed in bank equity crises. Panic-only crises are also associated with shallower real GDP declines and smaller bank credit contractions than bank equity crises.&lt;/p&gt;
&lt;h3 id="q12-in-what-fraction-of-bank-equity-crises-does-the-bank-equity-decline-precede-the-banking-panic-and-what-does-this-imply-about-the-root-cause"&gt;Q12. In what fraction of bank equity crises does the bank equity decline precede the banking panic, and what does this imply about the root cause?&lt;/h3&gt;
&lt;p&gt;In 57 of the 76 bank equity crises (75%), the bank equity decline strictly precedes the emergence of the banking panic. This timing implies that most bank equity crises are not liquidity shocks that evolved into solvency problems — rather, fundamental weaknesses in the banking system are already present at the early stages of the crisis. Only 19 cases (25%), called &amp;ldquo;mismanaged banking panics,&amp;rdquo; saw the panic precede the equity decline; these occurred predominantly in the pre-1945 period, often in countries on the Gold Standard with limited central bank capacity.&lt;/p&gt;
&lt;h3 id="q13-under-what-conditions-can-early-liquidity-interventions-avert-an-incipient-banking-crisis"&gt;Q13. Under what conditions can early liquidity interventions avert an incipient banking crisis?&lt;/h3&gt;
&lt;p&gt;Of 183 episodes of incipient liquidity shocks in which a prior 30% bank equity decline had not yet occurred, 126 received early liquidity interventions, of which 92 were successfully averted (approximately 50% of the original 183 episodes). The two strongest predictors of a successfully averted crisis — essentially necessary conditions — are: (1) the pre-panic bank equity decline remains below 30%, and (2) liquidity intervention occurs within one month of the panic. War outbreak and single-bank focus of the run are additional factors that substantially increase the probability of aversion. Combining the small-equity-decline and early-intervention conditions predicts averted panics with a true-positive rate of 99% (91/92), though with a 24% false-positive rate.&lt;/p&gt;
&lt;h3 id="q14-does-cross-sectional-heterogeneity-at-the-bank-level-confirm-the-permanent-loss-interpretation"&gt;Q14. Does cross-sectional heterogeneity at the bank level confirm the permanent-loss interpretation?&lt;/h3&gt;
&lt;p&gt;Yes. Sorting the ten largest banks by country into five bins by market-to-book (M/B) ratio at crisis onset shows monotonic relationships with five-year outcomes. The most distressed banks (M/B below 0.2) experience reduced credit growth of 26 percentage points and reduced income-to-book-equity of 87 percentage points (both cumulative over five years) relative to the healthiest banks (M/B above 0.8). The M/B ratio at crisis onset is persistently low in subsequent years, because market values crash permanently while book values are sticky (slow write-down recognition). These results hold with crisis fixed effects, meaning the patterns reflect within-crisis cross-sectional variation, not merely crisis-level heterogeneity.&lt;/p&gt;
&lt;h3 id="q15-do-crises-preceded-by-credit-booms-have-worse-post-crisis-outcomes-for-banks"&gt;Q15. Do crises preceded by credit booms have worse post-crisis outcomes for banks?&lt;/h3&gt;
&lt;p&gt;Yes. Crises preceded by above-median growth in the credit-to-GDP ratio (from pre-crisis trough to peak) are associated with an additional 60 log-point abnormal decline in bank equity excess total returns occurring around year three after crisis onset, persisting through year five. By contrast, crises not preceded by credit booms earn bank equity returns similar to the country&amp;rsquo;s unconditional average after the initial decline. This supports the hypothesis that credit-boom-driven crises involve unexpected future deterioration in asset quality, possibly linked to persistently negative housing returns (which do not recover to pre-crisis levels within five years after banking crises).&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Bank equity crisis (paper-specific definition):&lt;/strong&gt; An episode identified in real time when two criteria are jointly met for the first time: (1) the bank equity index declines by more than 30% in annual excess total returns within any year of the past five years, and (2) a top-20 bank (ranked by total assets within the country) fails. This definition is purely data-driven and does not require any look-ahead information. It produces 76 crises across 46 economies from 1870 to 2019.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Permanent-loss view:&lt;/strong&gt; The theoretical interpretation that banking crises primarily reflect fundamental, lasting deterioration in the value of bank assets — arising either from fire sales that permanently destroy value or (more commonly in the authors&amp;rsquo; evidence) from deterioration in asset quality (rising nonperforming loans, loan impairments). Under this view, bank equity declines are earnings-driven rather than discount-rate-driven and do not reverse even after funding and market liquidity are restored.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Temporary-loss view:&lt;/strong&gt; The theoretical interpretation that bank losses during crises are primarily due to temporary price dislocations — assets held by financial intermediaries trade at sharp discounts due to binding borrowing constraints or depositor fragility, but recover their fundamental value once central banks provide liquidity support. Under this view, bank equity should earn elevated future returns after crises, and forceful liquidity interventions should be equivalent to direct recapitalizations in restoring bank value.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Write-downs (paper-specific definition):&lt;/strong&gt; Revaluations of assets that remain on the balance sheet, reflecting expected future reductions in cash flows. They include loan loss provisions, additions to loan loss reserves, write-downs of fixed assets, and goodwill impairments. Distinguished from trading income (realized gains and losses from securities trading and all asset dispositions). Write-downs are subject to accounting discretion and are recognized slowly over multiple years after crisis onset, while equity markets price in expected total losses rapidly at crisis onset.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Trading income (paper-specific definition):&lt;/strong&gt; Realized gains and losses from securities trading and all asset sales, including sales of real estate, loans, and subsidiary divisions. Unlike write-downs, trading losses must be recognized immediately (they are realized transactions), so large trading losses at crisis onset would be evidence consistent with fire-sale dynamics.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Panic-only crises:&lt;/strong&gt; Banking panics (sustained bank runs or depositor withdrawals) that do not coincide with a greater-than-30% bank equity decline. Identified as N=85 in the full sample. These episodes are characterized by temporary equity declines, full recovery within five years, large positive responses to liquidity interventions, and smaller macroeconomic output losses — consistent with the temporary-loss view.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mismanaged banking panics:&lt;/strong&gt; The minority of bank equity crises (19 cases, 25%) in which the banking panic occurred first or concurrently with the 30% bank equity decline, rather than the equity decline preceding the panic. Concentrated in the pre-1945 period, often in Gold Standard countries with limited central bank flexibility. The U.S. Great Depression is the prominent example.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Averted crisis:&lt;/strong&gt; An incipient liquidity shock to the banking sector that fully recedes within two months without any bank failures or 30% bank equity declines. Empirically, all averted crises in the sample had not yet experienced a 30% bank equity decline and all received early liquidity interventions (within one month of the incipient panic onset).&lt;/p&gt;</description></item><item><title>Place-Based Redistribution</title><link>https://macropaperwarehouse.com/papers/place-based-redistribution/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/place-based-redistribution/</guid><description>&lt;h2 id="place-based-redistribution-overview"&gt;Place-Based Redistribution: Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Should national governments redistribute income to residents of poor areas through place-based transfers, or should redistribution rely solely on place-blind (income-only) taxes? The longstanding view in urban economics—&amp;ldquo;help poor people, not poor places&amp;rdquo;—holds that place-based aid is inefficient because it channels activity to less productive locations. This paper challenges that view by formalizing the conditions under which place-based redistribution improves on purely income-based transfers, using tools from optimal tax theory embedded in a spatial equilibrium model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model and Methodology&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper develops a two-location model (&amp;ldquo;Distressed&amp;rdquo; and &amp;ldquo;Elsewhere&amp;rdquo;) with a unit mass of heterogeneous households who differ in skill level (θ) and idiosyncratic preference for living in Distressed (φ). Households choose where to live and how much to earn, facing competitive labor and housing markets in each location. Locations may differ in amenity levels, wage schedules (which may embody skill-specific comparative advantage), and housing costs. A utilitarian planner sets location-specific income tax schedules—observed earnings and location are the only signals of unobserved skill—maximizing a weighted average of household utilities and landlord profits subject to a budget constraint.&lt;/p&gt;
&lt;p&gt;The paper proceeds in three steps. First, it derives closed-form conditions for the optimality of a lump-sum place-based transfer under a fixed income tax. Second, it characterizes fully general optimal nonlinear, location-specific marginal tax rate (MTR) schedules (Proposition 2). Third, it calibrates the model numerically, anchoring to the U.S. Empowerment Zone (EZ) program.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Three Sorting Mechanisms and Their Policy Implications&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper identifies three polar mechanisms that generate sorting of lower-skill households into Distressed:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;&lt;em&gt;Skill-taste correlation&lt;/em&gt;: higher-skill households have stronger tastes for Elsewhere, independent of wages or rents.&lt;/li&gt;
&lt;li&gt;&lt;em&gt;Comparative advantage&lt;/em&gt;: higher-skill workers are relatively more productive in Elsewhere.&lt;/li&gt;
&lt;li&gt;&lt;em&gt;Income-based sorting&lt;/em&gt;: because Elsewhere is more expensive, lower-income households are priced into Distressed.&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;Under skill-taste correlation, place-based transfers to Distressed are unambiguously welfare-improving even when income taxes are already optimal, because high-skill households prefer Elsewhere for reasons that are orthogonal to income. Under comparative advantage, the direction of the optimal transfer depends on migration elasticities: low migration elasticities favor transfers to Distressed, while high migration elasticities can reverse the sign. Under pure income-based sorting (with homogeneous locational preferences), the conditions for superfluous commodity taxation (Atkinson-Stiglitz 1976) are satisfied, and optimal place-based transfers are zero—though idiosyncratic preference heterogeneity restores non-zero optimal transfers even in this case.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Quantitative Findings&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Numerical simulations use Census data and ACS moments calibrated to EZ areas. With high migration responsiveness (κ = 0.5, approximating urban EZs) and skill-taste correlation as the sole sorting driver, the optimal average place-based transfer to Distressed is &lt;strong&gt;$4,805&lt;/strong&gt;, with about 40% ($1,943) arising from lower MTRs rather than a higher demogrant. With low migration responsiveness (κ = 4, approximating rural EZs), the optimal transfer more than doubles to &lt;strong&gt;$10,918&lt;/strong&gt;.&lt;/p&gt;
&lt;p&gt;When comparative advantage alone drives sorting and migration is low (κ = 4), the optimal transfer to Distressed is &lt;strong&gt;$7,091&lt;/strong&gt;, with a $3,740 larger demogrant. With high migration and comparative advantage, the transfer reverses to &lt;strong&gt;−$2,763&lt;/strong&gt; (i.e., Elsewhere receives the subsidy). For intermediate migration under comparative advantage (e.g., κ ≈ 1), the optimal policy is nonlinear: the poorest Distressed residents receive a place-based transfer of &lt;strong&gt;$1,254&lt;/strong&gt;, while high-skill Distressed residents face a place-based tax of &lt;strong&gt;$12,398&lt;/strong&gt; at the 99th percentile.&lt;/p&gt;
&lt;p&gt;In the empirically calibrated &lt;strong&gt;urban EZ baseline&lt;/strong&gt; (migration elasticity 0.82, rent ratio 0.86, sorting driven by skill-taste correlation and income effects), the optimal average place-based transfer is &lt;strong&gt;$3,143&lt;/strong&gt;, roughly matching the magnitude of actual EZ wage tax credits (~$3,000 for full-time eligible workers). The demogrant advantage for Distressed is &lt;strong&gt;$1,462&lt;/strong&gt;, with just over half of the transfer arising from lower MTRs.&lt;/p&gt;
&lt;p&gt;In the &lt;strong&gt;rural EZ baseline&lt;/strong&gt; (migration elasticity 0.20, rent ratio 0.54, comparative advantage and income effects), the optimal average transfer rises to &lt;strong&gt;$4,329&lt;/strong&gt;, concentrated in lower MTRs rather than a larger demogrant. Halving the migration elasticity from the rural baseline raises the optimal transfer to &lt;strong&gt;$6,906&lt;/strong&gt;, while doubling it reduces the transfer to near zero (&lt;strong&gt;$573&lt;/strong&gt;).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;All results are derived under the assumption of &lt;em&gt;no market failures&lt;/em&gt;; the model deliberately excludes agglomeration spillovers or other Pigouvian motives, attributing the case for place-based redistribution purely to redistributive goals.&lt;/li&gt;
&lt;li&gt;The planner observes only earnings and location, not skill type directly.&lt;/li&gt;
&lt;li&gt;Household Pareto weights are set equal to one across types in the simulations, so redistribution is driven solely by diminishing marginal utility of consumption.&lt;/li&gt;
&lt;li&gt;The model abstracts from interactions with subnational governments, local public services, and endogenous amenities.&lt;/li&gt;
&lt;li&gt;Results on the desirability of transfers to Distressed hinge critically on the motive for sorting, not simply on the existence of spatial income inequality.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-equity-efficiency-tradeoff-formula-for-a-lump-sum-place-based-transfer-and-what-does-it-reveal"&gt;Q1. What is the equity-efficiency tradeoff formula for a lump-sum place-based transfer, and what does it reveal?&lt;/h3&gt;
&lt;p&gt;Lemma 1 shows that the first-order welfare effect of a small per-capita transfer from Elsewhere to Distressed starting from a place-blind tax system is dSWF/dt = (λ̄₁ − λ̄₀) + Eθ{m(0)·[T(z₁*) − T(z₀*)]}. The equity gain (λ̄₁ − λ̄₀) is positive when Distressed households have higher average social marginal welfare weights, which holds when their skill distribution is first-order stochastically dominated by Elsewhere&amp;rsquo;s. The fiscal cost equals the earnings-tax-revenue loss from movers: households induced to migrate to Distressed who earn less there generate lower tax payments. This formula identifies the earnings response to migration as a sufficient statistic for the efficiency cost of place-based policy.&lt;/p&gt;
&lt;h3 id="q2-what-characterizes-the-optimal-lump-sum-transfer-t-in-proposition-1"&gt;Q2. What characterizes the optimal lump-sum transfer t* in Proposition 1?&lt;/h3&gt;
&lt;p&gt;Proposition 1 shows t* = [λ̄₁(t*) − λ̄₀(t*) + Eθ{m(t*)·[T(z₁*) − T(z₀*)]}] / (Eθ[m(t*)] / [L₀(t*)L₁(t*)]). The optimal transfer is larger when (i) the average social marginal welfare weight gap between Distressed and Elsewhere is greater, (ii) migration responses m(t*) are small, and (iii) the earnings difference between locations for marginal movers is small. This formula holds regardless of whether the income tax schedule T(·) is itself set optimally.&lt;/p&gt;
&lt;h3 id="q3-under-skill-taste-correlation-why-are-place-based-transfers-always-welfare-improving-even-under-an-optimal-income-tax"&gt;Q3. Under skill-taste correlation, why are place-based transfers always welfare-improving even under an optimal income tax?&lt;/h3&gt;
&lt;p&gt;When sorting is driven by skill-taste correlation (high-skill households have stronger preferences for Elsewhere despite identical wages and rents), the equity gain λ̄₁ − λ̄₀ is positive because low-skill households concentrate in Distressed. A small positive transfer starting from t = 0 also incurs zero fiscal cost because movers between locations face identical wages and do not change their earnings. Thus, welfare unambiguously increases. The key insight is that skill-taste correlation violates the Atkinson-Stiglitz condition: high earners would still prefer Elsewhere even if forced to earn less, so location serves as a proxy for skill not captured by income taxes alone.&lt;/p&gt;
&lt;h3 id="q4-under-comparative-advantage-why-can-the-sign-of-the-optimal-transfer-reverse-with-migration-elasticity"&gt;Q4. Under comparative advantage, why can the sign of the optimal transfer reverse with migration elasticity?&lt;/h3&gt;
&lt;p&gt;When higher-skill workers are more productive in Elsewhere, movers to Distressed experience wage and earnings reductions, generating a fiscal externality. When migration elasticities are high (low κ), this fiscal cost is large and can dominate the equity gain, making transfers to Elsewhere optimal (simulated optimal transfer of −$2,763 at κ = 0.5). When migration elasticities are low (high κ), the fiscal cost is small and equity considerations dominate, yielding transfers to Distressed ($7,091 at κ = 4). At intermediate elasticities, the optimal policy is nonlinear, redistributing to poor Distressed residents while taxing rich Distressed residents more.&lt;/p&gt;
&lt;h3 id="q5-why-are-place-based-transfers-superfluous-under-pure-income-based-sorting-with-homogeneous-locational-preferences"&gt;Q5. Why are place-based transfers superfluous under pure income-based sorting with homogeneous locational preferences?&lt;/h3&gt;
&lt;p&gt;Example 6 (and its formal proof in Appendix B.3.5) demonstrates that when sorting arises solely from higher rents in Elsewhere and preferences over location are homogeneous (no idiosyncratic φ heterogeneity), the Atkinson-Stiglitz sufficient condition for commodity tax superfluousness is met: hypothetically forcing high earners to earn less would not change their preferred consumption bundle relative to low earners. Hence a place-blind income tax implements optimal redistribution without spatial supplements. As the variance of idiosyncratic location preferences κ shrinks toward zero, Figure 3 confirms that optimal place-based transfers tend toward zero across all three sorting motives.&lt;/p&gt;
&lt;h3 id="q6-what-new-terms-appear-in-the-optimal-location-specific-mtr-formulas-proposition-2-relative-to-a-standalone-economy-optimum"&gt;Q6. What new terms appear in the optimal location-specific MTR formulas (Proposition 2) relative to a standalone-economy optimum?&lt;/h3&gt;
&lt;p&gt;The optimal MTR schedules in Proposition 2 contain two new terms beyond the standard Mirrlees (1971)/Saez (2001) formula. The term Δτ+(θ) captures the fiscal externality from migration: raising Elsewhere&amp;rsquo;s MTR at skill level θ and above induces movers to Distressed who change their tax revenue by T₁(z₁*(s)) − T₀(z₀*(s)). The term (λ_L − 1)Δr+(θ) captures the equilibrium rent effect: MTR changes shift households between locations, altering rents in both communities and redistributing between renters and landlords. When λ_L &amp;lt; 1 (landlords are weighted less than average households), the rent term creates additional motives for spatial redistribution depending on the ratio of rents to housing supply elasticities across locations.&lt;/p&gt;
&lt;h3 id="q7-how-do-housing-supply-elasticities-affect-the-optimal-spatial-transfer-and-why-does-the-sign-differ-between-urban-and-rural-settings"&gt;Q7. How do housing supply elasticities affect the optimal spatial transfer, and why does the sign differ between urban and rural settings?&lt;/h3&gt;
&lt;p&gt;The rent redistribution term Δr+(θ) has sign determined by r₁/ϱ₁ − r₀/ϱ₀. For urban EZs, where Distressed has lower rents but also lower housing supply elasticity than Elsewhere (ϱ₁ = 0.24, ϱ₀ = 0.34 in the baseline), this ratio is positive, meaning transfers to Distressed shift households into relatively inelastic markets, raising rents there and generating landlord income. When λ_L &amp;lt; 1, this reduces the desirability of transfers to Distressed. For rural EZs, Distressed has higher housing supply elasticity (ϱ₁ = 0.60), so the ratio is negative: transfers shift households to more elastic markets where rents rise minimally. When λ_L &amp;lt; 1, this actually motivates more transfers to rural Distressed areas. In the 75%-landlord-weight sensitivity, optimal urban transfers fall by ~$1,000 while rural transfers rise by ~$1,000, illustrating this asymmetry.&lt;/p&gt;
&lt;h3 id="q8-what-does-the-urban-ez-baseline-calibration-find-about-optimal-transfers-and-how-does-it-compare-to-actual-ez-policy"&gt;Q8. What does the urban EZ baseline calibration find about optimal transfers and how does it compare to actual EZ policy?&lt;/h3&gt;
&lt;p&gt;The urban baseline targets a migration elasticity of 0.82 (from Busso et al. 2013), a Distressed-to-Elsewhere rent ratio of 0.86, and 56% of Distressed residents earning under $50,000. The calibrated κ is 0.44. At the optimum, Distressed residents receive an average place-based transfer of $3,143, with $1,462 as a higher demogrant and the remainder from lower MTRs. By comparison, actual EZs provide a wage tax credit of approximately $3,000 per eligible full-time worker. The paper concludes that the magnitude—but not the capped, flat structure—of EZ transfers approximates the optimal level.&lt;/p&gt;
&lt;h3 id="q9-what-does-the-rural-ez-calibration-find-and-how-sensitive-are-results-to-migration-assumptions"&gt;Q9. What does the rural EZ calibration find, and how sensitive are results to migration assumptions?&lt;/h3&gt;
&lt;p&gt;The rural baseline targets a migration elasticity of 0.20 (from Sprung-Keyser et al. 2022), a rent ratio of 0.54, and 60% of Distressed residents earning under $50,000, with sorting attributed to comparative advantage and income effects. The calibrated κ is 4.06. The optimal average transfer is $4,329, primarily arising from lower MTRs rather than a higher demogrant ($532). Doubling the migration elasticity reduces the optimal transfer to near zero ($573); halving it raises it to $6,906. The direction and magnitude of optimal transfers are therefore highly sensitive to the assumed level of migration responsiveness, highlighting the empirical importance of estimating migration elasticities—particularly heterogeneity in migration by income level and earnings changes for marginal movers.&lt;/p&gt;
&lt;h3 id="q10-do-within-income-transfers-arising-from-differences-in-marital-and-parental-status-across-communities-effectively-constitute-place-based-redistribution"&gt;Q10. Do within-income transfers arising from differences in marital and parental status across communities effectively constitute place-based redistribution?&lt;/h3&gt;
&lt;p&gt;Online Appendix A investigates this by estimating the implicit place-based transfer induced by marital and parental status differences between EZ communities and the rest of the country. Using ACS tract-level data merged with Piketty-Saez-Zucman distributional national accounts (DINA), the authors find that marital status and parental status have offsetting effects: marital status raises taxes on single households (common in Distressed), while parental status increases transfers to households with children (also common in Distressed). Across all preferred CPS-adjusted estimates, net within-earnings transfers are below $1,000 in magnitude, and the two factors essentially cancel. The authors conclude that marital and parental status differences do not yield substantial de facto place-based redistribution within income levels.&lt;/p&gt;
&lt;h3 id="q11-what-does-the-mtr-decomposition-table-3-reveal-about-why-sorting-motives-generate-different-mtr-patterns"&gt;Q11. What does the MTR decomposition (Table 3) reveal about why sorting motives generate different MTR patterns?&lt;/h3&gt;
&lt;p&gt;The decomposition separates the optimal MTR into a within-community component (standard equity-efficiency tradeoff) and a between-community component (fiscal externality from migration). Under skill-taste correlation with high migration (κ = 0.5), both components contribute positively to the Distressed MTR (0.246 within + 0.234 between = 0.479), yielding lower MTRs in Distressed (0.479) than in Elsewhere (0.510). Under comparative advantage with high migration, the within-community component is negative (−0.111) because high MTRs at the optimum reduce the concentration of high-skill types in Distressed, depressing the standard revenue-raising benefit of MTRs. The large positive between-community component (0.655) reflects the large fiscal externality from movers and overcomes this, yielding higher Distressed MTRs (0.544 vs. 0.509 in Elsewhere). With low migration (κ = 4), between-community components shrink substantially, and MTRs in Distressed fall below Elsewhere in all sorting scenarios.&lt;/p&gt;
&lt;h3 id="q12-what-does-the-crosswalk-from-urban-to-rural-baseline-reveal-about-which-assumptions-drive-the-change-in-optimal-transfers"&gt;Q12. What does the crosswalk from urban to rural baseline reveal about which assumptions drive the change in optimal transfers?&lt;/h3&gt;
&lt;p&gt;Table 5 traces the urban-to-rural transition step by step. Starting from the urban baseline ($3,143 average transfer), replacing the migration elasticity target with the rural value of 0.20 triples the optimal transfer to $9,870. Subsequently replacing skill-taste correlation with comparative advantage as the sorting mechanism reduces the transfer by roughly half ($6,402). Adjusting rent to match the rural ratio (0.54) reduces it further to $2,780, as lower Distressed rent reduces the marginal utility of consumption at the bottom and increases income-based sorting. Targeting the rural income share (60% below $50K) raises it back to $4,140, and incorporating rural housing supply elasticities yields the rural baseline result of $4,329. This decomposition reveals that lower migration responsiveness is the single largest driver of higher optimal transfers in rural settings.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Place-based redistribution&lt;/strong&gt;: Transfer schemes in which economic benefits or tax burdens are conditioned on the geographic location of residence, as distinct from place-blind income taxes that condition only on earned income. In this paper, modeled as location-specific tax schedules T_j(z) that may differ across communities j.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Skill-taste correlation&lt;/strong&gt;: A source of spatial sorting in which households with higher skill levels (θ) have systematically stronger preferences for the &amp;ldquo;Elsewhere&amp;rdquo; location, independently of wage or rent differences. Formally, the conditional distribution G_θ(φ) of locational tastes given skill is weakly increasing in θ. This correlation breaks the Atkinson-Stiglitz sufficient condition for commodity tax superfluousness and generates unambiguously positive optimal transfers to Distressed.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Comparative advantage (spatial)&lt;/strong&gt;: A sorting mechanism in which higher-skill workers are disproportionately more productive in Elsewhere than in Distressed, captured by the wage elasticity with respect to skill being higher in Elsewhere (γ₀(θ) &amp;gt; γ₁(θ)). Households with skill above a threshold sort into Elsewhere even with homogeneous locational preferences. The existence of spatial comparative advantage means that migrants to Distressed earn less, creating a fiscal externality for place-based transfers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Income-based sorting&lt;/strong&gt;: Sorting of lower-income, lower-skill households into Distressed arising purely from the higher cost of living in Elsewhere, without any systematic skill-taste correlation or comparative advantage. Because high-skill households are less sensitive to rent differences, they sort into Elsewhere when rents there are higher. When this is the sole sorting mechanism and locational preferences are homogeneous, the Atkinson-Stiglitz commodity tax superfluousness conditions are satisfied and optimal place-based transfers are zero.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Fiscal externality (migration)&lt;/strong&gt;: The change in income tax revenue caused by migration responses to place-based policy changes, not by changes in incentives for stayers. When movers from Elsewhere to Distressed earn less in their new location, they generate lower tax payments, imposing a first-order cost on the government budget. This externality is measured by Δτ+(θ) in the optimal MTR formulas and equals the earnings-tax-revenue loss from movers across all skill levels above θ. This term is a &amp;ldquo;sufficient statistic&amp;rdquo; for the efficiency cost of place-based transfers in the sense of Chetty (2009).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Demogrant (∆₀)&lt;/strong&gt;: The difference in lump-sum transfers provided to zero-earners across the two locations (−T₀(0) − (−T₁(0)) = T₀(0) − T₁(0)). A positive ∆₀ means Distressed provides a larger transfer to non-earners. It represents the place-based redistribution that occurs at the bottom of the earnings distribution, independently of MTR differences. In the paper&amp;rsquo;s decomposition, total optimal place-based redistribution (∆_z) exceeds ∆₀ when Distressed also has lower MTRs, meaning redistribution grows with income.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Income-constant average tax difference (∆_z)&lt;/strong&gt;: The paper&amp;rsquo;s preferred summary measure of the average place-based transfer, defined as an equally weighted average of two tax-difference indices: the tax difference evaluated at Elsewhere earnings levels and the tax difference evaluated at Distressed earnings levels. This measure isolates tax schedule differences from productivity differences across locations, avoiding conflation of tax policy and wage effects on measured income.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Landlord welfare weight (λ_L)&lt;/strong&gt;: The social marginal welfare weight assigned to landlords relative to the multiplier on the government budget constraint. When λ_L &amp;lt; 1, the planner values a marginal dollar of public funds more than a marginal dollar to landlords, creating a motive to use place-based taxes to shift rent incidence. The rent redistribution effect on optimal MTRs operates through the term (λ_L − 1)Δr+(θ), which has opposite signs in urban (positive) and rural (negative) distressed areas because of their different housing supply elasticities.&lt;/p&gt;</description></item><item><title>Policy Diffusion and Polarization across U.S. States</title><link>https://macropaperwarehouse.com/papers/policy-diffusion-and-polarization-across-u.s.-states/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/policy-diffusion-and-polarization-across-u.s.-states/</guid><description>&lt;p&gt;DellaVigna and Kim study the innovation and diffusion of policies across U.S. states using a dataset of over 700 state laws spanning seven decades. The central question is what predicts whether a state adopts a policy — and how those predictors have changed over time. The paper draws on two primary data sources: the State Policy Innovation and Diffusion (SPID) Database (Boehmke et al., 2020), covering 676 policies, and a hand-collected sample of 57 policies from 91 NBER working papers (April 2012–September 2021) that feature state-level policy variation. The combined dataset covers 733 policies adopted from the 1950s onward across the contiguous 48 states.&lt;/p&gt;
&lt;p&gt;On policy innovation, the paper finds that state capacity plays only a small role: larger and richer states are only slightly more likely to introduce new policies, innovation originates from both Republican and Democratic states, and the patterns are largely idiosyncratic with respect to observable state characteristics. California is the most frequent innovator, but large states like Florida and Texas rank in the middle.&lt;/p&gt;
&lt;p&gt;For policy diffusion, the paper employs both a static Geary&amp;rsquo;s C clustering statistic (measuring whether the first 10 adopting states cluster geographically or politically relative to a random-diffusion benchmark) and a dynamic logit hazard model estimated separately by decade. The hazard model identifies three similarity channels — geographic, demographic, and political — and allows their coefficients to vary over time.&lt;/p&gt;
&lt;p&gt;The central finding is a structural break in diffusion patterns around 2000. From the 1950s to the 1990s, geographic proximity is the dominant predictor of policy adoption: the coefficient on geographic similarity is 0.34 in the 1970s and remains roughly constant at 0.33 in the most recent decade. Demographic similarity is consistently positive and stable (approximately 0.20 in the 1980s, 0.22 in the 2010s). Political similarity — measured by closeness in Republican vote-share from the most recent presidential election — is a modest predictor before 2000, with coefficients between 0.14 (1970s) and 0.17 (1990s). Since 2000, the political similarity coefficient triples: 0.46 in the 2000s and 0.52 in the 2010s, making it by far the strongest predictor. The overall pseudo R-squared rises from 0.13 in the 1970s to 0.19 in the 2010s.&lt;/p&gt;
&lt;p&gt;These patterns are more pronounced for policies studied by economists: in the NBER subsample, the political similarity coefficient reaches 0.66 (s.e.=0.09) in the most recent two decades, versus 0.42 (s.e.=0.04) in the SPID sample.&lt;/p&gt;
&lt;p&gt;The paper tests whether the increased role of political similarity reflects correlated voter preferences, learning, or competition versus party discipline. Against correlated-preferences explanations: adding cross-state migration flows as a similarity measure reduces geographic predictive power but leaves the political similarity coefficient entirely unchanged; and typical policy-outcome variables (poverty rate, opioid mortality, income) have not become more correlated among politically similar states over time. In favor of party discipline: similarity in unified state government has zero predictive power through the 1990s but a coefficient of 0.42 (s.e.=0.06) in the 2000s–2010s. An event study of switches to unified party control confirms this causally for 1991–2020: switching to unified government raises the probability of passing ideologically aligned laws by approximately 2 percentage points in the four years following the switch, with no pre-trends and no effect on neutral-leaning laws; the same event study for 1950–1990 yields no detectable effect.&lt;/p&gt;
&lt;p&gt;COVID policies (77 state laws since October 2019) show strong political similarity in adoption; historical vaccination mandate policies (28 laws since 1975) show no political similarity effect. The paper concludes that rising party polarization at the state level — detectable from the 2000s onward, lagging the Congressional trend by roughly four to five decades — is the primary driver of the shift in diffusion patterns. The authors additionally classify each of the 57 NBER-sample policies by type of diffusion as an input for difference-in-differences research design assessment.&lt;/p&gt;
&lt;p&gt;Q: What data do the authors use and what is its scope?
A: The main source is the SPID Database (Boehmke et al., 2020), covering 676 policies over seven decades. The authors supplement this with 57 policies hand-collected from 91 NBER working papers (2012–2021) that use state-level policy variation. The combined sample covers 733 policies adopted from the 1950s onward in the contiguous 48 states, with the SPID sample averaging 23 adopting states per policy and the NBER sample averaging 29.&lt;/p&gt;
&lt;p&gt;Q: Do states with more resources or larger populations systematically innovate more policies?
A: The evidence for a state-capacity hypothesis is weak. There is only suggestive evidence that higher per-capita income predicts being in the top-20% of innovators, and no clear difference in population between the top and bottom innovators. Innovations arise from both Republican and Democratic states. One consistent correlate is urban population share, but overall innovation is largely idiosyncratic with respect to observable characteristics.&lt;/p&gt;
&lt;p&gt;Q: What was the dominant predictor of policy diffusion before 2000?
A: Geographic proximity was the dominant predictor. The coefficient on geographic similarity in the hazard model is 0.34 in the 1970s and remains stable at approximately 0.33 in the 2010s. Demographic similarity contributes consistently at approximately 0.20. Political similarity before 2000 is modest, ranging from 0.14 in the 1970s to 0.17 in the 1990s — roughly one-third to one-half the magnitude of the geographic coefficient.&lt;/p&gt;
&lt;p&gt;Q: How dramatically does political similarity change after 2000, and is this finding robust?
A: The political similarity coefficient triples, rising from 0.17 in the 1990s to 0.46 in the 2000s and 0.52 in the 2010s, making it the largest single predictor in recent decades. This pattern is robust across linear probability models, alternative measures of political similarity, alternative thresholds for &amp;ldquo;closest&amp;rdquo; states (closest fifth, fourth, third, or half all yield comparable coefficients), and alternative ways of computing adoption counts.&lt;/p&gt;
&lt;p&gt;Q: Is the shift toward political diffusion stronger for policies economists study?
A: Yes. In the NBER subsample, the political similarity coefficient reaches 0.66 (s.e.=0.09) in the 2000s–2010s, compared to 0.42 (s.e.=0.04) in the SPID sample. Geographic similarity also has somewhat higher coefficients in the NBER sample throughout the period. This implies that the policies most studied for difference-in-differences evaluation are also those most subject to politically-driven diffusion.&lt;/p&gt;
&lt;p&gt;Q: What does the Medicaid case study illustrate about political polarization?
A: ACA Medicaid expansion spread almost exclusively along partisan lines, with Republican vote-share accurately predicting the year of adoption. Crucially, the states that delayed or declined adoption — higher Republican vote-share states — had a higher share of population that would benefit from the expansion and therefore face a worse policy-need match. By contrast, the original 1966 Medicaid rollout showed no relationship between state political leaning and timing of adoption, and neither did the 1960s–1970s food stamp program expansion.&lt;/p&gt;
&lt;p&gt;Q: How do the authors distinguish party discipline from correlated voter preferences as the mechanism?
A: Two tests point away from correlated preferences: (1) cross-state migration flows, when added as a similarity measure, absorb geographic predictive power but leave the political similarity coefficient entirely unaffected; (2) typical policy-outcome variables (opioid mortality, poverty rate, income, etc.) have not become more correlated among politically similar states over time, contradicting the hypothesis that local needs or environments have become politically correlated.&lt;/p&gt;
&lt;p&gt;Q: What is the direct evidence for party discipline as the operative mechanism?
A: The authors construct a measure of similarity based on unified party control (governor and both chambers of the same party). This variable has zero predictive power through the 1990s (point estimate near zero). In the 2000–2020s, the coefficient for unified-government similarity is 0.42 (s.e.=0.06), making it the strongest single predictor of adoption in those decades. States with divided governments show no predictive power of adoption by other divided-government states, further isolating the role of party control.&lt;/p&gt;
&lt;p&gt;Q: What does the event-study of switches to unified party control show?
A: Switches to unified party control in 1991–2020 produce a statistically significant increase of approximately 2 percentage points in the probability of adopting ideologically aligned laws within four years of the switch, relative to the year before. The effect emerges in year t+1 and is persistent, with no pre-trends, and the effect on neutral-leaning laws is zero, ruling out a simple reduced-gridlock story. The same event study for 1950–1990 detects no effect.&lt;/p&gt;
&lt;p&gt;Q: How do COVID state policies compare to historical vaccination policies in terms of political diffusion?
A: COVID policies (77 state laws, October 2019–August 2021) show significant political similarity in adoption, consistent with the recent-decade patterns. Vaccination mandate laws (28 policies since 1975) show no political similarity effect whatsoever, with demographic and modest geographic similarity being the relevant predictors. This contrast underscores that political polarization in policy adoption is a recent phenomenon that has spread even to policy areas without prior partisan patterning.&lt;/p&gt;
&lt;p&gt;Q: How does partisan polarization at the state level compare temporally to polarization in Congress?
A: Congressional polarization (measured by DW-NOMINATE) has been rising since the 1950s. State-level policy polarization, as documented here, does not emerge until the 2000s — a lag of roughly four to five decades. The paper notes it has risen rapidly and has already reached policy domains (such as COVID mandates) that showed no political patterning historically.&lt;/p&gt;
&lt;p&gt;Q: Does the diffusion pattern vary across policy types?
A: Yes. For economic policies, geography and demographics decline in importance over time with a smaller increase in political predictors. For non-economic (social) policies, geographic importance remains stable while political polarization is especially strong. Political polarization is strongest in Republican-leaning and Democratic-leaning states, and weaker among battleground states, consistent with a party-driven model where ideologically extreme states adopt from each other.&lt;/p&gt;
&lt;p&gt;Q: How do the authors classify individual NBER-sample policies by diffusion type?
A: Using Geary&amp;rsquo;s C statistics computed separately for geographic and political clustering for each of the 57 NBER policies, the authors identify three approximate clusters: (1) primarily politically-clustered (e.g., Medicaid expansion); (2) jointly geographically and politically clustered (e.g., ban on asking about past salary history); and (3) largely idiosyncratic, neither geographically nor politically clustered (e.g., anti-bullying laws). This classification has direct implications for assessing identification threats in difference-in-differences designs.&lt;/p&gt;
&lt;p&gt;Q: What does the overall predictability of policy adoption look like over time?
A: The pseudo R-squared from the logit hazard model rises from 0.13 in the 1970s to 0.19 in the 2010s. The increase in political similarity is large enough not only to surpass geographic similarity as a predictor but to make the overall process of state policy adoption more predictable over time.&lt;/p&gt;
&lt;p&gt;Policy diffusion: The process by which a policy adopted in one state subsequently spreads to other states; measured here along geographic, demographic, and political dimensions using a logit hazard model estimated by decade.&lt;/p&gt;
&lt;p&gt;Geary&amp;rsquo;s C statistic: A ratio of weighted to unweighted average pairwise squared differences in adoption status, adapted from spatial statistics (Geary, 1954). Values below 1 indicate clustering; values above 1 indicate anti-clustering. The paper reports 1−C so higher values mean more clustering among similar states.&lt;/p&gt;
&lt;p&gt;Policy innovation: First-year adoption of a law in any state; a state is an &amp;ldquo;innovator&amp;rdquo; if it adopts in the first year the policy appears anywhere. The paper distinguishes innovation (origination) from diffusion (spread).&lt;/p&gt;
&lt;p&gt;Logit hazard model: A discrete-time logit model estimated at the state-year-policy level for all states that have not yet adopted a given policy, with policy-decade fixed effects as a baseline hazard and three time-varying similarity measures (geographic, demographic, political) as key predictors.&lt;/p&gt;
&lt;p&gt;Political similarity: Closeness of two states&amp;rsquo; Republican vote-shares from the most recent presidential election; the closest third of states in this dimension are used to construct the diffusion measure. Shown to be independent of — and to have grown far more predictive than — geographic similarity since 2000.&lt;/p&gt;
&lt;p&gt;Unified party control: A state government in which the governor and both state legislative chambers belong to the same party. The paper shows this is the variable most predictive of politically-driven policy diffusion in the 2000s–2020s, with a coefficient of 0.42 where it was effectively zero before 2000.&lt;/p&gt;
&lt;p&gt;Party discipline / party polarization: The paper&amp;rsquo;s preferred explanation for post-2000 patterns: state politicians increasingly vote and adopt policies along party lines beyond what voter preferences alone would predict, with the effect detectable since the 2000s at the state level, lagging the Congressional polarization trend by roughly four decades.&lt;/p&gt;</description></item><item><title>Politics at Work</title><link>https://macropaperwarehouse.com/papers/politics-at-work/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/politics-at-work/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Do individual political views shape firm behavior and labor market outcomes in the private sector? Specifically, do business owners sort copartisan workers into their firms, and does employers&amp;rsquo; political discrimination drive this sorting?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Setting&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper studies the complete Brazilian formal labor market over 2002–2019, assembling a novel longitudinal worker-firm-owner-party matched dataset from three administrative sources: (1) RAIS (Relação Anual de Informações Sociais), the universe of formal-sector workers (87 million unique workers, 7.6 million unique firms); (2) the Receita Federal do Brazil (RFB) and Cadastro Nacional de Empresas (CNE), containing business ownership structures for all registered firms; and (3) the Tribunal Superior Eleitoral (TSE) registry of all party members (19.3 million individuals) over 2002–2019. Matching these sources yields political affiliation for 11.4% of all private-sector owners and 7.8% of all private-sector workers in the sample. Party affiliation in Brazil requires an active registration step and is interpreted as a signal of strong and visible political views, distinguishing affiliated from unaffiliated individuals who likely hold milder views. The 35 parties in the sample are highly fragmented; the top 7 account for nearly 70% of all party members.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Political assortative matching.&lt;/em&gt; Using a likelihood ratio index (Eika et al., 2019; Chiappori et al., 2020), the paper finds that workers and owners belonging to the same party are on average about twice as likely to match in the labor market relative to random matching. Once within-municipality geographical sorting is accounted for, this figure falls to approximately 55% excess probability of copartisan matching, and increases over time: from 1.41 in 2002–2006 to 1.67 in 2016–2019. A dyadic regression approach — constructing all worker-firm dyads within industry-municipality labor markets and controlling for shared gender, race, age, and education — confirms the result: across all years, a politically affiliated worker is between 41% and 75% more likely to be employed by a copartisan owner than by an owner affiliated with a different party. Political assortative matching is driven both by higher hiring probabilities (range: 32%–59% more likely for copartisans, hiring margin only) and by longer tenure: copartisan workers stay in the firm roughly 5.5% longer than otherwise comparable workers of a different party, even within the same firm and hire-year (column 3 of Table 2). In every year and by every method, the degree of political assortative matching exceeds that of gender (15%–31% excess probability under dyadic approach) and race (approximately 3.4%), which are themselves both positive and significant.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Mechanisms: political discrimination.&lt;/em&gt; Three sets of evidence point to employer political discrimination as a relevant driver. First, in the administrative micro-data: assortative matching decreases strongly with firm size — it is more than twice as large in firms with up to 10 employees than in medium firms and more than six times as large as in firms with more than 50 employees — and is stronger for higher occupational layers and for jobs requiring above-median social skills or interpersonal relationships. Political assortative matching is, if anything, larger for parties not in power locally, inconsistent with a patronage mechanism. An event study of 5,262 owners who switched party finds a sharp increase of about 0.2 standard deviations in hires from the new party and a corresponding drop in hires from the old party at the time of the switch, with the share of workers from the new party rising by roughly 5 percentage points persistently. Second, an incentivized resume rating (IRR) field experiment (150 business owners; nondeceptive design) shows that owners rate copartisan resumes 0.213 points higher on a 1–7 Likert scale (a 7.4% increase relative to the mean rating for different-party resumes, statistically significant at p &amp;lt; 0.05), with no significant effect on perceived candidate acceptance probability. Third, a representative survey of 891 owners and 1,003 workers finds that belief-based and taste-based discrimination are ranked as the leading explanations by both groups; 47% of owners and 58% of workers agree with the belief-based discrimination statement. Additionally, 29% of surveyed owners (22% say &amp;ldquo;Yes&amp;rdquo; and 7% &amp;ldquo;In some cases&amp;rdquo;) explicitly reveal that political views affect their hiring decisions.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Real consequences.&lt;/em&gt; Conditional on employment, copartisan workers are promoted faster: they are 0.448 percentage points more likely to be promoted from white-collar to managerial positions (against a base rate of 2.58%) and 0.44 percentage points more likely to be promoted from blue-collar to white-collar positions (base rate 2.98%). Workers from a different party than the owner face a promotion penalty of 0.104–0.180 percentage points for white-collar-to-manager promotions. On wages, copartisan workers earn 3.9% more than unaffiliated coworkers within the same firm and year (firm-year FE specification); the effect is 2.8% when restricting to the same occupation within the firm. Workers from a different party earn 1.6% less. Decomposing by tier: managers (copartisan premium 1.6%), white-collar workers (3.4%), blue-collar workers (1.5%). Despite better outcomes, copartisan workers are 2.1 percentage points (2.3% relative to the mean) less likely to be educationally qualified for their occupation, conditional on firm-year and controlling for a full set of demographics. Finally, a higher share of copartisan workers in the prior year is associated with lower firm employment growth (estimated β = −0.071), corresponding to approximately a 1 percentage point gap in annual growth rate for a one-standard-deviation difference in copartisan share — substantial relative to an average annual growth rate of 10%.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;All findings pertain to the formal private sector in Brazil over 2002–2019. Political affiliation in the Brazilian system requires an active step and signals strong views; results apply to the approximately 7.8%–11.4% of workers and owners who are party-registered. The field experiment sample is limited to 150 business owners affiliated with major Brazilian parties who were actively seeking to hire. The firm growth result is explicitly characterized as suggestive, without a source of exogenous variation.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-likelihood-ratio-index-and-what-does-it-show-for-political-matching-in-brazil"&gt;Q1. What is the likelihood ratio index and what does it show for political matching in Brazil?&lt;/h3&gt;
&lt;p&gt;The likelihood ratio index measures how many times more likely a match between a worker and owner of the same party is, relative to the expected frequency under random matching (conditional on the population shares of each party). Across 2002–2019, the unconditional index ranges from 1.56 to 1.85, implying workers and employers of the same party are on average about twice as likely to match as under random matching. After accounting for geographic sorting within municipalities, the index ranges from approximately 1.41 (2002–2006 average) to 1.67 (2016–2019 average), showing a clear increasing trend. The corresponding gender and race indexes average about 1.2 and 1.35, respectively, in the basic specification, both significantly lower than the party index in every year of the sample.&lt;/p&gt;
&lt;h3 id="q2-how-do-the-dyadic-regression-estimates-control-for-omitted-characteristics-and-what-do-they-find"&gt;Q2. How do the dyadic regression estimates control for omitted characteristics, and what do they find?&lt;/h3&gt;
&lt;p&gt;The dyadic regression constructs all possible worker-firm pairs within each municipality-industry labor market in a given year. The dependent variable is an indicator for whether worker i is employed by firm f. The key coefficient of interest is the differential probability of employment for a copartisan pair relative to a different-party pair, controlling for indicators for shared gender, race, age bracket, and education level, as well as worker occupation fixed effects and experience. This controls for the concern that politically affiliated individuals share non-political traits that correlate with employment choices. After these controls, a politically affiliated worker is 41%–75% more likely (depending on year) to be employed by a copartisan owner than by a different-party owner. The effect stems primarily from copartisan workers being preferentially hired (not just from unaffiliated owners preferring any affiliated worker indiscriminately). The analogous dyadic estimate for shared gender is 15%–31% and for shared race is approximately 3.4%, both lower than the party estimate in all years.&lt;/p&gt;
&lt;h3 id="q3-how-is-political-assortative-matching-decomposed-into-hiring-versus-retention-margins"&gt;Q3. How is political assortative matching decomposed into hiring versus retention margins?&lt;/h3&gt;
&lt;p&gt;To isolate the hiring margin, the authors estimate the dyadic regression restricting to newly hired workers (not present in the firm in year t-1). They find that the probability of being hired by a copartisan owner is 32%–59% higher than by a different-party owner across years. The retention (tenure) margin is estimated by regressing the share of subsequent years a worker remains at the firm on partisan alignment at the time of hire. In the most stringent specification (year-of-hire × firm fixed effects), copartisan hires stay 5.5 percentage points longer (as a share of post-hire years) than different-party hires from the same firm and hire-year cohort. Both margins are significant, and both exhibit stronger political sorting than equivalent estimates for gender or race.&lt;/p&gt;
&lt;h3 id="q4-what-is-the-evidence-against-political-patronage-as-the-primary-driver-of-political-assortative-matching"&gt;Q4. What is the evidence against political patronage as the primary driver of political assortative matching?&lt;/h3&gt;
&lt;p&gt;If political patronage (parties pressuring owners to hire copartisans) were the main driver, we would expect political assortative matching to be stronger when the owner&amp;rsquo;s party is in power locally, as those parties have greater leverage over business owners. The authors estimate a modified dyadic regression distinguishing between cases where the owner&amp;rsquo;s party is in the ruling coalition of the municipal mayor or state governor versus not in power. The results show that political assortative matching is, if anything, larger for parties not in power. This is inconsistent with patronage being the dominant mechanism and consistent with the discrimination channel being driven by owner preferences rather than external political pressure.&lt;/p&gt;
&lt;h3 id="q5-what-does-the-event-study-of-owner-party-changes-show"&gt;Q5. What does the event study of owner party changes show?&lt;/h3&gt;
&lt;p&gt;The event study tracks 5,262 owners who switch party affiliation during 2002–2019, comparing their firms to control firms in the same market whose owners remain affiliated to the original party. At the time of the switch, there is a sharp increase of approximately 0.2 standard deviations in hires from the owner&amp;rsquo;s new party and a corresponding sharp decrease in hires from the old party. Hires from other parties and unaffiliated hires also decline modestly. The share of the workforce affiliated with the new party increases by roughly 5 percentage points and remains elevated in subsequent years. Because nonpolitical network ties (shared school, neighborhood, sports team) are unlikely to dissolve abruptly when an owner changes party, this design provides additional evidence that the change in hiring is driven by a direct change in the owner&amp;rsquo;s political preferences rather than by network overlap.&lt;/p&gt;
&lt;h3 id="q6-what-was-the-design-of-the-incentivized-resume-rating-experiment-and-why-does-it-identify-political-discrimination"&gt;Q6. What was the design of the incentivized resume rating experiment and why does it identify political discrimination?&lt;/h3&gt;
&lt;p&gt;The experiment was conducted with 150 Brazilian business owners recruited from the administrative data (who are already known to be affiliated with one of six major parties), targeting owners with active hiring interest through a leading job platform. Owners rated 20 synthetic resumes with fully randomized features (education, experience, training, skills, formatting). Sixteen resumes had no partisan cues; two contained cues signaling copartisanship with the rating owner; two signaled a party from the opposite side of the political spectrum. Incentives were provided by committing to send respondents real job-seeker profiles from the platform chosen by machine learning based on revealed preferences. Because all resume features other than the partisan cue were randomized, the experiment shuts down shared nonpolitical networks and patronage as explanations; the only channel is the employer&amp;rsquo;s direct preference for the candidate&amp;rsquo;s partisan affiliation. The response rate was 11% and the survey was conducted March–May 2022.&lt;/p&gt;
&lt;h3 id="q7-what-is-the-quantitative-magnitude-of-the-field-experiment-result"&gt;Q7. What is the quantitative magnitude of the field experiment result?&lt;/h3&gt;
&lt;p&gt;Owners rate copartisan resumes 0.213 points higher on the 1–7 Likert scale relative to resumes from the opposite side of the political spectrum (statistically significant at p &amp;lt; 0.05), representing a 7.4% increase relative to the mean rating of different-party resumes (2.950). When resume-level controls (gender, high-skill experience flag, years of experience, programming skills, training) are added, the estimate is 0.254. There is no statistically significant effect on owners&amp;rsquo; perceived likelihood that a candidate would accept a job offer (coefficient 0.150–0.158, not significant), suggesting that the observed difference in interest ratings reflects a genuine direct preference for copartisans, not an expectation that copartisans are more likely to accept.&lt;/p&gt;
&lt;h3 id="q8-what-do-the-survey-findings-add-about-mechanisms-and-the-prevalence-of-political-discrimination"&gt;Q8. What do the survey findings add about mechanisms and the prevalence of political discrimination?&lt;/h3&gt;
&lt;p&gt;The survey of 891 owners and 1,003 workers (response rate 26.84%) presents five candidate mechanisms and asks respondents to evaluate each. Both groups rank belief-based discrimination (owners believe copartisans would be more productive) as the most likely explanation: 47% of owners and 58% of workers partially or strongly agree. Taste-based discrimination is second (36% owners, 52% workers agree), followed by networks (39% owners, 49% workers). Patronage and workers&amp;rsquo; preferences attract little agreement from either group. Among owners ranked by single strongest agreement, 29.7% most strongly agree with belief-based discrimination and 22.0% with taste-based, while 29% of all surveyed owners explicitly stated that political views do affect their hiring decisions. These patterns are broadly similar regardless of the respondent&amp;rsquo;s own political affiliation status.&lt;/p&gt;
&lt;h3 id="q9-how-large-are-the-political-promotion-and-wage-premia-and-how-do-they-compare-to-gender-and-race-effects"&gt;Q9. How large are the political promotion and wage premia, and how do they compare to gender and race effects?&lt;/h3&gt;
&lt;p&gt;For promotions, copartisan white-collar workers are 0.448 percentage points more likely to be promoted to manager (relative to unaffiliated co-workers hired in the same firm-year), against a base promotion rate of 2.58% — an effect of approximately 17% of the mean. For blue-collar-to-white-collar promotion, the copartisan premium is 0.44 percentage points against a base rate of 2.98%. For wages, copartisans earn 3.9% more than unaffiliated co-workers within the same firm and year; restricting to the same occupation within the firm, the premium is 2.8%. The political wage premium (3.9%) exceeds the gender wage premium (1.5%) and the race wage premium (1.0%) in the same specification. Workers from a different party than the owner earn 1.6% less than unaffiliated co-workers within the same firm-year.&lt;/p&gt;
&lt;h3 id="q10-are-copartisan-workers-better-qualified-than-those-they-displace-and-what-does-this-imply-for-firm-performance"&gt;Q10. Are copartisan workers better qualified than those they displace, and what does this imply for firm performance?&lt;/h3&gt;
&lt;p&gt;Copartisan workers are significantly less qualified in terms of education relative to their occupation: they are 2.1 percentage points less likely to be educationally qualified for their position than their unaffiliated co-workers within the same firm-year (2.3% relative to the mean qualification rate of 93.2%), with the largest effects for managers. Workers of a different party show only a small and economically negligible qualification gap. The fact that copartisans are paid more, promoted faster, and yet are less qualified is consistent with political discrimination substituting for competence in personnel decisions. The qualification shortfall is specifically attributed to copartisanship and not to shared gender, race, age, or education between owner and worker, as those coefficients are economically small.&lt;/p&gt;
&lt;h3 id="q11-what-is-the-evidence-on-firm-growth-and-what-are-the-limitations-of-that-evidence"&gt;Q11. What is the evidence on firm growth and what are the limitations of that evidence?&lt;/h3&gt;
&lt;p&gt;Firms with a higher share of copartisan workers in the prior year grow less. The estimated coefficient β = −0.071, and a one-standard-deviation difference in the copartisan share is associated with approximately a 1 percentage point gap in annual employment growth, relative to a mean growth rate of 10%. The specification compares firms of the same size and with the same number of affiliated workers in the same year. The result is robust to adding municipality and municipality-industry fixed effects. The authors explicitly characterize this evidence as suggestive, noting the absence of an exogenous source of variation in political discrimination. The negative association is more consistent with taste-based discrimination (Becker, 1957) — in which politically homogeneous firms sacrifice productivity for the owners&amp;rsquo; amenity of employing copartisans — than with accurate belief-based discrimination.&lt;/p&gt;
&lt;h3 id="q12-how-is-political-assortative-matching-distributed-across-parties-and-does-it-depend-on-party-ideology"&gt;Q12. How is political assortative matching distributed across parties and does it depend on party ideology?&lt;/h3&gt;
&lt;p&gt;The likelihood ratio index shows large assortative matching across the entire political spectrum. For most years, relatively more ideologically extreme parties — on the left (PT, PDT) and on the right (PP, DEM) — display higher assortative matching than more centrist parties (PMDB, PSDB). This pattern is consistent with stronger partisan identity at the extremes leading to stronger preferences for copartisan workers, but the paper does not formally model the mechanism behind this heterogeneity.&lt;/p&gt;
&lt;h3 id="q13-what-is-the-role-of-workers-preferences-as-opposed-to-employers-discrimination-and-how-can-wages-distinguish-them"&gt;Q13. What is the role of workers&amp;rsquo; preferences as opposed to employers&amp;rsquo; discrimination, and how can wages distinguish them?&lt;/h3&gt;
&lt;p&gt;If workers have a preference for working with copartisan owners (treating this as a job amenity), compensating differentials theory would predict a negative wage premium for copartisan workers — they would accept lower wages in exchange for working with like-minded owners. The data show the opposite: copartisan workers earn significantly more, not less, than their unaffiliated co-workers. This evidence is inconsistent with workers&amp;rsquo; preferences being the primary driver of political assortative matching, and is instead consistent with employers&amp;rsquo; discrimination. The survey evidence corroborates this: both owners and workers assign low priority to the &amp;ldquo;workers&amp;rsquo; preferences&amp;rdquo; mechanism.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Political assortative matching&lt;/strong&gt;: The phenomenon by which workers and business owners belonging to the same political party are matched in the labor market at rates significantly exceeding what would occur under random matching within the local labor market. Measured via the likelihood ratio index and dyadic regressions that control for shared demographic characteristics. In this paper, political assortative matching is larger in magnitude than assortative matching along gender or racial lines.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Likelihood ratio index (S)&lt;/strong&gt;: A measure of assortative matching defined as the weighted sum of the ratios of observed same-party co-occurrence probabilities to their expected probabilities under random matching. S &amp;gt; 1 indicates positive assortative matching. The paper uses both a basic version and a geography-adjusted version that computes the index within municipalities to control for geographic concentration of party membership.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Dyadic regression&lt;/strong&gt;: A regression approach that constructs all possible worker-firm pairs within a defined labor market (municipality × 2-digit industry) to estimate the differential probability that a worker is employed by a copartisan firm relative to a different-party firm. The key advantage is the ability to control simultaneously for multiple shared demographic characteristics between worker and owner, accounting for the correlation of assortative criteria.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Incentivized resume rating (IRR) experiment&lt;/strong&gt;: A nondeceptive field experiment design (following Kessler et al., 2019) in which business owners rate synthetic resumes with fully randomized characteristics. Truthful rating is incentivized because respondents are told that their revealed preferences will be used to select real job-seeker profiles sent to them by a partner platform via machine learning. This design allows direct identification of employer preference for copartisan candidates while ruling out alternative channels such as shared nonpolitical networks or patronage.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Political wage premium&lt;/strong&gt;: The percentage wage difference earned by copartisan workers relative to unaffiliated co-workers within the same firm-year (and occupation), after controlling for a full set of socio-demographic characteristics. A positive political wage premium is the paper&amp;rsquo;s primary piece of evidence that workers&amp;rsquo; compensating differentials cannot explain political assortative matching, since amenity-based sorting would predict a negative premium.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Political promotion premium&lt;/strong&gt;: The differential probability that a copartisan worker is promoted to a higher organizational layer (blue-collar to white-collar, or white-collar to manager) relative to an unaffiliated co-worker hired in the same firm and year, net of demographic controls.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Educational mismatch (Qualified)&lt;/strong&gt;: An indicator variable equal to one if a worker&amp;rsquo;s educational level meets or exceeds the educational level required by their specific occupation in the CBO (Classificação Brasileira de Ocupações) classification. Used to assess whether politically favored (copartisan) workers are less competent along this observable dimension.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Belief-based discrimination vs. taste-based discrimination&lt;/strong&gt;: Two distinct theoretical channels for employer political discrimination. Belief-based discrimination (Phelps, 1972; Arrow, 1973) occurs when employers perceive copartisans to be more productive — e.g., because shared political views reduce intra-firm conflict. Taste-based discrimination (Becker, 1971) occurs when employers have a direct utility-affecting preference for copartisan workers, independent of productivity beliefs. The paper treats these as observationally distinct from patronage and network overlap, and uses the negative correlation between political homogeneity and firm growth as suggestive evidence favoring the taste-based channel.&lt;/p&gt;</description></item><item><title>Present Bias Amplifies the Household Balance-Sheet Channels of Macroeconomic Policy</title><link>https://macropaperwarehouse.com/papers/present-bias-amplifies-the-household-balance-sheet-channels-of-macroeconomic-policy/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/present-bias-amplifies-the-household-balance-sheet-channels-of-macroeconomic-policy/</guid><description>&lt;h2 id="layer-1--summary"&gt;Layer 1 — Summary&lt;/h2&gt;
&lt;p&gt;Maxted, Laibson, and Moll study fiscal and monetary policy in a partial-equilibrium heterogeneous-agent model in which homeowners have present-biased time preferences (Instantaneous Gratification preferences, the continuous-time limit of quasi-hyperbolic discounting) and naive beliefs, alongside a liquid savings account, an illiquid home, and access to credit card and mortgage debt. Because present bias substantially increases households&amp;rsquo; marginal propensity to consume — in the calibrated model the quarterly MPC rises from 4% under exponential discounting to 14% under present bias, and the quarterly marginal propensity for expenditure (MPX) rises from 13% to 30% — present bias powerfully increases the effect of fiscal stimulus. Present bias also amplifies the overall effect of expansionary monetary policy, but at the same time slows down the speed of monetary transmission: interest rate cuts incentivize households to conduct cash-out refinances, which become targeted liquidity injections to households near the liquidity constraint who have especially high MPCs, but present bias with naive beliefs also introduces a motive for households to procrastinate on refinancing their mortgage, which substantially slows the speed at which this channel operates. A noteworthy feature of the model is that present bias amplifies the direct effect of monetary policy on household consumption while simultaneously delivering larger MPCs — a combination that is in contrast to standard heterogeneous-agent models, where modeling choices that amplify MPCs typically deliver smaller consumption responses to interest rate changes. The calibrated present-biased economy also replicates several empirical regularities that are difficult to match with exponential discounting: high-cost credit card borrowing by homeowners, empirically plausible cash-out behavior and loan-to-value ratios, and refinancing inertia.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-q-what-is-the-core-modeling-innovation-and-why-is-it-needed"&gt;Q1. Q: What is the core modeling innovation and why is it needed?&lt;/h3&gt;
&lt;p&gt;A: The paper introduces naive Instantaneous Gratification (IG) preferences — the continuous-time limit of quasi-hyperbolic (beta-delta) discounting — into a two-asset heterogeneous-agent model with a liquid savings account and illiquid home equity accessible via mortgage refinancing. The naivete assumption (households do not foresee their own future present bias) is essential because it generates procrastination: naive households perpetually intend to refinance &amp;ldquo;soon&amp;rdquo; but keep delaying. A model with exponential discounting that merely sets parameters to match empirical MPCs would not generate procrastination behavior, and would require implausible interest rate calibrations (very low credit card rates or very high illiquid asset returns) to simultaneously match low liquid wealth accumulation and high credit card borrowing. Present bias with interest rates taken from the data resolves both issues.&lt;/p&gt;
&lt;h3 id="q2-q-what-are-the-key-quantitative-mpc-results-and-why-do-they-matter-for-fiscal-policy"&gt;Q2. Q: What are the key quantitative MPC results and why do they matter for fiscal policy?&lt;/h3&gt;
&lt;p&gt;A: In the exponential discounting benchmark, the quarterly MPC is 4% and the quarterly MPX (which includes nondurables and durables) is 13%. Under the present-bias benchmark, the MPC rises to 14% and the MPX rises to 30%. The empirical literature estimates quarterly nondurable spending responses on the order of 15%–25%, and total expenditure responses typically two to three times larger, so the present-biased model is substantially more consistent with the data. Because fiscal stimulus (modeled as an unexpected one-time lump-sum payment, financed by a flow income tax) operates through household spending propensities, the higher MPCs and MPXs under present bias directly and powerfully increase the aggregate consumption response to fiscal policy relative to the exponential benchmark.&lt;/p&gt;
&lt;h3 id="q3-q-how-does-present-bias-amplify-the-effect-of-monetary-policy"&gt;Q3. Q: How does present bias amplify the effect of monetary policy?&lt;/h3&gt;
&lt;p&gt;A: Interest rate cuts incentivize households to conduct cash-out refinances — they borrow against accumulated home equity, converting illiquid home equity into liquid wealth. Because this liquidity is targeted to households who are near their borrowing constraint (and thus have especially high MPCs), the aggregate consumption response to a given rate cut is amplified. Crucially, present bias amplifies this channel beyond the exponential benchmark precisely because higher MPCs mean each dollar of liquidity injected generates more consumption. This stands in contrast to the standard result in the heterogeneous-agent literature (Auclert 2019; Olivi 2017; Kaplan, Moll, and Violante 2018) that MPC-amplifying modeling choices reduce the consumption response to interest rate changes because MPC enters the substitution effect with a negative sign in standard one-asset models. The two-asset structure with home equity and the cash-out refinance channel breaks this trade-off.&lt;/p&gt;
&lt;h3 id="q4-q-how-does-present-bias-slow-the-speed-of-monetary-transmission"&gt;Q4. Q: How does present bias slow the speed of monetary transmission?&lt;/h3&gt;
&lt;p&gt;A: Present bias with naive beliefs introduces a motive for households to procrastinate on refinancing their mortgage. Refinancing is an immediate-cost, delayed-reward task: it requires the borrower to spend weeks gathering documents, filling out paperwork, and negotiating with lenders, with benefits (lower mortgage payments or extracted home equity) accruing afterward. Naive present-biased households discount current effort costs very heavily relative to future benefits, so they delay, all the while (counterfactually) believing they will complete the task in the near future. This procrastination substantially slows down the speed at which the cash-out refinance channel of monetary policy operates: even though a rate cut eventually incentivizes households to refinance and extract equity, the timing of that response is stretched out relative to what exponential discounters would do.&lt;/p&gt;
&lt;h3 id="q5-q-what-is-the-role-of-naive-beliefs-versus-sophisticated-partially-or-fully-aware-present-bias"&gt;Q5. Q: What is the role of naive beliefs versus sophisticated (partially or fully aware) present bias?&lt;/h3&gt;
&lt;p&gt;A: Naivete is necessary to generate procrastination from small effort costs. A fully sophisticated present-biased household (one who correctly anticipates its own future self-control problems) would not indefinitely defer a task it correctly anticipates will keep being deferred. The paper extends the analysis to partial and full sophistication in Online Appendix D.5. The key takeaway is that procrastination — and thus the speed-reduction effect on monetary transmission — is driven by at least partial naivete. The MPC-amplification and fiscal-policy amplification results are more robust across sophistication levels.&lt;/p&gt;
&lt;h3 id="q6-q-what-empirical-regularities-does-the-present-biased-calibration-match-that-the-exponential-model-cannot-easily-match"&gt;Q6. Q: What empirical regularities does the present-biased calibration match that the exponential model cannot easily match?&lt;/h3&gt;
&lt;p&gt;A: The present-biased economy replicates: (1) empirically plausible levels of high-cost credit card debt held simultaneously with home equity (a puzzle under exponential discounting); (2) cash-out behavior and loan-to-value ratios consistent with data; (3) a buildup of liquidity-constrained households consistent with empirical propensities to spend out of credit card limit increases (Gross and Souleles 2002; Agarwal et al. 2018); (4) consumption function discontinuities at the borrowing constraint consistent with Ganong and Noel (2019); (5) MPCs and MPXs that remain elevated for large shocks (Fagereng, Holm, and Natvik 2021); (6) the intertemporal MPC profile consistent with Auclert, Rognlie, and Straub (2018); (7) differential MPCs out of liquid versus illiquid transfers (Ganong and Noel 2020); and (8) refinancing inertia — the proclivity for households to delay refinancing when financially optimal (Keys, Pope, and Pope 2016; Johnson, Meier, and Toubia 2019; Andersen et al. 2020).&lt;/p&gt;
&lt;h3 id="q7-q-what-is-the-models-scope--what-does-it-abstract-from"&gt;Q7. Q: What is the model&amp;rsquo;s scope — what does it abstract from?&lt;/h3&gt;
&lt;p&gt;A: The model is set in partial equilibrium, so general equilibrium effects (e.g., endogenous interest rate responses, aggregate demand externalities) are not captured; the authors describe their results as inputs for a fuller general equilibrium analysis. The model focuses on homeowners (two-thirds of U.S. housing units), abstracting from renters. House prices are fixed (consistent with their slow movement over short horizons), with an extension to house price shocks in Online Appendix D.2.1. The model does not allow for home equity lines of credit, second mortgages, or reverse mortgages, because these products are more commonly used when interest rates are rising, and the paper focuses on the stimulative effect of rate cuts. The interest rate in the model is a long rate (e.g., 10-year TIPS), with the implicit assumption that the Federal Reserve implements the necessary short-rate adjustments.&lt;/p&gt;
&lt;h3 id="q8-q-how-does-the-present-biased-model-compare-to-the-standard-hank-picture-on-the-monetary-mpc-trade-off"&gt;Q8. Q: How does the present-biased model compare to the standard HANK picture on the monetary-MPC trade-off?&lt;/h3&gt;
&lt;p&gt;A: In standard one-asset heterogeneous-agent models, a household&amp;rsquo;s MPC is a sufficient statistic that enters the substitution effect of interest rate changes with a negative sign — so modeling choices that raise MPCs reduce monetary policy effectiveness. The present-biased two-asset model breaks this result: because interest rate cuts trigger cash-out refinances that inject liquidity targeted to high-MPC households near the constraint, higher MPCs translate into larger, not smaller, aggregate consumption responses to monetary policy. Present bias therefore simultaneously amplifies fiscal policy (via higher MPCs) and amplifies the overall effect of monetary policy (via the targeted liquidity channel), while introducing the procrastination-driven speed reduction as the offsetting cost.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Present bias (Instantaneous Gratification preferences):&lt;/strong&gt; The paper uses &amp;ldquo;present bias&amp;rdquo; to refer to quasi-hyperbolic discounting. In the continuous-time limit (Instantaneous Gratification, or IG, preferences, following Harris and Laibson 2013), the current self discounts all future selves by factor β &amp;lt; 1, while exponential discounting of the future (rate ρ) applies from any future vantage point. This creates a discontinuity in the discount function at t = 0 whenever β &amp;lt; 1. Setting β = 1 recovers standard exponential discounting.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Naive beliefs:&lt;/strong&gt; Households do not foresee their own future present bias. The current self believes all future selves will be exponential discounters (β = 1), even though this belief is incorrect. Naivete is what transforms present bias into procrastination: the household perpetually expects its future self to complete effortful tasks, but each future self faces the same bias.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Cash-out refinance channel:&lt;/strong&gt; When market interest rates fall, households have an incentive to refinance their fixed-rate mortgage, locking in a lower interest rate. If the household has accumulated home equity (illiquid), it can simultaneously borrow against that equity — a cash-out refinance — converting illiquid home equity into liquid wealth. In the model, this acts as a targeted liquidity injection to households near their borrowing constraint (who have high MPCs), amplifying the aggregate consumption response to rate cuts.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Procrastination motive:&lt;/strong&gt; Present bias introduces a motive to procrastinate on immediate-cost, delayed-reward tasks such as mortgage refinancing. The effort and paperwork costs of refinancing are borne immediately, while the financial benefits accrue over time. A naive present-biased household heavily discounts the current effort cost relative to future benefits, leading it to defer refinancing repeatedly. This substantially slows the speed at which the cash-out refinance channel of monetary policy operates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Marginal propensity to consume (MPC) vs. marginal propensity for expenditure (MPX):&lt;/strong&gt; The paper distinguishes the quarterly MPC (response of nondurable consumption to a one-unit cash transfer) from the quarterly MPX (which also includes durables). Under exponential discounting, MPC = 4% and MPX = 13%; under the present-bias benchmark, MPC = 14% and MPX = 30%. The higher MPXs are more consistent with empirical estimates (quarterly nondurable responses of 15%–25%; total spending responses two to three times larger).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Refinancing inertia:&lt;/strong&gt; The empirical regularity that households delay mortgage refinancing even when it is financially optimal to do so. The paper provides a theoretical foundation for this behavior through the procrastination motive generated by naive present bias combined with the small effort cost of refinancing.&lt;/p&gt;
&lt;hr&gt;
&lt;p&gt;&lt;em&gt;Summary based on LSE Research Online published version. AI-assisted, human review pending.&lt;/em&gt;&lt;/p&gt;</description></item><item><title>Private Information and Price Regulation in the US Credit Card Market</title><link>https://macropaperwarehouse.com/papers/private-information-and-price-regulation-in-the-us-credit-card-market/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/private-information-and-price-regulation-in-the-us-credit-card-market/</guid><description>&lt;p&gt;The 2009 Credit Card Accountability Responsibility and Disclosure (CARD) Act barred credit card lenders from discretionarily raising borrowers&amp;rsquo; interest rates in response to new information. Using the near-universe of US credit card account data (covering roughly 90% of outstanding general-purpose balances across 17–19 large and midsize issuers) together with a large panel of consumer credit reports, the paper documents that the class of rate increases restricted by the Act affected over 50% of borrowing accounts annually before the Act; incidence dropped to nearly zero afterward, and the interquartile range of interest rates on newly-mature accounts compressed immediately by one-third. The paper then estimates a structural model of the credit card market featuring differentiated lenders competing à la Bertrand, consumers with dynamic discrete choice over lenders and borrowing status, private information types identified from equilibrium pricing, and flexible correlation between demand and risk. Imposing the Act&amp;rsquo;s restrictions in the estimated model reveals that consumer surplus rises at all credit scores — by roughly $600 for subprime and over $1,000 for prime and superprime consumers — despite partial market unraveling among the deepest subprime accounts. The net surplus gains are driven by two forces: (1) a fall in lender markups (pre-CARD-Act median-risk subprime markups exceeded 40 percent), and (2) the insurance value of rate lock-in for borrowers whose credit risk deteriorates over time. Counterfactual analysis shows that if pre-CARD-Act markets had been perfectly competitive (zero markups), the Act would have induced complete market unraveling and consumer surplus would have fallen by $100–$600 per consumer.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-did-the-card-act-restrict-and-how-large-was-the-pre-act-practice-it-curtailed"&gt;Q1. What did the CARD Act restrict, and how large was the pre-Act practice it curtailed?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Before the CARD Act, lenders raised borrowers&amp;rsquo; interest rates in response to new information about risk and demand on 48–54% of borrowing accounts at least once per year; the Act&amp;rsquo;s repricing restrictions drove this incidence to nearly zero and immediately compressed the interquartile range of interest rates across accounts within a credit-score tier from about 7.5 percentage points to about 5 percentage points — a one-third reduction in price dispersion.&lt;/strong&gt; The pricing of emergent risk (risk revealed after account opening) was nearly indistinguishable from the pricing of origination risk before the Act (both approximately 30 basis points per 10-point FICO difference); after the Act, emergent risk was priced at only about 7 basis points per 10 FICO points — less than one-third as much — while origination risk remained priced at 26 basis points.&lt;/p&gt;
&lt;h3 id="q2-how-are-private-information-types-identified-and-what-do-they-reveal-about-adverse-selection"&gt;Q2. How are private information types identified and what do they reveal about adverse selection?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Private information types ψ are identified from the equilibrium pricing of mature accounts: the paper exploits portfolio-wide repricing events (where a lender raises rates on all accounts in a segment simultaneously) as quasi-experimental price variation to estimate price sensitivities γ via 2SLS, then inverts the lender first-order conditions to recover residual default-risk types orthogonal to observed credit scores.&lt;/strong&gt; The Chiappori–Salanié bivariate probit test on simulated borrowing choices and default outcomes confirms adverse selection on new accounts: the estimated correlation of unobservables is 0.104, and within each credit-score group the correlation between private type and borrowing utility is at least 0.4 — meaning the highest-risk private types also have the strongest demand for credit.&lt;/p&gt;
&lt;h3 id="q3-what-does-the-structural-model-say-about-markups-and-their-source"&gt;Q3. What does the structural model say about markups and their source?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Estimated pre-CARD-Act markups exceed 40% for the median-risk subprime consumer; the high markups reflect a combination of switching costs (setup costs κ averaging $86.5–$89.7 per account for the lowest FICO groups), high borrowing utility δ among high-risk types (averaging 22.9 at FICO 580–599, falling to 3.3 at FICO 780–799), and lenders&amp;rsquo; exploitation of private information about which borrowers have low price sensitivity.&lt;/strong&gt; Setup costs are substantially larger than liquidity costs (costs of paying off existing balances), consistent with prior findings on adjustment costs in financial markets; new-account acquisition costs are increasing in credit score, consistent with lenders making larger offers to attract higher-quality borrowers.&lt;/p&gt;
&lt;h3 id="q4-what-happens-to-prices-under-the-card-act-restrictions-and-why-does-partial-unraveling-occur-only-for-deep-subprime"&gt;Q4. What happens to prices under the CARD Act restrictions, and why does partial unraveling occur only for deep subprime?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Among deep subprime (FICO 580–599), the Act induces near-complete pooling at roughly 50% APR annualized for all private types; the safest private types exit the market as they are pooled with riskier peers, whose higher costs push prices further up — a textbook Akerlof unraveling spiral — and over 30% of the privately safest subprime borrowers face prices that newly exceed their willingness to pay.&lt;/strong&gt; At higher credit scores (e.g., FICO 680–699), nearly all private types face lower prices because the mark-up compression dominates; at FICO 780+, all private types face lower or unchanged prices. Average traded prices fall at all credit score levels because: (1) consumers who exit were paying lower prices than those who stay, and (2) borrowers locked into favorable rates retain them as their types worsen.&lt;/p&gt;
&lt;h3 id="q5-what-are-the-consumer-surplus-gains-and-what-drives-them"&gt;Q5. What are the consumer surplus gains, and what drives them?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Consumer surplus rises at all credit scores — by approximately $600 for subprime consumers and by over $1,000 for prime and superprime consumers — with the gain coming from two sources of roughly equal magnitude: markup compression (a transfer from lender profits to consumer surplus) and the insurance value of rate lock-in for consumers whose default risk deteriorates.&lt;/strong&gt; The insurance channel is most important for superprime borrowers, who are most likely to lock in favorable pricing and to experience type migration over the life of a relationship; the direct pecuniary markup gains dominate for subprime borrowers. Even absent any insurance value, the surplus gains from markup compression alone (a few hundred dollars) are comparable to prior reduced-form estimates of the Act&amp;rsquo;s price effects.&lt;/p&gt;
&lt;h3 id="q6-why-were-high-pre-card-act-markups-necessary-for-the-surplus-gains"&gt;Q6. Why were high pre-CARD-Act markups necessary for the surplus gains?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;In a counterfactual where pre-CARD-Act markets are perfectly competitive (marginal-cost pricing), imposing the CARD Act restrictions causes complete market unraveling: all prices exceed 150% APR, virtually no consumers borrow, and surplus per consumer falls by $100–$600 depending on credit score.&lt;/strong&gt; With only modestly higher price sensitivity (one bootstrapped standard error larger than the point estimate), unraveling occurs at all credit score tiers and total surplus falls throughout the market. The intuition is that with zero markups, lenders have no cushion to absorb adverse selection costs; restricting risk-based repricing immediately makes lending unprofitable for any pooled price, triggering exit cascades that the pre-CARD-Act markup buffer prevented.&lt;/p&gt;
&lt;h3 id="q7-what-is-the-adverse-retention-finding-and-its-magnitude"&gt;Q7. What is the adverse retention finding and its magnitude?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;After the Act, lenders face adverse retention on mature accounts: for every 100 basis points by which emergent risk is priced below origination risk, the quarterly hazard of attrition from borrowing falls by 0.7 percentage points — meaning newly risky borrowers are less likely to leave while newly safe borrowers are more likely to leave.&lt;/strong&gt; This is precisely the dynamics consistent with Akerlof adverse selection: the Act&amp;rsquo;s restriction on emergent-risk pricing reduces the signal lenders can use to retain safe borrowers selectively, degrading the quality of each lender&amp;rsquo;s continuing portfolio.&lt;/p&gt;
&lt;h3 id="q8-what-private-information-rents-existed-before-the-act"&gt;Q8. What private information rents existed before the Act?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Before the CARD Act, lenders exploited private information about borrowers&amp;rsquo; demand characteristics: accounts that engaged in over-limit transactions or brief delinquencies (less than 30 days late) received median price increases of 6.9 and 15.5 percentage points annualized respectively, and 12-month revenue yields on these accounts rose by over 50% relative to baseline accounts, generating sustained higher returns despite higher charge-off risk.&lt;/strong&gt; The revenue yield increase is not transitory; it persists for at least 12 months, and the returns figures confirm these behaviors reveal not just higher costs but also lower price sensitivity — enabling lenders to extract information rents.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;private information type (ψ)&lt;/strong&gt; : in the model&amp;rsquo;s notation, a consumer&amp;rsquo;s residual risk and demand characteristic that is known to the consumer and revealed to the incumbent lender over time through account behavior, but not observed by competing lenders or at the time of account opening; identified from equilibrium pricing using the assumption of price-invariant default.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;emergent risk&lt;/strong&gt; : default risk that becomes observable to the lender after account origination, in contrast to origination risk visible at account opening; the CARD Act restricted repricing in response to emergent risk while leaving origination-risk pricing unrestricted.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;adverse retention&lt;/strong&gt; : the post-CARD-Act phenomenon in which borrowers who become higher-risk are less likely to attrite (because their pricing is not raised) while borrowers who become lower-risk are more likely to leave (because their favorable pricing is no longer reinforced); the paper estimates a 0.7 percentage point drop in quarterly attrition hazard per 100 basis points of under-pricing of emergent risk.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;insurance value of rate lock-in&lt;/strong&gt; : the welfare benefit to consumers from knowing that an adverse draw of their risk type will not trigger a price increase; quantified by comparing actual surplus gains with gains in a counterfactual where types are perfectly persistent (eliminating the demand for insurance); roughly equal in magnitude to the direct markup compression gains.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;market unraveling&lt;/strong&gt; : the Akerlof-style exit spiral in which pooling raises prices, inducing exit by low-risk types, which raises the average cost of the remaining pool, which raises prices further; in the paper&amp;rsquo;s estimates this is severe only among deep subprime (FICO below 620) and would have been universal had pre-CARD-Act markups been zero.&lt;/p&gt;</description></item><item><title>Production and Financial Networks in Interplay</title><link>https://macropaperwarehouse.com/papers/production-and-financial-networks-in-interplay/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/production-and-financial-networks-in-interplay/</guid><description>&lt;p&gt;This paper provides the first integrated empirical analysis of how bank credit supply shocks propagate through both the production network and the financial network simultaneously, using the universe of firm-to-firm VAT transactions and bank-firm credit register data for Spain during the 2008-09 global financial crisis. The theoretical framework, following Bigio and La&amp;rsquo;O (2016), links credit supply shocks to price distortions in the real economy and derives network-mediated propagation effects. The central empirical finding is that propagation through the production network triples the impact of direct bank credit shocks: a negative bank shock induces a 0.98 percentage point reduction in the directly affected firm&amp;rsquo;s purchases and sales growth, while first-order network effects add another 0.91 pp and higher-order network effects add 1.07 pp, for a combined indirect effect equal to twice the direct effect. Both upstream and downstream propagation are economically significant and of similar magnitude at the first-order level. Market concentration amplifies all propagation effects, and firms that are simultaneously central in both the production and financial networks generate disproportionately large aggregate contractions.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a published paper based on the CREI working paper full text, AI-assisted, pending human review. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Huremovic, Jimenez, Moral-Benito, Peydro, and Vega-Redondo study how financial shocks originating in the banking sector propagate through interlinked production and financial networks, exploiting Spain&amp;rsquo;s administrative registers covering essentially the complete production and credit networks of the Spanish economy during the 2008-09 crisis. The Spanish data are unique: approximately 4.3 million VAT firm-to-firm transactions (above a €3,005 threshold) covering 245,000 firms, matched with 1.68 million bank-firm loans from 206 active banks. Bank credit supply shocks are identified using the Khwaja-Mian (2008) / Amiti-Weinstein (2018) approach — isolating bank-level credit supply variation by conditioning on firm-time fixed effects across firms with multiple bank relationships — and cross-validated using banks&amp;rsquo; pre-crisis interbank market exposure. The paper&amp;rsquo;s main contribution is to show that treating production and financial networks separately understates the real effects of financial shocks by a factor of three: the combined direct and indirect (network-mediated) effects are three times the direct bank shock effect alone. First-order and higher-order downstream effects are both quantitatively significant, while upstream propagation is strong at first order but attenuates at higher orders.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-how-does-the-paper-identify-bank-credit-supply-shocks-and-what-makes-spains-administrative-data-unusual"&gt;Q1. How does the paper identify bank credit supply shocks, and what makes Spain&amp;rsquo;s administrative data unusual?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Bank credit supply shocks are identified using within-firm variation across bank relationships — the Khwaja-Mian/Amiti-Weinstein approach — which partials out all firm-level credit demand variation by including firm-time fixed effects, isolating the supply component of each bank&amp;rsquo;s credit change during the 2008-09 crisis.&lt;/strong&gt; Spain is particularly suited for this analysis for two reasons. First, it is a bank-dominated economy with minimal shadow banking, so bank credit is the primary external financing channel and the credit register is comprehensive (capturing all loans above €6,000). Second, around 75% of credit comes from firms with at least two banking relationships, enabling the within-firm identification. A complementary shock measure based on banks&amp;rsquo; pre-crisis reliance on interbank funding — a market sharply disrupted by the Lehman failure — yields similar results and does not require multi-bank relationships. Crucially, both shock measures show effects that are significant during the 2008-09 crisis but not in the pre-crisis year 2007, consistent with the shocks being crisis-specific supply disruptions rather than pre-existing trends.&lt;/p&gt;
&lt;h3 id="q2-what-are-the-direct-effects-of-bank-credit-supply-shocks-on-firm-level-real-outcomes"&gt;Q2. What are the direct effects of bank credit supply shocks on firm-level real outcomes?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;At the link (firm-to-firm) level, a direct negative bank credit supply shock to a supplier reduces the purchasing firm&amp;rsquo;s growth in purchases from that supplier by 3.7 percentage points (29% of the median purchase growth), while a shock to a customer reduces the supplier&amp;rsquo;s sales growth to that customer by 5.1 percentage points (37% of median sales growth).&lt;/strong&gt; At the firm level, aggregating across all suppliers and customers, direct bank shocks reduce employment growth by 0.41 percentage points (41% of the median) and investment growth by 0.55 percentage points (9% of the median), consistent with the existing bank lending channel literature. Negative bank shocks also affect total credit availability at the firm level, including trade credit, indicating that the transmission operates through multiple channels and not only through the reduction in direct bank credit.&lt;/p&gt;
&lt;h3 id="q3-how-large-are-the-first-order-and-higher-order-network-propagation-effects-and-how-do-they-compare-to-direct-effects"&gt;Q3. How large are the first-order and higher-order network propagation effects, and how do they compare to direct effects?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The first-order indirect effects — propagation from direct customers and suppliers — are of comparable magnitude to the direct bank shock effects: a negative bank shock to all direct suppliers generates a 2.3 pp reduction in firm purchases, while a shock to all direct customers generates a 1.9 pp reduction in sales, both comparable to the 0.98 pp direct effect on purchases and sales combined.&lt;/strong&gt; Higher-order downstream effects (shocks to suppliers of suppliers) are also quantitatively important at approximately 2.0 pp, similar in magnitude to first-order downstream effects. In contrast, higher-order upstream propagation is weak — only first-order customer shocks matter for upstream transmission. This asymmetry is consistent with the theoretical model&amp;rsquo;s prediction that upstream propagation is non-linear in shock magnitude, attenuating more rapidly at higher orders than downstream propagation. In aggregate, the combined direct plus first-order plus higher-order effects triple the direct effect: the overall reduction in purchases and sales growth is approximately three times the direct bank shock effect alone.&lt;/p&gt;
&lt;h3 id="q4-what-is-the-symmetric-finding-on-upstream-versus-downstream-propagation-and-why-does-it-matter"&gt;Q4. What is the symmetric finding on upstream versus downstream propagation, and why does it matter?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Upstream and downstream propagation at the first-order level are of similar magnitude — a negative bank shock induces a 3.7 pp contraction in purchases (downstream, from the shocked supplier to the buying firm) and a 5.1 pp contraction in sales (upstream, from the shocked customer to the selling firm) — challenging the prior literature&amp;rsquo;s assumption that production network propagation is predominantly downstream.&lt;/strong&gt; The comparable magnitudes of upstream and downstream propagation imply that financial shocks hitting customers matter for suppliers almost as much as financial shocks hitting suppliers matter for customers. The model provides the analytical basis for this result: downstream propagation is linear in shock magnitude (input supply contraction is passed through proportionally), while upstream propagation is non-linear (demand shortfalls at the customer do not fully translate into supply contraction from the supplier if the customer can be substituted). The near-symmetry at first order, however, means that both channels must be modeled for accurate aggregate impact assessment.&lt;/p&gt;
&lt;h3 id="q5-how-does-market-concentration-amplify-financial-shock-propagation"&gt;Q5. How does market concentration amplify financial shock propagation?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Firms operating in more concentrated markets — proxied by sectoral market concentration — experience stronger propagation both upstream and downstream; firm-to-firm propagation is also amplified when the two connected firms are mutual trading partners (both buyer and seller of each other), and for downstream propagation specifically when firms are geographically distant and share no common bank.&lt;/strong&gt; The market concentration amplification is consistent with the theory: concentrated markets have fewer substitution possibilities for inputs and outputs, so firms cannot easily re-route around a shocked partner, forcing the shock to transmit more fully along the existing network link. The amplification from mutual trading ties reflects that the combined demand-and-supply shock through a reciprocal link creates compound effects. The attenuation of downstream propagation when firms share a common bank is consistent with the bank internalizing the financial interdependence of borrowers connected in a supply chain.&lt;/p&gt;
&lt;h3 id="q6-what-is-the-contribution-of-combining-production-and-financial-network-analysis-jointly-beyond-studying-either-separately"&gt;Q6. What is the contribution of combining production and financial network analysis jointly, beyond studying either separately?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The joint analysis reveals that the real effects of financial shocks are massively understated when production and financial networks are studied in isolation: the overall impact triples the direct bank shock effect, a result that only emerges when both network structures are mapped and their interaction is quantified.&lt;/strong&gt; The paper also shows that aggregating to the firm level — rather than analyzing only link-level effects — is essential: firms minimize shocks from individual connections by adjusting across multiple suppliers or customers, so link-level estimates do not translate directly to firm-level outcomes. The joint network analysis further reveals a &amp;ldquo;dual centrality&amp;rdquo; amplification: firms that are central both in the production network (high customer-supplier centrality) and in the financial network (large credit relationships with strongly-shocked banks) generate disproportionately large aggregate output contractions. A standard deviation increase in a firm&amp;rsquo;s customer centrality is associated with a 3 pp decrease in its purchase growth, while the same increase in supplier centrality is associated with a 0.6 pp decrease in sales growth.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;upstream propagation&lt;/strong&gt; : the transmission of a bank credit supply shock from a directly shocked customer to that customer&amp;rsquo;s suppliers, operating through the demand channel — a customer facing tighter credit reduces its purchases, contracting the supplier&amp;rsquo;s sales; the paper shows first-order upstream effects (5.1 pp reduction in sales growth) are of similar magnitude to first-order downstream effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;downstream propagation&lt;/strong&gt; : the transmission of a bank credit supply shock from a directly shocked supplier to that supplier&amp;rsquo;s customers, operating through the supply channel — a supplier facing tighter credit reduces its output, contracting the availability of inputs to customers; both first-order (2.3 pp) and higher-order (2.0 pp) downstream effects are quantitatively large.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;dual centrality amplification&lt;/strong&gt; : the finding that firms simultaneously central in the production network (many supplier-customer relationships) and in the financial network (large credit from banks that receive large supply shocks) generate disproportionately large aggregate output contractions when hit by financial shocks, because the shock propagates through both network channels simultaneously.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Khwaja-Mian identification&lt;/strong&gt; : the strategy of isolating bank credit supply shocks by exploiting within-firm variation across banks — conditional on firm-time fixed effects, changes in credit from different banks to the same firm reflect supply rather than demand — originally proposed by Khwaja and Mian (2008) and extended by Amiti and Weinstein (2018).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;credit network shock&lt;/strong&gt; : a bank-level credit supply shock derived from the Khwaja-Mian/Amiti-Weinstein methodology, capturing the component of each bank&amp;rsquo;s credit contraction attributable to bank-level supply factors rather than firm-level demand; the paper uses both this measure and an interbank-market-exposure measure to cross-validate identification.&lt;/p&gt;</description></item><item><title>Professional survey forecasts and expectations in DSGE models</title><link>https://macropaperwarehouse.com/papers/professional-survey-forecasts-and-expectations-in-dsge-models/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/professional-survey-forecasts-and-expectations-in-dsge-models/</guid><description>&lt;p&gt;This paper asks whether Survey of Professional Forecasters (SPF) data can be efficiently integrated into medium-scale DSGE models, and whether models with imperfectly rational expectations based on Adaptive Learning (AL) outperform the standard Rational Expectations (RE) hypothesis when survey forecasts are used as observables. The authors work with quarterly US data spanning 1981q2–2019q2, using the Philadelphia Fed Real-Time Data Set (first and second releases) alongside SPF nowcasts for inflation, consumption, investment, and output growth. The SPF nowcast is defined as a prediction formed in the middle of period t+1 for period t+1 given information for period t, making it a suitable proxy for the model-based expectation E_t y_{t+1}.&lt;/p&gt;
&lt;p&gt;The core methodological contribution is a re-specification of structural shocks into persistent (AR) and transitory (i.i.d.) components. For the risk premium, investment-specific technology, government spending, and markup shocks, each shock is decomposed into two independent innovations, yielding 12 total structural innovations. A reduced-form VAR exercise motivates this: SPF nowcast innovations explain 19–33% of the 5-year forecast error variance of the macro variables and 44–71% of the variance of the nowcasts themselves. The 1-quarter RMSFE of the baseline RE model without SPF is 1.10 for inflation, 1.26 for consumption, 1.19 for investment, and 1.26 for GDP — all significantly exceeding the SPF RMSFEs of 0.21, 0.43, 1.49, and 0.35.&lt;/p&gt;
&lt;p&gt;Log marginal likelihood improves monotonically as shocks are progressively re-specified: baseline RE (–577.37), RE with two-component markups (RE_mu, –536.63), adding real shocks stepwise (–473.29, –410.84), and finally all shocks (RE_all, –385.07). RE_all matches or beats SPF 1-quarter forecast accuracy (RMSFE ratio to SPF of 1.00 for inflation and investment; beats SPF for consumption growth), and Diebold-Mariano tests show no significant difference from SPF up to 5 quarters ahead. The paper further shows that once this two-component structure is imposed, exogenous sentiment shocks become unnecessary: RE_all (–385.07) outperforms RES_all (–388.17), and the RE model with all real shocks re-specified but without sentiment decisively dominates.&lt;/p&gt;
&lt;p&gt;Three AL belief specifications are then estimated: MSVflex (full RE information set with an independently and rapidly updating constant, posterior autocorrelation 0.9937 — nearly a random walk), RBflex (restricted information set augmented with shock innovations, with meaningful time-variation of belief coefficients at rho_AL = 0.87), and HBflex (agents switch between MSV and RB based on past forecasting performance; average RB weight 0.34, weight sensitivity delta = 4.77). All AL models outperform RE_all: MSVflex (–381.38), HBflex (–355.09), RBflex (–351.59), with RB and HB yielding the largest gains particularly during and after the Great Financial Crisis.&lt;/p&gt;
&lt;p&gt;AL models address three specific RE limitations. First, trend breaks: the ALM constant tracks persistent deviations, with ALM constants for consumption and investment successfully picking up rising macroeconomic trends in earlier sub-periods, yielding superior long-term forecasts. Second, time-varying transmission: the RB model generates cyclical volatility that stays lower in normal times and rises during distress, reducing reliance on large persistent investment-technology shocks relative to RE. Third, predictability of forecast errors: the RE model&amp;rsquo;s investment forecast inherits the SPF underreaction (b-coefficient 0.72, p &amp;lt; 0.001), while RBflex and HBflex reduce this to 0.17 and 0.34 respectively, both statistically insignificant.&lt;/p&gt;
&lt;p&gt;On an extended sample including the Covid recession, the RBflex model underperforms because its restricted information set cannot handle abrupt complex dynamics; MSVflex and HBflex continue to perform well, with the MSV regime dominating in the HB model during Covid and post-Covid periods. Scope conditions: the dataset is US, 1981q2–2019q2 for baseline estimation; the predictability (underreaction) problem is confirmed only for investment SPF, not for inflation, consumption, or GDP growth in this sample.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-spf-nowcast-and-why-do-the-authors-treat-it-as-a-proxy-for-model-based-expectations"&gt;Q1. What is the SPF nowcast, and why do the authors treat it as a proxy for model-based expectations?&lt;/h3&gt;
&lt;p&gt;The SPF nowcast is defined as a prediction formed in the middle of quarter t+1 for the value of a variable in quarter t+1, conditional on information available through quarter t. Because agents are assumed to make decisions for period t and form expectations for t+1 based on information through t, this timing aligns precisely with the model-based conditional expectation E_t y_{t+1}. The authors use first-release data (r1) and the SPF nowcast (f0) both published in the course of t+1 as measurement variables, with the Kalman filter recovering implied structural shocks.&lt;/p&gt;
&lt;h3 id="q2-how-large-is-the-informational-content-of-spf-nowcasts-in-reduced-form-analysis"&gt;Q2. How large is the informational content of SPF nowcasts in reduced-form analysis?&lt;/h3&gt;
&lt;p&gt;A 7-variable Cholesky VAR places each SPF series last, so the survey innovation is orthogonal to standard macro variables by construction. The 5-year forecast error variance decompositions show SPF nowcast shocks explain 19% of inflation variance, 33% of consumption variance, 33% of investment variance, and 29% of GDP variance (Table 1). The nowcasts themselves are explained 44–71% by their own innovations. SPF nowcasts also substantially outperform the baseline RE model: the RE model without SPF produces RMSFE ratios of 1.10 for inflation, 1.26 for consumption, 1.19 for investment, and 1.26 for GDP relative to SPF (all statistically significant by Diebold-Mariano test).&lt;/p&gt;
&lt;h3 id="q3-what-is-the-shock-re-specification-and-why-is-it-necessary-to-exploit-survey-data"&gt;Q3. What is the shock re-specification, and why is it necessary to exploit survey data?&lt;/h3&gt;
&lt;p&gt;The Smets-Wouters (2007) ARMA(1,1) shock structure conflates the transitory and persistent innovation into a single disturbance, making it impossible for the Kalman filter to separately attribute high-frequency and low-frequency movements. The re-specification splits each shock b_t into a persistent component b_t^ar (driven by epsilon^bar with persistence rho_b) and an i.i.d. transitory component b_t^iid (driven by epsilon^biid), yielding 12 total structural innovations. This allows survey nowcasts — which are forward-looking — to identify the persistent component separately from the transitory one. Without this, marginal likelihood improvements are far smaller (RE: –577 vs. RE_all: –385).&lt;/p&gt;
&lt;h3 id="q4-does-re-specification-of-real-shocks-render-exogenous-sentiment-shocks-redundant"&gt;Q4. Does re-specification of real shocks render exogenous sentiment shocks redundant?&lt;/h3&gt;
&lt;p&gt;Yes. Models with standard real shock processes but exogenous sentiment shocks (RES: –477.88; RES_mu: –488.96) do fit substantially better than models without sentiment (RE: –577.37; RE_mu: –536.63), confirming Milani&amp;rsquo;s (2017) result. However, once the two-component real shock structure is introduced, RE_all (–385.07) outperforms RES_all (–388.17) and the estimated sentiment shocks become small and explain little of the business cycle. The fundamental shock re-specification subsumes what sentiment shocks were previously capturing.&lt;/p&gt;
&lt;h3 id="q5-how-do-al-models-compare-to-re-in-terms-of-model-fit"&gt;Q5. How do AL models compare to RE in terms of model fit?&lt;/h3&gt;
&lt;p&gt;All three AL models outperform RE_all: MSVflex (–381.38, improvement of 3.69 log-likelihood units), HBflex (–355.09, improvement of 29.98 units), RBflex (–351.59, improvement of 33.48 units). The RB and HB specifications, which assume more severe deviation from RE with restricted information sets and time-varying transmission, achieve the largest gains. The MSV improvement accumulates gradually, concentrating in the late 1990s and 2000s, while RB shows sustained improvement in the 1980s and mid-1990s and performs exceptionally well during and after the GFC.&lt;/p&gt;
&lt;h3 id="q6-how-does-the-al-mechanism-handle-macroeconomic-trend-shifts"&gt;Q6. How does the AL mechanism handle macroeconomic trend shifts?&lt;/h3&gt;
&lt;p&gt;Under RE with fixed coefficients, expectations anchor around a constant steady state, so persistent deviations from trend generate systematic forecast errors. Under AL, the ALM constant mu_t in the Actual Law of Motion evolves over the business cycle. In the MSVflex model, the autocorrelation parameter for the constant is estimated at 0.9937 (posterior mean), making it nearly a random walk that can track long-lasting trends. ALM constants for consumption and investment in the MSV setup successfully pick up rising macroeconomic trends in earlier sub-periods, translating into superior longer-term forecast performance relative to RE.&lt;/p&gt;
&lt;h3 id="q7-how-does-the-rb-model-generate-time-varying-volatility-and-why-does-this-matter-for-investment-dynamics"&gt;Q7. How does the RB model generate time-varying volatility, and why does this matter for investment dynamics?&lt;/h3&gt;
&lt;p&gt;In RBflex, as beliefs are revised via the Kalman filter, the sensitivity of expectations and realized variables to shocks changes over the business cycle. The model generates cyclical volatility that remains lower in normal times and rises during distress — a realistic pattern absent from RE models. Consequently, RB does not need to rely as heavily on large persistent risk premium and investment-specific technology shocks: average volatility of these processes in the RB model does not increase in the last sub-period and remains generally lower across the whole sample, in contrast to RE&amp;rsquo;s behavior during the GFC. The RB model also shows a 3-times-smaller estimated measurement error in the investment SPF equation relative to the AL specification without restricted beliefs.&lt;/p&gt;
&lt;h3 id="q8-what-happens-to-predictability-of-model-based-forecast-errors-under-al-versus-re"&gt;Q8. What happens to predictability of model-based forecast errors under AL versus RE?&lt;/h3&gt;
&lt;p&gt;Using the Coibion-Gorodnichenko (2015) regression of forecast errors on forecast revisions, the RE model&amp;rsquo;s investment forecast shows a b-coefficient of 0.72 (p &amp;lt; 0.001), inheriting the underreaction documented in SPF investment data (b = 0.49, p = 0.006). AL models break this inheritance: RBflex ALM b-coefficient for investment is 0.17 (not statistically significant) and HBflex is 0.34 (not statistically significant). AL models achieve this because they relax the RE constraint of internal consistency between agents&amp;rsquo; and model forecasts, allowing the ALM to generate efficient forecasts even when agent PLMs display sluggish adjustment.&lt;/p&gt;
&lt;h3 id="q9-how-do-the-models-perform-during-the-covid-recession"&gt;Q9. How do the models perform during the Covid recession?&lt;/h3&gt;
&lt;p&gt;The RBflex model does not perform optimally on the extended sample including the Covid recession. The authors attribute this to the restricted information set in the RB PLM being insufficient to describe the abrupt, complex macroeconomic dynamics of the Covid crisis. The MSVflex and HBflex models continue to perform well. In the HBflex model, the MSV regime naturally dominates during the Covid and post-Covid periods, while the RB regime had been more prominent between recessions in the pre-Covid sample.&lt;/p&gt;
&lt;h3 id="q10-what-is-the-role-of-heterogeneous-beliefs-and-how-do-agents-switch-between-plms"&gt;Q10. What is the role of heterogeneous beliefs, and how do agents switch between PLMs?&lt;/h3&gt;
&lt;p&gt;In HBflex, expectations are a weighted average of MSV and RB predictions with weights evolving as a function of past belief forecast errors. The weight sensitivity parameter is estimated at delta = 4.77, indicating weights are relatively sensitive to fitness. The average estimated weight on the RB PLM is 0.34 (MSV receives 0.66 on average). The RB weight tends to increase and reach its highest values between recessions, consistent with the restricted model being more parsimonious and useful in stable periods, while the fuller MSV model dominates in high-volatility episodes such as the Covid recession.&lt;/p&gt;
&lt;h3 id="q11-what-are-the-out-of-sample-forecasting-results"&gt;Q11. What are the out-of-sample forecasting results?&lt;/h3&gt;
&lt;p&gt;The out-of-sample evaluation covers 2008q1–2019q2. The RB model outperforms the RE model in predicting investment and interest rate dynamics, and for investment it also outperforms professional forecasters during this period. At longer horizons (up to 5 quarters ahead), RE model forecasts are generally not statistically significantly different from SPF predictions once SPF nowcasts are included as observables, suggesting that observing the SPF data is sufficient to capture the most informative content from surveys for longer-horizon predictions.&lt;/p&gt;
&lt;h3 id="q12-what-is-the-relationship-to-milani-2017-and-the-prior-literature-on-sentiment-shocks"&gt;Q12. What is the relationship to Milani (2017) and the prior literature on sentiment shocks?&lt;/h3&gt;
&lt;p&gt;Milani (2017) found that exogenous sentiment shocks orthogonal to fundamentals were needed to fit SPF forecasts alongside an AL model and explained a significant portion of US business cycle fluctuations. The current paper shows this result is not robust to re-specifying fundamental shocks into persistent and transitory components: once the two-component structure is introduced, sentiment shocks become small and economically unimportant (RES_all at –388.17 versus RE_all at –385.07). What Milani attributed to sentiment was largely capturing the inability of single-innovation shocks to separately account for high-frequency and low-frequency variance.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;SPF Nowcast as proxy for model expectations: The Survey of Professional Forecasters&amp;rsquo; nowcast is defined as a prediction formed in the middle of quarter t+1 for the value of a variable in that same quarter, conditional on information available through quarter t. This timing makes it directly comparable to the model-based conditional expectation E_t y_{t+1}, so the SPF nowcast can be added to the DSGE model&amp;rsquo;s observable set with a straightforward measurement equation linking it to model expectations plus i.i.d. measurement error.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Shock re-specification into persistent and transitory components: Each structural shock (risk premium, investment-specific technology, government spending, and markup shocks) is decomposed into an AR(1) persistent component driven by epsilon^bar and an i.i.d. transitory component driven by epsilon^biid, replacing the ARMA(1,1) specification in Smets-Wouters (2007) that conflates both into a single innovation. This decomposition is the key technical device enabling survey data to separately identify low-frequency and high-frequency sources of volatility.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Adaptive Learning (AL): An expectation-formation mechanism in which agents do not know true model parameters and instead estimate linear forecasting models (PLMs) that are updated each period via a Kalman filter algorithm. This produces a time-varying Actual Law of Motion — transmission parameters mu_t, T_t, R_t all evolve with beliefs — enabling endogenous trend drift and time-varying shock responses absent from RE models with fixed coefficients.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Minimum State Variable (MSV) beliefs with flexible constant: An AL specification in which agents use the same endogenous state variables and shocks as in the RE solution but with the constant term updated at an independent, more rapid rate. The constant&amp;rsquo;s autocorrelation is estimated at 0.9937, making it nearly a random walk capable of tracking persistent macroeconomic trend deviations from the deterministic steady state.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Restricted Beliefs (RB): An AL specification in which each agent&amp;rsquo;s PLM uses a reduced information set — autoregressive terms of the forward-looking variable augmented with selected shock innovations — rather than the full RE state space. This more severe departure from RE yields the largest marginal-likelihood gain over RE_all, generates realistic cyclical volatility amplification, and produces a 3-times-smaller measurement error for investment SPF, but underperforms during the Covid recession due to the restricted set&amp;rsquo;s inability to handle abrupt complex dynamics.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Heterogeneous Beliefs (HB): An AL specification in which agents may switch between MSV and RB PLMs as a weighted average, with weights evolving as a function of past belief forecast errors. The average weight on RB is 0.34 and the weight sensitivity delta is estimated at 4.77; the RB weight tends to be highest between recessions and lowest during high-volatility episodes such as the Covid recession when the fuller MSV information set dominates.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;FIRE predictability test (Coibion-Gorodnichenko regression): Under Full Information Rational Expectations, the regression of forecast errors on forecast revisions should yield a b-coefficient of zero. A positive and significant b indicates systematic underreaction to news. The paper confirms b = 0.49 (p = 0.006) for investment SPF — but not for inflation, consumption, or GDP — and shows the RE model inherits this inefficiency (b = 0.72, p &amp;lt; 0.001 for investment), while AL models reduce it to insignificance (RBflex: 0.17; HBflex: 0.34).&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;</description></item><item><title>Quality Adjustment at Scale: Hedonic versus Exact Demand-Based Price Indices</title><link>https://macropaperwarehouse.com/papers/quality-adjustment-at-scale-hedonic-versus-exact-demand-based-price-indices/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/quality-adjustment-at-scale-hedonic-versus-exact-demand-based-price-indices/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper implements and evaluates methods for constructing quality-adjusted price indices from item-level retail scanner data at scale — across hundreds of product categories, heterogeneously encoded product attributes, and rapid product turnover. Using proprietary NPD Group data covering five general merchandise categories (2014–2018) and NielsenIQ scanner data covering 50+ food product groups (2006–2015), the paper compares hedonic superlative indices (using the Erickson-Pakes EP-TV methodology and a novel machine-learning extension) against exact demand-based indices (the Feenstra 1994 lambda-ratio adjustment and the Redding-Weinstein 2020 CES Unified Price Index, CUPI). The central finding is that quality adjustment is quantitatively large: the hedonic Tornqvist index shows roughly 2.5–2.9 percentage points per year faster price decline than the matched-model Tornqvist in high-tech categories (headphones, memory cards) and 4–5 percentage points of cumulative additional disinflation relative to matched-model indices for food. The Feenstra index agrees closely in magnitude with the hedonic approach, but the CUPI is highly sensitive to the choice of common-goods rule (CGR) and in some specifications shows 20–40 percentage points more cumulative disinflation than the Feenstra, a gap the paper attributes to the CUPI&amp;rsquo;s extreme sensitivity to low-share goods. The paper establishes that hedonic superlative price indices are feasible to implement at scale, including with machine learning on sparse text descriptions, and recommends them as the practical benchmark for re-engineering official price statistics.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a published paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-problem-in-price-index-construction-does-the-paper-address"&gt;Q1. What problem in price index construction does the paper address?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper addresses the long-standing challenge of simultaneously accounting for consumer substitution and quality change due to product entry and exit in official price statistics, and shows these two corrections are now feasible to implement in real time from item-level retail transactions data.&lt;/strong&gt; Standard official statistics (CPI, PCE) use an arithmetic Laspeyres index that holds spending weights fixed — the Boskin Commission documented this overstates the true cost of living due to substitution bias. Scanner data permit superlative indices (Tornqvist, Fisher) that correct for substitution at the item level, but further correcting for quality change requires dealing with the millions of products that enter and exit the market each quarter. The paper&amp;rsquo;s contribution is to show that both corrections can be combined at scale.&lt;/p&gt;
&lt;h3 id="q2-what-data-infrastructure-does-the-paper-use-and-what-does-it-reveal-about-product-turnover"&gt;Q2. What data infrastructure does the paper use, and what does it reveal about product turnover?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The NPD Group data cover five product groups with quarterly turnover rates of 4.5–13.5 percent per quarter (both entry and exit), and exhibit a characteristic life-cycle pattern: prices peak at entry and decline steadily thereafter while market shares follow a hump shape — rising as products gain distribution and then declining as newer products displace them.&lt;/strong&gt; Memory cards, for example, show approximately a 50 percent price decline over their life cycle and a 200 percent increase in market share in the first year after entry. These interrelated price-quantity dynamics mean that any matched-model index that ignores entering and exiting goods misses substantial quality improvement. The NielsenIQ data cover over 2.6 million UPCs across 40,000 stores, with nominal food sales closely tracking BEA PCE food expenditures, validating its representativeness.&lt;/p&gt;
&lt;h3 id="q3-what-is-the-ep-tv-hedonic-method-and-why-does-the-paper-prefer-it"&gt;Q3. What is the EP-TV hedonic method and why does the paper prefer it?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The Erickson-Pakes time-varying unobservables (EP-TV) method estimates hedonic price indices from item-level transactions data using a two-step procedure: first predict log price levels from observable characteristics to recover item-level residuals, then predict log price changes from characteristics plus the lagged residual, which allows the model to track changing valuations of unobservable attributes over time.&lt;/strong&gt; This approach outperforms the simpler log-level hedonic and the EP-F (fixed unobservables) approach in model fit for price changes: EP-TV achieves R² of 0.13–0.50 versus 0.05–0.24 for log-level models across product groups. The paper extends EP-TV to the NielsenIQ data using deep neural networks to decode sparse abbreviated product descriptions (e.g., &amp;ldquo;ZR DT LN/LM CF NBP CT&amp;rdquo; for a diet soft drink), achieving out-of-sample R² of roughly 75% for price-level predictions and above 50% in-sample for price changes.&lt;/p&gt;
&lt;h3 id="q4-what-are-the-main-quantitative-findings-for-the-hedonic-indices"&gt;Q4. What are the main quantitative findings for the hedonic indices?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The EP-TV hedonic Tornqvist index indicates price declines approximately 2.9 percentage points per year faster than the matched-model Tornqvist for memory cards, 2.5 pp/year for headphones, 1.3 pp/year for boys&amp;rsquo; jeans, 0.7 pp/year for coffee makers, and 0.4 pp/year for occupational footwear; for NielsenIQ food categories, the hedonic Tornqvist is approximately 4 percentage points lower cumulatively over 2006–2015 than the matched-model Tornqvist.&lt;/strong&gt; These gaps represent the quality improvement embedded in product turnover — the fact that new memory cards at a given price embody more storage than their predecessors, new headphones better sound quality, etc. The paper also shows that the hedonic Laspeyres, which only adjusts for exiting goods (following the standard Pakes 2003 bounding result), is substantially lower than the matched-model Laspeyres, confirming that the selection bias from ignoring exiting goods in official statistics is quantitatively important.&lt;/p&gt;
&lt;h3 id="q5-how-do-the-demand-based-exact-price-indices-compare-with-the-hedonic-approach"&gt;Q5. How do the demand-based exact price indices compare with the hedonic approach?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The Feenstra (1994) lambda-ratio index, which adjusts the Sato-Vartia index for product entry and exit via the ratio of entering to exiting expenditure shares scaled by (1/(σ−1)), shows cumulative disinflation approximately 2 percentage points beyond the matched-model Sato-Vartia across all five NPD product groups and approximately 5 percentage points for NielsenIQ food, broadly comparable in magnitude to the hedonic adjustment.&lt;/strong&gt; The estimated substitution elasticities (σ) range from about 5.2 to 7.8 across NPD product groups and have a median of about 6 across food product groups, consistent with the literature. The Feenstra-hedonic agreement is a reassuring finding: two methodologically distinct approaches based on different identifying assumptions yield similar magnitudes of quality correction.&lt;/p&gt;
&lt;h3 id="q6-what-is-the-cupi-and-why-is-it-problematic"&gt;Q6. What is the CUPI and why is it problematic?&lt;/h3&gt;
&lt;p&gt;&lt;em&gt;&lt;em&gt;The Redding-Weinstein (2020) CES Unified Price Index (CUPI) generalizes the Feenstra index by adding a taste-shock correction (S&lt;/em&gt; ratio) and a Jevons index (P&lt;/em&gt; ratio), both of which are unweighted geometric means across common goods — making the CUPI extremely sensitive to products with tiny expenditure shares, because any product with a low share is inferred to have low appeal, which the model translates into a large quality-adjustment downward.** Without a common-goods rule (CGR), the CUPI shows 30–40 percent per year price declines for high-tech goods and boys&amp;rsquo; jeans, 10–30 percentage points below the Feenstra; with a 25th-percentile market-share CGR, the CUPI is still more than 40 percentage points below the Feenstra for food in 2015 on a cumulative basis. The key concern is that very low expenditure shares for entering or exiting products can reflect search frictions, limited distribution, or clearance-rack effects rather than genuinely low consumer appeal, so the CUPI&amp;rsquo;s unweighted components may conflate these factors with quality change. The paper concludes that more research on the appropriate CGR is needed before the CUPI can be recommended for official statistics.&lt;/p&gt;
&lt;h3 id="q7-is-the-hedonic-approach-robust-to-missing-or-omitted-product-attributes"&gt;Q7. Is the hedonic approach robust to missing or omitted product attributes?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Yes — omitting key observable characteristics (memory size for memory cards, major brand dummies for apparel and footwear) from the EP-TV estimation has only minimal effect on the resulting hedonic price index, in contrast to log-level hedonic models where such omissions produce much larger distortions.&lt;/strong&gt; For memory cards, the baseline EP-TV Tornqvist index produces an average annual cumulative chained price change of −20.12%; excluding entering products whose size or speed is outside the range of continuing products changes this to −20.09%, even though roughly 50% of entering products (accounting for 25% of entering-product sales) are excluded. This robustness reflects the EP-TV design: the first-stage residual absorbs time-varying unobservable characteristics that would otherwise confound the hedonic mapping.&lt;/p&gt;
&lt;h3 id="q8-what-are-the-implications-for-official-statistics"&gt;Q8. What are the implications for official statistics?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper argues that adopting hedonic superlative price indices from real-time scanner data would produce official CPI and PCE price measures that simultaneously correct for substitution bias and quality change, resulting in meaningfully lower measured inflation in categories with high product turnover — likely understating quality-adjusted price declines in current official statistics by several percentage points per year in high-tech consumer goods and by roughly half a percentage point per year in food.&lt;/strong&gt; The practical case for adoption is that the EP-TV approach can be implemented across heterogeneously encoded data (both structured NPD attributes and unstructured NielsenIQ text), is feasible in real time with transaction data, is relatively insensitive to chain drift (full-imputation hedonic indices avoid the transitory price volatility that contaminates matched-model chained indices), and satisfies bounding properties under general conditions established by Pakes (2003). The paper thus operationalizes long-standing recommendations of the Boskin Commission (1998) that called for exactly these corrections.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;EP-TV hedonic index&lt;/strong&gt; : the Erickson-Pakes time-varying unobservables hedonic price index, which imputes quality-adjusted price changes for entering and exiting products using a two-step regression that includes lagged residuals from a log-level hedonic to control for products whose unobservable attributes change their market valuations over time; the paper&amp;rsquo;s preferred hedonic approach.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Feenstra (1994) lambda-ratio adjustment&lt;/strong&gt; : a correction to the Sato-Vartia CES price index that accounts for product entry and exit by multiplying by (λ_{t,t-1}/λ_{t-1,t})^{1/(σ-1)}, where the lambda terms are the expenditure shares of continuing goods relative to all goods in each period; larger entry shares relative to exit shares produce a downward adjustment reflecting quality improvement from new products.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;CES Unified Price Index (CUPI)&lt;/strong&gt; : the Redding-Weinstein (2020) extension of the Feenstra index that additionally incorporates time-varying product appeal shocks via an unweighted Jevons index (P* ratio) and an unweighted expenditure-share ratio (S* ratio); the paper finds this index is highly sensitive to the common-goods rule and may overstate quality adjustment in practice due to sensitivity to low-share goods.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;common-goods rule (CGR)&lt;/strong&gt; : a threshold rule that restricts the set of goods entering the CUPI&amp;rsquo;s unweighted components to those with sufficiently large or long-duration market shares, introduced by Redding and Weinstein (2020) to limit the influence of fringe products; the paper finds CUPI results are sensitive to the CGR specification in a way that varies across product groups.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;full-imputation hedonic index&lt;/strong&gt; : a hedonic price index that uses the hedonic mapping to impute price changes for all goods (including continuing goods), rather than only entering and exiting goods; reduces chain drift relative to partial-imputation approaches because imputed prices are less volatile than observed prices.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;product turnover&lt;/strong&gt; : the quarterly entry and exit of products from the market, ranging from 4.5% to 13.5% per quarter in NPD data; the primary mechanism through which quality change is embedded in item-level scanner data and the main source of mismeasurement in matched-model price indices.&lt;/p&gt;</description></item><item><title>Quantifying the allocative efficiency of capital: The role of capital utilization</title><link>https://macropaperwarehouse.com/papers/quantifying-the-allocative-efficiency-of-capital-the-role-of-capital-utilization/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/quantifying-the-allocative-efficiency-of-capital-the-role-of-capital-utilization/</guid><description>&lt;p&gt;Standard measures of capital allocative efficiency—based on the dispersion of the average revenue product of capital (ARPK)—are severely biased when capital utilization is endogenous. When utilization is flexible, firms can bypass physical adjustment constraints by varying intensity, so that the correct efficiency measure requires the dispersion of average revenue product of capital services (ARPKS), defined as the log difference between revenue and utilized capital, not of ARPK. Contrary to the standard view that higher ARPK dispersion signals lower allocative efficiency, the paper demonstrates that when efficiency improvements arise from greater utilization flexibility, ARPK dispersion can increase alongside efficiency gains. An application to India&amp;rsquo;s capital market liberalization reform shows that the standard approach (ignoring utilization) predicts allocative efficiency gains of 5.25% (statistically significant), while the corrected approach accounting for utilization finds gains of only 0.04% (not statistically significant).&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary based on a working paper version, AI-assisted and human-reviewed. See the linked published article for the authoritative version.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-wrong-with-arpk-dispersion-as-a-measure-of-allocative-efficiency"&gt;Q1. What is wrong with ARPK dispersion as a measure of allocative efficiency?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;ARPK dispersion conflates two conceptually distinct quantities—the dispersion of physical factor allocation and the dispersion of utilization intensity—so it is not a monotone proxy for allocative efficiency when utilization is endogenous.&lt;/strong&gt; In the Hsieh-Klenow (2009) framework, higher ARPK dispersion is interpreted as higher misallocation. But ARPK is simply revenue over capital inputs, so when firms with too little (too much) capital relative to their productivity simply utilize their capital more (less) intensely, the variance of ARPK rises even as the efficiency of factor services allocation improves. The standard interpretation therefore has the causality backwards in economies with flexible utilization.&lt;/p&gt;
&lt;h3 id="q2-what-is-arpks-and-why-does-it-correctly-measure-capital-allocative-efficiency"&gt;Q2. What is ARPKS and why does it correctly measure capital allocative efficiency?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;ARPKS—the average revenue product of capital services, defined as the log difference between revenue and utilized capital—is the theoretically correct sufficient statistic for capital allocative efficiency when utilization is endogenous, because it measures the dispersion in the productivity of factor services rather than the dispersion of physical factor inputs.&lt;/strong&gt; The paper embeds endogenous utilization into a neoclassical investment model and shows formally that the variance of log ARPKS is zero in the efficient equilibrium, while the variance of log ARPK is not. ARPK dispersion is a combination of ARPKS dispersion and utilization variation, and the latter is not a sign of misallocation.&lt;/p&gt;
&lt;h3 id="q3-can-higher-arpk-dispersion-accompany-higher-allocative-efficiency"&gt;Q3. Can higher ARPK dispersion accompany higher allocative efficiency?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Yes: when allocative efficiency improvements arise from greater flexibility in capital utilization, ARPK dispersion increases even though allocative efficiency improves.&lt;/strong&gt; Greater utilization flexibility generates more variation in how intensively different firms use their capital, raising the ratio of revenue to physical capital input for high-utilization firms. A researcher using ARPK dispersion would therefore mistakenly conclude that allocative efficiency fell when it actually rose. The paper provides counterfactual simulations illustrating this phenomenon.&lt;/p&gt;
&lt;h3 id="q4-what-is-the-empirical-application-and-what-do-the-results-show"&gt;Q4. What is the empirical application and what do the results show?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;An application to Bau and Matray (2023)&amp;rsquo;s Indian capital market liberalization reform finds that: the standard approach (ignoring utilization) predicts allocative efficiency gains of 5.25% (statistically significant); accounting for utilization, the corrected estimate is only 0.04% (not statistically significant).&lt;/strong&gt; The analysis uses firm-level panel data for India including capacity utilization rates and capital maintenance expenses. Maintenance expenses serve as a proxy for capital utilization rates in the model, bridging the gap between observed capacity and the theoretically relevant utilization measure.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;average revenue product of capital (ARPK)&lt;/strong&gt; : the ratio of revenue to physical capital input; commonly used as a proxy for firm-level distortions in the Hsieh-Klenow (2009) framework, but shown to be a biased measure of allocative efficiency when capital utilization is endogenous.
&lt;strong&gt;average revenue product of capital services (ARPKS)&lt;/strong&gt; : the ratio of revenue to utilized capital; the theoretically correct measure of capital allocative efficiency when utilization is endogenous; its dispersion is zero in the efficient equilibrium.
&lt;strong&gt;capital utilization&lt;/strong&gt; : the intensity with which a firm deploys its physical capital stock; endogenous in the model, allowing firms to partially bypass adjustment constraints; the omission of utilization is the source of the bias in standard ARPK-based efficiency estimates.&lt;/p&gt;</description></item><item><title>Racial disparities in crime and wealth</title><link>https://macropaperwarehouse.com/papers/racial-disparities-in-crime-and-wealth/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/racial-disparities-in-crime-and-wealth/</guid><description>&lt;p&gt;This paper asks whether racial differences in labor income can simultaneously explain both the crime gap and the wealth gap between Black and White individuals in the United States. The authors build a large-scale overlapping generations (OLG) model in which property crime is endogenously determined — agents choose whether to steal alongside their consumption and savings decisions — while drug-related incarcerations are treated as exogenous, reflecting evidence that racial profiling distorts enforcement independently of offending behavior. The model is calibrated to match several well-documented racial disparities: Black individuals comprise 12.36% of the adult population but 33.8% of the incarcerated population; 42.7% of Black individuals fall in the bottom wealth quintile (below $3,400 in assets) versus 15.1% of White individuals; the median Black-White wealth gap is 89.5% (SCF 2019). Data sources include the Survey of Consumer Finances (SCF 2019), Uniform Crime Reports (UCR 1996–2011), NLSY79, PSID (1968–2021), and MORG (2000–2019).&lt;/p&gt;
&lt;p&gt;The model incorporates four dimensions of labor market disparity between Black and White agents: educational attainment, unemployment risk and duration, age-earnings profiles, and idiosyncratic income shock processes. It also incorporates race-skill-specific survival probabilities (life expectancy at birth: 73 years for Black, 78 years for White), scarring effects from incarceration on future labor income, a progressive income tax, means-tested transfers, and accidental bequests distributed within race groups.&lt;/p&gt;
&lt;p&gt;The benchmark model successfully replicates key data moments. Black individuals constitute 34.3% of the incarcerated population (data: 33.8%). The model-generated median wealth gap is 83.6% (data: 89.5%). The share of Black individuals in the bottom wealth quintile is 37.7% in the model versus 42.7% in the data. The model does not match the average wealth gap: the model-generated gap is 58.9% versus 84.4% in the SCF.&lt;/p&gt;
&lt;p&gt;The main counterfactual experiments yield three findings. First, equalizing labor market conditions — particularly age-earnings profiles — is the dominant driver of both racial wealth and crime disparities. When all labor market conditions are equalized, the Black crime rate falls by 66.25% (from 11.97% to 4.04%), the median wealth gap declines by 69.6% (from 83.58% to 25.4%), and the share of Black individuals in the bottom wealth quintile falls from 37.73% to 20.75%. Equalizing age-earnings profiles alone accounts for the largest single-factor effect: the median wealth gap declines from 83.58% to 44.16% and the Black crime rate from 11.97% to 7.59%. The resource cost of equalizing age-earnings profiles is estimated at 3.29% of GDP for the No-HS group and 22.2% of GDP for the HS group.&lt;/p&gt;
&lt;p&gt;Second, higher crime and incarceration rates among Black individuals do not significantly contribute to their lower wealth. When crime is entirely eradicated, the share of Black individuals in the bottom quintile barely moves (37.73% to 37.61%), and the median wealth gap falls only from 83.58% to 82.6%. The mechanism is that most crimes are committed by young, already-poor individuals who are not saving in any case; income loss during incarceration is not large enough to affect wealth accumulation meaningfully.&lt;/p&gt;
&lt;p&gt;Third, equalizing life expectancy generates a 25.39% reduction in the median wealth gap and a 12.4% decline in the share of Black individuals in the bottom wealth quintile, with negligible effect on crime rates.&lt;/p&gt;
&lt;p&gt;The paper also validates the model against Cesarini et al. (2023), who find a small, statistically insignificant effect of lottery wealth on criminal behavior in Sweden. The model replicates this finding: a $150,000 windfall reduces incarceration risk over seven years by 0.81 percentage points. The mechanism is that lottery winnings displace means-tested transfers, winnings gradually dissipate as low income persists, and individuals eventually return to poverty and resume criminal activity.&lt;/p&gt;
&lt;p&gt;Q: What is the central research question and why does the paper treat property crime and drug crime differently?
A: The paper asks whether racial labor income differences can simultaneously account for both crime and wealth disparities. Property crimes are modeled endogenously because offending behavior responds rationally to economic incentives. Drug crime incarcerations are exogenous to capture evidence that racial profiling in enforcement — rather than differential offending alone — drives racial disparities in drug arrests: Beck and Blumstein (2018) show differential offending explains only about 52% of the drug imprisonment gap, versus over 70% for overall imprisonment.&lt;/p&gt;
&lt;p&gt;Q: What are the benchmark model&amp;rsquo;s key calibration targets and how well does it fit the data?
A: The benchmark targets Black individuals as 34.3% of the incarcerated population (data: 33.8%), a median Black-White wealth gap of 83.6% (data: 89.5%), and 37.7% of Black individuals in the bottom wealth quintile (data: 42.7%). The model does not match the average wealth gap: the model-generated gap is 58.9% versus 84.4% in the SCF, which the authors acknowledge explicitly.&lt;/p&gt;
&lt;p&gt;Q: What is the quantitative effect of equalizing all labor market conditions?
A: Experiment 5 (equalize educational attainment, unemployment risk, and age-earnings profiles jointly) reduces the Black crime rate by 66.25% (from 11.97% to 4.04%), the median wealth gap by 69.6% (from 83.58% to 25.4%), and the share of Black individuals in the bottom quintile from 37.73% to 20.75%. Equalizing all factors including life expectancy drives the median wealth gap to 0%, with the bottom-quintile share for Black individuals at 19.31%.&lt;/p&gt;
&lt;p&gt;Q: Which single labor market factor matters most for the wealth gap and crime rate?
A: Equalizing age-earnings profiles (Experiment 3) is the single most important factor, reducing the median wealth gap from 83.58% to 44.16% and the Black crime rate from 11.97% to 7.59%. By contrast, equalizing educational attainment or unemployment risk each reduces the median wealth gap only to 76.71%, with smaller crime effects.&lt;/p&gt;
&lt;p&gt;Q: Does education-group heterogeneity matter for interpreting the age-earnings equalization effect?
A: Yes, substantially. Equalizing age-earnings profiles for the No-HS group reduces the Black crime rate by 21% with little effect on the median wealth gap. Equalizing profiles for the HS group reduces the median wealth gap by approximately 40% with a much smaller effect on crime rates. The earnings channel to crime operates primarily at the bottom of the education distribution, while the earnings channel to wealth accumulation operates more strongly in the high school group.&lt;/p&gt;
&lt;p&gt;Q: Why does crime have so little effect on the wealth distribution?
A: Criminals are predominantly young and already-poor individuals who are not accumulating savings. Because these individuals have minimal assets and rely heavily on means-tested transfers for consumption, the income loss during incarceration does not reduce their wealth meaningfully. When crime is completely eradicated, the share of Black individuals in the bottom quintile falls only from 37.73% to 37.61% and the median wealth gap declines from 83.58% to only 82.6%.&lt;/p&gt;
&lt;p&gt;Q: What is the effect of eliminating drug-related incarcerations on the Black wealth distribution?
A: Experiment 4 (eliminating drug crime incarcerations) reduces the share of Black individuals in the bottom quintile only slightly, from 37.73% to 37.30%. Eliminating the scarring effect of all incarcerations likewise has negligible effects on the bottom-quintile share (37.53% versus 37.73% in the benchmark) and the zero-assets share (32.56% versus 33.24%). Neither the direct incarceration penalty nor its labor market scarring meaningfully affects wealth accumulation.&lt;/p&gt;
&lt;p&gt;Q: What happens to crime and wealth when the property crime clearance rate changes?
A: Doubling the clearance rate from 17.2% to 34.4% reduces the Black crime rate from 11.97% to 1.72% and the White rate from 3.05% to 0.52%, with minimal change in the wealth distribution (Blacks in bottom quintile: 37.83%). Halving the clearance rate to 8.6% more than doubles Black crime to 27.53% and White crime to 9.55%, and increases the share of Black individuals in the bottom quintile by about 11% to 42.01%. This asymmetry — crime reduction barely helps wealth but crime increase does hurt — is consistent with the poverty-trap mechanism.&lt;/p&gt;
&lt;p&gt;Q: How does the model validate against the Cesarini et al. (2023) Swedish lottery study?
A: Cesarini et al. find a small, statistically insignificant negative effect of a $150,000 lottery windfall on conviction rates. The model replicates this: simulating 34,709 individuals per skill-race group, a $150,000 windfall reduces incarceration risk over the following seven years by 0.81 percentage points. When the authors use model-generated property crime records rather than incarceration records as the dependent variable, they find a statistically significant effect more than twice as large, suggesting incarceration data systematically understates the crime-reducing effect of wealth shocks.&lt;/p&gt;
&lt;p&gt;Q: What is the mechanism by which lottery winnings have minimal persistent effects on crime?
A: Lottery winners in the model are disproportionately drawn from low-income, low-wealth individuals who also receive means-tested transfers. After winning, these individuals lose transfer eligibility, so winnings substitute for lost transfers rather than being invested. With income levels remaining low, winnings dissipate over time, individuals return to poverty, and resume criminal activity. Larger lottery prizes extend the crime-free interval but do not permanently alter behavior.&lt;/p&gt;
&lt;p&gt;Q: What is the role of life expectancy differences in racial wealth and crime gaps?
A: Equalizing survival probabilities generates a 25.39% reduction in the median wealth gap and a 12.4% reduction in the share of Black individuals in the bottom quintile, with virtually no change in crime rates. The channel operates through savings incentives: a shorter expected lifetime (73 years for Black versus 78 for White) reduces the return to wealth accumulation independently of income.&lt;/p&gt;
&lt;p&gt;Q: What are the fiscal resource requirements implied by the income equalization experiments?
A: Implementing equalized age-earnings profiles for the No-HS group would require resources equal to 3.29% of total GDP, while equalization for the HS group would require 22.2% of GDP. These figures reflect the scale of redistribution needed to close earnings profiles and serve as a benchmark for assessing policy feasibility.&lt;/p&gt;
&lt;p&gt;Q: How does incarceration scarring affect lifetime income in the benchmark, and how does this validate against external data?
A: A Black high school graduate who experiences at least one incarceration earns 16.8% less over his lifetime than one who is never incarcerated; for White high school graduates the gap is 28.7%. Gordon et al. (2023) report corresponding empirical estimates of 18.6% for Black and 32.7% for White high school graduates, closely validating the model&amp;rsquo;s scarring calibration.&lt;/p&gt;
&lt;p&gt;Endogenous property crime: A rational choice by working-age agents who weigh the expected gain from stealing (fraction γ = 6.4% of average labor income y) against the probability of apprehension (clearance rate πa = 17.2%), the loss of means-tested transfers, scarring of future labor income, and the minimum consumption floor in jail. Retired agents face no such choice.&lt;/p&gt;
&lt;p&gt;Exogenous drug incarceration: Incarceration for drug possession modeled as an exogenous shock with race-age-specific probabilities, not responsive to individual optimization, capturing the possibility that racial profiling in enforcement generates disparities in drug arrests independently of offending behavior.&lt;/p&gt;
&lt;p&gt;Scarring effect: Post-incarceration labor income penalty modeled as a higher probability of drawing a lower idiosyncratic income shock state upon labor market re-entry, calibrated so the model reproduces lifetime income gaps between ever-incarcerated and never-incarcerated individuals by race-skill group (18.6% for Black HS, 32.7% for White HS per Gordon et al. 2023).&lt;/p&gt;
&lt;p&gt;Age-earnings profile (ε^{i,ζ}_j): The deterministic, skill-race-age-specific component of labor income estimated from PSID data for each of six race-education groups. The gap between Black and White age-earnings profiles is identified as the dominant driver of both the racial wealth gap and racial crime disparities, accounting for the largest single-factor reduction in both outcomes across all counterfactual experiments.&lt;/p&gt;
&lt;p&gt;Means-tested transfer floor: A consumption support program that fills the gap between an agent&amp;rsquo;s post-tax income plus assets and a minimum threshold κ (5.8% of average net tax income and assets). This transfer is a critical mechanism linking wealth shocks to crime: lottery winnings and other wealth gains displace transfer eligibility, causing winnings to be consumed rather than saved, and eventually exhausted.&lt;/p&gt;
&lt;p&gt;Median wealth gap: The percentage difference between median White and median Black wealth — 89.5% in the 2019 SCF, 83.6% in the benchmark model — used as the primary scalar summary of racial wealth disparity, chosen because the model does not match the average wealth gap (model: 58.9%, data: 84.4%).&lt;/p&gt;
&lt;p&gt;Victimization probability (πv(Y)): A step-wise decreasing function of taxable income capturing spatial concentration of property crime in low-income neighborhoods; in equilibrium this equals the aggregate property crime rate χp, ensuring market clearing in the crime sector and implying that poorer agents face higher victimization risk.&lt;/p&gt;</description></item><item><title>Real Credit Cycles</title><link>https://macropaperwarehouse.com/papers/real-credit-cycles/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/real-credit-cycles/</guid><description>&lt;p&gt;This paper incorporates diagnostic expectations — beliefs that overweight the representativeness of recent data, formalized as $E_t^\theta(A_{t+1}) = E_t(A_{t+1}) + \theta[E_t(A_{t+1}) - E_{t-1}(A_{t+1})]$ with θ &amp;gt; 0 — into a workhorse real business cycle model with heterogeneous firms and risky defaultable debt, to assess whether non-rational belief overreaction can account for boom-bust credit cycles without requiring large fundamental shocks. The diagnosticity parameter θ is structurally estimated via simulated method of moments, targeting moments including forecast-error predictability from the IBES manager guidance database, and yields θ ≈ 0.991, consistent with prior estimates from financial analysts and professional forecasters. The estimated DE model generates several untargeted results that the rational-expectations (RE) benchmark cannot: countercyclical credit spreads, predictable firm-level bond returns, and investment fragility in good times — specifically, a one-standard-deviation negative TFP shock causes a much larger investment decline when the previous period had good TFP news than in normal times. The model also shows that the 2008-09 spread increase can be generated by mere disappointment of overoptimistic beliefs, not requiring a large negative TFP shock. These findings establish diagnostic expectations as a parsimonious and empirically disciplined mechanism for producing financial reversals in business cycle models.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a published paper based on the NBER working paper full text (w28416), AI-assisted, pending human review. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Bordalo, Gennaioli, Shleifer, and Terry modify a standard heterogeneous-firm RBC model with risky defaultable debt by a single behavioral parameter — the diagnosticity θ governing belief overreaction to TFP news — to assess whether non-rational beliefs can quantitatively account for boom-bust credit cycles. The model departs from the rational expectations (RE) benchmark only in that firms and lenders form expectations diagnostically: after good TFP news, agents become excessively optimistic about future TFP, causing too much investment and debt issuance; when TFP growth disappoints relative to those optimistic expectations (even without an outright TFP decline), agents sharply revise down their beliefs, causing credit spreads to spike and investment to collapse. The diagnosticity parameter θ ≈ 0.991 is estimated by structural SMM targeting 16 moments — including 3 moments from IBES manager guidance data directly measuring the predictability of forecast errors — and is consistent with independent estimates from analyst forecasts (θ ≈ 0.9, Bordalo et al. 2019), professional macroeconomic forecasters (θ ≈ 0.5, Bordalo et al. 2020), and bond-price-implied beliefs (θ = 1.0, D&amp;rsquo;Arienzo 2020). The paper shows that the estimated DE model, unlike the RE benchmark, delivers countercyclical spreads, predictable firm-level bond returns, investment nonlinearity (fragility in good times), and an account of the 2008-09 spread episode requiring only a modest TFP disappointment.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-are-diagnostic-expectations-and-how-does-the-single-parameter-θ-govern-their-departure-from-rational-expectations"&gt;Q1. What are diagnostic expectations, and how does the single parameter θ govern their departure from rational expectations?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Diagnostic expectations (DE) are beliefs that overweight outcomes that are representative of recent news relative to their true base rate, formalized as $E_t^\theta(A_{t+1}) = E_t(A_{t+1}) + \theta[E_t(A_{t+1}) - E_{t-1}(A_{t+1})]$, where $\theta \geq 0$ is the diagnosticity parameter: when $\theta = 0$ beliefs are rational, and when $\theta &amp;gt; 0$ agents exaggerate the persistence of current news shocks.&lt;/strong&gt; The mechanism is grounded in the psychology of selective recall: good news makes good future outcomes top-of-mind and thus overweighted. In the context of an AR(1) TFP process, DE agents effectively behave as if TFP follows an ARMA(1,1) with an additional moving-average term that boosts the perceived response to current shocks. The parameter θ has a clean measurement interpretation: θ ≈ 1 means that for every unit of incoming news, agents&amp;rsquo; beliefs overshoot by approximately one additional unit (forecast errors are roughly equal in magnitude to the news that generated them). DE are forward-looking (unlike adaptive expectations) and hence not mechanically subject to the Lucas critique, since agents&amp;rsquo; beliefs respond to news in a structured way.&lt;/p&gt;
&lt;h3 id="q2-how-is-θ-identified-and-estimated-and-what-disciplines-the-models-departure-from-rationality"&gt;Q2. How is θ identified and estimated, and what disciplines the model&amp;rsquo;s departure from rationality?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The diagnosticity parameter θ is identified from three moments that directly exploit the predictability of future forecast errors from current firm-level investment and debt issuance growth — moments that are positive under DE and exactly zero under RE — drawn from the IBES manager guidance database covering 1999-2018.&lt;/strong&gt; The key identification equation is: $\text{cov}(\Delta \text{Forecast Error}&lt;em&gt;{t+1}, \Delta x_t) = a&lt;/em&gt;\pi a_x \rho \theta (1+\theta)$ where $x$ is investment or debt, positive if and only if θ &amp;gt; 0. In the data, a one-standard-deviation increase in the firm&amp;rsquo;s investment rate predicts approximately 10 percentage points stronger disappointment in next-year earnings, and a one-standard-deviation increase in debt issuance predicts about 5 percentage points stronger disappointment — robust to within-firm estimation that controls for heterogeneity in optimism across firms. The estimated θ ≈ 0.991 (s.e. 0.074) is precisely estimated and falls well within the range [0.5, 1.5] implied by independent estimates from other datasets. The RE model (constrained to θ = 0) cannot generate any comovement between future forecast error growth and current firm fundamentals, offering a falsifiable restriction that the data reject.&lt;/p&gt;
&lt;h3 id="q3-what-is-the-investment-fragility-finding-and-why-can-the-re-model-not-replicate-it"&gt;Q3. What is the investment fragility finding, and why can the RE model not replicate it?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The DE model generates a strong nonlinearity in investment: the same one-standard-deviation negative TFP shock causes a much larger investment decline when it follows a period of good TFP news (good times) than when it follows average or bad news; the RE model produces essentially no such nonlinearity, with investment responses roughly flat across initial conditions.&lt;/strong&gt; The mechanism is as follows: after a positive TFP shock, firms and lenders become overoptimistic, driving high investment and low credit spreads. The aggregate investment response to the subsequent negative shock is therefore large — overoptimism has boosted the capital stock and the debt level beyond what fundamentals warrant, so the negative shock both lowers true productivity and triggers a sharp correction in beliefs. Under RE, agents correctly anticipate mean reversion of TFP and do not overbuild, so the same negative shock hits a less-leveraged economy and generates a smaller correction. This fragility-in-good-times mechanism is consistent with empirical evidence from Bachmann et al. (2013), Winberry (2017), and Bloom et al. (2018) that investment is more sensitive to shocks during booms.&lt;/p&gt;
&lt;h3 id="q4-how-does-the-de-model-account-for-countercyclical-spreads-and-why-does-the-re-model-predict-the-wrong-sign"&gt;Q4. How does the DE model account for countercyclical spreads, and why does the RE model predict the wrong sign?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Under DE, credit spreads are countercyclical because lenders become excessively optimistic about future TFP in good times, driving down perceived default risk and hence spreads below their rational counterpart; when optimism wanes, spreads spike beyond what fundamental deterioration alone would warrant.&lt;/strong&gt; Under RE with constant required returns (as modeled), the supply of capital tracks fundamentals; in good times with high TFP, default risk is genuinely lower, so spreads fall — a qualitatively correct prediction. But the RE model also generates a positive correlation between spreads and investment in the cross-section of firms, while the data show a strong negative correlation (Column 10 of Table 5: Corr(Investment, Spread) = -0.057 in data, -0.054 in DE model, +0.083 in RE model). The DE mechanism driving this: overoptimistic lenders simultaneously over-supply credit (reducing spreads) and firms over-invest, creating the negative comovement. The paper links this formally to the concept of &amp;ldquo;financial shocks&amp;rdquo; in Jermann and Quadrini (2012) and Gilchrist and Zakrajšek (2012): in the DE framework, waning optimism produces inward shifts in the supply of capital that appear as exogenous financial shocks in reduced-form analyses.&lt;/p&gt;
&lt;h3 id="q5-how-does-the-model-account-for-the-2008-09-spread-episode-and-what-shock-size-is-required"&gt;Q5. How does the model account for the 2008-09 spread episode, and what shock size is required?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The DE model generates a spread increase consistent in magnitude with the 2008-09 episode from a modest moderation in TFP growth — not an outright TFP decline, but merely disappointment relative to the optimistic expectations formed during the preceding boom — while the RE model requires a large negative TFP shock of implausible size.&lt;/strong&gt; During 2005-2007, a sequence of positive TFP shocks made firms and lenders excessively optimistic; when TFP growth merely slowed in 2007-08 (below the high level agents had been projecting), their beliefs corrected sharply, spreading up and investment down. In the DE model, the deceleration of TFP growth is sufficient to produce spread increases matching the observed magnitude during 2008-09, along with quantitatively consistent declines in aggregate investment, credit, and earnings forecast revisions. The RE model cannot match this because rational agents, correctly anticipating mean reversion, would not have built up the overoptimistic base to correct from.&lt;/p&gt;
&lt;h3 id="q6-how-do-the-microeconomic-boom-bust-predictions-of-the-model-perform-out-of-sample"&gt;Q6. How do the microeconomic boom-bust predictions of the model perform out-of-sample?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The model-simulated data replicate the firm-level boom-bust cycles documented in the paper&amp;rsquo;s Section 2: current overoptimism (proxied by high investment or debt issuance) predicts next-year spread increases, lower realized bond returns, and subsequent investment declines, with magnitudes that quantitatively match the data regressions; the RE model generates none of these predicted cycles.&lt;/strong&gt; Specifically, in model-simulated firm-level regressions: higher current investment predicts 1-year-ahead spread increases; current spread increases predict negative future bond returns (the diagnostic model implies bonds are overpriced during booms, consistent with predictable low returns); and current high investment predicts future investment declines (mean reversion amplified by DE correction). All three predictions are also confirmed in the data and at the sectoral level, providing multiple out-of-sample validation tests. The diagnosticity parameter θ = 1 estimated from forecast errors simultaneously fits these untargeted dynamics.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;diagnostic expectations&lt;/strong&gt; : beliefs that overweight outcomes representative of recent news, with the single deparature parameter θ ≥ 0 governing the degree of overreaction; in the AR(1) TFP context, agents act as if TFP follows an ARMA(1,1) with over-weighted current shocks; estimated at θ ≈ 1 from firm-level forecast error data, consistent with independent estimates from multiple other datasets.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;fragility in good times&lt;/strong&gt; : the paper&amp;rsquo;s key qualitative finding that the investment response to a given negative TFP shock is much larger when the shock follows a period of positive TFP news; arises because DE agents have built up excessive optimism, inflated capital stocks, and stretched leverage during the boom, making the correction larger; absent in the RE model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;diagnosticity parameter (θ)&lt;/strong&gt; : the single behavioral parameter governing the degree to which agents overweight representative recent outcomes; θ = 0 is rational expectations; θ ≈ 1 is the structural SMM estimate, implying that forecast errors are roughly as large as the news that generated them; identified from the covariance between future forecast-error growth and current investment/debt changes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;financial shocks as waning optimism&lt;/strong&gt; : the paper&amp;rsquo;s interpretation of &amp;ldquo;financial shocks&amp;rdquo; — inward shifts in the supply of capital generating spread spikes — as the endogenous waning of previously excessive diagnostic optimism, rather than exogenous disturbances to lender preferences or required returns; provides microfoundations for the Jermann-Quadrini and Gilchrist-Zakrajšek empirical findings.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;countercyclical credit spreads&lt;/strong&gt; : the empirical regularity that credit spreads fall in good times and rise in bad times, a moment the DE model matches (through overoptimistic lenders compressing spreads in booms) but the RE model with constant required returns fails to match in the cross-section (predicting a positive correlation between investment and spreads).&lt;/p&gt;</description></item><item><title>Religion, Education, and the State</title><link>https://macropaperwarehouse.com/papers/religion-education-and-the-state/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/religion-education-and-the-state/</guid><description>&lt;p&gt;This paper studies how Indonesia&amp;rsquo;s Islamic education sector responded to one of the largest state-driven mass schooling expansions in history — the SD INPRES program (Sekolah Dasar Presidential Instruction) launched in 1973 — and whether that program achieved its secular nation-building objectives. The research question is three-part: Did Islamic schools enter or exit markets where the state built more primary schools? How did religious school choice shift across cohorts? And did the program advance secular identity formation among exposed individuals?&lt;/p&gt;
&lt;p&gt;The empirical setting is Indonesia in the 1970s onward. Under SD INPRES, the government used windfall oil revenues to build more than 61,000 primary schools between 1973 and 1980, allocating construction across districts proportional to the non-enrolled primary-school-age population. Because Islamic schools were historically more prevalent in underserved areas, this rule produced a strong positive correlation between SD INPRES intensity and pre-existing Islamic school density — the same markets where the state expanded were precisely those with the greatest Islamic education presence.&lt;/p&gt;
&lt;p&gt;The authors use several novel data sources: administrative registries covering nearly 220,000 secular and 160,000 Islamic schools with establishment dates; six rounds of the National Socioeconomic Survey (Susenas) from 2012–18; the Indonesia Family Life Survey (IFLS, 1993–2014); a 2018–19 curriculum timetable registry (SIAP) covering nearly 20% of madrasa; and a 2016 political/religious attitudes survey. Identification relies on difference-in-differences (DID) exploiting cross-district variation in SD INPRES intensity, the synthetic DID approach of Arkhangelsky et al. (2021) for robustness to violations of parallel trends, and a staggered village-level event study using the Borusyak et al. (2024) estimator.&lt;/p&gt;
&lt;p&gt;The main findings are as follows. First, Islamic schools did not exit markets where the state expanded — they entered in greater numbers. A one standard deviation increase in SD INPRES construction led to 1.4 additional Islamic school entries per district above a mean of 1.9 per district in 1972. Entry was competitive at the primary level, where new madrasa (MI) entered at twice the baseline annual rate in the years immediately following INPRES construction, and strategic at the secondary level, where Islamic junior secondary schools (MTs) peaked 6–9 years after INPRES entry as graduates sought continued education. The Islamic sector financed this expansion through waqf (inalienable religious endowments), informal taxation (infaq, zakat), and revenues from a concurrent rice price spike; entry responses were stronger in villages with above-median waqf endowments and above-median potential rice yields.&lt;/p&gt;
&lt;p&gt;Second, rather than converging toward secular curricula, newly established Islamic schools in high-INPRES districts devoted more time to religious content. Each additional SD INPRES is associated with a 1.2 percentage point increase in the religious curriculum share among newly created madrasa, with increases of 1.3 and 2.4 percentage points at the primary and junior secondary levels respectively — the latter equaling 82% of the cross-school standard deviation. Some of this increase came at the expense of Pancasila/civic education and national language instruction.&lt;/p&gt;
&lt;p&gt;Third, while SD INPRES reduced Islamic primary school enrollment by roughly 7%, it increased overall Islamic school attendance: each additional SD INPRES increased the likelihood of attending any Islamic school by approximately 5%, as demand for secondary education outweighed substitution at the primary level. Female students exhibited stronger secondary-level demand effects, amplified in districts with a concurrent state ban on the Islamic veil in public schools.&lt;/p&gt;
&lt;p&gt;Fourth, SD INPRES did not advance its ideological objectives. In the 1977 and 1982 elections, Golkar&amp;rsquo;s vote share fell and the Islamic PPP&amp;rsquo;s rose by 0.5–1.0 percentage points per SD INPRES school in high-INPRES districts. Among exposed cohorts, SD INPRES did not increase Pancasila proficiency, national language use at home, or support for secular governance, but did increase Arabic literacy by approximately 3% per additional SD INPRES. Exposed cohorts also prayed more frequently, fasted more during Ramadan, gave more to charity, and expressed greater pilgrimage intentions. These religious patterns were transmitted to children of exposed cohorts, who were more likely to attend Islamic schools themselves.&lt;/p&gt;
&lt;p&gt;Q: What was the allocation rule for SD INPRES and why did it create confrontation with Islamic schools?
A: Presidential Instruction No. 10/1973 allocated school construction across districts proportional to the non-enrolled primary-school-age population in 1971. Because Islamic schools historically served underserved populations, this rule meant the state built more schools precisely where Islamic education was most prevalent. The paper shows graphically and in Table 1 that the number of SD INPRES schools built is strongly correlated with the pre-existing stock of Islamic schools, conditional on district population and enrollment.&lt;/p&gt;
&lt;p&gt;Q: How large was the Islamic sector&amp;rsquo;s entry response to SD INPRES at the district level?
A: In the standard DID specification (Table 2, panel a), a one standard deviation increase in SD INPRES schools led to 0.013 more Islamic schools per district-year per 1,000 children, equivalent to 1.4 additional Islamic school entries in the average district relative to a mean of 1.9 Islamic schools per district in 1972. The synthetic DID (panel b) delivers positive and slightly larger estimates, indicating the result is not an artifact of diverging pre-trends.&lt;/p&gt;
&lt;p&gt;Q: What was the timing of the Islamic sector entry response at the village level?
A: Using the Borusyak et al. (2024) estimator on a balanced panel from 1960 to 1999, the paper finds (Figure 4) that INPRES construction is followed by a jump in Islamic school entry. Primary madrasa (MI) entered at twice the baseline annual rate in the years immediately following INPRES construction and this elevated rate persisted for six years before reverting to baseline. Islamic junior secondary entry (MTs) peaked around years 6–9 after SD INPRES construction, consistent with newly graduated primary students seeking continued schooling.&lt;/p&gt;
&lt;p&gt;Q: How did the Islamic sector finance its expansion?
A: The sector relied on waqf endowments (inalienable religious land assets), informal faith-based contributions (infaq), and obligatory alms (zakat). Fortuitously, the initial year of SD INPRES coincided with a large spike in the global price of rice, Indonesia&amp;rsquo;s main agricultural commodity, boosting harvest revenues channeled through informal Islamic taxation. Table 3 shows that entry responses were significantly stronger in villages with above-median waqf endowments and above-median potential rice yields, and these heterogeneous effects did not arise in non-INPRES periods or for non-Islamic private schools. Survey data from 2007–13 further show higher rates of informal taxation in villages with Islamic schools built during this period.&lt;/p&gt;
&lt;p&gt;Q: Did Islamic schools converge toward secular curricula under competitive pressure from SD INPRES?
A: No. Table 4 shows that madrasa established in high-INPRES districts after 1972 devote more time to religious content, not less. Each additional SD INPRES is associated with a 1.2 percentage point increase in the share of classroom time devoted to religious subjects among newly created Islamic schools, with increases of 1.3 percentage points at the primary level and 2.4 percentage points at the junior secondary level — the latter equal to 82% of the cross-school standard deviation. Similar patterns hold for Arabic instruction, and the junior secondary increase comes partially at the expense of Pancasila/civic education and national language instruction.&lt;/p&gt;
&lt;p&gt;Q: Did curriculum differentiation responses vary with local religious ideology?
A: Yes. Appendix Table A.14 shows a stronger curriculum differentiation response in markets with greater historical support for conservative Islam, proxied by Islamic political party vote shares in the 1950s elections. The paper also constructs a school-name-based predicted ideology index using a ridge shrinkage estimator and finds (Appendix Table A.15) that madrasa entering high-INPRES districts after the program onset have a more religious ideology on this measure.&lt;/p&gt;
&lt;p&gt;Q: What happened to the formalization of the Islamic sector?
A: Figure 5 and Appendix Table A.6 show that formal madrasa entry increased as a share of all new school entry, while informal Islamic schools (pesantren, diniyah) declined as a share of all new schools and all new Islamic schools. This formalization mirrors the organizational structure of state schools (primary-to-secondary progression), facilitating switching between public and religious schools and providing option value to moderate but still religious families. Crucially, the newly entering formal madrasa introduced more religious curriculum than incumbent madrasa, so formalization did not reduce religious instruction.&lt;/p&gt;
&lt;p&gt;Q: What was the net effect of SD INPRES on Islamic school attendance?
A: Table 5 shows that SD INPRES reduced the likelihood of attending Islamic primary school by roughly 7% per additional SD INPRES school but increased Islamic secondary attendance, with the net effect being a roughly 5% increase in the likelihood of attending any Islamic school (column 4). This finding holds in both DID and synthetic DID. The IFLS validation (Appendix Table A.18) confirms decreased Islamic elementary attendance and increased Islamic junior secondary attendance, consistent with the Susenas results.&lt;/p&gt;
&lt;p&gt;Q: How does selection into secondary education affect the religious schooling results?
A: The authors address selection using parametric (Heckman 1976) and semiparametric (Newey 2009) selection-correction procedures, using exposure to a 1960s pilot compulsory schooling program as an exclusion restriction. Table 6, panels (c) and (d), show that selection-adjusted estimates are broadly consistent with unadjusted estimates, with similar signs and magnitudes. The selection-corrected estimates approximately identify a local average treatment effect among compliers: those induced to attend elementary school were less likely to attend Islamic elementary; those induced to continue to secondary were more likely to attend Islamic secondary.&lt;/p&gt;
&lt;p&gt;Q: How did gender shape the effects of SD INPRES on religious school choice?
A: Table 7 shows that SD INPRES had more limited impacts on total schooling for women than men (consistent with Duflo 2001) but that the secondary-level demand effect toward Islamic schools was stronger for women. Table 8 shows that within high-INPRES areas, the SD INPRES-induced increase in Islamic secondary education is three times larger for women in districts with greater exposure to the 1982 state ban on the Islamic veil in public schools, and this differential is specific to Islamic schooling rather than total schooling.&lt;/p&gt;
&lt;p&gt;Q: Did SD INPRES strengthen or weaken the secular ruling regime&amp;rsquo;s political standing?
A: It weakened it. Table 10 shows that in the 1977 and 1982 elections, Golkar&amp;rsquo;s vote share decreased and the Islamic PPP&amp;rsquo;s vote share increased in high-INPRES districts, in the range of 0.5–1.0 percentage points per SD INPRES school. This represents a 1.5–3.0% change in PPP vote share and a 0.5–1.0% change in Golkar vote share per standard deviation in SD INPRES intensity. The PPP gained most in areas where SD INPRES had the greatest potential to draw students away from Islamic schools.&lt;/p&gt;
&lt;p&gt;Q: Did SD INPRES produce a secular ideological shift among exposed cohorts?
A: No. Table 11 shows that SD INPRES did not increase self-reported Pancasila proficiency, national language use at home, national language literacy, or attitudes in favor of secular governance. By contrast, Arabic literacy increased by approximately 3% per additional SD INPRES among exposed cohorts, indicating that Islamic schooling exposure rather than secular schooling drove literacy gains in that language.&lt;/p&gt;
&lt;p&gt;Q: Did SD INPRES increase religiosity among exposed cohorts?
A: Yes. Table 12 shows that SD INPRES increased prayer frequency, fasting during Ramadan, charitable giving, and pilgrimage intentions among exposed cohorts. These effects on prayer and fasting are stronger among women, consistent with the stronger shift toward Islamic secondary schooling found in Table 7. These outcomes are consistent with greater exposure to Islamic education increasing religiosity rather than the secular curriculum reducing it.&lt;/p&gt;
&lt;p&gt;Q: Were the effects on religious identity and Arabic literacy transmitted to the next generation?
A: Yes. Table 13 shows that SD INPRES increased Arabic literacy among the children of exposed cohorts, and that children of exposed cohorts were more likely to attend Islamic schools themselves. These intergenerational results confirm that the preference for Islamic education instilled during the SD INPRES era persisted into the next generation rather than converging toward secular norms over time.&lt;/p&gt;
&lt;p&gt;Q: What are the key robustness checks for the school entry results?
A: Several checks support causal interpretation. The Roth and Rambachan (2022) procedure finds no systematic pre-trends in the standard DID. Historical Podes data from 1980, 1983, 1990, and 1993 confirm the post-1973 increase in Islamic school entry, addressing survival bias in the 2019 registry. Results are robust to allowing differential trends in waqf endowments, Muslim population share, Islamic party vote shares, historical Arab immigration, Islamist insurgency, and Transmigration resettlement. The heterogeneous entry responses by waqf and rice yield do not appear in non-INPRES periods or for non-Islamic schools.&lt;/p&gt;
&lt;p&gt;Q: What do the results imply for the political economy of education reform more broadly?
A: The paper argues that state capacity to homogenize culture through education is limited when strong non-state actors can mobilize their own resources and provide differentiated alternatives. Rather than crowding out religious schools, state expansion triggered competitive entry, curriculum differentiation, and formalization in the religious sector, producing an equilibrium where both sectors expanded simultaneously with distinct clienteles. The findings imply that the long-run cultural effects of education programs cannot be evaluated without accounting for equilibrium responses by competing non-state providers.&lt;/p&gt;
&lt;p&gt;SD INPRES (Sekolah Dasar Presidential Instruction): Indonesia&amp;rsquo;s 1973 mass public primary school construction program, financed by oil windfalls, which built more than 61,000 elementary schools between 1973 and 1980 by allocating schools to districts proportional to the non-enrolled child population; the program&amp;rsquo;s explicitly secular nation-building objectives brought it into direct confrontation with the Islamic education sector.&lt;/p&gt;
&lt;p&gt;Waqf: Inalienable Islamic religious endowments — of land, agricultural assets, or other property — that under Islamic law can only be used for religious or charitable purposes and cannot be seized or repurposed by the state; in this paper, the pre-existing waqf base in a village serves both as a long-run financing mechanism for Islamic school construction and as an index of Islamic sector organizational capacity.&lt;/p&gt;
&lt;p&gt;Madrasa: Formal day Islamic schools operating at the same primary-to-secondary grade levels as secular state schools, teaching standard academic subjects alongside a religious curriculum (including Islamic law, doctrine, ethics, Qur&amp;rsquo;an, Arabic, and history of the Prophets) that averages 26% of total instruction hours; distinct from the more informal pesantren (boarding schools) and madrasa diniyah (afternoon Qur&amp;rsquo;anic study schools).&lt;/p&gt;
&lt;p&gt;Curriculum differentiation: The strategy by which newly entering madrasa in high-INPRES districts increased the share of classroom time devoted to religious and Arabic instruction rather than converging toward the secular state curriculum; measured as classroom hours devoted to Islamic subjects, Arabic, Pancasila/civic education, and national language instruction from 2018–19 SIAP timetable data.&lt;/p&gt;
&lt;p&gt;Pancasila: The official secular nationalist ideology of the Indonesian state, consisting of five principles (monotheism, humanitarianism, national unity, democracy, and social justice) intended to transcend ethnic and religious divisions; SD INPRES sought to transmit Pancasila through civic education and national language instruction as part of its homogenizing nation-building agenda.&lt;/p&gt;
&lt;p&gt;Synthetic difference-in-differences (SDID): The Arkhangelsky et al. (2021) estimator used throughout the paper, which reweights and matches pre-INPRES trends in Islamic school construction across high- and low-INPRES exposure districts to deliver estimates more robust than standard DID to violations of parallel trends; applied with a binary treatment indicator (districts above the 51st percentile in INPRES intensity).&lt;/p&gt;
&lt;p&gt;Formalization: The documented shift in the composition of the Islamic sector after SD INPRES, whereby formal madrasa (organized along the same grade-level progression as state schools) increased as a share of all new Islamic school entry while informal pesantren and diniyah declined as a share; interpreted as a competitive response that expanded parental option value without sacrificing religious instruction intensity.&lt;/p&gt;</description></item><item><title>Republican Support and Economic Hardship: The Enduring Effects of the Opioid Epidemic</title><link>https://macropaperwarehouse.com/papers/republican-support-and-economic-hardship-the-enduring-effects-of-the-opioid-epidemic/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/republican-support-and-economic-hardship-the-enduring-effects-of-the-opioid-epidemic/</guid><description>&lt;p&gt;This paper establishes a causal connection between the opioid epidemic and the political realignment toward the Republican Party in the United States from the mid-2000s through 2022. The authors—Carolina Arteaga and Victoria Barone—exploit rich geographic variation in Purdue Pharma&amp;rsquo;s initial marketing strategy for OxyContin, drawn from unsealed litigation records, to construct a quasi-exogenous measure of community-level exposure to the epidemic.&lt;/p&gt;
&lt;p&gt;The identification strategy rests on a documented feature of OxyContin&amp;rsquo;s 1996 launch: Purdue initially targeted the established cancer pain market—physicians and patients already using MS Contin—as an entry point into the much larger noncancer pain market. Areas with higher cancer mortality in 1996 received disproportionate pharmaceutical marketing, leading to outsized opioid prescription growth that spilled over from cancer patients to the broader population through shared physicians. The authors use 1996 commuting-zone (CZ) cancer mortality rates as a proxy for this initial targeting, interacted with year fixed effects in an event-study specification with CZ and state-year fixed effects. The sample covers 625 CZs across the continental United States from 1982 to 2022.&lt;/p&gt;
&lt;p&gt;The empirical chain runs through three stages. First, the instrument strongly predicts opioid supply: by 2012, a one-standard-deviation higher 1996 cancer mortality rate led to an additional 0.97 opioid doses prescribed per capita, 65% above the baseline mean. Second, the resulting epidemic caused measurable mortality and economic hardship. A one-standard-deviation increase in 1996 cancer mortality caused drug-induced deaths in 2017 to be 46% above the pre-epidemic average; by 2012 the same increase caused prescription opioid deaths to be 61% higher. Excess mortality was concentrated among individuals under age 55, with no significant effects for those aged 55 and older. The epidemic also raised disability applications: SSDI applications rose by 12% and SSI applications by 7.6% by 2012, effects that persisted through 2020. SNAP enrollment in exposed CZs was 8% higher by 2022, equivalent to a 0.14 standard deviation increase.&lt;/p&gt;
&lt;p&gt;Third, and centrally, the communities that endured these health and economic shocks shifted persistently toward the Republican Party. By the 2022 House elections, a one-standard-deviation increase in 1996 cancer mortality increased the Republican two-party vote share by 4.5 percentage points. Effects of similar magnitude appear in presidential elections (4.6 percentage points) and gubernatorial elections (4.3 percentage points). The vote-share shift is consistent across gender, age, race, and education, with no detectable change in voter turnout. The shift translates into actual seat gains: beginning in 2012, exposed areas consistently elected more Republican House members, moving the chamber&amp;rsquo;s roll-call voting in a more conservative direction. The effect is not driven by anti-incumbent sentiment—results hold regardless of which party held the seat at the time.&lt;/p&gt;
&lt;p&gt;The paper identifies three reinforcing mechanisms. First, the Republican Party repositioned itself during this period as the advocate of &amp;ldquo;forgotten America&amp;rdquo; and working-class economic hardship, a message that resonated acutely in opioid-devastated communities. Second, conservative-leaning newspapers covered the epidemic at higher rates, and their coverage tracked local mortality; liberal-leaning outlets showed no such correlation. Fox News covered opioid stories at 1.5 times the rate of CNN and 1.7 times the rate of MSNBC, emphasizing crime, trafficking, and cartels at twice the frequency of liberal outlets. Third, exposed communities expressed stronger preferences for Republican-favored policy responses: higher police presence, greater sense of safety around law enforcement, and lower support for marijuana legalization on state ballot initiatives.&lt;/p&gt;
&lt;p&gt;Pre-trend tests show no relationship between 1996 cancer mortality and outcomes before OxyContin&amp;rsquo;s launch. Out-of-sample exercises using 1976 cancer mortality find no analogous pattern in the pre-epidemic period (1982–1994). Placebo instruments based on unrelated causes of death yield null results. The baseline findings are robust to controlling for the China import shock, NAFTA, the 1994 Republican Revolution, the 2001 and 2008–2009 recessions, declining unionization, robot adoption, Fox News introduction, deaths of despair, and Southern and rural political realignment.&lt;/p&gt;
&lt;p&gt;Q: What is the paper&amp;rsquo;s central research question?
A: The paper asks whether the opioid epidemic causally increased Republican vote share in communities most severely affected by the crisis. It documents a causal chain from pharmaceutical marketing through drug mortality and economic hardship to political realignment, contributing the first causal estimate of a major public health crisis&amp;rsquo;s effect on partisan voting.&lt;/p&gt;
&lt;p&gt;Q: What is the identification strategy, and why is 1996 cancer mortality a valid instrument?
A: Purdue Pharma explicitly targeted physicians in the cancer pain market at OxyContin&amp;rsquo;s 1996 launch, then used those established relationships to expand into the noncancer pain market. CZs with higher cancer mortality in 1996 received disproportionate marketing, generating differential opioid prescription growth unrelated to pre-existing political or economic trends. Pre-trend tests confirm no differential patterns before 1996, out-of-sample tests using 1976 cancer mortality find no relationship with pre-epidemic outcomes, and placebos using unrelated causes of death yield null results.&lt;/p&gt;
&lt;p&gt;Q: How strong is the first stage linking 1996 cancer mortality to opioid prescriptions?
A: The relationship between 1996 cancer mortality and opioid prescriptions is positive and statistically significant from 1998 through 2020. By 2012—the year prescription rates peaked nationally at 81.3 per 100 persons—a one-standard-deviation higher cancer mortality rate led to an additional 0.97 morphine-equivalent doses prescribed per capita, 65% above the baseline mean. CZs in the highest cancer mortality quartile experienced a 1,800% increase in grams of oxycodone per capita between 1997 and 2010, compared to less than half that in the lowest quartile.&lt;/p&gt;
&lt;p&gt;Q: What are the effects on drug-induced mortality?
A: Drug-induced mortality (a broad measure covering deaths from prescription opioids, heroin, and fentanyl) rose continuously in exposed CZs after 1996. By 2017, a one-standard-deviation increase in 1996 cancer mortality caused drug-induced deaths to be 46% above the pre-epidemic average. By 2012, the same increase caused prescription opioid deaths specifically to be 61% higher relative to the pre-epidemic average. Excess mortality was concentrated among individuals under age 55, with no statistically significant effects for those aged 55 and older.&lt;/p&gt;
&lt;p&gt;Q: What are the effects on disability program take-up?
A: Applications for Social Security Disability Insurance (SSDI) rose by 12% and Supplemental Security Income (SSI) applications rose by 7.6% by 2012 for a one-standard-deviation increase in 1996 cancer mortality. These effects persisted: SSDI recipients grew by 15% and SSI recipients by 3.2% by 2020 in similarly exposed CZs. The increases in disability were concentrated among individuals under age 55, paralleling the mortality effects.&lt;/p&gt;
&lt;p&gt;Q: What are the effects on SNAP enrollment?
A: Exposed CZs showed a continuous increase in SNAP enrollment over two decades following the epidemic&amp;rsquo;s onset. By 2020, a one-standard-deviation increase in 1996 cancer mortality corresponded to an 8% increase in the share of the population receiving SNAP benefits, equivalent to 0.14 standard deviations. By 2022, the corresponding figure remains 8%, indicating persistent economic strain in exposed communities.&lt;/p&gt;
&lt;p&gt;Q: What is the magnitude of the political effects in House elections?
A: A one-unit increase in the 1996 cancer mortality rate yielded a 7.9-percentage-point increase in the 2022 Republican House vote share relative to 1996. Scaled to one standard deviation (0.58 units), this corresponds to a 4.5-percentage-point increase in the Republican two-party vote share by the 2022 midterms. The vote-share shift became statistically significant beginning in 2006, but only translated into consistent seat-level Republican gains starting in 2012.&lt;/p&gt;
&lt;p&gt;Q: When did opioid exposure start winning Republicans additional House seats?
A: Although the Republican vote share in exposed areas began increasing around 2006, actual seat flips did not become consistent until 2012. The paper explains this lag by noting that initial vote-share gains were concentrated in communities with low baseline Republican support, where additional votes did not immediately cross the winning threshold. Starting in 2010, median-baseline-Republican CZs also began shifting, enabling additional seat changes.&lt;/p&gt;
&lt;p&gt;Q: How large are the presidential and gubernatorial election effects?
A: In presidential elections, a one-standard-deviation increase in 1996 cancer mortality raised the Republican vote share by 4.6 percentage points. In gubernatorial elections, the same increase raised the Republican vote share by 4.3 percentage points after approximately six election cycles (corresponding to 2017–2020). These effects are described as comparable in magnitude to the difference in Republican vote share between the top and bottom quartiles of NAFTA vulnerability.&lt;/p&gt;
&lt;p&gt;Q: Does the political shift reflect increased polarization toward extremist candidates?
A: No. The paper finds no increase in the probability of electing candidates at the extremes of the Nokken-Poole ideological scale in any given election year. The ideological shift in the House composition arises from changes in which party wins seats rather than from the election of more extreme Republicans. Campaign donations to Republican candidates did not increase; rather, donations to Democratic candidates declined (in 2016, a one-standard-deviation increase in cancer mortality widened the Republican-Democrat donation gap by 0.44 standard deviations). The shift is interpreted as a change in voting preferences in previously Democratic-leaning areas, not heightened polarization.&lt;/p&gt;
&lt;p&gt;Q: Is the shift driven by anti-incumbent sentiment?
A: The authors test this by splitting the sample by the incumbent&amp;rsquo;s party at the time of each election and by redefining the outcome as the incumbent&amp;rsquo;s vote share. Neither exercise produces evidence of a systematic anti-incumbent response. The changes in Republican vote share are not statistically distinguishable based on whether the incumbent was a Republican or Democrat. If anything, after 2016 there is a slight increase in the likelihood of incumbents retaining their seats.&lt;/p&gt;
&lt;p&gt;Q: Where geographically are the Republican gains largest?
A: Using state-level treatment effects estimated from an in-differences model interacting cancer mortality with state-year indicators, the paper finds a strong positive correlation between the magnitude of the epidemic&amp;rsquo;s effect on economic hardship (measured by SNAP participation) and the magnitude of the Republican vote-share increase. This correlation is strongest with a lag: SNAP effects measured in 2006 are most predictive of vote-share shifts in 2022, indicating that deterioration in community economic fabric preceded and predicted the political realignment.&lt;/p&gt;
&lt;p&gt;Q: How did conservative and liberal media differ in covering the opioid epidemic?
A: Republican-leaning local newspapers covered the opioid epidemic more extensively than Democratic-leaning papers throughout the epidemic period, and their coverage tracked local opioid mortality rates; Democratic-leaning coverage showed no such correlation with local incidence. Fox News covered opioid stories at 1.5 times the rate of CNN and 1.7 times the rate of MSNBC. In terms of content, Republican-leaning newspapers showed 23% higher frequency of economic hardship keywords, 19% higher frequency of illegal activity and crime keywords, and 22% higher frequency of rehabilitation and treatment keywords relative to Democratic-leaning papers. Fox News emphasized crime, drug trafficking, and cartels at double the frequency of more liberal outlets.&lt;/p&gt;
&lt;p&gt;Q: How did voter policy preferences align with Republican versus Democratic platforms?
A: Using 2020 CCES data, the authors find that higher 1996 cancer mortality predicts a greater expressed preference for increasing the number of police officers on the street and a greater reported sense of safety around law enforcement—both consistent with the Republican Party&amp;rsquo;s law enforcement approach. Conversely, exposure to the epidemic predicts lower support for marijuana legalization on state ballot initiatives across 18 states from 2012 to 2023, indicating opposition to a key Democratic harm-reduction policy.&lt;/p&gt;
&lt;p&gt;Q: What role did political actors themselves play in driving the realignment?
A: Relatively little. The opioid epidemic was largely absent from House floor speeches until 2015 and from campaign advertising until 2020. Neither party took a clear legislative lead on the issue during the first two decades of the crisis. The authors interpret the political realignment as driven primarily by the Republican Party&amp;rsquo;s broader repositioning as the champion of working-class economic hardship and by differential media framing, rather than by active legislative competition over opioid policy.&lt;/p&gt;
&lt;p&gt;Q: What major confounds are ruled out?
A: The authors control for exposure to the China import shock, NAFTA, the 1994 Republican Revolution, the 2001 and 2008–2009 recessions, declining unionization, robot adoption, Fox News entry, deaths of despair (which include but are not limited to opioid deaths), and the political realignment of the South, rural areas, evangelicals, and the population over 65. Results remain robust across all these specifications. Placebo instruments using unrelated causes of death yield null results.&lt;/p&gt;
&lt;p&gt;Q: Could the vote-share effects be mechanically driven by opioid-related deaths removing Democratic voters from the electorate?
A: The authors perform a back-of-the-envelope calculation and estimate that even if all opioid-related deaths would have been Democratic votes, the mechanical effect on the Republican vote share is at most 0.22 percentage points relative to the observed 2020 vote share—far smaller than the estimated 4.5-percentage-point shift by 2022. The result is also inconsistent with a turnout mechanism, as voter turnout shows no meaningful change with epidemic exposure.&lt;/p&gt;
&lt;p&gt;Opioid epidemic exposure instrument: The paper measures community-level exposure to the opioid epidemic using cancer mortality rates in 1996, the year OxyContin launched. This instrument is grounded in Purdue Pharma&amp;rsquo;s documented marketing strategy of targeting the cancer pain market first; areas with more cancer patients received disproportionate pharmaceutical marketing, generating differential opioid prescription growth that extended well beyond cancer patients to the broader noncancer population through shared physicians.&lt;/p&gt;
&lt;p&gt;Commuting zone (CZ): The paper&amp;rsquo;s unit of geographic analysis, defined to capture local economic markets. There are 720 CZs in the US, encompassing all metropolitan and nonmetropolitan areas. The authors use 625 CZs with more than 20,000 residents, which account for more than 99% of all opioid deaths and total population.&lt;/p&gt;
&lt;p&gt;Two-party Republican vote share: The ratio of votes for Republican candidates to the total votes for both Republican and Democratic candidates in a given election. The paper tracks this measure for House, presidential, and gubernatorial elections from 1976 or 1982 through 2020 or 2022, depending on data availability.&lt;/p&gt;
&lt;p&gt;Drug-induced mortality: The paper&amp;rsquo;s broadest mortality measure, covering deaths from poisoning and medical conditions caused by legal or illegal drugs, including prescription opioids, heroin, and synthetic opioids such as fentanyl. It is distinguished from the narrower measures of prescription opioid deaths and all opioid deaths.&lt;/p&gt;
&lt;p&gt;Issue ownership: The political science concept, used in the paper to describe how the Republican Party repositioned itself during the epidemic period as the voice of working-class economic hardship, &amp;ldquo;forgotten America,&amp;rdquo; and &amp;ldquo;America left behind.&amp;rdquo; The paper contrasts this with Democratic ownership of income inequality and argues that Republican ownership of the hardship narrative made the party&amp;rsquo;s message especially salient in heavily opioid-affected communities.&lt;/p&gt;
&lt;p&gt;Path dependency in pharmaceutical marketing: Purdue&amp;rsquo;s strategy of concentrating initial OxyContin promotion in cancer-market areas, then later focusing on top-prescribing physicians (the highest three deciles of the distribution), meant that areas receiving high initial cancer-market promotion continued to receive disproportionate promotion as the company expanded to the noncancer market. This created a persistent targeting advantage for high-cancer CZs throughout the epidemic&amp;rsquo;s first wave.&lt;/p&gt;
&lt;p&gt;Nokken-Poole ideological measure: A roll-call-based measure of elected House members&amp;rsquo; ideology along the liberal-conservative dimension. The paper uses this measure to show that the epidemic shifted the composition of the House toward more conservative members, not by electing more extreme candidates in any given election, but by changing which party won seats over time.&lt;/p&gt;</description></item><item><title>Revolutionary Transition: Inheritance Change and Fertility Decline</title><link>https://macropaperwarehouse.com/papers/revolutionary-transition-inheritance-change-and-fertility-decline/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/revolutionary-transition-inheritance-change-and-fertility-decline/</guid><description>&lt;p&gt;Gay, Gobbi, and Goñi test Le Play&amp;rsquo;s (1875) hypothesis that the French Revolution contributed to France&amp;rsquo;s early fertility decline by abolishing impartible inheritance. In 1793, a series of decrees culminating in the Loi de Nivôse (January 6, 1794) abolished testamentary rights and imposed equal partition of assets among all children — partible inheritance — across France, overriding the mosaic of local customs and written laws that had governed inheritance in the Ancien Régime.&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s central argument is that this reform reduced the economic incentive to have children through indivisibility constraints in agricultural land. Under impartible inheritance, land passed to a single heir undivided, keeping plots above the subsistence productivity threshold even at high fertility. Under partible inheritance, each additional child fragments the land further, potentially pushing plots below the minimum productive size, so households face a strong incentive to limit fertility. A Stone-Geary production function with a minimum land threshold L̄ formalizes this mechanism: when landholdings fall in the binding range (L̄ &amp;lt; L &amp;lt; L̃), fertility is strictly higher under impartible than under partible inheritance.&lt;/p&gt;
&lt;p&gt;The authors construct the first complete map of inheritance rules across France&amp;rsquo;s 435 judicial districts as of 1789, classifying each along two dimensions: partible versus impartible, and whether women were included or excluded. This atlas draws on Brette&amp;rsquo;s (1904) Atlas des Bailliages and the Nouveau Coutumier Général (Bourdot de Richebourg 1724), covering 141 distinct customs. Treatment is defined as municipalities under impartible inheritance before 1793 whose system was altered by the reforms; control municipalities were already under partible inheritance.&lt;/p&gt;
&lt;p&gt;The main identification strategy is a difference-in-differences (DD) design comparing women with varying lengths of remaining fertile years after 1793 — from 0 for women aged 40+ at the reform to 25 for women aged 15 or younger — across treated and untreated municipalities. This is augmented by a regression-discontinuity difference-in-differences (RD-DD) design exploiting sharp discontinuities at judicial district borders. Two independent datasets are used: the Enquête Louis Henry (34,812 women in 39 rural municipalities, family-reconstitution method) and Geni.com crowdsourced genealogies (11,649 women across 2,966 locations after the Blanc 2023 horizontal restriction).&lt;/p&gt;
&lt;p&gt;Each additional fertile year of exposure to the 1793 reforms reduced completed fertility by approximately 1 percent. Over the full 25-year fertile cycle, this corresponds to a reduction of roughly 0.7 children, or 24 percent relative to the pre-reform mean of 2.92 surviving children in treated areas. This magnitude equals the entire pre-reform fertility gap between impartible- and partible-inheritance areas (2.9 versus 2.2 children), meaning the reforms closed this gap entirely. DD and RD-DD estimates are similar and not statistically distinguishable from each other, and results replicate across both datasets. Results hold on both the extensive margin (childlessness) and intensive margin (fertility of mothers).&lt;/p&gt;
&lt;p&gt;The mechanism is most relevant where smallholder landownership is widespread. France — where 40–80 percent of households owned land at the eve of the Revolution — meets this condition. England and Prussia, with more concentrated landownership, would not be expected to show the same response because the indivisibility constraint would not bind even after partition.&lt;/p&gt;
&lt;p&gt;Q: What was France&amp;rsquo;s inheritance system before the Revolution, and how heterogeneous was it?
A: Before 1793, inheritance was governed by 141 distinct customary and written laws applied within 435 judicial districts. The country was broadly divided between the customary-law north (Pays de droit coutumier) and the Roman written-law south (Pays de droit écrit), with substantial local variation within regions. Systems ranged from strictly partible (equal division among all offspring) to impartible (primogeniture, ultimogeniture, or unigeniture). Systems also varied in whether women could inherit or received only a dowry. This geographic variation — rooted in the laws of Germanic peoples after the fall of Rome in 476 CE — is exogenous to late eighteenth-century economic conditions and provides the identifying variation for the paper.&lt;/p&gt;
&lt;p&gt;Q: What exactly did the 1793 reforms change, and were they enforced?
A: The Loi de Nivôse an II (January 6, 1794) abolished testamentary rights entirely and mandated equal partition of assets among all children, including women, throughout France. The reforms came unexpectedly — only 8 of 571 cahiers de doléances analyzed by Goy (1988) mentioned inheritance — and were motivated by the equality principle, legal unification, and the fear that revolutionary sympathizers would be disinherited (Lataste et al. 1901). Offspring quickly asserted their new rights, and by the late 1790s inheritance disputes were the most common cases before family tribunals (Desan 1997; Poumarède 2011).&lt;/p&gt;
&lt;p&gt;Q: What is the model&amp;rsquo;s core mechanism linking inheritance reform to fertility decline?
A: The model uses a Stone-Geary production function with a minimum land threshold L̄ below which output falls to zero. Under impartible inheritance, land passes undivided to a single heir, keeping the farm above L̄ regardless of family size. Under partible inheritance, each child receives an equal share, so adding children risks fragmenting plots below L̄ — a powerful incentive to limit family size. The fertility gap between impartible and partible households is at its maximum when landholdings fall in the intermediate range (L̄ &amp;lt; L &amp;lt; L̃) where the constraint is binding. As land size increases, the constraint becomes less binding but the positive fertility differential persists.&lt;/p&gt;
&lt;p&gt;Q: What is the paper&amp;rsquo;s main quantitative estimate of the reform&amp;rsquo;s effect on completed fertility?
A: Each additional fertile year of exposure to the 1793 reforms reduced completed fertility by approximately 1 percent. Over the full 25-year fertile cycle (ages 15–40), this implies a reduction of roughly 0.7 children, or 24 percent relative to the pre-reform mean of 2.92 surviving children in treated areas. This is nearly identical to the pre-existing fertility gap between impartible- and partible-inheritance areas (0.7 children: 2.9 versus 2.2 surviving children), implying the reforms effectively eliminated the fertility differential.&lt;/p&gt;
&lt;p&gt;Q: Are the DD and RD-DD estimates consistent with each other, and do both datasets agree?
A: Yes. The DD and RD-DD estimates are similar and not statistically different from each other. The RD-DD design compares women born close to judicial district borders where inheritance rules differed, before and after 1793, exploiting the sharp spatial discontinuity at those borders. Consistency across these two designs — which rely on different identifying assumptions — strengthens causal interpretation. Results are also consistent across the Enquête Louis Henry (family-reconstitution) and Geni.com (crowdsourced genealogies) datasets, which are produced by fundamentally different methodologies.&lt;/p&gt;
&lt;p&gt;Q: How do the authors verify the parallel trends assumption?
A: Figure 6 shows that for cohorts who completed their fertile cycle before 1793, fertility trended downward in parallel across partible- and impartible-inheritance areas: a constant gap of approximately 0.7 children was maintained from women born in the early 1700s (3 versus 2.3 children) through women born in the early 1750s (2.7 versus 2.0 children), the last cohorts to complete fertility before the reforms. The convergence — from 0.7 to 0 children — only begins among cohorts fertile after 1793. The authors also include flexible trend controls interacted with municipality-level religiosity, political support for the Revolution, proximity to administrative centers, and wheat prices, and confirm the main estimate is robust.&lt;/p&gt;
&lt;p&gt;Q: What role did the extension of inheritance rights to women play?
A: The extension of rights to women was a companion mechanism distinct from abolishing impartible inheritance. Beyond increasing the number of heirs (which directly reduces land per heir), the right to inherit improves a woman&amp;rsquo;s outside option and postpones entry into marriage, following de Moor and van Zanden (2010). The DD and RD-DD estimates suggest that including women in inheritance and abolishing impartible inheritance had similar effects on fertility. The paper treats these as separate but reinforcing channels.&lt;/p&gt;
&lt;p&gt;Q: How do the authors address potential confounders — mortality, migration, and economic conditions?
A: On mortality: child mortality did not evolve differently after 1793 across areas with different inheritance rules (Appendix Table A3), and baseline adult mortality (age at death, probability of dying before completing the fertile cycle) was balanced across treated and control areas (Table 1). On migration: the authors explicitly rule out that results are driven by migration. On economic conditions: municipality-specific decade-average wheat prices (Ridolfi 2019) are included as controls for local Malthusian dynamics, and results are robust to their inclusion.&lt;/p&gt;
&lt;p&gt;Q: What do the balance tests show?
A: Panel A of Table 1 shows that before the reforms, areas with impartible versus partible inheritance were balanced on 9 of 11 individual-level characteristics — including husband and wife age at death, probability of dying before completing the fertile cycle, probability that parents-in-law were alive at marriage, literacy, data accuracy, and age at marriage. The only systematic pre-reform difference was fertility itself (0.7 children). Municipality-level climatic variables, soil suitability, and proxies for mortality uncertainty were also balanced. This is consistent with the origins of these systems in post-Roman Germanic law, which are unrelated to late eighteenth-century economic conditions.&lt;/p&gt;
&lt;p&gt;Q: What robustness checks are reported?
A: The authors report: (1) permutation tests reshuffling treatment exposure across women and municipalities; (2) non-linear treatment effects across cohorts, showing the heterogeneity required to explain away the baseline estimate is implausibly large per de Chaisemartin and d&amp;rsquo;Haultfoeuille (2020); (3) exclusion of outlier municipalities; (4) a placebo test for cohorts who completed their fertile cycle before 1793; (5) robustness to alternative sample definitions, treatment definitions, outcome variables, and control groups; (6) Cummins (2020) first-name repetition technique to correct for under-reported child deaths in Henry; (7) terrain characteristics including climatic and soil suitability (Galor and Özak 2016) and ruggedness (Nunn and Puga 2012); (8) for RD-DD: alternative bandwidths, running variable specifications, kernel functions, samples, and border-segment fixed effects. All checks support the main finding.&lt;/p&gt;
&lt;p&gt;Q: Why did France experience a fertility decline from inheritance reform while other countries with similar reforms did not?
A: The model rationalizes this through landownership structure. The fertility-reducing mechanism operates through indivisibility constraints that bind only when landholdings are small and fragmented — as in France, where 40–80 percent of households owned their land and plots were small. Where landownership is concentrated (England, Prussia), land per heir remains above L̄ even after partible division, so the indivisibility constraint is non-binding and fertility is unaffected by the reform. This provides a structural reason why France&amp;rsquo;s particular agrarian structure made it uniquely susceptible to this mechanism.&lt;/p&gt;
&lt;p&gt;Q: What is the broader historical significance for understanding France&amp;rsquo;s early demographic transition?
A: France&amp;rsquo;s fertility decline began roughly 50 years before industrialization, making it anomalous relative to standard quantity-quality tradeoff theories linking fertility decline to technological progress and rising returns to human capital. The 1793 reforms provide a legal-institutional explanation for the sharp post-Revolution acceleration visible in Figure 1, which is difficult to attribute to slowly-evolving cultural factors or human capital considerations not yet operative. The estimates imply the reforms brought large impartible-inheritance areas to the low-fertility regime that already characterized partible-inheritance areas, thus sharply accelerating the national transition.&lt;/p&gt;
&lt;p&gt;Impartible inheritance: A system under which parents could designate a single heir (through primogeniture, ultimogeniture, or unigeniture) to receive the bulk of the family estate, preventing fragmentation of wealth; in pre-revolutionary France this was associated with extended family households and higher fertility (2.9 surviving children on average) relative to partible areas (2.2).&lt;/p&gt;
&lt;p&gt;Partible inheritance: A system under which family wealth was divided equally among all offspring upon death; in the paper&amp;rsquo;s model this creates an incentive to limit fertility to prevent land fragmentation below the subsistence productivity threshold L̄.&lt;/p&gt;
&lt;p&gt;Indivisibility constraint (land threshold L̄): In the Stone-Geary production function, a minimum land input below which agricultural output falls to zero; this is the mechanism through which partible inheritance generates fertility-limiting incentives, since dividing a small plot among many heirs risks crossing L̄ into zero production.&lt;/p&gt;
&lt;p&gt;Difference-in-differences (DD) exposure design: The paper&amp;rsquo;s main identification strategy, using remaining fertile years after 1793 as a continuous treatment-intensity variable (0 for cohorts past fertility at the reform date, up to 25 for cohorts entirely within their fertile years), compared between treated municipalities (impartible → partible) and control municipalities (already partible).&lt;/p&gt;
&lt;p&gt;Regression-discontinuity difference-in-differences (RD-DD): An augmented design exploiting the sharp geographic discontinuity at borders between judicial districts with different pre-reform inheritance rules, comparing outcomes on both sides before and after 1793, to address smooth unobserved confounders.&lt;/p&gt;
&lt;p&gt;Completed fertility (net): The number of children surviving to age six, preferred over total births because child mortality before 1800 was high (1–1.5 children per mother did not survive to age six per Houdaille 1984), making net fertility the more economically meaningful measure for inheritance and bequest decisions.&lt;/p&gt;
&lt;p&gt;Horizontal restriction: A sampling correction applied to crowdsourced genealogical data (Blanc 2023a) that retains an observation only if at least one of the four preceding generations has more than one recorded offspring, correcting for the over-representation of single-child families that arises because Geni users tend to record direct ancestors rather than collateral relatives.&lt;/p&gt;</description></item><item><title>Robot adoption and inflation dynamics</title><link>https://macropaperwarehouse.com/papers/robot-adoption-and-inflation-dynamics/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/robot-adoption-and-inflation-dynamics/</guid><description>&lt;h2 id="robot-adoption-and-inflation-dynamics"&gt;Robot Adoption and Inflation Dynamics&lt;/h2&gt;
&lt;h3 id="research-question"&gt;Research Question&lt;/h3&gt;
&lt;p&gt;Basso and Rachedi investigate how robot adoption influences inflation dynamics — specifically, whether the surge in automation during the 2000s and 2010s can explain the muted sensitivity of inflation to unemployment (the &amp;ldquo;flat Phillips curve&amp;rdquo;) observed in advanced economies prior to the Covid pandemic, and whether the same framework can account for the subsequent resurgence of steep inflation-unemployment co-movement.&lt;/p&gt;
&lt;h3 id="data-and-methodology"&gt;Data and Methodology&lt;/h3&gt;
&lt;p&gt;The empirical analysis uses an annual panel covering 384 U.S. metropolitan statistical areas (MSAs) from 2008 to 2018. The dependent variables are non-tradable goods inflation (log-difference of services prices excluding rents and utilities, from BEA regional price parities) and wage inflation (log-difference of average compensation per job). Robot adoption at the MSA-year level is constructed following Acemoglu and Restrepo (2020a): industry-level robots per employee at the U.S. national level are weighted by industry employment shares in each MSA, yielding an MSA-year robot-per-employee ratio.&lt;/p&gt;
&lt;p&gt;The regression specification extends Hazell et al. (2022) by adding an interaction term between the lagged unemployment rate and the (demeaned) robot-per-employee ratio, along with MSA and year fixed effects. Year fixed effects absorb common inflation expectations and the endogenous response of monetary policy to aggregate demand shocks. To address endogeneity, unemployment is instrumented with a Bartik shift-share variable of tradable demand spillovers, and robot adoption is instrumented with average industry-level robot penetration in the five largest European economies — under the identifying assumption that robot demand shocks are weakly correlated across advanced countries.&lt;/p&gt;
&lt;p&gt;The theoretical framework is a New Keynesian model augmented with (i) directed search frictions in the labor market, and (ii) producer-level automation decisions in the spirit of Acemoglu and Restrepo (2020a). Producers pay a fixed entry cost, draw idiosyncratic efficiency for employing labor, and then choose between a robot technology (certain output at low efficiency) and a labor technology (uncertain hiring, higher potential efficiency). This generates an automation threshold: low-efficiency producers install robots, displacing low-wage jobs. A Taylor rule closes the model. Quantitative exercises compare two steady states calibrated to robot-per-employee ratios of 0.2% (low automation, targeting the U.S. in the early 2000s) and 0.6% (high automation, calibrated to one standard deviation of robot penetration variation across MSAs).&lt;/p&gt;
&lt;h3 id="main-findings"&gt;Main Findings&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Empirical.&lt;/strong&gt; In the baseline IV regression, a one standard deviation increase in robot adoption reduces the sensitivity of price inflation to unemployment by 17%, and the sensitivity of wage inflation to unemployment by 9%, relative to a MSA with the average robot penetration. The larger flattening effect on price inflation than on wage inflation implies that robot adoption also diminishes the pass-through from wages to prices. All three effects are statistically significant at the 5% level, and are robust to controls for demographic structure (age composition, gender/race/education participation rates, MPC heterogeneity), occupational structure (abstract, routine, manual, and offshorable occupations), and import competition exposure (Chinese and Mexican import shares).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model quantification.&lt;/strong&gt; Comparing the high-automation to the low-automation steady state, the model generates a 14% reduction in the slope of the price Phillips curve and a 13% reduction in the slope of the wage Phillips curve, conditional on the same-sized demand shocks in both economies. The price Phillips curve result accounts for 82% of the empirical estimate (17%). The model overstates the flattening of the wage Phillips curve (13% vs. 9% in the data), and therefore understates the reduction in the wage-to-price pass-through.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mechanisms.&lt;/strong&gt; Automation flattens the Phillips curve through two primary channels. First, the outside option of automating production reduces workers&amp;rsquo; bargaining power and dampens the elasticity of wages to unemployment (the &amp;ldquo;Wage Setting Effect&amp;rdquo;). Second, a higher share of robot firms reduces the aggregate labor share, muting the pass-through from wages into prices (the &amp;ldquo;Steady State Effect&amp;rdquo;). A third channel — firms cyclically substituting workers for machines in response to a shock (the &amp;ldquo;Cyclical Effect&amp;rdquo;) — operates during the transition but the Wage Setting Effect accounts for the bulk of the flattening.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Non-linearity and the post-Covid resurgence.&lt;/strong&gt; When robot-production is subject to convex adjustment costs, the threat of automation that underlies the Wage Setting Effect becomes inoperative during large expansionary shocks. When investment in machines surges, the marginal cost of producing robots rises sharply, raising the price of machines and pushing the automation threshold downward — more firms must use labor. Workers then negotiate higher wages, which pass into prices. Conditional on small demand shocks, the high-automation economy still exhibits a flatter Phillips curve than the low-automation economy. Conditional on large demand shocks (simulated as a 2 percentage point drop in unemployment), there is no difference in the inflation response between the low- and high-automation economies, so the Phillips curve reverts to steep.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-exact-empirical-specification-and-how-does-it-map-to-a-structural-object"&gt;Q1. What is the exact empirical specification and how does it map to a structural object?&lt;/h3&gt;
&lt;p&gt;The regression is: non-tradable goods inflation = β × lagged unemployment + γ × (lagged unemployment × demeaned robot adoption) + ζ × lagged robot adoption + χ × relative non-tradable price + MSA fixed effects + year fixed effects + error. In a multi-region model without automation, Hazell et al. (2022) show that the coefficient β identifies the aggregate slope of the Phillips curve because year fixed effects absorb both common inflation expectations and the endogenous monetary policy response to aggregate demand shocks. Adding the interaction term extends this logic: γ identifies how robot adoption causally shifts the slope of the local Phillips curve, which maps into changes in the aggregate slope.&lt;/p&gt;
&lt;h3 id="q2-what-are-the-first-stage-instruments-and-why-are-they-valid"&gt;Q2. What are the first-stage instruments and why are they valid?&lt;/h3&gt;
&lt;p&gt;Unemployment is instrumented with local tradable demand spillovers — a Bartik variable weighting national industry value-added growth (excluding each MSA&amp;rsquo;s own contribution) by each MSA&amp;rsquo;s average industry value-added shares, so national supply disturbances uncorrelated with MSA-level heterogeneity generate plausibly exogenous unemployment variation. Robot adoption is instrumented with the implied robot-per-employee ratio obtained by replacing U.S. industry robot installations with the average across the five largest European economies, weighted by U.S. industry employment shares; this isolates the supply-side efficiency improvements in robot technology that drove global adoption, conditional on robot demand shocks being weakly correlated across countries. The correlation between the two instruments in the sample is 0.2, ensuring they do not strongly co-move.&lt;/p&gt;
&lt;h3 id="q3-what-are-the-point-estimates-and-their-magnitudes-in-the-baseline-iv-regression"&gt;Q3. What are the point estimates and their magnitudes in the baseline IV regression?&lt;/h3&gt;
&lt;p&gt;For price inflation (Panel A, Column 4), the base sensitivity β = −0.5069 (SE 0.1381, significant at 1%), and the interaction coefficient γ = 0.0066 (SE 0.0030, significant at 5%). For wage inflation (Panel B, Column 4), β = −0.9580 (SE 0.2450, significant at 1%), and γ = 0.0049 (SE 0.0024, significant at 5%). A one standard deviation increase in robot adoption reduces price inflation sensitivity by 17% and wage inflation sensitivity by 9% relative to the average-automation MSA.&lt;/p&gt;
&lt;h3 id="q4-what-does-the-difference-in-flattening-magnitudes-17-for-prices-vs-9-for-wages-imply-about-the-wage-price-pass-through"&gt;Q4. What does the difference in flattening magnitudes (17% for prices vs. 9% for wages) imply about the wage-price pass-through?&lt;/h3&gt;
&lt;p&gt;Because automation reduces the price Phillips curve slope by proportionally more than the wage Phillips curve slope, each percentage-point change in wages translates into a smaller percentage-point change in prices in higher-automation areas. This indicates that robot adoption diminishes the influence of wage changes on price changes — i.e., it reduces the wage-to-price pass-through. In the model, this operates through the Steady State Effect: a larger share of production carried out by robot firms means that a given change in average wages applies to a smaller portion of total marginal costs, weakening the price response.&lt;/p&gt;
&lt;h3 id="q5-how-is-the-automation-threshold-determined-in-the-theoretical-model-and-what-economic-forces-govern-it"&gt;Q5. How is the automation threshold determined in the theoretical model, and what economic forces govern it?&lt;/h3&gt;
&lt;p&gt;A producer opts for the labor technology if and only if the expected value of a labor firm (= job-filling probability × (producer price × labor efficiency − posted wage) − entry cost) exceeds the value of a robot firm (= producer price × robot efficiency − machine price − entry cost). Since the value of a labor firm increases in labor efficiency, there is a unique cut-off efficiency level γ* at which a producer is indifferent. Producers with labor efficiency above γ* post vacancies; those below γ* install robots. The cut-off rises (more automation) when wages rise relative to machine prices, and falls (less automation) when machine prices rise due to costly robot production during large expansionary shocks.&lt;/p&gt;
&lt;h3 id="q6-how-does-the-wage-posting-equilibrium-under-directed-search-generate-the-wage-setting-effect-of-automation"&gt;Q6. How does the wage-posting equilibrium under directed search generate the Wage Setting Effect of automation?&lt;/h3&gt;
&lt;p&gt;Under directed search, each labor firm posts a wage to maximize its expected value, and workers sort into sub-markets offering higher wages but lower job-finding probabilities. The equilibrium posted wage for a firm with labor efficiency γj is Wγj,t = PP,t × γj × (1 − η), where η is the elasticity of matches to vacancies. The option to install a robot — available at any time — limits how much any individual firm needs to offer workers. When automation increases, the outside option becomes more attractive to more firms, which constrains wage offers industry-wide, reducing the elasticity of average wages to unemployment fluctuations.&lt;/p&gt;
&lt;h3 id="q7-how-is-the-slope-of-the-price-phillips-curve-characterized-analytically"&gt;Q7. How is the slope of the price Phillips curve characterized analytically?&lt;/h3&gt;
&lt;p&gt;Log-linearizing the model around the steady state and substituting labor market and wholesaler equilibrium conditions into the pricing equation yields: inflation = −[(ε−1)/φ] × Ψ(γ*; Θ) × unemployment gap + β × expected future inflation, where Ψ(γ*; Θ) is a function of the automation cut-off γ*, the elasticity of substitution ε, the matching function elasticity η, the efficiency bounds γM and γH, and the distribution shape parameter α. In contrast to standard New Keynesian models where the slope depends only on markup and nominal rigidity parameters, this expression depends directly on the degree of automation through the steady-state threshold γ*.&lt;/p&gt;
&lt;h3 id="q8-across-different-structural-parameter-configurations-does-automation-always-flatten-the-phillips-curve"&gt;Q8. Across different structural parameter configurations, does automation always flatten the Phillips curve?&lt;/h3&gt;
&lt;p&gt;Yes. Numerical analysis of the closed-form Phillips curve expression (Figure 1) shows that robot adoption unambiguously decreases the slope of the price Phillips curve across all combinations of the key structural parameters — the distribution shape parameter α, the matching elasticity η, the upper bound of labor efficiency γH, and the steady-state unemployment rate ū. The flattening effect is more pronounced when η is low, when α implies a larger fraction of low-efficiency producers, and when the steady-state unemployment rate is low.&lt;/p&gt;
&lt;h3 id="q9-how-do-the-three-mechanism-channels-cyclical-wage-setting-steady-state-compare-quantitatively"&gt;Q9. How do the three mechanism channels (Cyclical, Wage Setting, Steady State) compare quantitatively?&lt;/h3&gt;
&lt;p&gt;The paper isolates channels by comparing alternative model specifications: (i) Baseline directed search with endogenous automation, (ii) Directed search with fixed automation (removing Cyclical and Wage Setting Effects, leaving only the Steady State Effect), (iii) Random search with τ = 0.5 (efficient bargaining, retaining both the Cyclical and Wage Setting Effects), (iv) Random search with τ = 0.01 (near-zero worker bargaining power, removing the Wage Setting Effect but retaining the Cyclical Effect). Figure 5 shows that the Steady State Effect alone accounts for only a small portion of the total inflation differential between low- and high-automation economies. The Wage Setting Effect — isolated by comparing τ = 0.01 and τ = 0.5 economies with endogenous automation — accounts for the bulk of the flattening. The Cyclical Effect (isolated by comparing fixed and endogenous automation with τ = 0.01) contributes an intermediate amount.&lt;/p&gt;
&lt;h3 id="q10-what-is-the-quantitative-exercise-comparing-low--and-high-automation-steady-states"&gt;Q10. What is the quantitative exercise comparing low- and high-automation steady states?&lt;/h3&gt;
&lt;p&gt;The low-automation economy targets the U.S. robot-per-employee ratio of 0.2% in the early 2000s (Acemoglu and Restrepo, 2020a), calibrated with robot-specific technological change ζ = 2. The high-automation economy features a 200% higher robot-per-employee ratio of 0.6%, calibrated to replicate one standard deviation of cross-MSA dispersion in robot penetration in the data. Both economies are simulated with 10,000 realizations of preference shocks, and the slopes of the price and wage Phillips curves are estimated from simulated inflation and unemployment outcomes. The price Phillips curve flattens by 14% and the wage Phillips curve by 13% moving from low to high automation, conditional on the same-sized shock in both economies.&lt;/p&gt;
&lt;h3 id="q11-how-does-the-model-account-for-the-covid-era-resurgence-of-high-inflation-despite-high-automation"&gt;Q11. How does the model account for the Covid-era resurgence of high inflation despite high automation?&lt;/h3&gt;
&lt;p&gt;The paper extends the machine manufacturer&amp;rsquo;s production function to include an asymmetric convex adjustment cost that activates when investment deviates more than 5% from its steady-state level (parameterized with δ = 0.0015 and ϱ = 100). Under a small expansionary shock (0.25 percentage point decrease in unemployment), inflation rises less in the high-automation economy, consistent with a flat Phillips curve. Under a large expansionary shock (2 percentage point decrease in unemployment), the surge in robot investment triggers sharply rising machine prices, eliminating the automation outside option for marginal producers and fully restoring workers&amp;rsquo; bargaining power — so the inflation response is identical in the low- and high-automation economies, consistent with a steep Phillips curve. The paper interprets this as a proof-of-concept consistent with post-Covid wage compression evidence for low-wage workers documented by Autor, Dube, and McGrew (2023).&lt;/p&gt;
&lt;h3 id="q12-what-do-the-robustness-checks-establish-regarding-alternative-explanations"&gt;Q12. What do the robustness checks establish regarding alternative explanations?&lt;/h3&gt;
&lt;p&gt;The interaction of unemployment and robot adoption remains statistically significant at the 5% level across all the robustness checks (Appendix A). These include controlling for: (i) demographic heterogeneity — shares of young (below 30) and old (above 60) individuals, female/Black/Asian labor market participation, low-education attainment shares, overall participation, and MSA-level average marginal propensity to consume (MPC); (ii) occupational structure — shares of abstract, routine, manual, and offshorable occupations; and (iii) import competition — MSA exposure to Chinese and Mexican import competition. The coefficient on the robot-unemployment interaction term is stable across specifications, with the magnitude remaining close to that in the baseline (approximately 0.0140 across all demographic robustness columns in Table A.1).&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;em&gt;&lt;em&gt;Automation threshold (γ&lt;/em&gt;):&lt;/em&gt;* The paper-specific level of idiosyncratic labor efficiency at which a producer is indifferent between installing a robot and posting a vacancy. Producers with labor efficiency below γ* choose the machine technology; those above choose the labor technology. The threshold is determined by the relative profitability of the two technologies, and it shifts endogenously with wages, machine prices, and job-filling probabilities. A higher γ* means more of the production sector is automated.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Wage Setting Effect of automation:&lt;/strong&gt; The channel through which the existence of the outside option to install robots reduces workers&amp;rsquo; bargaining power and dampens the elasticity of wages to unemployment fluctuations. Under directed search, firms&amp;rsquo; ability to substitute machines for labor at a lower cost constrains the wage offers they need to post, so that a given decline in unemployment generates a smaller increase in average wages in higher-automation economies.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Steady State Effect of automation:&lt;/strong&gt; The channel through which a larger steady-state fraction of robot firms reduces the aggregate labor share, so that even a given change in wages translates into a smaller change in aggregate marginal costs and prices. This channel operates even when automation cannot change upon a shock (fixed automation baseline).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Cyclical Effect of automation:&lt;/strong&gt; The channel through which firms actively replace workers with machines in response to expansionary shocks that raise wages, generating an endogenous dampening of labor demand and putting downward pressure on the wage increase itself. This channel requires endogenous automation choices at business-cycle frequencies.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Robot-specific technological change (ζ):&lt;/strong&gt; In the paper&amp;rsquo;s model, the parameter governing the efficiency with which machine manufacturers transform final goods into robots. A higher ζ reduces the relative price of machines (PM/P = 1/ζ), making automation more attractive to lower-efficiency producers and raising the automation threshold γ*. In quantitative exercises, variation in ζ across steady states drives differences in the degree of automation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Price Phillips curve slope (Ψ):&lt;/strong&gt; In the paper&amp;rsquo;s log-linearized model, the structural coefficient linking inflation to the unemployment gap. Unlike in standard New Keynesian models — where the slope depends only on the markup and nominal rigidity — Ψ is a function of the automation threshold γ*, the matching elasticity η, the efficiency distribution parameters (γM, γH, α), and the elasticity of substitution ε. Robot adoption shifts γ* and thereby changes Ψ.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Asymmetric investment adjustment cost:&lt;/strong&gt; An extension of the machine manufacturer&amp;rsquo;s production function that imposes convex costs when robot investment deviates above 5% from its steady-state level (parameterized by δ and ϱ). This specification makes it increasingly costly to rapidly scale up automation in response to large demand shocks, causing the machine price to spike and the automation outside option to cease being effective for marginal producers, thereby restoring workers&amp;rsquo; bargaining power and steepening the Phillips curve during large expansionary episodes.&lt;/p&gt;</description></item><item><title>Robust Real Rate Rules</title><link>https://macropaperwarehouse.com/papers/robust-real-rate-rules/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/robust-real-rate-rules/</guid><description>&lt;p&gt;The paper proposes and analyzes &lt;strong&gt;real rate rules&lt;/strong&gt; — monetary policy rules of the form i_t = r_t + φπ_t (φ &amp;gt; 1), where r_t is the current-period real interest rate observed via TIPS yields or inflation swap markets. The central analytical result is that combining this rule with the Fisher equation i_t = r_t + E_t[π_{t+1}] immediately yields E_t[π_{t+1}] = φπ_t, whose unique non-explosive solution is π_t = 0 for all t. This proof uses only the Fisher equation — not the aggregate Euler equation — making the determinacy result robust to household heterogeneity, hand-to-mouth consumers, non-rational household or firm expectations, active fiscal policy, missing transversality conditions, and any specification of intertemporal or nominal-real links. The Fisher equation itself requires only two deep-pocketed, fully-informed, rational agents to arbitrage between nominal and real bonds — a much weaker assumption than aggregate Euler equation rationality. Under the real rate rule, &lt;strong&gt;inflation is decoupled from the Phillips curve&lt;/strong&gt;: causation runs monetary policy → inflation, then inflation → output gap, not the reverse; the Phillips curve determines the output gap residually given already-determined inflation. In a three-equation New Keynesian model with a mark-up shock ζ_t and cost-push shock ω_t, the output gap satisfies x_t = −(ζ_t/(κ(φ − ρ_ζ))) − (ω_t/κ), where the Euler equation plays no role in inflation determination. The rule is &lt;strong&gt;globally stable under learning&lt;/strong&gt; via a contraction argument using Gautschi&amp;rsquo;s inequality: even if financial market participants hold incorrect prior beliefs, the learning process converges to the target inflation. With a &lt;strong&gt;time-varying inflation target&lt;/strong&gt; π*_t, the modified rule i_t = r_t + φ(π_t − π*_t) implements any target path determinately — π_t = π*_t for all t, including optimal Ramsey paths — making real rate rules observationally equivalent to any other monetary policy specification. The Taylor principle (φ_π &amp;gt; 1) is neither necessary nor sufficient for determinacy in richer models (Bilbiie 2008 TANK; Leeper-Leith 2016 FTPL); the real rate rule achieves determinacy without invoking Euler equation structure. An additional result: with long-maturity government debt, a stable inflation equilibrium always exists under the real rate rule regardless of whether fiscal policy is active or passive — the fiscal theory of the price level fails to produce unique outcomes in this setting.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-real-rate-rule-and-why-does-it-achieve-determinacy-without-requiring-the-aggregate-euler-equation"&gt;Q1. What is the real rate rule, and why does it achieve determinacy without requiring the aggregate Euler equation?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The real rate rule i_t = r_t + φπ_t (φ &amp;gt; 1) combined with the Fisher equation i_t = r_t + E_t[π_{t+1}] immediately gives E_t[π_{t+1}] = φπ_t, whose unique non-explosive solution is π_t = 0 for all t; the proof is complete at this step, requiring no information about how households form expectations or optimize intertemporally.&lt;/strong&gt; Standard Taylor-rule determinacy proofs rely on the aggregate Euler equation to close the system — the IS curve determines aggregate demand as a function of the real interest rate; deviation from determinacy arises when the Euler equation-Phillips curve system allows self-fulfilling expectation spirals. The real rate rule bypasses this entirely: the Fisher equation alone pins down the inflation path. The Fisher equation is a no-arbitrage condition between nominal and real bonds; it holds as long as two &amp;ldquo;deep-pocketed, fully-informed, rational agents&amp;rdquo; can trade both types of bonds — a condition that does not require aggregate household rationality, representative agent assumptions, or any specific consumption theory. Hand-to-mouth households, heterogeneous expectations, learning dynamics, and non-Ricardian fiscal regimes all leave the Fisher equation intact as long as some agents are pricing both asset classes. The consequence is that the Euler equation in the three-equation NK model becomes residual under the real rate rule: it determines the path of real interest rates given already-determined inflation and output gap, but plays no role in choosing among inflation equilibria.&lt;/p&gt;
&lt;h3 id="q2-what-does-the-real-rate-rule-imply-about-causation-between-inflation-and-the-output-gap"&gt;Q2. What does the real rate rule imply about causation between inflation and the output gap?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Under the real rate rule, the Phillips curve operates in reverse relative to standard models: inflation is determined first (by the Fisher equation and the monetary rule), and the Phillips curve then determines the output gap as a residual; cost-push and demand shocks cannot amplify or dampen inflation variance under the rule.&lt;/strong&gt; In the standard three-equation NK model with a mark-up shock ζ_t (law of motion ζ_t = ρ_ζ ζ_{t-1} + ε_{ζ,t}) and cost-push shock ω_t, the output gap under the real rate rule is x_t = −ζ_t/(κ(φ − ρ_ζ)) − ω_t/κ — a closed-form solution determined entirely by shocks, where the Euler equation does not appear. Inflation is π_t = 0 at all t (zero target): shocks affect the output gap but not inflation. Under an augmented rule that also responds to the output gap (i_t = r_t + φ_π π_t + φ_x x_t), determinacy still holds as long as a Phillips curve linking inflation and the output gap exists and the Taylor principle φ_π &amp;gt; 1 holds — providing additional policy degrees of freedom without sacrificing robustness. The decoupling of inflation from the Phillips curve is consistent with the empirical finding of Dotsey, Fujita, and Stark (2018) that the Phillips curve ceased to forecast inflation after 1984 — compatible with the hypothesis that the Fed&amp;rsquo;s post-Volcker behavior moved toward more real-rate-rule-like rules, giving the Fisher equation stronger anchor over inflation.&lt;/p&gt;
&lt;h3 id="q3-how-does-global-stability-under-learning-extend-the-determinacy-result-beyond-local-uniqueness"&gt;Q3. How does global stability under learning extend the determinacy result beyond local uniqueness?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Equilibrium determinacy is a local result (unique bounded solution near the target); the real rate rule additionally provides global stability under learning — even if financial market participants start with prior beliefs far from zero, the learning process converges to π_t = 0, preventing self-fulfilling sunspot equilibria from taking hold in the first place.&lt;/strong&gt; The proof (Appendix D, using Gautschi&amp;rsquo;s inequality) establishes that the mapping from current beliefs to future beliefs is a contraction in the appropriate norm: since E_t[π_{t+1}] = φπ_t with φ &amp;gt; 1 drives realized inflation to zero, agents who update beliefs based on observed prices will progressively correct any initial error. This contrasts with Taylor rules, which are only locally determinate — an economy that starts at a non-zero sunspot inflation level may remain there if the sunspot is self-fulfilling. The global stability result also provides a response to the Cochrane (2022) critique that indeterminate equilibria under standard Taylor rules are &amp;ldquo;everywhere&amp;rdquo;: under the real rate rule, the only globally stable equilibrium is the target. The interest rate smoothing variant (Section 1.5) — fully smoothed real rate rule, θ &amp;gt; 0 — provides additional robustness: it requires agents to believe only that the central bank responds positively to inflation (not that φ &amp;gt; 1 specifically), and still generates identical inflation dynamics; this is more credible as a commitment device because the specific magnitude of φ cannot be directly observed.&lt;/p&gt;
&lt;h3 id="q4-how-can-the-real-rate-rule-implement-arbitrary-inflation-dynamics-including-optimal-policy"&gt;Q4. How can the real rate rule implement arbitrary inflation dynamics, including optimal policy?&lt;/h3&gt;
&lt;p&gt;&lt;em&gt;&lt;em&gt;With a time-varying inflation target π&lt;/em&gt;_t, the modified rule i_t = r_t + φ(π_t − π&lt;/em&gt;&lt;em&gt;t) implements any target inflation path determinately: the Fisher equation gives E_t[π&lt;/em&gt;{t+1} − π*_{t+1}] = φ(π_t − π*_t), whose unique solution is π_t = π*_t for all t, so realized inflation tracks the announced target exactly.** The CB must announce π*_t each period; this announcement may respond to the output gap, cost-push shocks, or any other variable. For example, to stabilize inflation while accommodating a cost-push shock, the CB sets π*&lt;em&gt;t as a function of ω_t; realized inflation then follows this target, and the Phillips curve determines the output gap residually. There are two constraints: (1) the CB must be able to compute a reasonable approximation to E_t[π*&lt;/em&gt;{t+1}] — achievable via inflation futures, inflation swap markets, or an internal forecasting model; (2) the target path itself must not be explosive (a target that amplifies its own past realizations would generate explosive equilibria). Under these constraints, the paper formally proves (Appendix E.5) that real rate rules with time-varying targets can replicate the outcomes of any other monetary regime. This implies: (a) real rate rules can implement Ramsey-optimal policy, attaining the highest possible welfare; (b) it is empirically impossible to test whether a central bank is following a general real rate rule — any observed inflation and interest rate dynamics are consistent with some choice of π*_t. The Smets-Wouters (2007) estimated rule for the US illustrates: at the posterior mode, the correlation between the rule component z_t and the real interest rate r_t is 0.63, with both variables having standard deviation 0.46%, suggesting the Fed is already approximately two-thirds of the way toward a simple robust real rate rule.&lt;/p&gt;
&lt;h3 id="q5-why-does-the-taylor-principle-fail-in-richer-models-and-how-does-the-real-rate-rule-avoid-those-failures"&gt;Q5. Why does the Taylor principle fail in richer models, and how does the real rate rule avoid those failures?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The Taylor principle (φ_π &amp;gt; 1) is sufficient for determinacy in the benchmark three-equation NK model with a representative rational agent, but it is neither necessary nor sufficient in richer environments: Bilbiie (2008) shows that with enough hand-to-mouth consumers, higher φ_π can destabilize the economy; Leeper-Leith (2016) shows that following the Taylor principle can generate explosive inflation under the fiscal theory when nominal debt is present.&lt;/strong&gt; Bilbiie (2008, 2019) inverts the Euler equation for the representative rational household when hand-to-mouth agents dominate: the aggregate consumption Euler equation has a negative intertemporal substitution sign, making the system&amp;rsquo;s eigenvalues switch. With enough hand-to-mouth agents, φ_π &amp;gt; 1 actually generates explosive equilibria (indeterminacy flips). Under the real rate rule, the Euler equation is disconnected from inflation determination entirely — Bilbiie&amp;rsquo;s mechanism cannot operate because the inflation equation relies only on the Fisher equation, not on whether the Euler equation has positive or negative sign. Similarly, the paper&amp;rsquo;s Section 2 result on fiscal robustness: with long-maturity government debt (Appendix B), a stable inflation equilibrium always exists under the real rate rule regardless of whether fiscal policy is active or passive. This implies the fiscal theory of the price level (FTPL) cannot uniquely determine inflation under the real rate rule — there is always a stable solution — so FTPL determinations are not unique, which may be of independent theoretical interest. The proof uses the contracting property of the non-linear real rate rule in the fully non-linear model, showing the target gross inflation Π* is always a solution of the bond-pricing fixed-point equation and that it is approached from all starting points via iteration.&lt;/p&gt;
&lt;h3 id="q6-how-is-the-real-rate-rule-implemented-in-practice-and-what-are-the-policy-implications-for-central-bank-design"&gt;Q6. How is the real rate rule implemented in practice, and what are the policy implications for central bank design?&lt;/h3&gt;
&lt;p&gt;&lt;em&gt;&lt;em&gt;Implementation uses TIPS yields (Treasury Inflation-Protected Securities) or inflation swap markets as real-time signals for r_t; the central bank sets i_t = TIPS_yield_t + φπ_t without estimating the natural rate (r&lt;/em&gt;) or output gap, eliminating the key measurement error in standard rules.&lt;/em&gt;* The key operational advantage over standard Taylor-type rules: standard rules require estimating the natural rate r* (now known to be mismeasured; Holston-Laubach-Williams 2017 revisions) and the output gap (subject to large real-time revisions); the real rate rule bypasses both because r_t is directly observable from financial markets (it equals the TIPS yield to a risk premium). The CB must also compute E_t[π*_{t+1}] to set the time-varying target; inflation futures or swap markets provide a forward-looking market price for this purpose. The paper discusses Hall and Reis (2016) &amp;ldquo;indexed payment on reserve&amp;rdquo; rules, which use a different mechanism (central bank liability indexation) to achieve similar robustness goals but do not rely on the Fisher equation as directly. Adão, Correia, and Teles (2011) achieve related results via complete nominal bond indexation. The real rate rule is more transparent and simpler to communicate: the CB says &amp;ldquo;we will raise the policy rate one-for-one with the real rate plus respond to inflation with coefficient φ.&amp;rdquo; For a smoothed version, communicating &amp;ldquo;we respond positively to inflation&amp;rdquo; — without specifying exactly how much — is sufficient for determinacy, and arguably more credible as a commitment. Section 4 (not covered here) develops a ZLB-adapted version of the rule for zero lower bound episodes that rules out explosive inflation equilibria at the bound.&lt;/p&gt;
&lt;hr&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;real rate rule&lt;/strong&gt; : the monetary policy rule i_t = r_t + φπ_t (φ &amp;gt; 1), where r_t is the current real interest rate observed from TIPS or inflation swap markets; achieves equilibrium determinacy via the Fisher equation alone, without invoking the aggregate Euler equation, making it robust to heterogeneous agents, hand-to-mouth consumers, non-rational expectations, and active fiscal policy.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Fisher equation&lt;/strong&gt; : the no-arbitrage condition i_t = r_t + E_t[π_{t+1}] linking the nominal policy rate, real rate, and expected inflation; in the context of the real rate rule, it is the only structural equation needed for determinacy; requires only two deep-pocketed rational agents to arbitrage between nominal and real bonds — not aggregate household rationality.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;inflation decoupling&lt;/strong&gt; : the property under the real rate rule that the Phillips curve determines the output gap residually given already-determined inflation, rather than operating as a transmission mechanism for cost-push or demand shocks into inflation; implies that only monetary policy shocks and Fisher equation shocks can move inflation — cost-push and demand shocks affect the output gap but not the price level.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Taylor principle failure&lt;/strong&gt; : the result (Bilbiie 2008) that standard Taylor rules can fail to deliver determinacy in models with hand-to-mouth consumers or heterogeneous agents — because the inverted aggregate Euler equation can flip eigenvalue signs — and (Leeper-Leith 2016) that following the Taylor principle can generate explosive inflation under the fiscal theory of the price level with nominal debt; the real rate rule avoids both failures by relying on the Fisher equation rather than the Euler equation for inflation determination.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;global stability under learning&lt;/strong&gt; : the property that even if financial market participants start with beliefs far from the inflation target, the learning process converges to the target under the real rate rule, proven via a contraction argument using Gautschi&amp;rsquo;s inequality; stronger than local determinacy (which only guarantees uniqueness near the target), ruling out self-fulfilling sunspot equilibria from any starting point.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;fiscal theory robustness&lt;/strong&gt; : the paper&amp;rsquo;s finding that with long-maturity government debt, the real rate rule always implies a stable inflation equilibrium regardless of whether fiscal policy is active (non-Ricardian) or passive (Ricardian); equivalently, the fiscal theory of the price level cannot uniquely determine inflation under the real rate rule because a stable solution always coexists with any fiscal regime.&lt;/p&gt;</description></item><item><title>Running Primary Deficits Forever in a Dynamically Efficient Economy: Feasibility and Optimality</title><link>https://macropaperwarehouse.com/papers/running-primary-deficits-forever-in-a-dynamically-efficient-economy-feasibility-and-optimality/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/running-primary-deficits-forever-in-a-dynamically-efficient-economy-feasibility-and-optimality/</guid><description>&lt;h2 id="running-primary-deficits-forever-in-a-dynamically-efficient-economy-feasibility-and-optimality"&gt;Running Primary Deficits Forever in a Dynamically Efficient Economy: Feasibility and Optimality&lt;/h2&gt;
&lt;h3 id="research-question"&gt;Research Question&lt;/h3&gt;
&lt;p&gt;The paper addresses two questions about government debt rollover. First, a positive question: what is the maximum ratio of government bonds to capital that can be sustained forever without any primary budget surpluses? Second, a normative question: among sustainable bond-capital ratios along a balanced growth path, which one maximizes the welfare (steady-state utility) of consumers? The analysis is motivated by Blanchard&amp;rsquo;s (2019) AEA presidential address and the fiscal responses to the COVID-19 pandemic.&lt;/p&gt;
&lt;h3 id="setting-and-mechanism"&gt;Setting and Mechanism&lt;/h3&gt;
&lt;p&gt;The baseline environment is a standard two-generation (young and old) overlapping-generations model. Young consumers earn labor income and save; old consumers live off portfolio returns. The production function is Cobb-Douglas, Yt = (GtN)^(1−α) K^α, where G = 1+g is the gross growth rate of labor-augmenting productivity. Uncertainty enters exclusively through a stochastic i.i.d. durability shock ε_t to the depreciation rate of capital (δ − ε_t), so the rate of return on capital r = αk^(α−1) − δ + ε is stochastic even though the capital stock per unit of effective labor k is deterministic along a balanced growth path. Consumers have Epstein-Zin-Weil utility with an intertemporal elasticity of substitution equal to one. Because IES = 1 and labor income is earned only when young, aggregate saving of young consumers is a constant fraction β of their wage income, making total assets (capital plus bonds) non-stochastic.&lt;/p&gt;
&lt;p&gt;This structure creates a key wedge: the expected rate of return on capital R can exceed the growth rate g (dynamic efficiency) while the riskfree interest rate rf — determined by the portfolio equilibrium between risky capital and riskless bonds — can remain below g. In deterministic economies these two rates coincide, so dynamic efficiency and the infeasibility of permanent debt rollover always go together. In this stochastic model they can be decoupled.&lt;/p&gt;
&lt;h3 id="main-findings"&gt;Main Findings&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Positive finding.&lt;/strong&gt; The maximum sustainable bond-capital ratio, Bmax, is attained precisely when rf = g (equivalently, when the adjusted gross riskfree rate Rf = 1). Starting from a bond-less economy with rf &amp;lt; g (which may itself be dynamically efficient), introducing government bonds crowds out capital, raises the marginal product of capital and the constellation of returns, and drives rf upward toward g. Once rf = g is reached, any further increase in bonds would require rf &amp;gt; g, making rollover infeasible without primary surpluses. The maximum sustainable ratio Bmax is characterized as the unique root of f(Bmax, 1) = 0, and it is finite.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Normative finding.&lt;/strong&gt; The welfare-maximizing sustainable bond-capital ratio equals Bmax. Proposition 6 establishes that u′(B) ≥ 0 for all B ∈ [0, Bmax] whenever Rf ≤ 1, with strict inequality unless Rf = 1. Proposition 7 therefore concludes that the welfare-maximizing B is the corner solution Bmax. Intuitively, increasing B reduces capital and wages but raises the rate of return on capital. When rf ≤ g, the welfare gain from a higher return on capital in old age dominates the welfare loss from a lower wage when young (via the factor-price frontier and the intertemporal optimality condition E{uo′(co)} ≥ uy′(cy)). When rf = g (at Bmax), a marginal increase in bonds also provides no additional welfare improvement if all seignorage is transferred to young consumers (ζ = 1), but still raises welfare if some seignorage is wasted (ζ &amp;lt; 1). In either case, Bmax is the optimum. Critically, at the optimum the economy is dynamically efficient — even though the government is running permanent primary deficits.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Dual role of bonds.&lt;/strong&gt; At the optimal bond-capital ratio, government bonds serve two purposes simultaneously: (1) they crowd out any dynamically inefficient overaccumulation of capital that might prevail without bonds, and (2) they supply riskfree assets to risk-averse consumers who would otherwise hold only risky capital, improving risk sharing.&lt;/p&gt;
&lt;h3 id="quantitative-illustration"&gt;Quantitative Illustration&lt;/h3&gt;
&lt;p&gt;The paper calibrates a 30-year-period OLG model with α = 0.33, β = 0.353 (annual discount rate 2%), annual productivity growth g = 1% (G = 1.35), and target mean return on unlevered equity m = 3% per year. Risk aversion γ ∈ {1, 3, 8, 10} and annualized standard deviation of capital returns s ∈ {0.02, …, 0.22}. Key results (ζ = 0): at γ = 10 and s = 0.22, Bmax = 0.478 and B∗ (the bond-capital ratio needed just to eliminate dynamic inefficiency) = 0.083, so there is a wide interval [0.083, 0.478] of dynamically efficient, permanently rollable bond-capital ratios. For a capital-output ratio of 2, the debt-GDP ratio corresponding to Bmax = 0.478 is approximately 0.956. Bmax is strictly increasing in both γ and s, and is invariant to ζ (the share of seignorage transferred rather than wasted).&lt;/p&gt;
&lt;h3 id="scope-conditions"&gt;Scope Conditions&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;Results hold along balanced growth paths with constant g and constant rf; the sustainability characterization is more complex if either rate is stochastic.&lt;/li&gt;
&lt;li&gt;The key sufficient condition for Rf to be increasing in B (Proposition 1) is that risk aversion γ &amp;lt; Λ, a model-dependent upper bound that is always positive. All subsequent propositions assume R′f(B) &amp;gt; 0, which is satisfied for a potentially larger set of γ.&lt;/li&gt;
&lt;li&gt;The paper focuses on welfare along the balanced growth path; it does not study transition dynamics or welfare during convergence from an initial state.&lt;/li&gt;
&lt;li&gt;The No Ponzi Game (NPG) condition is violated by design in the feasible-rollover region (rf ≤ g); the value of government bonds is positive even though the present value of all future primary surpluses is non-positive.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-why-can-an-economy-be-both-dynamically-efficient-and-able-to-roll-over-government-bonds-forever-when-this-is-impossible-in-deterministic-models"&gt;Q1. Why can an economy be both dynamically efficient and able to roll over government bonds forever, when this is impossible in deterministic models?&lt;/h3&gt;
&lt;p&gt;In a deterministic economy, the riskfree rate rf and the rate of return on capital r are equal, so the conditions rf &amp;lt; g (feasibility of rollover) and r &amp;lt; g (dynamic inefficiency) are identical. In a stochastic economy, aggregate uncertainty drives a wedge between rf and the expected return on capital. Risk-averse consumers require a premium to hold risky capital over riskless bonds, so rf &amp;lt; E{r}. It is therefore possible that E{ln R} &amp;gt; 0 (the Zilcha sufficient condition for dynamic efficiency holds) while Rf &amp;lt; 1, i.e., rf &amp;lt; g. This decoupling is the central theoretical contribution of the paper.&lt;/p&gt;
&lt;h3 id="q2-what-is-the-formal-criterion-the-paper-uses-for-dynamic-efficiency-and-how-does-it-relate-to-the-amsz-criterion"&gt;Q2. What is the formal criterion the paper uses for dynamic efficiency, and how does it relate to the AMSZ criterion?&lt;/h3&gt;
&lt;p&gt;Abel, Mankiw, Summers, and Zeckhauser (AMSZ, 1989) show that if the rate of return on capital exceeds g in all states (R &amp;gt; 1 always), the economy is dynamically efficient, and since rf &amp;lt; r, the economy has rf &amp;gt; g so rollover is infeasible; conversely if r &amp;lt; g always, the economy is dynamically inefficient. The AMSZ criteria are silent when R sometimes exceeds and sometimes falls short of one. Building on Zilcha (1991), the paper uses E{ln R} ≥ 0 as a sufficient condition for dynamic efficiency. In the five-region diagram (Figure 1), Region E satisfies E{ln R} &amp;gt; 0 (Zilcha-efficient) and Rf &amp;lt; 1 (rollover feasible simultaneously), which is the case of central interest.&lt;/p&gt;
&lt;h3 id="q3-how-does-the-model-achieve-a-deterministic-capital-stock-despite-stochastic-capital-returns"&gt;Q3. How does the model achieve a deterministic capital stock despite stochastic capital returns?&lt;/h3&gt;
&lt;p&gt;The durability shock ε_t affects depreciation but is additively separable from the production function. Because (1) IES = 1 and (2) consumers earn income only when young, aggregate saving is the fixed fraction β of wage income, which depends only on capital k (itself non-stochastic). Total assets At+1 = Kt+1 + Bt+1 = St are thus non-stochastic. The stochastic shock to depreciation makes the rate of return on capital r = αkα−1 − δ + ε stochastic even though k is deterministic. Online Appendix B establishes that this model is isomorphic to a model with production function shocks, extending the scope of the results.&lt;/p&gt;
&lt;h3 id="q4-what-is-the-financial-market-equilibrium-condition-that-pins-down-the-riskfree-rate"&gt;Q4. What is the financial market equilibrium condition that pins down the riskfree rate?&lt;/h3&gt;
&lt;p&gt;Young consumers optimally choose the portfolio share λ in riskfree bonds. The first-order condition for this portfolio problem along a balanced growth path is E{(λRf + (1−λ)R)^(−γ)(Rf − R)} = 0 (equation 20). In equilibrium, λ = B/(1+B) (the bond-capital ratio determines the portfolio share), so the equilibrium riskfree rate Rf satisfies the implicit equation f(B, Rf) = 0 (equation 21). Lemma 1 establishes that Rf = E{R^(1−γ)_a}/E{R^(−γ)_a}, a ratio-of-moments formula analogous to an Euler equation.&lt;/p&gt;
&lt;h3 id="q5-why-is-the-riskfree-rate-rf-an-increasing-function-of-the-bond-capital-ratio-b-and-what-is-the-sufficient-condition-for-this"&gt;Q5. Why is the riskfree rate Rf an increasing function of the bond-capital ratio B, and what is the sufficient condition for this?&lt;/h3&gt;
&lt;p&gt;Lemma 2 shows ∂f/∂B &amp;gt; 0; intuitively, more bonds reduce capital, raise the marginal product of capital, and raise R, inducing consumers to demand more capital and less bonds, pushing Rf up to restore equilibrium. Lemma 3 provides a sufficient condition for ∂f/∂Rf &amp;lt; 0, namely γ &amp;lt; Λ (where Λ is a positive parameter-dependent bound). Under this condition, the implicit function theorem implies Rf′(B) &amp;gt; 0 (Proposition 1). The condition γ &amp;lt; Λ is sufficient but not necessary, so the results of all downstream propositions hold potentially for a wider parameter range.&lt;/p&gt;
&lt;h3 id="q6-what-is-the-maximum-sustainable-bond-capital-ratio-bmax-and-how-is-it-characterized"&gt;Q6. What is the maximum sustainable bond-capital ratio Bmax, and how is it characterized?&lt;/h3&gt;
&lt;p&gt;By definition, a bond-capital ratio B is sustainable if and only if Rf(B) ≤ 1. If Rf(0) ≥ 1, then Bmax = 0 (no positive amount of bonds is sustainable). If Rf(0) &amp;lt; 1, Bmax is the unique positive root of Rf(B) = 1, i.e., f(Bmax, 1) = 0 (Proposition 4). At Bmax, the riskfree rate exactly equals the growth rate: rf = g. The paper also shows Bmax ≤ (1−α)β/α − 1, an upper bound that depends only on production and preference parameters. Notably, Bmax is invariant to the parameter ζ (the share of seignorage transferred to young consumers rather than wasted), because at Bmax transfers are always zero regardless of ζ.&lt;/p&gt;
&lt;h3 id="q7-why-does-the-welfare-maximizing-sustainable-bond-capital-ratio-equal-bmax-rather-than-some-interior-value"&gt;Q7. Why does the welfare-maximizing sustainable bond-capital ratio equal Bmax rather than some interior value?&lt;/h3&gt;
&lt;p&gt;Proposition 6 shows that u′(B) ≥ 0 for all B ∈ [0, Bmax] whenever Rf ≤ 1, with strict inequality unless Rf = 1 and (1−ζ)B = 0. Since utility is weakly increasing throughout the feasible set, the optimum is the corner solution Bmax (Proposition 7). The mechanism: increasing B reduces k, lowering wages (bad for utility when young) but raising the marginal product of capital and hence the rates of return on capital and bonds (good for utility when old). The factor-price frontier ensures that the wage reduction equals the income gain accruing to initial capital, and the intertemporal optimality condition uy′(cy) = Rf E{uo′(co)} implies that when Rf ≤ 1 (so E{uo′(co)} ≥ uy′(cy)/Rf ≥ uy′(cy)), the welfare gain in old age dominates.&lt;/p&gt;
&lt;h3 id="q8-how-does-proposition-5-square-with-the-optimality-of-bmax-does-reducing-expected-consumption-not-reduce-welfare"&gt;Q8. How does Proposition 5 square with the optimality of Bmax? Does reducing expected consumption not reduce welfare?&lt;/h3&gt;
&lt;p&gt;Proposition 5 shows that when ζ = 1, a marginal increase in B at Bmax reduces expected aggregate consumption (dE{c}/dB &amp;lt; 0). However, welfare is not simply expected aggregate consumption: it also depends on the distribution of consumption across states. At Bmax, even though expected consumption falls, the increased risk sharing from holding more riskfree bonds — which smooth consumption between the high-return and low-return states of capital depreciation — is large enough to leave welfare unchanged (u′(Bmax) = 0 when ζ = 1) or to increase it (u′(Bmax) &amp;gt; 0 when ζ &amp;lt; 1). This illustrates that in stochastic economies, the welfare criterion diverges from the aggregate consumption criterion that characterizes dynamic inefficiency in deterministic economies.&lt;/p&gt;
&lt;h3 id="q9-how-does-the-papers-welfare-analysis-relate-to-the-no-ponzi-game-npg-condition-and-the-fiscal-theory-of-the-price-level"&gt;Q9. How does the paper&amp;rsquo;s welfare analysis relate to the No Ponzi Game (NPG) condition and the fiscal theory of the price level?&lt;/h3&gt;
&lt;p&gt;The standard NPG condition requires that the value of government debt equals the present value of future primary surpluses. In the paper&amp;rsquo;s feasible-rollover region (rf ≤ g), the NPG condition is violated by design: the present value of future primary surpluses is non-positive (all primary balances are deficits or zero), yet the market value of outstanding bonds is strictly positive. This is possible because, as Santos and Woodford (1997) show, when the present value of aggregate consumption is infinite, the NPG can fail. The market value of the capital stock remains finite (it is the value of profits on a depreciating capital stock approaching zero), but the bubble value of government bonds is positive.&lt;/p&gt;
&lt;h3 id="q10-what-does-the-quantitative-calibration-reveal-about-the-range-of-dynamically-efficient-permanently-rollable-bond-capital-ratios"&gt;Q10. What does the quantitative calibration reveal about the range of dynamically efficient, permanently rollable bond-capital ratios?&lt;/h3&gt;
&lt;p&gt;With α = 0.33, β = 0.353, g = 1% per year, G = 1.35, target mean equity return m = 3% per year, and risk aversion γ = 10 with annualized return standard deviation s = 0.22, the paper finds Bmax = 0.478 and B∗ = 0.083 (ζ = 0, Table 1). The interval [B∗, Bmax] = [0.083, 0.478] is the range of bond-capital ratios for which the economy is both dynamically efficient and able to roll over bonds permanently. For an economy with a capital-output ratio of 2, these bond-capital ratios correspond to debt-GDP ratios of up to 0.956. Both Bmax and B∗ are increasing in risk aversion γ and in the standard deviation of capital returns s; Bmax is independent of γ in any given column of the table for the ζ = 0 case (since R is independent of γ there), but rises substantially with γ in the ζ = 1 case.&lt;/p&gt;
&lt;h3 id="q11-what-is-the-role-of-the-parameter-ζ-the-share-of-seignorage-transferred-vs-wasted"&gt;Q11. What is the role of the parameter ζ (the share of seignorage transferred vs. wasted)?&lt;/h3&gt;
&lt;p&gt;The parameter ζ governs what the government does with seignorage revenue: transfer it to young consumers (ζ = 1) or waste it (ζ = 0), or some mix. Corollary 1 shows that Bmax is completely invariant to ζ, because at Bmax, rf = g so seignorage (g − rf)Bt = 0 in any case. The value ζ does affect u′(Bmax): if ζ &amp;lt; 1, u′(Bmax) &amp;gt; 0; if ζ = 1, u′(Bmax) = 0. Both configurations yield Bmax as the welfare-maximizing level. The parameter ζ matters for welfare levels and for B∗ (only in the ζ = 1 case, where transfers are positive and boost saving capacity), but not for the main positive or normative results.&lt;/p&gt;
&lt;h3 id="q12-in-what-sense-is-the-model-tractable-and-what-are-its-key-limitations"&gt;Q12. In what sense is the model tractable, and what are its key limitations?&lt;/h3&gt;
&lt;p&gt;Tractability comes from three design choices: (i) the durability shock is additively separable from the production function, so labor income and aggregate saving are non-stochastic; (ii) IES = 1 with Epstein-Zin-Weil preferences, making saving a constant fraction of income; (iii) along balanced growth paths, g and rf are constant, so sustainability reduces to comparing two constants. Limitations acknowledged by the authors: the paper analyzes only balanced growth paths and does not characterize transition dynamics; the framework does not directly address economies where g or rf are stochastic; and the two-period OLG structure is stylized. The authors pose as an open question whether the result that optimal borrowing equals maximal borrowing generalizes to settings with random g.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Bond-capital ratio (B):&lt;/strong&gt; The ratio of outstanding government bonds to the capital stock, Bt/Kt. This is the paper&amp;rsquo;s central state variable and policy instrument. A value B is &amp;ldquo;sustainable&amp;rdquo; if the government can roll over its debt forever at the riskfree interest rate without any primary budget surpluses. The paper distinguishes B from the more commonly reported debt-GDP ratio (which equals B times the capital-output ratio).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Adjusted gross rate of return / riskfree rate (R, Rf):&lt;/strong&gt; R ≡ (1+r)/G and Rf ≡ (1+rf)/G, where r is the net return on capital, rf is the riskfree interest rate on bonds, and G = 1+g is the gross growth rate. Expressing returns in these &amp;ldquo;adjusted&amp;rdquo; gross units scales out balanced growth and simplifies the sustainability condition to Rf ≤ 1 (equivalently, rf ≤ g).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Dynamic efficiency (Zilcha criterion):&lt;/strong&gt; In the paper&amp;rsquo;s stochastic setting, the relevant criterion for dynamic efficiency is E{ln R} ≥ 0 (Zilcha 1991, as amended by Rangazas-Russell 2005 and Barbie-Kaul 2009), meaning the geometric mean of the adjusted gross return on capital is at least one. This differs from the deterministic condition r ≥ g. The paper&amp;rsquo;s Region E in Figure 1 is the key zone where E{ln R} &amp;gt; 0 (dynamically efficient) and Rf &amp;lt; 1 (rollover feasible) simultaneously.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bmax (maximum sustainable bond-capital ratio):&lt;/strong&gt; The largest value of B for which the bond-capital ratio is sustainable, defined as the unique root of Rf(B) = 1. At Bmax, the riskfree rate exactly equals the growth rate (rf = g). The paper proves Bmax is finite, invariant to ζ, and equals the welfare-maximizing sustainable bond-capital ratio.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;B∗ (dynamic efficiency threshold):&lt;/strong&gt; The bond-capital ratio at which the economy crosses from Zilcha-inefficiency into Zilcha-efficiency, defined by E{ln R} = 0. For B ∈ [B∗, Bmax], the economy is dynamically efficient and debt rollover is feasible. B∗ &amp;lt; Bmax when risk aversion γ or return volatility s is large enough, defining a non-trivial interval of dynamically efficient, permanently rollable bond levels.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Durability shock (ε):&lt;/strong&gt; An i.i.d. random variable with mean zero that enters the capital depreciation rate as δ − ε_t. This shock makes the rate of return on capital r = αkα−1 − δ + ε stochastic while leaving the capital stock per unit of effective labor, aggregate wages, and aggregate saving non-stochastic. It is the only source of aggregate uncertainty in the model and is the mechanism that drives a wedge between rf and E{r}.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;No Ponzi Game (NPG) condition:&lt;/strong&gt; The condition that the present discounted value of government debt converges to zero (equivalently, debt equals the present value of future primary surpluses). Standard fiscal sustainability analyses assume this condition holds. The paper explicitly violates it: in the feasible-rollover region rf ≤ g, the present value of aggregate consumption is infinite and the NPG fails, yet government bond values are positive and debt rollover is sustainable.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Seignorage (ζ):&lt;/strong&gt; The revenue the government obtains by issuing new bonds in excess of interest payments on existing bonds, equal to (g − rf)Bt when rf &amp;lt; g. The parameter ζ ∈ [0,1] governs the share transferred to young consumers (as lump-sum transfers τt) versus wasted (captured by the government but yielding no utility). A key finding is that Bmax is invariant to ζ, since seignorage is zero at rf = g regardless of ζ.&lt;/p&gt;</description></item><item><title>Self-Fulfilling Debt Crises with Long Stagnations</title><link>https://macropaperwarehouse.com/papers/self-fulfilling-debt-crises-with-long-stagnations/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/self-fulfilling-debt-crises-with-long-stagnations/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper asks whether sovereign debt crises can be self-fulfilling — triggered by lenders&amp;rsquo; expectations of default rather than by weak fiscal fundamentals alone — and whether such crises are empirically plausible. Following the mechanism of Calvo (1988), high expected default probabilities require high interest rates to compensate lenders, but high interest rates in turn raise the cost of debt service and the probability of default, making the pessimistic expectations self-confirming. The key theoretical contribution is to show that this multiplicity of equilibria is state-dependent: it arises only in periods of stagnation, when the endowment process is in a persistent low-growth regime. The paper modifies a standard infinite-horizon sovereign default model (in the spirit of Eaton-Gersovitz and Arellano 2008) by introducing a two-state Markov regime-switching process for trend growth and by having the borrower choose current debt rather than debt at maturity — a timing assumption that is essential for multiplicity. Calibrating the output process to Argentina, Brazil, Italy, Portugal, and Spain using 1980–2017 data, the paper finds that for intermediate levels of debt and in low-growth states, interest rates can be either low (around 4%) or high (around 46%) depending on the coordination of lenders&amp;rsquo; beliefs — a self-fulfilling crisis range that reproduces the qualitative features of the European sovereign debt crisis of 2010–2012 and the Argentine crisis of 1998–2002. In high-growth states, the multiplicity region is negligibly small or absent entirely.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-calvo-1988-mechanism-and-why-does-it-require-a-bimodal-endowment-process"&gt;Q1. What is the Calvo (1988) mechanism, and why does it require a bimodal endowment process?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The Calvo mechanism generates multiple equilibrium interest rates for a given level of debt: lenders&amp;rsquo; expectation that the borrower will default in the low-output state forces them to charge a high interest rate to break even, but the high rate raises the debt service burden and makes default more likely, validating the pessimistic expectation.&lt;/strong&gt; For this self-confirming loop to sustain multiple stable equilibria, the interest rate correspondence — mapping debt levels to possible interest rates — must have an upward-sloping region at both the low and high rate. A unimodal endowment distribution generates a correspondence with a downward-sloping high-rate segment (higher debt → lower high interest rate), which is inadmissible and eliminates multiplicity. A bimodal distribution with well-separated high and low growth states, as observed empirically in crisis-prone countries, generates an upward-sloping correspondence at both rates, creating a region of intermediate debt levels where either rate is an equilibrium.&lt;/p&gt;
&lt;p&gt;The second key model feature is the timing of moves: the borrower chooses current debt (amount borrowed today) rather than debt at maturity (the repayment obligation). When the borrower chooses debt at maturity, it implicitly pins down the default probability and therefore the interest rate, eliminating multiplicity. When the borrower chooses current debt, the interest rate is determined by lenders and can take either the high or the low value consistent with break-even pricing, given the chosen debt level.&lt;/p&gt;
&lt;h3 id="q2-how-does-the-paper-calibrate-the-endowment-process-and-what-does-the-estimation-reveal"&gt;Q2. How does the paper calibrate the endowment process and what does the estimation reveal?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper estimates a two-state Markov regime-switching model for annual GDP per capita growth for Argentina, Brazil, Italy, Portugal, and Spain using data from 1980 to 2017, and finds clear evidence of a bimodal distribution with persistent high- and low-growth regimes across all five countries.&lt;/strong&gt; Estimated using a Bayesian MCMC algorithm with the Kim (1994) filter, the posterior means for the benchmark cross-country calibration are: low-growth rate gL = −1.0% per year, high-growth rate gH = 3.0% per year, persistence of low-growth state pL = 0.60, persistence of high-growth state pH = 0.80, and standard deviation of transitory shocks σ = 0.015. The average gap between gL and gH across countries is approximately 6 percentage points, more than three times the standard deviation of the transitory shock — confirming the bimodal structure that is essential for multiplicity. Both growth regimes are persistent, with the low-growth state having 60–80% persistence across countries.&lt;/p&gt;
&lt;p&gt;The quantitative model uses these estimates together with standard parameters: risk-free rate R* = 3.5%, recovery rate κ = 75%, discount factor β = 0.75, and risk aversion γ = 3. The sunspot process governing equilibrium selection is i.i.d. with a 5% probability of the bad (high-rate) sunspot in each period.&lt;/p&gt;
&lt;h3 id="q3-what-does-the-calibrated-model-predict-for-interest-rates-and-when-do-self-fulfilling-crises-occur"&gt;Q3. What does the calibrated model predict for interest rates and when do self-fulfilling crises occur?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;In the calibrated quantitative model, the multiplicity region is present only in the low-growth state and for intermediate debt levels; in the high-growth state, the multiplicity region is either empty or negligibly small.&lt;/strong&gt; In the low-growth state with intermediate debt, the interest rate schedule features two admissible equilibria: a low-rate equilibrium consistent with a low probability of default (1.7% in the benchmark simulation) and a high-rate equilibrium consistent with a high probability of default (60.1%). Both are sustained by self-confirming expectations. The scenario simulation illustrates this starkly: two economies starting from identical wealth and facing identical growth-shock sequences but different sunspot realizations in period t=2 (when they are in the low-growth state) face interest rates of 4.0% versus 46.4% and next-period default probabilities of 1.7% versus 60.1%, respectively, with no difference in fundamentals.&lt;/p&gt;
&lt;p&gt;The model also generates endogenous austerity: borrowers optimally refrain from increasing debt to avoid discrete jumps in interest rates, both at the fundamental threshold (driven by the growth regime) and at the expectations-driven threshold (driven by the sunspot). In low-growth states facing the bad sunspot, the borrower either bunches at a low debt level below the multiplicity region or makes a discrete jump above it, echoing the binary fiscal-adjustment dynamics observed in crisis episodes.&lt;/p&gt;
&lt;h3 id="q4-how-does-the-model-interpret-the-european-debt-crisis-and-the-role-of-the-ecb"&gt;Q4. How does the model interpret the European debt crisis and the role of the ECB?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The model provides a direct interpretation of the European sovereign debt crisis: the southern European economies (Italy, Spain, Portugal) entered a low-growth state around 2009–2010, which created conditions for the Calvo mechanism to operate; spreads jumped to high-rate equilibria driven by expectations rather than by fundamentals alone.&lt;/strong&gt; The ECB&amp;rsquo;s announcement of the Outright Monetary Transactions (OMT) program in September 2012 — the commitment to purchase sovereign bonds in secondary markets — shifted lenders&amp;rsquo; beliefs from the bad-sunspot equilibrium to the good-sunspot equilibrium, collapsing spreads substantially even without actual intervention. In the model&amp;rsquo;s language, a credible lender of last resort can eliminate the bad equilibrium by committing to lend at the low-rate schedule, rendering the high-rate self-fulfilling expectations non-viable. The Argentine crisis of 1998–2002 fits the model as an alternative trajectory: Argentina entered the low-growth state with a 7% spread on 35% debt-to-GDP and, without a lender of last resort intervention, eventually defaulted in 2002 — consistent with the bad-sunspot equilibrium path.&lt;/p&gt;
&lt;h3 id="q5-what-role-does-persistence-of-the-low-growth-state-play"&gt;Q5. What role does persistence of the low-growth state play?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The persistence of the low-growth state (pL) is the key parameter governing the severity of self-fulfilling crises: higher pL generates higher equilibrium interest rates in the bad-sunspot equilibrium and a larger multiplicity region.&lt;/strong&gt; Intuitively, if the economy is likely to remain in the low-growth state for a long time, the probability of default conditional on entry into the bad equilibrium is very high, requiring lenders to charge very high interest rates to break even. The higher the interest rate, the more debt service costs compress fiscal space, making default even more likely and potentially sustainable at even lower debt levels. The paper shows in robustness exercises that the multiplicity result is robust to reasonable perturbations in pL, κ (recovery rate), σ (transitory shock standard deviation), gL, and gH, with the key ingredient being the bimodal structure of the endowment process rather than any single parameter value.&lt;/p&gt;
&lt;h3 id="q6-what-are-the-policy-implications-for-lenders-of-last-resort"&gt;Q6. What are the policy implications for lenders of last resort?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The central policy implication is that a lender of last resort — such as the ECB or the IMF — is justified precisely when fundamentals are weak, not because fundamentals alone cause the crisis but because weak fundamentals create conditions in which expectations can trigger a self-fulfilling crisis.&lt;/strong&gt; Intervening in the bad-sunspot equilibrium by committing to supply funds at low-rate terms makes the high-rate equilibrium infeasible: lenders cannot expect default because the lender of last resort ensures the borrower can always roll over at low rates. The model thus rationalizes the design of the OMT: a credible commitment with no limit on size is sufficient to rule out the bad equilibrium without necessarily requiring actual asset purchases. The paper notes that such interventions will also have effects on the economy outside the period of crisis, since the availability of backstop financing may affect the equilibrium path more broadly.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;dl&gt;
&lt;dt&gt;&lt;strong&gt;Calvo (1988) mechanism&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;the self-fulfilling loop in sovereign debt markets in which high lender expectations of default require high interest rates for break-even pricing, which raise the actual default probability and thereby confirm the initial pessimistic expectations; generates multiple equilibrium interest rates for a given debt level.&lt;/dd&gt;
&lt;dt&gt;&lt;strong&gt;state-dependent multiplicity&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;the feature of the model in which multiple interest rate equilibria arise only in periods of low and persistent growth (stagnation), not in high-growth regimes; the central quantitative finding that self-fulfilling crises are empirically plausible only when growth fundamentals are weak.&lt;/dd&gt;
&lt;dt&gt;&lt;strong&gt;endogenous austerity&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;the borrower&amp;rsquo;s optimal choice to hold debt below the multiplicity region to avoid discrete jumps in interest rates triggered by either fundamentals or expectations; reflected in the flat portions of the debt policy function and consistent with fiscal consolidation patterns observed in crisis episodes.&lt;/dd&gt;
&lt;dt&gt;&lt;strong&gt;sunspot variable&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;the exogenous coordination device that selects among the multiple equilibrium interest rate schedules; takes a bad or good realization each period according to an i.i.d. process, with the bad sunspot selecting the high-rate schedule and the good sunspot selecting the low-rate schedule in the low-growth state.&lt;/dd&gt;
&lt;/dl&gt;</description></item><item><title>Skilled immigration frictions as a barrier for young firms</title><link>https://macropaperwarehouse.com/papers/skilled-immigration-frictions-as-a-barrier-for-young-firms/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/skilled-immigration-frictions-as-a-barrier-for-young-firms/</guid><description>&lt;p&gt;High-skilled immigration policy frictions—particularly the H-1B visa lottery, fixed annual cap, and associated compliance costs—impose well-known burdens on firms, but their disproportionate impact on young, technology-intensive companies has received less attention. This paper provides the first study combining firm-level panel data on H-1B outcomes with a general equilibrium model of firm dynamics to quantify these effects. Using the random allocation of H-1B visas in the FY 2014 and FY 2015 lotteries as quasi-random variation, the authors find that lower H-1B visa win rates significantly reduce the survival of young firms (aged 0–5) in technology-intensive sectors, while the impact for older firms is not statistically significant. A general equilibrium model with endogenous firm entry and exit, skilled foreign labor, and H-1B-style policy frictions matches the age distribution of high-tech firms and shows that eliminating major immigration policy frictions would increase average productivity in the high-tech sector primarily by enabling more young firms to enter and survive, which in turn drives the exit of older, less productive firms.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-quasi-random-identification-strategy-and-what-does-it-identify"&gt;Q1. What is the quasi-random identification strategy, and what does it identify?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper uses H-1B visa lottery win rates in fiscal years 2014 and 2015 as quasi-random variation in access to skilled foreign workers, combining National Establishment Time Series (NETS) data on firm survival with Labor Condition Application (LCA) and H-1B petition data.&lt;/strong&gt; Because the lottery randomly allocates visas among firms that applied (when the cap is binding), the fraction of applications that result in approvals is plausibly exogenous to firm characteristics, conditional on applying. The empirical finding—that lower lottery win rates significantly reduce survival of young firms (0–5 years old) in tech-intensive sectors but not of older firms—is identified off this lottery variation. The age-heterogeneity result is central: large incumbent firms have alternative channels (offshore hiring, internal labor markets, H-1B cap-exempt hires) that small, young firms lack.&lt;/p&gt;
&lt;h3 id="q2-why-are-young-high-tech-firms-more-exposed-to-h-1b-frictions-than-older-firms"&gt;Q2. Why are young high-tech firms more exposed to H-1B frictions than older firms?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Young firms in high-tech sectors depend heavily on specialized skilled foreign workers because they compete in rapidly changing technology fields where the domestic talent pool may not supply the precise skills needed at the pace required; they cannot easily substitute with a second-choice candidate or offshore to a foreign affiliate as large multinationals can.&lt;/strong&gt; The paper cites GAO (2011) survey evidence that in years when the H-1B cap bound, most large firms found alternative (often costly) ways to hire their preferred candidates, while small firms were more likely to fill positions with different candidates, incurring delays and economic losses.&lt;/p&gt;
&lt;h3 id="q3-what-does-the-general-equilibrium-model-predict-about-aggregate-productivity"&gt;Q3. What does the general equilibrium model predict about aggregate productivity?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The model shows that eliminating major H-1B-style immigration policy frictions would increase average productivity in the high-tech sector through the entry and survival channel: fewer frictions allow more young firms to enter and survive, which raises competitive pressure and leads to the exit of older, less productive firms (a selection effect via creative destruction).&lt;/strong&gt; This mechanism implies that the productivity gain from liberalizing skilled immigration comes not primarily from incumbent firms hiring more foreign workers, but from the change in the firm age and productivity distribution—a general equilibrium effect that partial-equilibrium analyses based on incumbent firms would miss.&lt;/p&gt;
&lt;h3 id="q4-how-does-the-model-match-the-data-on-firm-dynamics"&gt;Q4. How does the model match the data on firm dynamics?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The model is calibrated to match the age distribution of firms in high-technology sectors in the data, including the stylized fact that the share of young (0–5 year old) high-tech firms has declined since the early 2000s concurrent with a period of more restrictive skilled immigration policy (the H-1B cap fell from 195,000 in 2003 to 85,000 in 2005 and has remained constant).&lt;/strong&gt; The model also captures the pattern—documented in the data—that the entry of younger firms leads to a greater exit of older firms, consistent with the Hopenhayn-Rogerson (1993) model of firm dynamics.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;H-1B visa frictions&lt;/strong&gt; : the set of costs and constraints associated with the H-1B temporary skilled worker visa program in the US, including per-firm application costs, a fixed aggregate cap of 85,000 visas for private firms per year, and random lottery allocation when applications exceed the cap.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;firm survival channel&lt;/strong&gt; : the mechanism through which immigration policy frictions reduce the probability that young high-tech firms survive to maturity, as distinct from the hiring channel (whether incumbent firms hire foreign workers); the paper argues the former is the quantitatively relevant margin.&lt;/p&gt;</description></item><item><title>Spread too thin: The impact of lean inventories</title><link>https://macropaperwarehouse.com/papers/spread-too-thin-the-impact-of-lean-inventories/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/spread-too-thin-the-impact-of-lean-inventories/</guid><description>&lt;p&gt;This paper investigates the macroeconomic consequences of widespread just-in-time (JIT) inventory management, documenting a fundamental trade-off: JIT raises firm profitability and reduces micro-level volatility in normal times, but renders the economy significantly more vulnerable to large unanticipated shocks.&lt;/p&gt;
&lt;p&gt;The empirical analysis draws on a novel dataset of approximately 200 publicly listed U.S. manufacturing firms for which the author identifies JIT adoption years using narrative records from SEC filings and historical news archives. Firm-level balance sheet data come from Compustat Fundamentals Annual (1980–2018), merged with county-level weather event data from NOAA. Four empirical facts are documented. First, JIT adoption is associated with a 13% decrease in inventory-to-sales ratios and a 9% increase in sales. Second, JIT adopters experience a roughly 7% decline in sales and employment growth volatility. Third, JIT adopters are approximately 25–30% more cyclical than non-adopters: a 1% increase in GDP growth predicts an additional 0.47 percentage point increase in JIT firm sales growth above the non-adopter baseline of roughly 1.6%. Fourth, a weather disaster predicts an additional 3% decline in JIT firm sales and employment relative to non-JIT firms.&lt;/p&gt;
&lt;p&gt;To explain and quantify these facts, the author builds and structurally estimates a dynamic general equilibrium model with a distribution of heterogeneous final goods firms that differ in idiosyncratic productivity, inventory holdings, and JIT adoption status. Materials must be drawn from inventory stocks; new orders are subject to stochastic fixed order costs. JIT producers draw from a first-order stochastically dominated order cost distribution relative to non-JIT firms. Adopting JIT requires an upfront sunk cost and a smaller continuation cost thereafter. The model is estimated via simulated method of moments (SMM) targeting 11 moments (adoption frequency, inventory-to-sales ratios, covariances, and spike frequencies for both firm types), with nine parameters to be estimated.&lt;/p&gt;
&lt;p&gt;In the estimated model steady state, JIT adoption delivers a 9–10% increase in output, a 40% decline in the aggregate inventory-to-sales ratio (close to the observed 35% decline in nonfarm inventories-to-final-sales from 1980 to 2018), a 1.3% increase in firm value, a 1.3% increase in measured TFP, and a welfare gain of 1.43% in consumption equivalent terms. These gains arise because lower order costs allow firms to better align material input use with realized productivity, smoothing inventory cycles.&lt;/p&gt;
&lt;p&gt;The vulnerability side is quantified through an unanticipated supply disruption calibrated to match the 3.4% drop in real U.S. GDP between 2019 and 2020. In response, the JIT economy experiences an approximately 0.40 percentage point excess output contraction relative to the no-JIT counterfactual, amounting to roughly 13–15% more output lost. The mechanisms are stockouts — firms that fully exhaust their inventories and cannot produce — and hoarding behavior, whereby firms that retain some inventory draw stocks down more slowly to preserve buffers, reducing material input use. Both channels reduce production relative to the counterfactual. The excess output loss is estimated at approximately $100 billion, comparable to state and local government allocations under the CARES Act.&lt;/p&gt;
&lt;p&gt;JIT nevertheless remains welfare-improving even under this shock. For a social planner to prefer a no-JIT world, the negative productivity shock to the intermediate goods sector would need to be nearly 14% — an order of magnitude larger than the calibrated COVID-19 shock. The trade-off is robust across alternative order cost distributions, parameterizations, partial anticipation scenarios, and stockout cost specifications.&lt;/p&gt;
&lt;p&gt;Q: What is the central trade-off identified by the paper?
A: JIT adoption reduces fixed order costs, enabling firms to place smaller and more frequent orders, which raises sales, reduces micro-level volatility, and increases firm value and welfare in normal times. However, because JIT firms hold fewer inventories, an unexpected aggregate shock increases the likelihood of stockouts and hoarding behavior, producing a deeper aggregate output contraction relative to an economy without JIT. Firms do not internalize the prospect of large shocks when making their private adoption decisions, generating the externality at the heart of the trade-off.&lt;/p&gt;
&lt;p&gt;Q: How does the paper measure JIT adoption, and how large is the sample?
A: The author constructs an adoption dummy for approximately 200 publicly listed manufacturing firms by exhaustively reviewing SEC filings and historical news archives for keywords including &amp;ldquo;JIT,&amp;rdquo; &amp;ldquo;just-in-time,&amp;rdquo; &amp;ldquo;lean manufacturing,&amp;rdquo; &amp;ldquo;pull system,&amp;rdquo; and &amp;ldquo;zero inventory.&amp;rdquo; Each document is individually analyzed to confirm the adoption year and to ensure it refers to the firm itself rather than its suppliers. More than half of observed adopters in the sample adopt prior to 1990, and nearly all adopt before 2000. The final Compustat-linked sample covers about 5,017 unique manufacturing firms from 1980 to 2018.&lt;/p&gt;
&lt;p&gt;Q: What are the firm-level efficiency gains from JIT adoption?
A: JIT adoption is associated with a 13% decrease in inventory-to-sales ratios and a 9% increase in sales; the corresponding standard deviation changes are –16% and +4%, respectively. Adopters also experience a roughly 7% decline in both sales and employment growth volatility, and a 5% increase in sales per worker relative to non-JIT firms. JIT firms additionally show a roughly 20% standard deviation reduction in squared forecast errors, indicating improved predictability of profitability.&lt;/p&gt;
&lt;p&gt;Q: How much more cyclical are JIT firms relative to non-JIT firms?
A: A 1% increase in GDP growth is associated with approximately a 1.6% increase in sales growth for non-adopters; JIT adopters experience an additional 0.47 percentage point increase above this baseline, making them roughly 25–30% more cyclical. This elevated cyclicality is estimated from variation external to the firm and reflects the heightened sensitivity of lean producers to aggregate demand fluctuations.&lt;/p&gt;
&lt;p&gt;Q: How are JIT firms affected by local weather disasters?
A: On average, a weather disaster predicts an additional 3% decline in JIT firm sales and employment relative to non-JIT firms. Using upstream supply chain linkages from Compustat Segment files, a unit increase in the average number of disasters hitting a firm&amp;rsquo;s suppliers predicts a 7–8% decline in firm sales and employment, with a similar excess decline for JIT firms. These results parallel the strategy in Barrot and Sauvagnat (2016).&lt;/p&gt;
&lt;p&gt;Q: What is the model structure, and how does the JIT adoption decision work?
A: The model features a representative household, a representative intermediate goods firm producing materials with capital and labor, and a continuum of heterogeneous final goods firms that differ in idiosyncratic productivity (AR(1) in logs), inventory holdings, and JIT adoption status. Each period has three stages: adoption decision, order decision (conditional on stochastic fixed order cost draw), and production decision. JIT producers draw order costs from a distribution first-order stochastically dominated by the non-JIT distribution, meaning JIT firms face systematically lower expected order costs. Adoption requires an upfront sunk cost c_s; maintaining JIT requires a smaller continuation cost c_f (estimated at slightly more than one-third of c_s), generating hysteresis: conditional on being an adopter, the probability of remaining one is estimated at 94%.&lt;/p&gt;
&lt;p&gt;Q: What moments are targeted in the SMM estimation, and how well does the model fit?
A: Eleven moments are targeted to identify nine parameters: the empirical adoption frequency, plus five moments each for JIT and non-JIT firms (mean inventory-to-sales ratio, the covariance matrix of inventory-to-sales ratios and log sales delivering three moments, and the frequency of positive inventory-to-sales ratio spikes exceeding 0.20). The model successfully fits targeted moments; non-targeted regression coefficients reproduce a quantitatively similar reduction in inventory-to-sales ratios after adoption, a comparable increase in sales among adopters, and reductions in firm volatility of 4–5% versus 6–7% in the data.&lt;/p&gt;
&lt;p&gt;Q: What are the estimated key structural parameters?
A: The upper support of the order cost distribution among non-adopters is estimated to be an order of magnitude larger than that of adopters, implying JIT firms place orders about 45% smaller than non-JIT firms. The estimated carrying cost is about 20% of inventory value. The estimated share of non-adopters in the model&amp;rsquo;s steady state implies a mass of JIT establishments of approximately 0.40. The technology parameters for the idiosyncratic productivity process are consistent with prior estimates in the structural firm dynamics literature.&lt;/p&gt;
&lt;p&gt;Q: What are the steady-state aggregate gains from JIT adoption in the model?
A: Relative to a counterfactual economy with no JIT option, the estimated model delivers a 9–10% increase in output, a 40% decline in the aggregate inventory-to-sales ratio (close to the observed 35% decline from 1980 to 2018), a 1.3% increase in firm value, a 1.3% increase in measured TFP, and a welfare gain of 1.43% in consumption equivalent terms. The TFP gain arises because lower order costs reallocate resources toward high marginal product producers at the aggregate level.&lt;/p&gt;
&lt;p&gt;Q: How is the unanticipated disaster calibrated, and what are its effects in the JIT versus no-JIT economies?
A: The disaster is an unanticipated negative shock to aggregate productivity in the intermediate goods sector, calibrated to match the 3.4% drop in real U.S. GDP between 2019 and 2020. In response, the JIT economy experiences approximately a 0.40 percentage point excess output contraction relative to the no-JIT counterfactual, amounting to roughly 13–15% more output lost. This excess loss equals approximately $100 billion, comparable to CARES Act allocations to state and local governments.&lt;/p&gt;
&lt;p&gt;Q: What are the two mechanisms through which JIT amplifies the disaster shock?
A: The first mechanism is stockouts: because JIT firms hold fewer inventories, an unexpected spike in order costs makes them more likely to fully exhaust their existing stocks, leaving them with no material inputs and forcing them to forgo production entirely. The second mechanism is hoarding: firms that do not fully stock out face a higher shadow value of inventories and cut back on material input use to draw inventories down more slowly, reducing output even without a full stockout. Both mechanisms reduce material input utilization in the JIT economy, causing a sharper drop in sales relative to the counterfactual.&lt;/p&gt;
&lt;p&gt;Q: Is JIT still welfare-improving when the COVID-19 shock is accounted for?
A: Yes. A social planner comparing welfare across steady states would not prefer to eliminate JIT even accounting for the deeper crisis it generates. For the planner to prefer a no-JIT world, the negative productivity shock to the intermediate goods sector would need to be nearly 14% — an order of magnitude larger than the calibrated 3.4% shock. This implies that the welfare gains from JIT in normal times substantially outweigh the welfare costs of the deeper recession under a COVID-19-scale shock.&lt;/p&gt;
&lt;p&gt;Q: How does the paper relate to the Great Moderation literature?
A: JIT adoption is credited in prior work (McConnell and Perez-Quiros, 2000; Blanchard and Simon, 2001; Kahn et al., 2002) as contributing to the roughly 35% reduction in the aggregate inventory-to-sales ratio between 1980 and 2018 and to the broader decline in macroeconomic volatility. The estimated model is consistent with this: JIT adoption reduces firm-level volatility and, in the steady state, implies a reduction in aggregate inventory-to-sales ratios close to the observed magnitude. However, the paper documents that the same forces that smooth normal-times fluctuations amplify unanticipated large shocks.&lt;/p&gt;
&lt;p&gt;Q: What robustness checks does the paper conduct?
A: The paper considers alternate parameterizations (all robustly show the micro-macro trade-off), larger disaster sizes calibrated to UK and France 2020 contractions (JIT economy contracts ~10% vs. ~8.7%, a ~15% larger contraction), partial anticipation (a sizable excess output drop persists because the left tail of firm outcomes is truncated at zero profits), stockout costs (trade-off remains with ~1.2% firm value gain and ~10% excess contraction), and an alternative right-skewed beta order cost distribution (firm value gain rises to 1.8%, trade-off remains). An alternative CUSUM-based measure of JIT adoption identifying approximately 560 firms produces qualitatively similar empirical results.&lt;/p&gt;
&lt;p&gt;Q: What is the subsample estimation finding on adoption costs over time?
A: Comparing 1980–1989 and 1990–2018 subsamples, the upfront sunk cost of JIT adoption estimated from the 1980s sample is about 26% higher than in the later subsample, implying it has become easier to initiate JIT production over time. Steady-state output rises by about 3.4% in the 1990–2018 period relative to 1980–1989, and the excess output contraction under the disaster shock is about 15% relative to the 1980s counterfactual, close to the baseline estimate.&lt;/p&gt;
&lt;p&gt;Just-in-Time (JIT) Production: A lean inventory management philosophy that minimizes the time between orders by committing to smaller and more frequent orders from suppliers, reducing costs of managing large material purchases and storing idle stocks; in the model, JIT is operationalized as drawing order costs from a distribution first-order stochastically dominated by the non-JIT distribution.&lt;/p&gt;
&lt;p&gt;Stockout: The condition in which a final goods firm enters a period with no inventories (s = 0) and chooses not to place an order, leaving it without any material inputs and forcing it to forgo production entirely for that period.&lt;/p&gt;
&lt;p&gt;Hoarding (in the disaster context): The behavior of firms that, facing a higher shadow value of inventories during an unexpected shock, cut back on material input use in order to draw down existing inventory stocks more slowly, preserving buffers at the cost of reduced current production.&lt;/p&gt;
&lt;p&gt;Fixed Order Cost: A stochastic, labor-denominated cost that a firm must pay each period in which it places a materials order; JIT adopters face a systematically lower distribution of these costs, enabling more frequent ordering at smaller quantities.&lt;/p&gt;
&lt;p&gt;Adoption Sunk Cost: The one-time upfront cost c_s a non-adopter must pay to initiate JIT status, which exceeds the continuation cost c_f paid by existing JIT firms to maintain their status; the gap between these costs generates hysteresis in the adoption decision.&lt;/p&gt;
&lt;p&gt;Simulated Method of Moments (SMM): The structural estimation procedure used to identify model parameters by minimizing the weighted distance between model-simulated moments and their empirical counterparts; here applied with 11 targeted moments to identify 9 parameters in an overidentified system.&lt;/p&gt;
&lt;p&gt;Micro-Macro Trade-off: The paper&amp;rsquo;s central finding that individual firms rationally adopt JIT for private profitability gains (1.3% increase in firm value, 1.43% welfare gain), while the aggregate economy becomes more fragile to unanticipated shocks (roughly 13–15% deeper output contraction) because firms do not internalize the systemic vulnerability created by economy-wide lean inventories.&lt;/p&gt;</description></item><item><title>Staffing agencies and in-house bargaining</title><link>https://macropaperwarehouse.com/papers/staffing-agencies-and-in-house-bargaining/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/staffing-agencies-and-in-house-bargaining/</guid><description>&lt;p&gt;This paper asks whether a labor market with search-and-matching frictions and firms producing under decreasing returns to labor is better characterized by in-house hiring with intra-firm wage bargaining (Stole-Zwiebel) or by an alternative arrangement in which intermediaries — &amp;ldquo;staffing agencies&amp;rdquo; — search for and employ workers and then rent them to producing firms on a frictionless, perfectly competitive market. The paper&amp;rsquo;s second and central question is what happens when firms can choose their optimal combination of the two arrangements simultaneously.&lt;/p&gt;
&lt;p&gt;The model is static. There are Z homogeneous firms with production function F(n) satisfying F&amp;rsquo;&amp;rsquo;(n) &amp;lt; 0, N homogeneous workers, and a standard concave constant-returns-to-scale matching function M = m(V, N). Firms can post vacancies, workers search, and Nash bargaining with worker bargaining weight β determines wages. The analysis is conducted with fully general production and matching functions throughout, deviating to specific functional forms (Cobb-Douglas matching, power production function F(n) = An^α) only when needed to illustrate a particular efficiency result. All main results hold for both directed and random search.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Comparing the two polar arrangements (Theorem 3.1).&lt;/strong&gt; When all hiring is in-house (Stole-Zwiebel), equilibrium firm size n^SZ, aggregate employment Zn^SZ, labor market tightness θ^SZ, and the equilibrium wage w^SZ are all strictly higher than their counterparts under full staffing-agency employment (n^SA, Zn^SA, θ^SA, w^SA). The mechanism is that under in-house hiring with decreasing returns, a worker&amp;rsquo;s threat to leave raises the marginal product — and hence the wage — of remaining workers, giving workers additional bargaining leverage. Firms respond by over-employing in-house hires to dilute each worker&amp;rsquo;s marginal product and thus moderate wages. This over-employment raises vacancy posting and tightness, which in general equilibrium bids up wages despite each firm&amp;rsquo;s individual wage-moderation motive.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Efficiency (Theorem 3.2).&lt;/strong&gt; Under the standard Hosios condition — worker bargaining weight β equals the elasticity η(θ) of the matching function with respect to vacancies — the staffing-agency equilibrium achieves the social planner&amp;rsquo;s optimum (θ^SA = θ*), while the in-house equilibrium posts strictly too many vacancies (θ^SZ &amp;gt; θ^SA = θ*). The in-house arrangement can be optimal for some β &amp;gt; η when workers&amp;rsquo; bargaining power is sufficiently high (Theorem 3.3, proved for Cobb-Douglas matching and power production function), because the over-employment incentive then counteracts the externality from underprovision of vacancies.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;The main result: staffing agencies dominate (Theorem 3.4).&lt;/strong&gt; When firms choose their profit-maximizing combination of in-house hires n^SZ and rented staffers n^SA, the unique equilibrium has n^SZ = 0 and n^SA &amp;gt; 0 — firms use only staffers. The key is Lemma 3.1: renting one additional staffer reduces the wage paid to in-house workers by more than does hiring one additional in-house worker (formally, ∂w^SZ/∂n^SA &amp;lt; ∂w^SZ/∂n^SZ). This asymmetry arises because staffers cannot leave during intra-firm bargaining breakdowns — they remain regardless — so each additional staffer tightens the firm&amp;rsquo;s fallback position more effectively than an additional in-house hire. With continuous labor, any positive mass of in-house workers leaves residual scope for further wage moderation through staffers, so the firm always finds it profitable to convert the last in-house hire to a staffer. The corner solution n^SZ = 0 is thus the unique equilibrium. With discrete labor, a firm would be indifferent between exactly one and zero in-house workers.&lt;/p&gt;
&lt;p&gt;The paper also notes that this staffing-agency arrangement is formally equivalent to the &amp;ldquo;labor packer&amp;rdquo; or intermediate-good setup widely used in applied macroeconomics (e.g., Gertler, Sala, and Trigari 2008) to avoid Stole-Zwiebel complications, providing a micro-foundation for that modeling convention.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Why does in-house hiring with decreasing returns to labor generate higher wages and employment than the staffing-agency arrangement?&lt;/strong&gt;
A: Under decreasing returns, if a worker&amp;rsquo;s wage negotiation breaks down and the worker leaves, the marginal product of the remaining n−1 workers rises. This gives each in-house worker additional bargaining leverage beyond the standard β parameter. To counteract this, firms over-employ in-house hires to keep the marginal product low. In general equilibrium this raises tightness θ^SZ &amp;gt; θ^SA, which in turn raises wages w^SZ &amp;gt; w^SA even though each individual firm&amp;rsquo;s motive was wage moderation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the formal basis for the in-house wage equation, and what does it depend on?&lt;/strong&gt;
A: Following Stole and Zwiebel&amp;rsquo;s stability condition, the continuous-labor wage for a firm with n workers is w(n) = (1−β)b + n^(−1/β) ∫₀ⁿ z^((1−β)/β) F&amp;rsquo;(z) dz. The wage depends on the entire distribution of marginal products over [0, n], not merely on the marginal product at n. In the special case of a power production function, the integral yields an explicit power function in n.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: When is the staffing-agency equilibrium socially efficient?&lt;/strong&gt;
A: Under the standard Hosios condition β = η(θ*), the staffing-agency equilibrium attains exactly the planner&amp;rsquo;s tightness (θ^SA = θ*), because bargaining in staffing agencies is standard — the worker&amp;rsquo;s outside option does not affect other workers&amp;rsquo; wages and so the usual efficiency characterization applies. The in-house equilibrium then strictly over-posts vacancies (θ^SZ &amp;gt; θ*).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Can the in-house equilibrium ever be socially optimal?&lt;/strong&gt;
A: Yes, but only under specific parameter conditions. Theorem 3.3 shows that with Cobb-Douglas matching (η constant) and a power production function F(n) = An^α, there exists a threshold β̂ ∈ (η, 1) at which θ^SZ = θ*. The intuition is that strong worker bargaining power creates a vacancy-underprovision problem; the over-employment incentive under in-house hiring then partially corrects it. The functional form restriction is made for expositional convenience; the core logic is general.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is Lemma 3.1 and why is it the key to the main result?&lt;/strong&gt;
A: Lemma 3.1 states that, given any positive number of in-house hires n^SZ &amp;gt; 0, renting one additional staffer reduces the wage paid to in-house workers by more than does hiring one additional in-house worker: ∂w^SZ/∂n^SA &amp;lt; ∂w^SZ/∂n^SZ. This is proved by showing the relevant integral in the difference (∂w^SZ/∂n^SA − ∂w^SZ/∂n^SZ) is negative for n^SZ &amp;gt; 0 given F&amp;rsquo;&amp;rsquo; &amp;lt; 0.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Why does renting an additional staffer moderate in-house wages more than hiring an in-house worker?&lt;/strong&gt;
A: An in-house worker who is hired can, in principle, leave during a bargaining breakdown, triggering renegotiation all the way down to zero in-house workers and driving the firm&amp;rsquo;s fallback to zero profit. A rented staffer cannot leave; at minimum, all rented staffers remain in production regardless of in-house bargaining outcomes. Each additional staffer thus raises the firm&amp;rsquo;s floor payoff in bargaining by more than an additional in-house hire does, generating stronger wage moderation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Why does Theorem 3.4 produce a corner solution rather than an interior mix?&lt;/strong&gt;
A: Because labor is treated as a continuous input, any strictly positive mass n^SZ &amp;gt; 0 of in-house workers leaves the marginal in-house worker with positive bargaining leverage through the threat-to-leave mechanism. The firm can always improve its bargaining position by converting that marginal in-house worker to a staffer. This margin is present no matter how small n^SZ is, so the only equilibrium is n^SZ = 0. In discrete labor the firm would be indifferent between exactly one and zero in-house hires.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What happens to labor market tightness when both arrangements coexist?&lt;/strong&gt;
A: In the mixed equilibrium the tightnesses for in-house and staffer jobs must be equal in equilibrium (θ^SZ = θ^SA). If one tightness were higher, workers would prefer that job type (higher wage and higher probability of finding it), but firms would reduce vacancy posting there (costlier to fill), automatically equalizing tightness. This equilibration occurs even though in equilibrium vacancy posting for in-house jobs goes to zero.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Do the results require directed search or hold under random search as well?&lt;/strong&gt;
A: The results hold under both directed and random search. Appendix 3.C establishes that with random search and a single pooled matching function M = m(V^SZ + V^SA, N), the unique equilibrium also features n^SZ = 0. The directed-search assumption is made without loss of generality.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What does this paper imply for the applied macroeconomics literature&amp;rsquo;s &amp;ldquo;labor packer&amp;rdquo; modeling convention?&lt;/strong&gt;
A: The paper provides a formal micro-foundation for the labor-packer or intermediate-good approach used in New Keynesian DSGE models (e.g., Gertler, Sala, and Trigari 2008) to sidestep Stole-Zwiebel bargaining. In that literature, a &amp;ldquo;wholesale firm&amp;rdquo; or &amp;ldquo;packer&amp;rdquo; searches for workers and sells their services to final-goods firms under perfect competition — formally identical to the staffing-agency arrangement in this paper. Theorem 3.4 shows this arrangement is the unique equilibrium outcome of rational firm choice, so the shortcut is not merely convenient but theoretically grounded.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What does the empirical literature say about wage differentials between in-house and agency workers?&lt;/strong&gt;
A: Drenik et al. (2023), using Argentine administrative data linking temp agencies to user firms, estimate a significant wage premium for in-house hires relative to temp workers. This is consistent with the paper&amp;rsquo;s theoretical prediction that w^SZ &amp;gt; w^SA in the polar-case comparison (Theorem 3.1), though the paper itself presents no empirical estimation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What are the implications of the staffing-agency arrangement for measured labor shares?&lt;/strong&gt;
A: The paper notes that costs for staffers typically appear in firm accounts as intermediate input costs rather than labor costs. A shift from in-house hires to staffers therefore reduces measured labor costs and, because it also reduces value added (by more than the labor-cost reduction), lowers the measured labor share at the firm even when actual labor input and output are unchanged.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What future extensions do the authors identify as priorities?&lt;/strong&gt;
A: The authors flag three main extensions: (i) heterogeneous workers and firms, which could generate predictions about which firms use each hiring mode; (ii) worker effort/loyalty differences between in-house and agency workers that could make in-house hiring attractive ex post; and (iii) a frictional rental market for staffers or heterogeneous tasks within the firm, where insufficient staffer supply in certain sub-markets could restore a role for in-house hiring.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Staffing agency (in this paper&amp;rsquo;s sense):&lt;/strong&gt; An intermediary that posts vacancies on the frictional labor market, employs workers through standard Nash bargaining, and rents those workers one-for-one to producing firms on a frictionless, perfectly competitive market. The staffing agency is separated from the firm&amp;rsquo;s production decisions; its search activity has constant returns to scale.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;In-house hiring with Stole-Zwiebel bargaining:&lt;/strong&gt; A market arrangement in which the producing firm itself posts vacancies, employs workers, and conducts intra-firm Nash bargaining. Under decreasing returns to labor, the bargaining outcome for worker i depends on the firm&amp;rsquo;s payoff if that worker left, which in turn depends on wages paid to the remaining n−1 workers — generating a system of interdependent bargaining problems captured by the differential equation w(n) = (1−β)b + n^(−1/β) ∫₀ⁿ z^((1−β)/β) F&amp;rsquo;(z) dz.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Wage-moderation incentive (over-employment):&lt;/strong&gt; Under in-house hiring, a firm has an incentive to hire more workers than a social planner would recommend, because additional workers reduce each worker&amp;rsquo;s marginal product and hence the wage the firm must pay. This incentive is present because decreasing returns mean a departing worker raises the marginal product of remaining workers, giving each in-house worker leverage over the firm.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Differential wage-moderation effect (Lemma 3.1):&lt;/strong&gt; The finding that, given any positive mass of in-house hires, renting one additional staffer reduces in-house wages by more than hiring one additional in-house worker (∂w^SZ/∂n^SA &amp;lt; ∂w^SZ/∂n^SZ). The asymmetry arises because staffers cannot leave during intra-firm bargaining breakdowns, so they provide a more effective floor to the firm&amp;rsquo;s fallback payoff.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Hosios condition (as applied here):&lt;/strong&gt; The standard efficiency condition β = η(θ), where β is the worker&amp;rsquo;s Nash bargaining weight and η(θ) is the elasticity of the job-offer arrival rate with respect to tightness. When this condition holds, the staffing-agency equilibrium is socially optimal (θ^SA = θ*) and the in-house equilibrium is inefficient (θ^SZ &amp;gt; θ*).&lt;/p&gt;</description></item><item><title>Stock market participation and macro-financial trends</title><link>https://macropaperwarehouse.com/papers/stock-market-participation-and-macro-financial-trends/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/stock-market-participation-and-macro-financial-trends/</guid><description>&lt;p&gt;This paper documents a puzzle for canonical limited-participation models: when U.S. stock market participation rose from 31.6% to 53% between 1989 and 2007—a period also characterized by the Great Moderation—the equity premium and stock return volatility increased rather than fell as those models would predict. The paper resolves this puzzle using an RBC model with concentrated capital ownership in which capitalists have external habit utility with a habit stock that depends on aggregate per capita consumption. As participation rises, the representative capitalist&amp;rsquo;s consumption converges to aggregate consumption, shrinking the surplus-consumption ratio and raising endogenous average risk-aversion; this risk-aversion channel dominates the conventional risk-sharing channel (which predicts a lower equity premium under higher participation). The model implies that higher participation generates a sizeable rise in both the equity premium and stock return volatility while reducing the risk-free rate and aggregate consumption volatility—jointly explaining the observed U.S. macro-financial patterns. Household-level data from the Consumption Expenditure Survey (1984–2017) and cross-state variation support the model&amp;rsquo;s mechanism.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary based on a working paper version, AI-assisted and human-reviewed. See the linked published article for the authoritative version.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-puzzle-the-paper-addresses"&gt;Q1. What is the puzzle the paper addresses?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Existing limited-participation models predict that higher stock market participation should reduce the equity premium (by improving risk-sharing), yet the U.S. experienced a rising equity premium and higher stock return volatility precisely during the period of sharp participation growth (1989–2007), at the same time as the Great Moderation.&lt;/strong&gt; The standard channel predicts that as more households access financial markets, the representative capitalist&amp;rsquo;s risk burden falls and the covariance between capitalists&amp;rsquo; consumption and equity returns declines, lowering the equity premium. The data contradict this prediction, motivating the paper&amp;rsquo;s novel mechanism.&lt;/p&gt;
&lt;h3 id="q2-what-is-the-novel-risk-aversion-channel"&gt;Q2. What is the novel risk-aversion channel?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;As stock market participation rises, the representative capitalist&amp;rsquo;s consumption converges toward aggregate per capita consumption, shrinking the surplus-consumption ratio and thereby raising the endogenous effective risk-aversion of the economy—this risk-aversion channel dominates the conventional risk-sharing channel.&lt;/strong&gt; The key assumption is that capitalists&amp;rsquo; habit stock depends on aggregate per capita consumption. The surplus-consumption ratio (the gap between the capitalist&amp;rsquo;s consumption and the habit level) determines risk-aversion in the external habit utility framework. As participation rises, the capitalist&amp;rsquo;s consumption approaches the habit level, increasing risk-aversion and the equity premium, even as aggregate consumption volatility falls.&lt;/p&gt;
&lt;h3 id="q3-what-are-the-models-predictions-for-macro-financial-variables"&gt;Q3. What are the model&amp;rsquo;s predictions for macro-financial variables?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;In the model economy, an increase in stock market participation generates a sizeable rise in both the equity premium and the volatility of stock returns, a moderate increase in the price-dividend ratio, and a fall in the average risk-free rate and aggregate consumption volatility—jointly accounting for the U.S. macro-financial experience since the 1980s.&lt;/strong&gt; The rise in equity premium and stock volatility produced by higher participation substantially counteracts the shrinking effect due to lower aggregate uncertainty from the Great Moderation, providing a unified explanation for the co-movement of these macro-financial trends.&lt;/p&gt;
&lt;h3 id="q4-what-is-the-empirical-evidence-supporting-the-mechanism"&gt;Q4. What is the empirical evidence supporting the mechanism?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Household-level data from the U.S. Consumption Expenditure Survey (1984–2017) show that the model-implied average risk-aversion for the representative stockholder trended upward over time closely tracking the rate of participation, while the stockholder-to-aggregate consumption ratio trended downward; cross-state data document a negative relationship between participation and the stockholder-to-aggregate consumption ratio.&lt;/strong&gt; Both the time-series and cross-sectional patterns are consistent with the model&amp;rsquo;s prediction that higher participation compresses the gap between stockholder and aggregate consumption, the key driver of the risk-aversion channel.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;participation puzzle&lt;/strong&gt; : the empirical regularity that only a fraction of the population participates in the stock market; exploited in asset pricing models to explain the equity premium with plausible average risk-aversion; this paper studies the consequences of the upward trend in participation since the 1980s.
&lt;strong&gt;surplus-consumption ratio&lt;/strong&gt; : the gap between the capitalist&amp;rsquo;s consumption and their habit level, normalized by consumption; the key state variable in external habit utility models; determines endogenous risk-aversion so that a shrinking surplus-consumption ratio raises risk-aversion.
&lt;strong&gt;risk-aversion channel&lt;/strong&gt; : the novel mechanism introduced in this paper: as stock market participation rises, the capitalist&amp;rsquo;s consumption converges to aggregate consumption, shrinking the surplus-consumption ratio and raising endogenous risk-aversion and thus the equity premium; dominates the conventional risk-sharing channel in the model.
&lt;strong&gt;risk-sharing channel&lt;/strong&gt; : the conventional channel in limited-participation models: higher participation improves risk-sharing, reducing the covariance between stockholder consumption and equity returns and tending to depress the equity premium; present in the model but dominated by the risk-aversion channel.&lt;/p&gt;</description></item><item><title>Supply, Demand, Institutions, and Firms: A Theory of Labor Market Sorting and the Wage Distribution</title><link>https://macropaperwarehouse.com/papers/supply-demand-institutions-and-firms-a-theory-of-labor-market-sorting-and-the-wage-distribution/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/supply-demand-institutions-and-firms-a-theory-of-labor-market-sorting-and-the-wage-distribution/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research question.&lt;/strong&gt; How do workforce composition (labor supply), labor demand, and minimum wage policy jointly determine the wage distribution in imperfectly competitive labor markets, and what were the quantitative contributions of each force to the dramatic decline in Brazilian wage inequality between 1998 and 2012?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Motivation.&lt;/strong&gt; Brazil&amp;rsquo;s formal-sector wage inequality fell sharply over this period. Three candidate shocks are well-documented: (1) a large increase in educational attainment — the share of adults completing at least secondary school rose by 20 percentage points (a 68 percent increase) between 1998 and 2012; (2) labor demand shocks, primarily the commodities boom of the 2000s; and (3) a 93.7 percent (66.1 log point) real increase in the federal minimum wage. Existing frameworks analyze these shocks separately — competitive supply/demand models on one side and imperfectly competitive minimum wage models on the other — and therefore cannot detect interactions or jointly explain all observed patterns, including the novel finding that assortative matching between high-wage workers and high-wage establishments rose in 104 out of 151 microregions, a fact inconsistent with the predictions of leading minimum wage models.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data.&lt;/strong&gt; The paper uses the RAIS (Relação Anual de Informações Sociais), a confidential linked employer-employee dataset covering the Brazilian formal sector, together with Brazilian Census data for 1991, 2000, and 2010. Statistics are computed for 151 microregions (analogous to US commuting zones) with at least 15,000 workers in RAIS in both base years and at least 1,000 formal workers per educational group. The final sample covers 73 percent of the adult population. Firm wage premiums and assortative matching are measured via AKM two-way fixed effects regressions using the bias-corrected KSS (Kline, Saggio, Sølvsten 2018) estimator, run separately for each microregion and period on three-year panels.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Theoretical framework.&lt;/strong&gt; The paper develops a unified general-equilibrium model featuring: (i) a task-based production function with distance-dependent complementarity between worker types; (ii) monopsony power arising from idiosyncratic worker preferences for firms, generating constant firm-level labor supply elasticity β (calibrated at 4, implying markdowns of 20 percent); (iii) heterogeneous firms differentiated by their production &amp;ldquo;blueprints&amp;rdquo; (the complexity of tasks they require), with blueprint shape parameterized as a Gamma distribution; and (iv) free firm entry, endogenous participation, and goods market general equilibrium with CES consumer preferences (elasticity σ). A key result is that firms with different blueprints exhibit different within-firm substitution patterns: worker types that are substitutes at low-skill, low-wage firms may be complements at high-skill, high-wage firms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Estimation.&lt;/strong&gt; A parsimonious parameterization is estimated by simultaneous-equation nonlinear least squares, targeting 26 endogenous outcomes per region (13 per period) including between- and within-group wage inequality, variance of establishment effects, covariance of worker and establishment effects, formal employment rates by education, and minimum wage bindingness. The model requires solving for equilibrium more than 15,000 times per optimization step (151 regions × 2 periods × 53 Jacobian columns). The elasticity of substitution between goods is estimated at σ = 8.36 (significantly above 1), and the aggregate labor supply parameter λ implies formal-sector elasticities of approximately 0.6–0.7 for college workers and around 1.1 for less-than-secondary workers. The model fits the data well, with R² above 0.5 for most targeted moments and perfect fit for the six moments used in the inversion procedure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main findings.&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;&lt;strong&gt;Demand shocks and the minimum wage are the primary drivers of falling inequality.&lt;/strong&gt; In counterfactual simulations, the minimum wage alone (a 66.1 log point increase) reduces the variance of log wages by 0.13. Demand shocks reduce it by a further 0.18. Supply shocks (rising education) increase the variance by 0.04, leaving their net inequality-reducing contribution negligible.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Supply shocks increase assortative matching despite compressing within-firm skill premiums.&lt;/strong&gt; Within-firm task reassignment would reduce the variance of log wages by 0.221 and the correlation between worker and establishment effects by 0.165, holding production levels and firm entry fixed. However, scale, entry, and price adjustments — driven by the large estimated σ = 8.36 &amp;gt; β + 1 = 5 — reallocate skilled labor toward high-wage, skill-intensive firms, counteracting within-firm compression and raising assortative matching by 0.189. These two channels largely offset each other.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Concurrent supply and demand changes attenuate minimum wage impacts by roughly half.&lt;/strong&gt; When the minimum wage is the only shock, it would have reduced the variance of log wages by 0.13; in the presence of supply and demand changes, its incremental contribution is approximately 0.07. Minimum wage effects on sorting (which would reduce assortative matching when acting alone) disappear when accompanied by supply and demand transformations.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Minimum wage effects are concentrated in the bottom two productivity deciles.&lt;/strong&gt; Wage effects for workers in productivity deciles three through ten from the minimum wage are approximately 1 percent or less once all channels are considered. Strong wage gains are concentrated at the bottom, primarily through the monopsony channel. The wage-posting channel (within-firm returns to skill) reduces wages for low- and middle-skill workers and raises them at the top two deciles due to the reallocation of low-skilled workers toward high-wage firms, which reduces those workers&amp;rsquo; marginal products there.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Cross-firm differences in substitution patterns generate non-standard minimum wage spillovers.&lt;/strong&gt; Conditional on the task demands of the firm employing them, a pair of worker types may be substitutes in low-skill firms and complements in high-skill firms. This firm-heterogeneity channel causes minimum wage impacts to be non-monotone across the productivity distribution, contrasting with the smooth inequality-reducing effects predicted by both competitive task-based models and frictional minimum wage models.&lt;/li&gt;
&lt;/ol&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-novel-empirical-fact-that-motivates-the-unified-framework"&gt;Q1. What is the novel empirical fact that motivates the unified framework?&lt;/h3&gt;
&lt;p&gt;A: Using KSS bias-corrected AKM decompositions performed separately for each of 151 microregions, the paper documents that assortative matching — measured as the correlation between worker and establishment fixed effects — rises in 104 out of 151 regions between 1998 and 2012. The covariance term accounts for less than 7 percent of the average decline in the variance of log wages. This finding is inconsistent with the leading imperfectly competitive minimum wage model (Engbom and Moser 2022), in which minimum wages reduce assortative matching. It is also inconsistent with purely competitive supply/demand models, which have no role for firm wage premiums or sorting. The divergence from prior national-level studies (which do not find rising sorting) is explained by the fact that national-level sorting conflates geographical sorting with supply-demand dynamics.&lt;/p&gt;
&lt;h3 id="q2-what-is-the-key-mechanism-through-which-the-task-based-production-function-generates-cross-firm-differences-in-substitution-patterns"&gt;Q2. What is the key mechanism through which the task-based production function generates cross-firm differences in substitution patterns?&lt;/h3&gt;
&lt;p&gt;A: In the task-based production function, each firm assigns workers to tasks assortatively — lower types handle lower-complexity tasks, higher types handle higher-complexity tasks, with cutoff thresholds determined by the firm&amp;rsquo;s blueprint. When a firm has a blueprint concentrated in complex tasks (a high-skill, high-wage firm), adjacent worker types are more differentiated in the tasks they perform, making them complements. When a firm has a blueprint concentrated in simple tasks (a low-skill, low-wage firm), adjacent worker types are assigned to a narrow, similar range of tasks and are therefore closer substitutes. The elasticity of complementarity between any pair of worker types is thus endogenous, depending on which tasks the firm uses and, in the monopsony case, on the firm&amp;rsquo;s skill intensity — a prediction validated empirically using nonroutine cognitive task content data for Brazilian occupations.&lt;/p&gt;
&lt;h3 id="q3-under-what-conditions-can-a-positive-supply-shock-rising-educational-attainment-widen-the-aggregate-skill-wage-premium-rather-than-compress-it"&gt;Q3. Under what conditions can a positive supply shock (rising educational attainment) widen the aggregate skill wage premium rather than compress it?&lt;/h3&gt;
&lt;p&gt;A: The paper&amp;rsquo;s Proposition 4 and Corollary 2 show that a supply shock that increases the relative supply of skilled workers can widen the aggregate skill wage premium when the elasticity of substitution between goods (σ) exceeds the firm-level elasticity of labor supply plus one (β + 1). Intuitively, when σ is large, the reduction in prices for skill-intensive goods generated by the supply shock shifts consumption toward those goods, causing net entry of skill-intensive firms. If the gains in firm wage premiums earned by skilled workers reallocated to those firms outweigh the compression in within-firm productivity differentials, the aggregate skill premium can rise. This mechanism does not require non-convexities from endogenous innovation; it operates through imperfect competition and firm entry alone. In the estimated Brazilian model, σ = 8.36 substantially exceeds β + 1 = 5, so this condition holds, explaining why rising education increases rather than compresses assortative matching in the data.&lt;/p&gt;
&lt;h3 id="q4-how-does-the-model-generate-positive-employment-effects-from-minimum-wages-and-how-do-these-interact-with-reallocation"&gt;Q4. How does the model generate positive employment effects from minimum wages, and how do these interact with reallocation?&lt;/h3&gt;
&lt;p&gt;A: In the monopsonistic baseline without a minimum wage, firms post wages below workers&amp;rsquo; marginal revenue products, causing some workers to choose non-employment. A minimum wage increase raises posted wages at constrained firms, shifting some workers from non-employment (or home production) to formal employment, generating positive employment effects at the margin where the minimum wage binds. Simultaneously, minimum wages price out the least productive workers at low-wage firms (disemployment), while workers in the intermediate productivity range reallocate from low- to high-wage firms, because high-wage firms have higher revenue productivity and can profitably hire workers that low-wage firms can no longer afford. The net employment elasticity for the lowest productivity decile with respect to the log minimum wage is −0.61 (Table 7), while the mean wage for that decile rises substantially through the monopsony channel.&lt;/p&gt;
&lt;h3 id="q5-what-are-the-three-channels-through-which-the-minimum-wage-affects-wages-and-employment-in-the-model-and-what-does-each-channel-contribute"&gt;Q5. What are the three channels through which the minimum wage affects wages and employment in the model, and what does each channel contribute?&lt;/h3&gt;
&lt;p&gt;A: The paper decomposes minimum wage effects into three channels. Channel 1 (monopsony): mechanical wage increases, positive employment effects at firms where the minimum wage binds, disemployment of very low-productivity workers, and reallocation from low- to high-wage firms, holding posted wage schedules, prices, and entry fixed. This channel accounts for nearly all of the strong wage effects at the bottom two productivity deciles. Channel 2 (wage posting): firms reoptimize earnings schedules following changes in worker composition and marginal products induced by Channel 1, holding prices and entry fixed. This channel reduces wages for low- and middle-skill workers (productivity deciles 1–7) by approximately 0.01–0.02 log points and increases wages for top deciles (decile 9: +0.04, decile 10: +0.11), because reallocation of low-skill labor to high-wage firms lowers those workers&amp;rsquo; marginal products there. Channel 3 (general equilibrium): firm entry and price responses. The fall in low-wage-firm profits causes entry of high-wage, skill-intensive firms, while the price of low-skill goods falls. General equilibrium effects generate modest positive wage effects for most workers but negative effects for very low-productivity workers due to reduced aggregate demand for low-skill labor.&lt;/p&gt;
&lt;h3 id="q6-why-do-the-minimum-wages-inequality-reducing-effects-diminish-when-accompanied-by-concurrent-supply-and-demand-changes"&gt;Q6. Why do the minimum wage&amp;rsquo;s inequality-reducing effects diminish when accompanied by concurrent supply and demand changes?&lt;/h3&gt;
&lt;p&gt;A: The paper documents that, under concurrent supply and demand transformations, the minimum wage&amp;rsquo;s reduction of the variance of log wages is approximately 0.07, roughly half the 0.13 reduction it would achieve acting alone. The attenuation occurs through interactions: supply and demand shocks raise the average productivity level of the labor market and shift workers toward high-wage, skill-intensive firms. In this altered equilibrium, the minimum wage binds less tightly (or hits a different part of the distribution), and the reallocation effects of the minimum wage that would normally reduce assortative matching are offset by the sorting-increasing effects of supply and demand changes. The estimated model shows that interactions between the minimum wage and supply/demand changes (columns 6, 7, 8 of Table 5) are economically meaningful, something undetectable without a unified framework.&lt;/p&gt;
&lt;h3 id="q7-how-does-the-models-prediction-regarding-minimum-wage-spillovers-differ-from-engbom-and-moser-2022-and-what-explains-the-difference"&gt;Q7. How does the model&amp;rsquo;s prediction regarding minimum wage spillovers differ from Engbom and Moser (2022), and what explains the difference?&lt;/h3&gt;
&lt;p&gt;A: Engbom and Moser (2022) find that the Brazilian minimum wage hike had significant wage effects extending far up the worker productivity distribution, while this paper&amp;rsquo;s model finds negligible effects (approximately 1 percent) beyond the bottom two productivity deciles. Two structural differences explain this divergence. First, Engbom and Moser (2022) assume perfect substitutability between worker types within firms, so a minimum wage increase at low-wage firms mechanically raises posted wages at all other firms to maintain relative competitiveness. In this paper&amp;rsquo;s framework, wage-posting responses at high-wage firms can be negative for low-skill workers because the inflow of reallocated low-skill workers reduces their marginal products — a channel absent under perfect substitution. Second, Engbom and Moser (2022) use a national model, allowing displaced low-skill workers to reallocate to top-productivity firms anywhere in the country, dampening disemployment; this paper&amp;rsquo;s local labor markets approach restricts reallocation to within-region boundaries, consistent with low rates of interregional migration documented for Brazil by Dix-Carneiro and Kovak (2017).&lt;/p&gt;
&lt;h3 id="q8-how-are-firm-wage-premiums-generated-in-the-model-and-why-do-differences-in-physical-productivity-between-firms-not-generate-wage-differentials"&gt;Q8. How are firm wage premiums generated in the model, and why do differences in physical productivity between firms not generate wage differentials?&lt;/h3&gt;
&lt;p&gt;A: Proposition 3 establishes that wage dispersion for similar workers across firms requires either (i) differences in blueprint shapes (firm heterogeneity in skill intensity) or (ii) differences in entry costs. Differences in physical productivity (z_g) or consumer taste parameters alone are insufficient, because with equal entry costs, differences in productivity lead to additional firm entry until the marginal revenue product of labor is equalized across firm types. Wage premiums proportional to entry costs arise because optimal firm creation requires larger-scale operation for higher-entry-cost firms, and hiring more workers forces those firms to post higher wages. Additionally, skill-intensive firms (firms with blueprints tilted toward complex tasks) pay relative wage premiums for the worker types they use most intensively, and if skill intensity and entry costs co-vary, all workers at high-skill firms may receive a wage premium.&lt;/p&gt;
&lt;h3 id="q9-how-does-the-estimation-procedure-handle-unobserved-regional-heterogeneity-in-labor-demand"&gt;Q9. How does the estimation procedure handle unobserved regional heterogeneity in labor demand?&lt;/h3&gt;
&lt;p&gt;A: Demand shocks are not directly observed; they are inferred as a residual from changes in targeted outcomes after accounting for observed supply (education shares from Census) and minimum wage changes. Five region-time-specific demand parameters — TFP (z), blueprint complexities (θ₁, θ₂), relative entry costs (F₂/F₁), and relative consumer preferences (γ₂/γ₁) — are modeled as linear functions of 1998 regional covariates (educational shares, agricultural share, manufacturing share, and initial minimum wage bindingness) with time-specific coefficients. This formulation allows unobserved demand shifters to correlate with initial educational levels, preventing incorrect attribution of demand-supply correlations to causal supply effects. Region-specific parameters (TFP in each period, education-group-specific formal employment shifters) are inverted exactly from six targeted moments within each region, eliminating incidental parameter bias.&lt;/p&gt;
&lt;h3 id="q10-what-micro-level-empirical-validations-does-the-paper-conduct-for-the-task-based-models-mechanisms"&gt;Q10. What micro-level empirical validations does the paper conduct for the task-based model&amp;rsquo;s mechanisms?&lt;/h3&gt;
&lt;p&gt;A: The paper tests four micro-level predictions using nonroutine cognitive task content data for Brazilian occupations. First, skill-intensive firms have greater demand for complex tasks (consistent with Figure 1 of the model). Second, within firms, more skilled workers are assigned to more complex tasks (Lemma 1). Third, workers who move to more skill-intensive firms are assigned more complex tasks (Lemma 2, consistent with the monopsony model&amp;rsquo;s mismatch prediction). Fourth, wage gaps between high- and low-skill firms are larger for skilled workers (Proposition 3). The paper reports finding strong support for all four predictions in the data, lending credibility to the theoretical structure and quantitative results.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Task-based production function (paper&amp;rsquo;s definition):&lt;/strong&gt; A production function in which a firm produces output by assigning workers of different types to tasks indexed by complexity. The assignment is assortatively optimal: lower-type workers handle lower-complexity tasks, with unique threshold complexities separating adjacent worker types. The critical property is distance-dependent complementarity — any pair of worker types that are &amp;ldquo;close&amp;rdquo; in skill rank are substitutes, while pairs distant in skill rank are complements. This differs from CES production functions where the elasticity of complementarity is the same for all pairs; in the task-based version, substitutability depends on endogenous assignment and thus on the firm&amp;rsquo;s blueprint.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Blueprint (paper&amp;rsquo;s definition):&lt;/strong&gt; A function b_g(x) that specifies the density of tasks of each complexity level x required to produce one unit of good g. It is the fundamental source of firm heterogeneity in the model: firms producing goods with blueprints tilted toward complex tasks are more skill-intensive, hire workers of higher average type, and pay higher wages. The paper parameterizes blueprints as Gamma distributions with shape parameter θ_g indexing average task complexity; firms with higher θ_g are more skill-intensive.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Firm wage premium (paper&amp;rsquo;s definition):&lt;/strong&gt; The component of wages at a given establishment that accrues equally to all workers at that firm regardless of their type, measured as the establishment fixed effect ψ_j in AKM two-way fixed effects regressions. In this model, firm wage premiums arise from heterogeneity in blueprints (skill intensity) and entry costs, not from differences in TFP or consumer tastes. Under monopsony, firms with higher entry costs must operate at larger scale and post higher wages; blueprint heterogeneity generates differential wage premiums by skill type.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sorting / assortative matching (paper&amp;rsquo;s definition):&lt;/strong&gt; The correlation between the worker fixed effect (ν_i,r capturing worker skill) and the establishment fixed effect (ψ_j capturing firm wage premium) in the AKM decomposition, measured as Cov(ν_i,r, ψ_{J(i,r,τ)} | r). In this paper&amp;rsquo;s framework, sorting arises because firms with blueprints demanding complex tasks (high-wage firms) have a comparative advantage in employing high-skill workers; labor market sorting can therefore change over time due to supply, demand, or minimum wage shocks, even without changes in search frictions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Monopsony power / markdown (paper&amp;rsquo;s definition):&lt;/strong&gt; Arising from idiosyncratic worker preferences for firms (modeled as a nested logit), firms face upward-sloping labor supply curves with constant firm-level elasticity β. Optimal posted wages equal a constant markdown β/(β+1) of the marginal revenue product of labor, set to β = 4 (implying a 20 percent markdown). The macro elasticity of formal sector labor supply is governed by a separate parameter λ, estimated from the data, yielding aggregate formal-sector supply elasticities of approximately 0.6–0.7 for college workers and around 1.1 for less-educated workers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Wage posting responses (paper&amp;rsquo;s definition):&lt;/strong&gt; The second channel of minimum wage effects, in which firms reoptimize their entire earnings schedule following the wage-composition changes induced by the minimum wage&amp;rsquo;s mechanical and reallocation effects (Channel 1), while keeping goods prices and firm entry fixed. Because task-based production functions are concave, changes in factor proportions (due to reallocation of low-skill workers to high-wage firms) alter marginal products of all worker types within those firms, causing firms to adjust all posted wages — not just those directly constrained by the minimum wage.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Distance-dependent complementarity (paper&amp;rsquo;s definition):&lt;/strong&gt; The property, proven as a Corollary to Proposition 1, that for a fixed worker type h, the partial elasticity of complementarity between h and any other type h&amp;rsquo; is strictly increasing in h&amp;rsquo; for h&amp;rsquo; ≥ h (more distant high types are stronger complements) and strictly decreasing in h&amp;rsquo; for h&amp;rsquo; ≤ h (more distant low types are weaker substitutes / stronger complements). This pattern results from the division of labor: adding a very different worker type allows specialization gains that do not arise when adding similar-type workers competing for the same tasks.&lt;/p&gt;</description></item><item><title>Take the Goods and Run: Contracting Frictions and Market Power in Supply Chains</title><link>https://macropaperwarehouse.com/papers/take-the-goods-and-run-contracting-frictions-and-market-power-in-supply-chains/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/take-the-goods-and-run-contracting-frictions-and-market-power-in-supply-chains/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;This paper studies the efficiency of self-enforced relational agreements in manufacturing supply chains when sellers have market power and contracts cannot be externally enforced. The setting is Ecuador, an upper-middle-income country with slow commercial courts (debt enforcement takes around two years even after a 2016 reform) and highly concentrated manufacturing markets (average Herfindahl-Hirschman Index of 0.6 for 6-digit economic codes, well above the 0.25 threshold used by the US Department of Justice to identify highly concentrated markets).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Research question.&lt;/strong&gt; How efficiently do long-term trade relationships operate, period by period, when the seller can price discriminate and the buyer can opportunistically default on trade-credit debt? Does seller market power worsen or mitigate enforcement-driven inefficiencies?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data.&lt;/strong&gt; The paper uses three Ecuadorian government administrative databases: (1) an electronic invoicing (EI) system covering all sales of 49 large manufacturing firms in textiles, pharmaceuticals, and cement products for 2016–2017, providing product-level unit prices, quantities, and payment method for each buyer-seller pair (median seller has 600 buyers); (2) the universe of firm-to-firm VAT transactions from 2008–2015, used to measure relationship age (censored at 9 years); and (3) annual financial statements providing variable costs to proxy marginal cost.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model.&lt;/strong&gt; The author develops a dynamic contracting model that embeds non-linear pricing with heterogeneous buyers (following Jullien 2000 and Attanasio-Pastorino 2020) into an infinitely repeated game with limited enforcement (following Martimort et al. 2017). The seller holds all bargaining power, commits to a long-term menu of prices and quantities, and finances every transaction through trade-credit. The buyer has a privately observed, fully persistent type (willingness to pay) and can opportunistically default after delivery — &amp;ldquo;take the goods and run&amp;rdquo; — at the cost of losing the future relationship. The seller uses the value of the ongoing relationship as the enforcement instrument. The paper solves the seller&amp;rsquo;s profit-maximization problem using a recursive Lagrangian approach, yielding a modified virtual-surplus condition that governs optimal quantity allocations as a function of current and past limited-enforcement Lagrange multipliers (LE multipliers).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Six motivating empirical facts&lt;/strong&gt; documented in the data: (1) New buyers are ~35% of pairs but account for only ~10% of total trade; relationships lasting nine or more years are less than 10% of pairs but generate over 30% of trade. (2) Trade-credit is used in approximately 65% of transactions in the first year and 70–75% in older relationships. (3) Quantities increase as relationships age. (4) A 10% increase in quantity purchased is associated on average with a 2% decrease in unit price (quantity discounts). (5) Conditional on quantity, older buyers pay up to 3% less; these price discounts appear only in trade-credit transactions, not in pay-in-advance transactions. (6) Approximately 40% of new relationships survive one additional year, 60% of relationships aged 1–3 years survive, and more than 75% of relationships aged four or more years survive.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Key structural finding.&lt;/strong&gt; Almost all new relationships have binding enforcement constraints. The estimated LE multiplier equals 1 (unconstrained) only for the top 1% of pairs at tenure 0. As relationships age, the constraint relaxes and quantities are backloaded — consistent with the seller making promises of higher future trade to incentivize current debt repayment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Efficiency results.&lt;/strong&gt; New relationships operate at approximately 30% of the frictionless (first-best) surplus level. Efficiency rises to 60% at tenure 2, 75% at tenure 4, and over 80% at tenure 5. Aggregating across buyers weighted by efficient quantities: only 5% of sellers trade efficiently with new buyers, rising to 70% by tenure 2 and 84% in the long term. By sector, 68% of textiles, 88% of pharmaceutical, and 95% of cement-product sellers reach efficient aggregate output by tenure 5. Sellers capture approximately 80% of generated surplus; the median buyer captures around 25%, and the smallest buyers may capture less than 10%.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Counterfactuals reveal a second-best interaction.&lt;/strong&gt; Fixing enforcement alone (Counterfactual a: non-linear pricing with perfect enforcement) raises surplus for 75% of buyers in the early tenures but reduces surplus for essentially all buyers in later tenures, because the threat of buyer default was the force compelling the seller to promise growing quantities over time. Fixing market power alone (Counterfactual b: uniform pricing with limited enforcement) collapses surplus to 0–40% of the baseline because the seller can no longer tailor dynamic incentives to each buyer&amp;rsquo;s enforcement constraint, causing a large share of buyers to be excluded from trade. Addressing both frictions simultaneously (Counterfactual c: uniform pricing with perfect enforcement) raises surplus for most buyers in early tenures but remains welfare-reducing for high types in later tenures; the aggregate effect depends critically on weighting: positive (~40% gain) when weighted by number of buyers, negative (surplus falls to ~58% of baseline) when weighted by quantities.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-central-theoretical-mechanism-by-which-limited-enforcement-leads-to-backloading-of-quantities-in-the-model"&gt;Q1. What is the central theoretical mechanism by which limited enforcement leads to backloading of quantities in the model?&lt;/h3&gt;
&lt;p&gt;The buyer can default after delivery because payment is post-delivery (trade-credit). To prevent this, the seller must ensure the buyer&amp;rsquo;s discounted future net returns exceed the current payment obligation. This creates a forward-looking enforcement constraint: the seller must credibly promise sufficiently large future quantities at lower prices. As a result, current quantities are distorted downward (the seller delays granting full trade volumes), but quantities increase over time as past promises become binding promise-keeping constraints. The optimal contract is therefore non-stationary: total surplus generated and the buyer&amp;rsquo;s net return both increase over time even without efficiency gains in production.&lt;/p&gt;
&lt;h3 id="q2-how-does-seller-market-power-interact-with-enforcement-frictions--does-it-worsen-or-improve-efficiency-relative-to-a-perfect-enforcement-benchmark"&gt;Q2. How does seller market power interact with enforcement frictions — does it worsen or improve efficiency relative to a perfect-enforcement benchmark?&lt;/h3&gt;
&lt;p&gt;The paper&amp;rsquo;s key finding is that market power and enforcement constraints act as partially offsetting frictions. Seller market power creates downward quantity distortions (the seller restricts supply to extract rents). Limited enforcement, however, compels the seller to promise growing quantities to prevent buyer default, which counteracts the market-power distortion. Thus, in older relationships, the enforcement constraint effectively disciplines the seller&amp;rsquo;s rent-extraction incentives, producing trade levels that approach the frictionless first-best. This is an instance of the theory of second-best: each friction partially offsets the other, so removing only one friction can reduce total welfare.&lt;/p&gt;
&lt;h3 id="q3-what-are-the-six-motivating-empirical-facts-and-why-do-they-rule-out-standard-alternative-explanations"&gt;Q3. What are the six motivating empirical facts and why do they rule out standard alternative explanations?&lt;/h3&gt;
&lt;p&gt;The six facts are: (1) heavy concentration of trade in long-established relationships; (2) widespread trade-credit even in new relationships; (3) quantities increase with relationship age; (4) quantity discounts within any age cohort; (5) older buyers pay lower prices conditional on quantity; (6) survival rates increase with quantity and relationship age. Alternative models — efficiency gains, learning, demand assurance, and supply-side enforcement issues — cannot jointly account for all six patterns under realistic assumptions. Critically, Fact 5 holds only in trade-credit transactions and not in pay-in-advance transactions, which supports limited enforcement (not learning or demand assurance) as the underlying mechanism.&lt;/p&gt;
&lt;h3 id="q4-how-is-the-model-identified-from-cross-sectional-data-on-prices-and-quantities-for-a-single-seller"&gt;Q4. How is the model identified from cross-sectional data on prices and quantities for a single seller?&lt;/h3&gt;
&lt;p&gt;Identification exploits two sources of variation. First, because the seller offers non-linear price menus that induce type revelation, cross-sectional variation in prices and quantities across buyers reveals their underlying private types. Second, for the highest-type buyer at tenure 0, the cumulative LE multiplier equals 1 by construction, so the gap between the observed marginal price and marginal cost directly reveals the current enforcement multiplier for that type; cross-sectional variation across high-type buyers then identifies the elasticity parameter β. Once β is pinned down, the multipliers for all types and tenures are recovered as unique solutions to ordinary differential equations, and buyer types are recovered semi-parametrically. The approach requires only cross-sectional data from one seller per year — no panel of individual buyers is needed.&lt;/p&gt;
&lt;h3 id="q5-what-are-the-estimated-magnitudes-of-the-marginal-product-of-capital-wedge-and-how-do-they-compare-to-related-studies"&gt;Q5. What are the estimated magnitudes of the marginal product of capital wedge, and how do they compare to related studies?&lt;/h3&gt;
&lt;p&gt;The paper finds a wedge between the buyer&amp;rsquo;s marginal product of capital (MPK) and the transaction price of 40% for the median new relationship and 34% for the median tenure-5 relationship. These wedges are smaller than the 80% gaps estimated for Indian firms by Banerjee and Duflo (2014), and larger than the average 6% gap calculated by Blouin and Macchiavello (2019) in the international coffee market. They are also much smaller than the 300–500% gaps estimated for Mexican micro-enterprises by McKenzie and Woodruff (2008), which is consistent with the buyers in this sample being substantially larger (median yearly sales of USD 200,000).&lt;/p&gt;
&lt;h3 id="q6-what-does-counterfactual-a--perfect-enforcement-with-non-linear-pricing--reveal-about-the-intertemporal-trade-off"&gt;Q6. What does Counterfactual (a) — perfect enforcement with non-linear pricing — reveal about the intertemporal trade-off?&lt;/h3&gt;
&lt;p&gt;Counterfactual (a) shows massive short-run gains for low and middle types: surplus at tenure 0 increases to 1,508% and 628% of baseline for the bottom 10th and median buyer percentile groups respectively. However, for higher types (top 25%), perfect enforcement is immediately welfare-reducing because these buyers are already trading near efficiently and the seller loses the incentive to grow quantities over time once default is not a threat. By tenure 3 and beyond, perfect enforcement reduces surplus for essentially all buyers. The aggregate effect is negative because high-type buyers, who trade larger volumes, bear larger losses in later tenures when those tenures are weighted by quantity.&lt;/p&gt;
&lt;h3 id="q7-why-does-uniform-pricing-with-limited-enforcement-counterfactual-b-perform-so-poorly"&gt;Q7. Why does uniform pricing with limited enforcement (Counterfactual b) perform so poorly?&lt;/h3&gt;
&lt;p&gt;Under uniform pricing, the seller cannot tailor the dynamic contract to each buyer&amp;rsquo;s individual enforcement constraint. Without individualized price-quantity menus, many buyers cannot credibly commit to repaying their debts — because the seller cannot offer a sufficiently personalized future stream of benefits — and are thus excluded from trade entirely. For instance, at tenure 0, 95.8% of the bottom-decile buyers and 64% of median buyers are excluded. The aggregate surplus under this regime reaches only 3–68% of baseline across different tenures and percentile groups. This implies that the seller&amp;rsquo;s price discrimination ability, while generating informational rents, also serves a second purpose: it allows each buyer&amp;rsquo;s specific enforcement constraint to be satisfied, enabling trade that would otherwise be infeasible.&lt;/p&gt;
&lt;h3 id="q8-what-do-the-sector-level-results-suggest-about-the-generalizability-of-the-main-findings"&gt;Q8. What do the sector-level results suggest about the generalizability of the main findings?&lt;/h3&gt;
&lt;p&gt;All six motivating empirical facts are consistent across the three industries studied (textiles, pharmaceuticals, and cement products). The efficiency patterns also appear in all three sectors, though with heterogeneous speeds of convergence. Pharmaceutical and cement-product sellers converge faster (88% and 95% efficient at tenure 5) than textiles sellers (68% efficient at tenure 5). The finding that relationships approach efficiency in the medium and long term holds in every industry analyzed, suggesting that the underlying mechanisms — limited enforcement and seller market power — are broadly operative rather than sector-specific.&lt;/p&gt;
&lt;h3 id="q9-how-does-the-paper-establish-that-the-standard-non-linear-pricing-model-without-enforcement-constraints-does-not-explain-the-data"&gt;Q9. How does the paper establish that the standard non-linear pricing model without enforcement constraints does not explain the data?&lt;/h3&gt;
&lt;p&gt;The paper tests whether the LE multiplier at tenure 0 (G0) is statistically distinguishable from the null hypothesis of a standard non-linear pricing model (which would imply G0 = 1 for all buyers). Based on t-statistics from the estimated distribution of G0 across seller-year markets, the null of a standard model is rejected for 86% of the markets (seller-years) in the sample. Additionally, the dynamic price discounts conditional on quantity — which are the key signature of backloading — appear only in trade-credit transactions and not in pay-in-advance ones, ruling out alternative explanations such as learning about buyer quality or demand assurance.&lt;/p&gt;
&lt;h3 id="q10-what-are-the-models-main-limitations-and-how-do-they-affect-the-counterfactual-conclusions"&gt;Q10. What are the model&amp;rsquo;s main limitations and how do they affect the counterfactual conclusions?&lt;/h3&gt;
&lt;p&gt;The author flags three principal limitations. First, buyer types are assumed fully persistent due to data constraints (only two years of invoice-level data); a Markov type structure would require longer buyer-level panels. Second, the identification strategy relies on the seller&amp;rsquo;s first-order optimality conditions and cannot recover counterfactual dynamic quantities — the counterfactuals are therefore static comparisons of per-period surplus rather than full dynamic simulations. Third, if buyers have unobserved outside options, the counterfactual efficiency results may be biased, though the direction of the bias is uncertain and depends on the distribution of types and the curvature of the return function.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Limited enforcement constraint (LE-B).&lt;/strong&gt; The paper&amp;rsquo;s central friction: because payment is post-delivery, the buyer can default and keep the goods. In the model, the contract is &amp;ldquo;default-free&amp;rdquo; only if the buyer&amp;rsquo;s post-delivery payment is weakly less than the discounted value of all future truthful net returns. The constraint is binding when this condition is tight — the buyer is on the margin of defaulting. When binding, it forces the seller to reduce current tariffs and quantities (to lower the attractiveness of default) while promising higher future quantities (to raise the continuation value).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Limited enforcement Lagrange multiplier (LE multiplier), Gt(α).&lt;/strong&gt; The shadow price on the buyer&amp;rsquo;s enforcement constraint at tenure t for a buyer at quantile α. It takes values in [0,1], equals 1 only when the enforcement constraint is slack (unconstrained buyer), and equals zero for the lowest type at all tenures. In the paper&amp;rsquo;s framework, the entire trajectory of Gt(α) across tenures encodes the history of past enforcement promises and is the key object identified and estimated to recover the dynamic distortions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Backloading.&lt;/strong&gt; The equilibrium property whereby the total surplus generated by the relationship and the buyer&amp;rsquo;s net return both increase over time. The seller achieves this by initially restricting quantities and promising growing future allocations as an enforcement device. Formally, quantities increase over time if and only if enforcement constraints are relaxed (gt+1(q) ≤ gt(q)).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Modified virtual surplus.&lt;/strong&gt; The object that replaces ordinary virtual surplus (which appears in standard non-linear pricing models) in the seller&amp;rsquo;s first-order condition. It augments standard virtual surplus by adding shadow costs for current binding enforcement constraints and subtracting corrections for past enforcement promises. Optimal quantity allocations are determined by an inverse-markup rule applied to this modified virtual surplus.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Relational agreement / self-enforced relational contract.&lt;/strong&gt; An informal long-term agreement sustained purely through the repeated interaction between the parties, without access to third-party (court) enforcement. In this paper&amp;rsquo;s setting, the seller disciplines the buyer&amp;rsquo;s opportunism exclusively through the threat of relationship termination; no legal recourse is available or used in equilibrium.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Quantity discounts (non-linear pricing / wholesale quantity discounts).&lt;/strong&gt; Price schedules under which the unit price decreases with the quantity purchased, offered by a seller with market power. In the paper&amp;rsquo;s empirical setting, a 10% increase in quantity is associated with a 2% decrease in unit price, and these discounts appear at every relationship age. The model generates them as the incentive-compatibility requirement that ensures higher-type buyers truthfully reveal their demand.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Trade-credit.&lt;/strong&gt; Seller financing of the transaction, in which goods are delivered before payment is received. In the Ecuadorian data, approximately 65% of first-year purchases and 70–75% of purchases in mature relationships are conducted via trade-credit. Because the seller bears the full cost of buyer default, trade-credit is the financial arrangement that gives rise to the limited enforcement constraint studied in the paper.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Second-best interaction of frictions.&lt;/strong&gt; The paper&amp;rsquo;s counterfactual finding that removing a single friction (either enforcement or market power) can reduce total welfare when both frictions are present simultaneously. This occurs because the two frictions partially offset each other: enforcement constraints discipline the seller&amp;rsquo;s monopoly distortions, and market power allows the seller to price-discriminate in ways that enable enforcement in the first place. Addressing both frictions simultaneously can improve welfare, consistent with the Lipsey-Lancaster theory of second-best.&lt;/p&gt;</description></item><item><title>Tell Me Something I Don't Already Know: Learning in Low- and High-Inflation Settings</title><link>https://macropaperwarehouse.com/papers/tell-me-something-i-dont-already-know-learning-in-low-and-high-inflation-settings/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/tell-me-something-i-dont-already-know-learning-in-low-and-high-inflation-settings/</guid><description>&lt;p&gt;This paper uses randomized control trials (RCTs) applied over time in multiple countries to study whether the economic environment — specifically the level of inflation — affects how agents learn from new information. The main finding is that as inflation rose in advanced economies, both households and firms became more attentive to and informed about publicly available news about inflation, causing them to respond less to exogenously provided information about inflation and monetary policy in the RCT treatments. When agents are already well-informed about the current inflation environment (because high inflation makes it salient), additional information provision moves their beliefs less — the marginal value of information is decreasing in prior attentiveness. Complementary evidence from Uruguay (persistently high inflation) and New Zealand (persistently low inflation) confirms the cross-sectional pattern: agents in high-inflation environments have stronger priors and respond less to information treatments. The results imply that central bank communication interventions are more effective during low-inflation periods when agents are less informed.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-how-does-the-rct-design-identify-the-effect-of-the-inflation-environment"&gt;Q1. How does the RCT design identify the effect of the inflation environment?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper exploits the within-country time variation in inflation — running comparable RCT information treatments before, during, and after the 2021-2023 surge in multiple countries — to identify whether the same information treatment has different effects depending on the prevailing inflation level.&lt;/strong&gt; By holding the content of the information treatment constant and varying the macroeconomic environment, the paper isolates how environment-driven changes in agent attentiveness mediate the treatment effect.&lt;/p&gt;
&lt;h3 id="q2-why-do-agents-respond-less-to-information-when-inflation-is-high"&gt;Q2. Why do agents respond less to information when inflation is high?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;When inflation is high and salient, agents actively monitor publicly available inflation news, reducing their prior uncertainty; the Bayesian posterior shift from a given information signal is smaller when the prior is more concentrated.&lt;/strong&gt; This mechanism — decreasing marginal value of information with prior precision — means that information provision campaigns are subject to diminishing returns as the macroeconomic environment itself provides more signal.&lt;/p&gt;
&lt;h3 id="q3-what-does-this-imply-for-central-bank-communication-strategy"&gt;Q3. What does this imply for central bank communication strategy?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Central bank forward guidance and information campaigns are most effective when agents have diffuse priors — i.e., during periods of low, stable inflation when inflation is not salient to households and firms.&lt;/strong&gt; During high-inflation episodes, agents become more sophisticated but also more resistant to updating on any individual information signal, making communication a weaker tool precisely when the inflation challenge is most acute.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;prior attentiveness&lt;/strong&gt; : the degree to which agents actively monitor publicly available information about inflation; the paper shows this rises with inflation level, reducing the marginal value of additional information provided by RCT treatments.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;decreasing marginal value of information&lt;/strong&gt; : the property that additional information moves beliefs less when agents already have precise priors; the mechanism explaining why RCT treatment effects are smaller in high-inflation environments.&lt;/p&gt;</description></item><item><title>Temporary Layoffs, Loss-of-Recall, and Cyclical Unemployment Dynamics</title><link>https://macropaperwarehouse.com/papers/temporary-layoffs-loss-of-recall-and-cyclical-unemployment-dynamics/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/temporary-layoffs-loss-of-recall-and-cyclical-unemployment-dynamics/</guid><description>&lt;p&gt;This paper measures and models the role of temporary layoffs (TL) in cyclical unemployment dynamics, motivating the analysis by the extraordinary surge in temporary layoffs at the onset of the pandemic recession — roughly 15% of employed workers moved to temporary-layoff status from March to April 2020. The paper documents two opposing effects of temporary layoffs on total unemployment: a stabilizing direct effect (workers on TL return to employment rapidly via recall) and a destabilizing indirect effect through &amp;ldquo;loss-of-recall&amp;rdquo; — workers initially on temporary layoff who fail to be recalled and instead transition to jobless unemployment (JL), inheriting that state&amp;rsquo;s far lower reemployment probability. A new recursive accumulation method is used to construct a time series of the stock of workers in jobless unemployment whose most recent exit from employment was to temporary-layoff status (JL-from-TL); this stock has a standard deviation 16 times that of GDP and 2 times that of total unemployment, and is a high-correlation indicator of labor market slack. A search-and-matching model with staggered Nash wage bargaining, endogenous layoff thresholds, and separate recall and new-hire channels replicates the pre-pandemic cyclical behavior of TL and JL flows. Applying the model to the pandemic recession, the paper finds that the Paycheck Protection Program (PPP) reduced employment shortfalls by roughly 2 percentage points at peak, primarily by dampening loss-of-recall — the program&amp;rsquo;s forgivable loan structure reduced firms&amp;rsquo; incentive to permanently separate workers who had been placed on temporary layoff.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a published paper based on the NBER working paper full text (w30134), AI-assisted, pending human review. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Gertler, Huckfeldt, and Trigari study two distinct features of temporary layoffs in aggregate unemployment dynamics: the well-documented stabilizing role of recall hiring, and a less-studied destabilizing mechanism they term &amp;ldquo;loss-of-recall&amp;rdquo; — the countercyclical flow of workers from temporary-layoff unemployment into jobless unemployment. Using monthly CPS data from 1979 through the pandemic period, they construct a four-state Markov transition matrix (employment, TL unemployment, JL unemployment, inactivity) and develop a novel recursive method to track the accumulated stock of jobless unemployed workers whose most recent employment exit was via temporary layoff (JL-from-TL). This stock is small on average (roughly 40% of the average TL stock) but highly volatile — its standard deviation is 16 times GDP and twice total unemployment — and strongly co-moves with total unemployment (correlation 0.93) and the vacancy-unemployment ratio (0.83). Across historical recessions: TL unemployment contributed 36.1% of the increase in total unemployment during the 1980s recessions (25.1% direct, 11.0% indirect via loss-of-recall); 17.2% during the Great Recession (8.7% direct, 8.5% indirect — nearly equal); and 98% during the pandemic recession (almost entirely direct, because PPP dampened loss-of-recall). The structural model — DMP with staggered multiperiod Nash wage bargaining, firm-specific overhead cost shocks that generate endogenous exit and temporary layoffs, and separate hiring and recall margins — captures pre-pandemic dynamics and shows that loss-of-recall amplifies unemployment persistence following recessionary TFP shocks. In the pandemic recession application, the PPP counterfactual finds that without PPP: peak unemployment would have been roughly 2 percentage points higher; jobless unemployment would have peaked at 7.0% versus 5.9% in the PPP scenario; and cumulative TL-to-JL flows would have been roughly double, amounting to 47.4% of what they would otherwise have been.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-distinguishes-temporary-layoff-unemployment-from-jobless-unemployment-in-the-data-and-why-does-the-distinction-matter-for-cyclical-dynamics"&gt;Q1. What distinguishes temporary-layoff unemployment from jobless unemployment in the data, and why does the distinction matter for cyclical dynamics?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Temporary-layoff unemployment (TL) is the state in which a CPS respondent indicates an expectation of recall — either a specific return date or a stated expectation of recall within six months — while jobless unemployment (JL) is unemployment without such an expectation; the two states have starkly different reemployment probabilities, with TL workers returning to employment at substantially higher rates than JL workers, making the composition of total unemployment — not just its level — a key determinant of unemployment persistence.&lt;/strong&gt; In the Markov transition matrix estimated from CPS data 1979-2019 (Table 2), TL is a transient state: workers on TL transition to employment at a far higher rate than workers in JL, reflecting the attached recall relationship. The stock of TL unemployment is consequently small — averaging roughly one-eighth of total unemployment — even though TL separations account for roughly one-third of all separations from employment to unemployment. The distinction matters for aggregate dynamics because a recessionary increase in TL generates both a direct, relatively transient component (elevated TL stock) and an indirect, more persistent component (heightened loss-of-recall feeding into JL stock). Standard two-state unemployment models that lump TL and JL together miss the indirect channel entirely, understating both the volatility and persistence of total unemployment in the presence of countercyclical loss-of-recall.&lt;/p&gt;
&lt;h3 id="q2-what-is-the-recursive-accumulation-method-for-estimating-jl-from-tl-and-what-does-it-reveal-about-the-indirect-contribution-of-temporary-layoffs"&gt;Q2. What is the recursive accumulation method for estimating JL-from-TL, and what does it reveal about the indirect contribution of temporary layoffs?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper proposes a novel method to estimate the time series stock of jobless unemployed workers whose most recent employment exit was through temporary-layoff unemployment — the JL-from-TL stock — by propagating forward through the Markov transition matrix each cohort of workers who enter TL from employment, tracking the fraction that survive in any unemployment state without returning to employment, and summing across all past cohorts.&lt;/strong&gt; Formally, if $x_{t-m,t}$ denotes the distribution of workers at time $t$ whose last exit from employment was to TL at time $t-m$, then $x_{t-m,t} = \tilde{P}&lt;em&gt;t x&lt;/em&gt;{t-m,t-1}$ where $\tilde{P}&lt;em&gt;t$ is a modified transition matrix, and the JL-from-TL stock is $u^{JL,TL}&lt;em&gt;t = \sum&lt;/em&gt;{j=0}^{T} e&amp;rsquo;&lt;/em&gt;{JL} x_{t-j-1,t}$. The method requires only the Markov transition matrix — no individual-level panel data — and extends the Shimer (2012) / Elsby-Hobijn-Sahin (2015) variance decomposition approach to level decompositions. Applied to CPS data, the JL-from-TL stock has a standard deviation 16 times that of GDP (versus 2 times for TL itself) and a correlation of 0.93 with total unemployment — substantially higher than the 0.83 correlation of the vacancy-unemployment ratio with total unemployment. The large relative volatility reflects that the JL-from-TL stock compounds both the volatility of TL separations and the cyclical variation in the TL-to-JL transition probability (loss-of-recall); both components are countercyclical, so they co-amplify in recessions.&lt;/p&gt;
&lt;h3 id="q3-how-does-the-papers-structural-model-generate-endogenous-temporary-versus-permanent-layoffs-and-a-procyclical-recall-probability"&gt;Q3. How does the paper&amp;rsquo;s structural model generate endogenous temporary versus permanent layoffs and a procyclical recall probability?&lt;/h3&gt;
&lt;p&gt;&lt;em&gt;&lt;em&gt;Temporary layoffs and permanent exits arise endogenously from two cost shocks in the model: an employee-specific cost shock (ϑ) and a firm-specific overhead shock (γ), with thresholds ϑ&lt;/em&gt; and γ&lt;/em&gt; determined by firm optimization; workers whose idiosyncratic cost exceeds ϑ* are placed on temporary layoff (retaining recall rights), while firms whose overhead shock exceeds γ* exit, converting their TL workers to jobless unemployment.** The framework is a modified DMP model with staggered Nash wage bargaining (following Gertler-Trigari 2009), where firms can expand their workforce either by recalling workers from TL unemployment or by hiring new workers from JL unemployment, with separate quadratic adjustment costs for each margin ($\kappa$ for new hires, $\kappa_r$ for recalls). The recall elasticity exceeds the new-hire elasticity, consistent with the lower cost of re-integrating previously attached workers. Recall hiring (xr) and new hiring (x) are both driven by the discounted value of a worker to the firm, J(w,s), but respond with different sensitivities governed by their respective adjustment cost parameters. The TL-to-JL (loss-of-recall) flow is endogenous and driven by firm exit: when the overhead shock γ exceeds γ*(w,s), the firm exits and its TL workers lose their recall option, converting to JL unemployment. Because γ* rises in bad times (higher firm insolvency), loss-of-recall is countercyclical, matching the data pattern. An exogenous loss-of-recall probability $(1-\rho_r)$ is also included to capture TL-to-JL flows that occur even when the firm survives (e.g., firm restructuring or expiration of recall expectations), and this parameter is calibrated to long-run flow moments.&lt;/p&gt;
&lt;h3 id="q4-what-does-the-calibrated-model-reveal-about-the-amplification-role-of-loss-of-recall-and-how-is-this-quantified"&gt;Q4. What does the calibrated model reveal about the amplification role of loss-of-recall, and how is this quantified?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;A counterfactual exercise that sets the TL-to-JL transition probability to zero (shutting off loss-of-recall) shows that total unemployment peaks earlier and at a lower level following a recessionary TFP shock, and that unemployment displays markedly less persistence — revealing loss-of-recall as an important amplification mechanism by which a recessionary increase in temporary layoffs can generate persistently higher total unemployment.&lt;/strong&gt; The model is calibrated to monthly frequency with 16 parameters: 9 assigned externally (β=0.991^{1/3}, δ=0.025^{3}, α=1/3, standard AR(1) TFP parameters, matching function elasticity σ=0.5, bargaining power η=0.5, λ=8/9 targeting quarterly wage adjustment frequency), and 7 calibrated to long-run flow moments and business cycle volatility moments (Table 8-9). The calibrated model captures the cyclical volatility of aggregate labor market stocks and flows, and the impulse response to a negative 1% TFP shock shows a hump-shaped increase in total unemployment with TL unemployment recovering within roughly two years (due to lower recall costs) while JL unemployment recovers more slowly (due to lower job-finding rates). The countercyclical overshooting of employment-to-JL transition probabilities during the subsequent expansion reflects the procyclicality of the reservation wage — workers are less willing to accept pay cuts in good times, triggering exits from employment at the margin. The overall result is that loss-of-recall accounts for a quantitatively significant share of unemployment persistence in recessions, particularly in the later part of the sample.&lt;/p&gt;
&lt;h3 id="q5-how-is-the-model-adapted-for-the-pandemic-recession-and-what-are-the-specific-mechanisms-through-which-ppp-reduced-jobless-unemployment"&gt;Q5. How is the model adapted for the pandemic recession, and what are the specific mechanisms through which PPP reduced jobless unemployment?&lt;/h3&gt;
&lt;p&gt;&lt;em&gt;&lt;em&gt;The pandemic application introduces two temporary shock processes: (i) a &amp;ldquo;virus shock&amp;rdquo; that exogenously raises the TL rate above the threshold determined by ϑ&lt;/em&gt; (capturing mandatory closures and social distancing-induced reductions in effective labor demand), and (ii) a productivity shock from social-distancing requirements; PPP is modeled as a policy that subsidizes firms&amp;rsquo; wage bills conditionally on maintaining worker-firm attachments, reducing firms&amp;rsquo; incentive to exit and thereby directly dampening the endogenous TL-to-JL (loss-of-recall) flow.&lt;/em&gt;* With these modifications the model captures the key features of pandemic labor market dynamics: the extraordinary March-April 2020 TL spike, the rapid initial recall, and the subsequent slow recovery of employment. In the PPP counterfactual (no PPP), cumulative TL-to-JL flows over the pandemic period would have been approximately double their actual levels — the model generates a 47.4% ratio of actual-to-counterfactual cumulative TL-to-JL flows, indicating PPP prevented roughly 53% of the loss-of-recall that would have otherwise occurred. At peak (six months after the shock), employment under the no-PPP counterfactual is 8.8% below pre-pandemic levels versus 6.8% with PPP — a 2 percentage point gap. Jobless unemployment peaks at 7.0% without PPP versus 5.9% with PPP. Consistent with estimates from Hubbard and Strain (2020), the estimated average monthly PPP employment gain is approximately 2.0% over the first six months, with gains of 1.57% through February 2021 before convergence toward zero.&lt;/p&gt;
&lt;h3 id="q6-what-does-the-evidence-on-reemployment-probabilities-of-workers-who-transition-from-tl-to-jl-establish-and-why-is-it-important-for-identifying-loss-of-recall"&gt;Q6. What does the evidence on reemployment probabilities of workers who transition from TL to JL establish, and why is it important for identifying loss-of-recall?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Workers in jobless unemployment who were in temporary-layoff unemployment in the previous period have reemployment probabilities virtually indistinguishable from those of the full population of jobless unemployed (Table 3), which — because JL workers have far lower reemployment probabilities than TL workers — establishes that the TL-to-JL transition is a true loss-of-recall event: the worker has genuinely lost the recall relationship and now faces the same search frictions as other permanently separated workers.&lt;/strong&gt; This finding is important for the paper&amp;rsquo;s empirical strategy because it validates the interpretation of CPS-recorded TL-to-JL transitions as genuine loss-of-recall rather than mismeasurement or recategorization without substantive change in the worker&amp;rsquo;s employment prospects. The result also implies that TL-to-JL transitions create true duration dependence in reemployment probabilities among workers initially on TL: workers who spend longer in TL unemployment are more likely to lose recall, so the average reemployment probability of the TL cohort declines with duration. This duration dependence is consistent with the model&amp;rsquo;s mechanism — exit probability rises over time as firms facing prolonged overhead cost shocks eventually breach the exit threshold — and provides a micro-level validation of the endogenous loss-of-recall channel.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;temporary-layoff (TL) unemployment&lt;/strong&gt; : the labor market state in which an unemployed worker retains an expectation of recall to the prior employer (either a specific return date or an indication of recall within six months, per CPS classification); characterized by substantially higher reemployment probabilities than jobless unemployment, accounting for roughly one-third of separations from employment but only one-eighth of the total unemployment stock due to the transient nature of TL spells.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;loss-of-recall&lt;/strong&gt; : the conversion of a temporary layoff into a permanent separation — the event by which a worker initially on TL status transitions to jobless unemployment because the prior employer exits or cannot recall; the paper&amp;rsquo;s central amplification mechanism, shown to be countercyclical (higher in recessions), to account for 8.5–11.0% of unemployment increases in pre-pandemic recessions, and to be substantially dampened by PPP during the pandemic.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;JL-from-TL stock&lt;/strong&gt; : the accumulated stock of workers currently in jobless unemployment whose most recent exit from employment was through temporary layoff — constructed via the paper&amp;rsquo;s novel recursive accumulation method; has a standard deviation 16 times GDP and 2 times total unemployment, correlates 0.93 with total unemployment, and constitutes a leading slack indicator that captures the indirect destabilizing contribution of temporary layoffs to unemployment dynamics.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;recall hiring versus new-hire margin&lt;/strong&gt; : the model&amp;rsquo;s two channels through which firms can expand their workforce — recalling workers from their own TL pool (lower adjustment cost, higher recall elasticity) versus hiring new workers from the pool of jobless unemployed (higher cost); both margins respond positively to the discounted firm value J(w,s) but with different sensitivities calibrated to match the differential volatility of TL-to-E and JL-to-E transition probabilities in the CPS.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;staggered Nash wage bargaining&lt;/strong&gt; : the model&amp;rsquo;s wage rigidity mechanism (following Gertler-Trigari 2009), in which firms and workers negotiate base wages with probability (1-λ) each period; the calibrated λ=8/9 targets a wage adjustment frequency of roughly one per quarter, consistent with Taylor (1999) and Gottschalk (2005) evidence; wage rigidity — combined with the allowance for temporary pay cuts to prevent exit — is quantitatively important for replicating the observed volatility of labor market flows and stocks.&lt;/p&gt;</description></item><item><title>The Cost of Consumer Collateral: Evidence From Bunching</title><link>https://macropaperwarehouse.com/papers/the-cost-of-consumer-collateral-evidence-from-bunching/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-cost-of-consumer-collateral-evidence-from-bunching/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper estimates the shadow cost that consumers assign to pledging their primary residence as collateral, using administrative loan application and performance data from the U.S. Federal Disaster Loan (FDL) Program, which offers low-interest loans to households following natural disasters. A loan amount threshold — set at $10,000 from 2005–2007, $14,000 from 2008–2013, and $25,000 from 2014–2018 — separates uncollateralized from collateralized borrowing, with no other loan terms changing at the threshold; this sharp, discontinuous design allows the paper to use bunching estimation to identify collateral aversion. Roughly one-third of all program borrowers, and 38% of those with losses above the threshold, choose exactly the maximum uncollateralized loan amount, generating sharp mass at the threshold. Traditional bunching estimates, corroborated by two alternative approaches using household-level damage data and originally requested loan amounts, consistently find that the median borrower is willing to forgo 40–47% of their potential loan amount to avoid pledging their home as collateral, equivalent in demand terms to a 200 basis point interest rate increase. The paper also exploits threshold variation over time as an instrument for collateralization and finds that posting collateral causally reduces default rates by approximately 35%, an effect comparable in magnitude to a 100-point increase in borrower credit score, establishing that collateral substantially mitigates moral hazard in consumer lending.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-setting-and-why-does-it-cleanly-identify-the-collateral-shadow-cost"&gt;Q1. What is the setting and why does it cleanly identify the collateral shadow cost?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The Federal Disaster Loan Program creates a quasi-experimental threshold because collateral is the only loan term that changes at the $25,000 boundary — interest rate, maturity, approval probability, and all other terms remain identical on both sides.&lt;/strong&gt; Households with uninsured disaster damages (median $51,000) can borrow up to their loss amount at a fixed 2.5% rate; those requesting above the threshold must post their home as collateral. Unlike mortgage or auto loan markets, which always require collateral, and credit card markets, which never do, this program generates a setting where the binary collateral requirement is the borrower&amp;rsquo;s own choice subject only to the threshold, eliminating the standard endogeneity between contract terms and borrower risk. The authors use administrative data covering over 1 million applications from 2005 to 2018 across all 50 states.&lt;/p&gt;
&lt;h3 id="q2-how-do-the-bunching-estimates-identify-the-distribution-of-collateral-aversion"&gt;Q2. How do the bunching estimates identify the distribution of collateral aversion?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The traditional bunching estimator fits a polynomial to the distribution of loan amounts below the threshold, extrapolates it above the threshold to construct a counterfactual without the collateral requirement, and attributes the excess mass at the threshold (the &amp;ldquo;missing&amp;rdquo; mass above) to collateral aversion.&lt;/strong&gt; The bunching region spans from the threshold to the upper loan amount beyond which essentially no borrower would give up to avoid collateral; for the $25,000 threshold, this region extends to $49,900. Across all three threshold regimes, 73–78% of borrowers within the bunching region move to the threshold, and the estimated mean private value of collateral ranges from $7,944 (at the $10,000 threshold) to $18,268 (at the $25,000 threshold), representing 37–44% of ideal loan amounts. The median borrower&amp;rsquo;s collateral aversion — 39–47% depending on the threshold — is robust across all three estimation methods.&lt;/p&gt;
&lt;h3 id="q3-what-alternative-bunching-methods-does-the-paper-develop-and-what-do-they-find"&gt;Q3. What alternative bunching methods does the paper develop and what do they find?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Two alternative estimation approaches — a difference-in-bunching (DiB) estimator that compares borrowers with identical damage levels across different threshold regimes, and an originally-requested-loan estimator that uses the amount households requested before collateral salience increased — both yield median collateral aversion estimates consistent with the traditional method (40–47%), while suggesting considerably wider heterogeneity in the tails.&lt;/strong&gt; The DiB approach exploits the fact that for a household with $20,000 in damages, the same loan amount was uncollateralized under the $25,000 regime but required collateral under the $10,000 regime, enabling consumer-level identification. The originally-requested-loan method shows that 70% of eventual bunchers initially requested an amount above the threshold, with the majority of the shift occurring after meeting with a loan officer when the collateral requirement became salient. Both methods are immune to the standard counterfactual mis-specification concern of the traditional bunching approach, and they suggest the traditional estimator substantially under-predicts the proportion of highly collateral-averse borrowers at the upper end of the distribution.&lt;/p&gt;
&lt;h3 id="q4-what-is-the-mechanism-behind-collateral-aversion-and-what-does-heterogeneity-reveal"&gt;Q4. What is the mechanism behind collateral aversion and what does heterogeneity reveal?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Collateral aversion in this setting is driven by both financial incentives and behavioral/preference factors, as evidenced by the finding that roughly 30% of borrowers already underwater on existing mortgages — who have no real equity to lose — still bunch at the threshold to avoid adding a lien on their home.&lt;/strong&gt; More creditworthy borrowers (higher credit scores, higher incomes) are actually more likely to bunch, consistent with an &amp;ldquo;advantageous selection&amp;rdquo; interpretation in which borrowers who are confident in their repayment capacity are especially averse to the stigma and risk of pledging their home. Interest rates also matter: borrowers bunch more when program interest rates are higher, suggesting that financial incentives amplify the existing aversion. The magnitude of bunching — giving up thousands of dollars of subsidized low-interest disaster recovery loans — indicates that the perceived cost of a lien on the primary residence extends far beyond the financial value of potential foreclosure.&lt;/p&gt;
&lt;h3 id="q5-how-does-collateral-causally-reduce-default-rates"&gt;Q5. How does collateral causally reduce default rates?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Using time variation in the collateral threshold as an instrument for whether a borrower&amp;rsquo;s loan is collateralized — borrowers are more likely to collateralize when the threshold is low (so their ideal loan amount exceeds the low threshold) than when it is high — the paper estimates that collateral causally reduces default rates by about 35%.&lt;/strong&gt; This local average treatment effect applies to borrowers who would collateralize under the $10,000 threshold but not under the $25,000 threshold, and the magnitude is comparable to a 100-point FICO score improvement. The finding implies that collateral requirements address genuine moral hazard in consumer lending: when a primary residence is pledged, borrowers take repayment obligations substantially more seriously, reducing the probability of strategic or precautionary default. This provides causal evidence for the classic prediction of models like Bester (1985) and Chan and Thakor (1987) that collateral mitigates information asymmetries and expands efficient credit access.&lt;/p&gt;
&lt;h3 id="q6-what-are-the-aggregate-implications-and-contribution-to-methodology"&gt;Q6. What are the aggregate implications and contribution to methodology?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;In aggregate, borrowers in the program have given up more than $1.1 billion in disaster recovery loans to avoid posting collateral, indicating that collateral requirements — standard in most large consumer credit markets — impose large implicit costs that are not captured in stated interest rates.&lt;/strong&gt; The methodological contribution is threefold: the paper is among the first to apply bunching to consumer (rather than corporate) collateral; it develops two consumer-level alternative estimators that relax assumptions of the standard method and reveal greater heterogeneity in collateral aversion; and it separately identifies the moral hazard effect of collateral from adverse selection by exploiting threshold variation, extending bunching beyond tax compliance and into household finance.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;collateral shadow cost&lt;/strong&gt; : the implicit cost a borrower assigns to pledging collateral beyond the direct financial cost; measured in this paper as the maximum loan amount a borrower forgoes to avoid posting their home, identified from bunching mass at the collateral threshold in the FDL program.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;bunching estimator&lt;/strong&gt; : an estimation strategy that infers a structural parameter — here collateral aversion — from the excess density of agents at a policy threshold, using a polynomial-extrapolated counterfactual distribution to identify the missing mass above the threshold.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;difference-in-bunching (DiB) estimator&lt;/strong&gt; : a consumer-level alternative to traditional bunching estimation that uses borrowers with identical damage levels but facing different threshold regimes over time to construct a within-person counterfactual for the ideal loan amount, avoiding assumptions about the counterfactual distribution shape.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;advantageous selection&lt;/strong&gt; : the pattern in which more creditworthy, higher-income borrowers are the ones most likely to avoid collateral requirements, the reverse of the adverse selection typically assumed in collateral models; consistent with these borrowers having strong repayment intent independent of the lien.&lt;/p&gt;</description></item><item><title>The crowding-in effects of local government debt in China</title><link>https://macropaperwarehouse.com/papers/the-crowding-in-effects-of-local-government-debt-in-china/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-crowding-in-effects-of-local-government-debt-in-china/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This paper asks how changes in the &lt;em&gt;composition&lt;/em&gt; (not the size) of Chinese local government debt influence bank risk-taking, credit allocation between privately owned enterprises (POEs) and state-owned enterprises (SOEs), and local total factor productivity. The focus is a 2015 debt-to-bond swap program in which local governments were required to convert outstanding implicit debt — primarily bank loans to local government financing vehicles (LGFVs) and LGFV-issued corporate bonds — into explicitly guaranteed local government bonds.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Institutional Context&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Following China&amp;rsquo;s 2008–09 fiscal stimulus, local government debt outstanding rose from 5.8% of GDP in 2006 to 22% by 2013 and reached RMB 15.4 trillion (24% of GDP) by end-2014. The debt was largely held through LGFVs, which are nominally corporate firms but with implicit government backing. Under China&amp;rsquo;s amended budget law effective early 2015, all outstanding debt had to be converted to provincial government bonds through a three-year swap program. Before the swap, government bonds accounted for only 8% of outstanding local government debt; the remaining 92% (approximately RMB 14.17 trillion) needed to be swapped. Commercial banks hold on average 88% of newly issued local government bonds; the government bond share of commercial bank assets rose from 1.7% in 2014 to 14% in 2019.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mechanism&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Under Basel III capital adequacy ratio (CAR) regulations, Chinese commercial banks — specifically the Big Five systemically important banks using the internal-ratings-based (IRB) approach — assign risk weights above 80% on average to corporate loans, but only 20% (the regulatory approach) to local government bonds. Converting LGFV debt to government bonds therefore reduces banks&amp;rsquo; risk-weighted assets, loosening the binding CAR constraint. The paper formalizes this through a partial-equilibrium model of bank portfolio choice: a lower risk weight on government-bond assets (modeled as a fall in ξ_g) loosens an effective capital constraint, inducing banks to shift toward riskier (POE) lending and reducing the POE-SOE loan rate spread. The model predicts this effect is larger in provinces with higher initial outstanding government debt.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The empirical analysis uses: (1) confidential loan-level data from one of the Big Five Chinese commercial banks covering approximately 400,000 unique firm-loan pairs from 2008:Q1 to 2017:Q4 (regression sample 2013:Q1–2017:Q4); (2) province-level outstanding debt data at end-2014 for 25 provinces, constructed from prefectural-level data collected by Qu et al. (2023); and (3) firm-level balance sheet data from China&amp;rsquo;s Annual Survey of Industrial Firms (ASIF), covering above-scale manufacturing firms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings with Quantitative Magnitudes&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Using a triple-difference (DDD) identification — interacting POE status, a post-2015 dummy, and provincial initial government debt — the paper finds:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;At the average level of provincial government debt, the debt swap program reduced the POE credit spread (loan rate deviation from benchmark rate, relative to SOEs) by approximately &lt;strong&gt;3.18 percentage points&lt;/strong&gt; (coefficient α = −3.182, significant at p &amp;lt; 0.01).&lt;/li&gt;
&lt;li&gt;For provinces with initial outstanding debt &lt;strong&gt;one standard deviation above the mean&lt;/strong&gt; (approximately 0.402 log units above mean), the swap reduced the POE credit spread by an additional &lt;strong&gt;1.15 percentage points&lt;/strong&gt; (= 0.402 × 2.849; coefficient β = −2.849, significant at p &amp;lt; 0.01), accounting for 10.1% of the standard deviation of loan rates in the sample.&lt;/li&gt;
&lt;li&gt;In terms of the raw loan rate gap between SOEs and POEs (averaging 42 basis points in the sample), the program narrowed this spread by approximately 6 basis points in high-debt provinces (one standard deviation above mean), accounting for about 1/7 of the average gap.&lt;/li&gt;
&lt;li&gt;On the extensive margin, in provinces with outstanding debt one standard deviation above the mean, the swap raised the &lt;strong&gt;probability of bank lending to POE firms&lt;/strong&gt; by approximately &lt;strong&gt;1.2 percentage points&lt;/strong&gt; (= 0.402 × 0.0292).&lt;/li&gt;
&lt;li&gt;2SLS estimates instrumenting swapped debt by initial outstanding debt interacted with the post-2015 dummy confirm: one standard deviation increase in swapped debt leads to an &lt;strong&gt;11.21% decline&lt;/strong&gt; in the POE loan rate deviation from benchmark relative to SOEs (= 3.723 × 3.013%), accounting for 0.98 standard deviations of the loan rate variable.&lt;/li&gt;
&lt;li&gt;For provincial total factor productivity (TFP), provinces with 1% higher outstanding government debt before the swap experienced a &lt;strong&gt;2.2% larger increase in TFP&lt;/strong&gt; after 2015. The debt swap amount itself (instrumented) has a positive and significant effect on provincial TFP.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions and Parallel-Trends Validation&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Pre-trend tests show that neither the average POE-SOE rate spread (α_τ) nor its interaction with provincial government debt (β_τ) is significantly different from zero in 2014 relative to the base year 2013. Both turn significantly negative only from 2015 onward, validating the parallel-trends assumption. Results are robust to: excluding LGFV firms, excluding large firms (top 10% by assets), restricting to central SOEs as controls (dropping local SOEs), controlling for local debt capacity, GDP growth, FDI/GDP, aged population, total loans, and bank branch fixed effects. A placebo test using the 2016 deleveraging policy shows no significant effect on bank risk-taking, distinguishing the debt-swap mechanism from contemporaneous policy changes.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-key-theoretical-channel-through-which-the-debt-to-bond-swap-affects-bank-lending-to-poes"&gt;Q1. What is the key theoretical channel through which the debt-to-bond swap affects bank lending to POEs?&lt;/h3&gt;
&lt;p&gt;The channel is the risk-weighting mechanism under Basel III capital adequacy ratio (CAR) regulations. Under the IRB approach used by Big Five banks, corporate loans carry average risk weights above 80%, while local government bonds carry a fixed regulatory weight of 20%. Converting LGFV corporate loans and bonds to local government bonds on the bank&amp;rsquo;s balance sheet reduces total risk-weighted assets, loosening the binding CAR constraint. The bank responds by adopting a riskier investment policy — lowering the cutoff ω̂ in the model — which increases lending to POE firms and reduces the POE-SOE credit spread.&lt;/p&gt;
&lt;h3 id="q2-why-is-the-effect-of-the-swap-predicted-to-be-larger-in-provinces-with-higher-initial-outstanding-government-debt"&gt;Q2. Why is the effect of the swap predicted to be larger in provinces with higher initial outstanding government debt?&lt;/h3&gt;
&lt;p&gt;Proposition 2 of the model shows that the sensitivity of the POE loan rate spread to the debt swap policy (∂²ΔR_loan / ∂ξ_g ∂g) is positive, meaning it increases with the amount of government debt g. Provinces with more outstanding debt at end-2014 have more LGFV loans to swap into lower-risk-weight bonds, implying a larger reduction in risk-weighted assets for banks operating in those provinces and hence a larger relaxation of the CAR constraint. Empirically, the correlation between province-level outstanding debt and the amount of swapped debt from 2015–2017 is 0.85 (p-value &amp;lt; 0.0001), confirming the mechanism.&lt;/p&gt;
&lt;h3 id="q3-how-does-the-empirical-specification-identify-the-effect-of-the-debt-swap-rather-than-pre-existing-trends"&gt;Q3. How does the empirical specification identify the effect of the debt swap rather than pre-existing trends?&lt;/h3&gt;
&lt;p&gt;The authors use a triple-difference (DDD) design: the outcome (loan rate deviation from benchmark) is regressed on the interaction POE × Post × GovDebt, where GovDebt is the demeaned log of province-level outstanding debt at end-2014. Pre-trend analysis (Equation 16) estimates year-specific coefficients α_τ and β_τ using 2013 as the reference year. For 2014, both coefficients are statistically indistinguishable from zero. From 2015 onward, both turn significantly negative at the 95% confidence level, consistent with the debt-swap policy triggering the change and inconsistent with pre-existing differential trends by province debt level.&lt;/p&gt;
&lt;h3 id="q4-how-do-the-authors-establish-that-the-risk-taking-channel-rather-than-a-demand-side-story-drives-the-results"&gt;Q4. How do the authors establish that the risk-taking channel rather than a demand-side story drives the results?&lt;/h3&gt;
&lt;p&gt;Two complementary exercises address demand versus supply. First, the authors add firm × year-quarter fixed effects, which absorb all firm-level time-varying factors (including loan demand). After removing demand effects, the triple-difference coefficient on GovDebt × POE × Post becomes more negative (−23.66, significant at 5%) than the baseline (−2.849), suggesting demand-side movements are not the source of the finding. Second, adding bank-branch × year-quarter fixed effects to remove supply-side heterogeneity makes the triple-difference term insignificant while leaving the POE × Post coefficient at −2.196 (significant at 5%), implying the result is primarily supply-driven and province-specific supply factors captured by the triple interaction absorb into the branch-level controls.&lt;/p&gt;
&lt;h3 id="q5-what-heterogeneous-effects-across-firm-types-provide-additional-evidence-for-the-risk-taking-interpretation"&gt;Q5. What heterogeneous effects across firm types provide additional evidence for the risk-taking interpretation?&lt;/h3&gt;
&lt;p&gt;Three dimensions of heterogeneity all point toward bank risk-taking. (a) Size: the credit-easing effect (coefficient on GovDebt × POE × Post) is larger in magnitude for small POEs (by firm assets or by loan size) than for large POEs, consistent with small firms being riskier borrowers. (b) Credit rating: the effect is larger for low-rating POEs (below AA-) than for high-rating POEs, consistent with banks taking on more risk in response to a loosened CAR constraint. (c) Firm-bank distance: the effect is larger for firms located farther from the lending bank branch, where information asymmetry is more severe, consistent with increased bank risk-taking toward harder-to-monitor borrowers.&lt;/p&gt;
&lt;h3 id="q6-how-do-the-authors-confirm-that-the-debt-swap-program-is-the-operative-channel-rather-than-the-overall-regulation"&gt;Q6. How do the authors confirm that the debt swap program is the operative channel rather than the overall regulation?&lt;/h3&gt;
&lt;p&gt;Using the Bertrand-Mullainathan (2001) 2SLS approach, the authors treat the amount of swapped debt (ln(1 + Swap_jy)) as the channel variable, instrumented by GovDebt_j × Post_y (and its interaction with POE_i for the intensive-margin regression). The first-stage results are strong (F-statistics of 158–268), confirming that provinces with more initial outstanding debt swap more debt after 2015. The second-stage results show: (a) on the intensive margin, a one-standard-deviation increase in swapped debt leads to an 11.21% decline in the POE loan rate deviation from benchmark relative to SOEs; (b) on the extensive margin, provinces with more swapped debt show significantly higher probability of POE lending. Both second-stage estimates are significant, confirming the debt swap program as the transmission channel.&lt;/p&gt;
&lt;h3 id="q7-what-is-the-effect-of-the-debt-swap-on-provincial-total-factor-productivity-and-through-what-channel"&gt;Q7. What is the effect of the debt swap on provincial total factor productivity, and through what channel?&lt;/h3&gt;
&lt;p&gt;Provinces with 1% higher outstanding government debt before the swap experienced a 2.2% larger increase in average provincial TFP after 2015 (column 2 of Table 13, coefficient = 0.0220, significant at p &amp;lt; 0.01), with the parallel-trend analysis showing no significant pre-2015 differential effect (the 2014 coefficient is 0.00346, insignificant). 2SLS estimates using swapped debt as the channel variable confirm a positive, significant effect of swapped debt on provincial TFP, with a coefficient of 0.0253 (p &amp;lt; 0.01) in the second stage. The mechanism is credit reallocation from less-productive SOEs to more-productive POEs, consistent with POEs having higher average productivity as documented in Hsieh and Klenow (2009).&lt;/p&gt;
&lt;h3 id="q8-how-do-the-authors-rule-out-that-the-deleveraging-policy-implemented-in-december-2015-drives-the-results"&gt;Q8. How do the authors rule out that the deleveraging policy (implemented in December 2015) drives the results?&lt;/h3&gt;
&lt;p&gt;A placebo test replaces the Post_y dummy (equal to 1 from 2015 onward) with DeLevy (equal to 1 from 2016 onward, coinciding with the deleveraging policy). Neither the coefficient on GovDebt × POE × DeLevy nor on POE × DeLevy is statistically significant in the placebo regressions (Table 11). This distinguishes the mechanism from the deleveraging policy and confirms that the debt swap program — not deleveraging — is the source of the credit reallocation to POEs.&lt;/p&gt;
&lt;h3 id="q9-how-do-the-authors-confirm-results-are-not-driven-by-the-debt-capacity-channel"&gt;Q9. How do the authors confirm results are not driven by the debt capacity channel?&lt;/h3&gt;
&lt;p&gt;The local government debt reform also regulated debt capacity (the ratio of outstanding debt to a centrally assigned debt limit) for each local government. The authors control for the province-level debt capacity measure (DebtCap_j, the average ratio of local government debt to the debt limit in 2016–2017) alongside the baseline interaction terms. Table 9 shows the baseline results remain valid and significant after including debt capacity controls: the coefficient on GovDebt × POE × Post is −2.210 (p &amp;lt; 0.05) and the POE probability of lending result (coefficient on GovDebt × Post = 0.0277, p &amp;lt; 0.01) both hold, ruling out the debt capacity channel as the driver.&lt;/p&gt;
&lt;h3 id="q10-what-does-the-model-predict-about-the-general-relationship-between-capital-adequacy-requirements-and-bank-risk-taking"&gt;Q10. What does the model predict about the general relationship between capital adequacy requirements and bank risk-taking?&lt;/h3&gt;
&lt;p&gt;Proposition 1 establishes that tightening the capital adequacy ratio requirement (increasing ψ) leads to a safer investment policy (ω̂ increases, meaning the bank sets a higher cutoff before taking risky projects) and a lower leverage ratio. This is the benchmark: the debt swap effectively softens the constraint by reducing risk-weighted assets, analogous to lowering the effective ψ̃, which induces the opposite effect — riskier investment policy (lower ω̂) and lower POE credit spreads. The IRB approach&amp;rsquo;s property that risk weights are higher and increasing in project riskiness (ξ&amp;rsquo;(ω) &amp;lt; 0 and ξ&amp;rsquo;&amp;rsquo;(ω) ≤ 0) is essential for these comparative statics to hold.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Debt-to-Bond Swap Program (2015):&lt;/strong&gt; China&amp;rsquo;s central government program requiring local governments to convert all outstanding non-government-bond debt (primarily bank loans to LGFVs and LGFV-issued corporate bonds) into explicitly guaranteed provincial government bonds over three years starting in 2015. The program covered RMB 15.4 trillion in outstanding debt, of which 92% needed to be converted; by end-2018, approximately 90% of non-government-bond debt had been swapped.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Risk-Weighting Channel:&lt;/strong&gt; The mechanism by which the change in debt composition affects bank lending. Under Basel III&amp;rsquo;s internal-ratings-based (IRB) approach, Chinese Big Five banks assign risk weights above 80% on average to corporate loans but only 20% (the regulatory approach) to local government bonds. Swapping LGFV debt for government bonds reduces the bank&amp;rsquo;s total risk-weighted assets without changing the size of assets, loosening the binding capital adequacy ratio constraint and enabling increased lending to riskier (POE) borrowers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;POE Credit Spread:&lt;/strong&gt; Defined in the paper as the difference between the loan rate for privately owned enterprises (POEs) and that for state-owned enterprises (SOEs), measured as the percentage deviation of each loan&amp;rsquo;s interest rate from the benchmark rate set by the central bank. SOEs are treated as effectively riskless borrowers due to implicit government guarantees; POEs are the riskier counterparts. The paper tracks the POE credit spread as the primary outcome variable.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Local Government Financing Vehicles (LGFVs):&lt;/strong&gt; Nominally corporate firms established by Chinese local governments to raise funds for public investment — primarily through bank loans and LGFV-issued corporate bonds (&amp;ldquo;municipal corporate bonds&amp;rdquo;). LGFVs are implicitly backed by local governments but not explicitly guaranteed, so the bank loans and bonds they issue carry higher Basel III risk weights (treated as corporate exposures) than formal government bonds.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Capital Adequacy Ratio (CAR) Constraint:&lt;/strong&gt; The Basel III requirement that a bank&amp;rsquo;s equity capital exceed a minimum fraction ψ of its risk-weighted assets. For systemically important Big Five banks in China, implemented via the IRB approach for corporate loans and the regulatory approach for government bonds since 2012. In the theoretical model, the CAR constraint is binding and determines the bank&amp;rsquo;s effective leverage; relaxing it (by reducing risk-weighted assets) permits the bank to shift toward riskier lending.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Internal Ratings-Based (IRB) Approach:&lt;/strong&gt; The Basel III methodology used by the Big Five Chinese banks to calculate risk-weighted assets for corporate loan portfolios. Under this approach, the risk weight is an increasing function of credit risk (higher-risk loans receive higher weights), so the average weight on corporate loans exceeds 80%, and even high-quality loans carry weights above 50%. This contrasts with the fixed 20% regulatory weight assigned to local government bonds.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Crowding-In Effect:&lt;/strong&gt; In this paper&amp;rsquo;s usage, the mechanism by which restructuring local government debt composition — specifically, replacing corporate-form LGFV debt with low-risk-weight government bonds — frees up bank capacity to extend credit to private firms (POEs) that would otherwise face higher credit spreads or loan denial. This is framed as the opposite of the standard crowding-out effect (where more government debt squeezes private credit), arising because it is the &lt;em&gt;composition&lt;/em&gt; rather than the &lt;em&gt;size&lt;/em&gt; of government debt that changes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Total Factor Productivity (TFP) Reallocation Effect:&lt;/strong&gt; The paper measures provincial average TFP (using the Brandt et al. 2013 methodology) and documents that provinces with more government debt outstanding before the swap experienced larger TFP gains after 2015, attributing this to credit reallocation from less-productive SOEs to more-productive POEs. The effect is interpreted as a reduction in credit misallocation rather than within-firm productivity improvement.&lt;/p&gt;</description></item><item><title>The Earnings and Labor Supply of U.S. Physicians</title><link>https://macropaperwarehouse.com/papers/the-earnings-and-labor-supply-of-u.s.-physicians/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-earnings-and-labor-supply-of-u.s.-physicians/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; What do U.S. physicians earn, how is that earnings variation structured across geography and specialty, and how much does government healthcare payment policy shape those earnings and — through them — physicians&amp;rsquo; labor supply and long-run talent allocation?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data.&lt;/strong&gt; The paper builds a novel administrative panel by merging the universe of U.S. federal individual income tax returns (2005–2017) with: the National Plan and Provider Enumeration System (NPPES) physician registry; Medicare billing records with procedure-level Relative Value Unit (RVU) rates (2012–2017); restricted-use American Community Survey responses; Social Security Administration demographic records; and medical school ranking and graduation data. The main sample covers 11.6 million physician-year observations for 965,000 unique physicians aged 20–70.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Earnings Facts.&lt;/strong&gt; In 2017, average physician total individual income was $350,000 (median $265,000); the distribution is right-skewed — the top 1% of age-40–55 physicians averages $4.0 million. Physicians in aggregate earned $297 billion in pre-tax dollars, equaling 8.6% of total U.S. healthcare spending. The age-earnings profile is steep: earnings are approximately $60,000 during residency, rise to roughly $185,000 by the early thirties, and peak near $425,000 at age 50. Business income — systematically underreported in survey data (ACS estimates are approximately $140,000 lower than tax data during peak career years, almost entirely due to non-reporting of business income) — accounts for nearly one-quarter of earnings at age 50. Earnings differ sharply across specialties: primary care physicians average $201,200 (ages 40–55), about half the sample mean, while surgeons earn roughly twice as much.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Geographic Pattern.&lt;/strong&gt; Contrary to the pattern for lawyers and workers broadly, physician earnings are not highest on the coasts. A movers-based event study (physicians who changed commuting zones once during 2005–2017) finds that roughly 70% of the cross-location income difference is driven by place rather than worker composition. A two-way fixed-effects variance decomposition reveals pronounced negative physician-location sorting: high-earning physicians tend to locate in lower-income commuting zones, while lower-earning physicians locate in higher-income areas — the opposite of the pattern for lawyers. Medicare&amp;rsquo;s relatively weak adjustment of reimbursement rates for local costs (the empirical elasticity of the Geographic Adjustment Factor to median household income is 0.09, versus 0.33 for a broader local price index) can, by the authors&amp;rsquo; estimates, account for approximately one-third of this unusual geographic earnings pattern.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Government Influence — Medicare Price Changes.&lt;/strong&gt; Using procedure-specific RVU changes as a simulated instrument for each physician&amp;rsquo;s Medicare price exposure, the authors find that a 10% increase in the Medicare price instrument leads to a 2.4% increase in professional earnings of physicians aged 40–55. The behavioral supply response is substantial: physicians bill 4.4% more RVUs (supply elasticity of 0.4 after netting out the mechanical component), of which 3.9% reflects more unique procedures and the rest a shift toward higher-paid procedures. Nearly all of the procedure-level supply increase (3.4 out of 3.8 percentage points) comes from treating additional patients rather than more frequent treatment of existing patients. Converting to pass-through: physicians retain $62 of each $100 in additional Medicare spending directly, or approximately $25 of each $100 of any insurance spending once Medicare&amp;rsquo;s documented spillover into private insurance rates is accounted for. For physicians aged 56–70, a 10% increase in earnings driven by reimbursement changes reduces retirement probability by 0.5 percentage points in that year.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Government Influence — ACA Insurance Expansion.&lt;/strong&gt; Using county-level variation in pre-ACA uninsurance rates (as of 2013) as a source of differential exposure to the ACA&amp;rsquo;s Medicaid expansions and Marketplace subsidies (in 24 states expanding Medicaid in 2014 or early 2015), the authors estimate that a 10 percentage point higher baseline uninsurance rate led to 3.9% higher physician earnings four years post-expansion. Scaling by the first stage (a 10 p.p. higher uninsurance rate translating to 4.96 p.p. higher insurance coverage post-expansion), the implied elasticity of physician earnings to the insurance rate is 0.41. The ACA expansion also reduced retirement probability — a 10 p.p. higher insurance coverage rate leads to a 1 p.p. decline in retirement probability — consistent with a medium-run retirement-to-income elasticity of approximately −1.1. In aggregate, 6% of the $110 billion in annual ACA insurance expansion spending accrued to physicians personally, slightly below their 8.6% baseline share of healthcare spending.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Talent Allocation.&lt;/strong&gt; Specialty choice is sticky and entry-restricted. The authors estimate a discrete-choice model of specialty choice using graduates of top-5 medical schools — physicians with effectively unconstrained specialty access — and an aggregate model using USMLE Step 1 score buckets as ability proxies. At the top of the ability distribution, higher specialty earnings strongly attract physicians: increasing primary care physicians&amp;rsquo; hourly income from $98 to $168 per hour (the level of medicine subspecialists) would raise the share of top-5 medical school graduates choosing primary care by approximately 20 percentage points (nearly doubling their representation in primary care). Moving down the USMLE score distribution, the earnings coefficient falls monotonically and turns negative for the lowest score groups — consistent with the model&amp;rsquo;s prediction that entry restrictions cause higher-paying specialties to displace lower-ability applicants as earnings rise, rather than simply attracting more entrants. A more modest counterfactual — raising internal medicine earnings to dermatology levels — raises the average USMLE score in internal medicine by 10 points (from 230.2 to 239.6).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions.&lt;/strong&gt; The earnings estimates are for the period 2005–2017. Pass-through estimates use a short-run price instrument; long-run pass-through may differ depending on private market spillovers and entry. The ACA analysis is restricted to 24 early-expanding states. The specialty-choice model is estimated on medical graduates entering the residency match; the extensive margin of entering medicine itself is not modeled. Health outcome effects of changing physician ability distributions are not estimated.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-level-and-composition-of-physician-earnings-in-the-tax-data-and-how-do-they-compare-to-survey-based-estimates"&gt;Q1. What is the level and composition of physician earnings in the tax data, and how do they compare to survey-based estimates?&lt;/h3&gt;
&lt;p&gt;In 2017, average physician total individual income was $350,000 and median was $265,000; the top 1% of age-40–55 physicians earned $4.0 million on average, more than twice the average of the top 5%. Business income constitutes nearly one-quarter of earnings at age 50 and is concentrated among top earners: 80% of physicians in the top 1% have business income exceeding $25,000, versus 35% overall. ACS survey data for the same physicians underestimate earnings by approximately $140,000 (roughly one-third of the administrative mean) during peak career years, driven entirely by non-reporting of business income on the extensive margin.&lt;/p&gt;
&lt;h3 id="q2-what-share-of-total-us-healthcare-spending-do-physician-earnings-represent-and-what-does-this-imply-for-policy"&gt;Q2. What share of total U.S. healthcare spending do physician earnings represent, and what does this imply for policy?&lt;/h3&gt;
&lt;p&gt;Physicians in aggregate earned $297 billion pre-tax in 2017, equaling 8.6% of total U.S. healthcare spending (approximately $913 of the average American&amp;rsquo;s $10,611 annual healthcare expenditure). After applying a 30% income tax rate, after-tax physician earnings equal approximately 6% of total healthcare spending, or roughly 1% of GDP. The authors note this provides an upper bound on the magnitude of savings available from policies aimed at reducing physician incomes as a strategy for lowering overall healthcare spending.&lt;/p&gt;
&lt;h3 id="q3-how-does-the-age-earnings-profile-of-physicians-evolve-and-what-drives-growth-during-peak-years"&gt;Q3. How does the age-earnings profile of physicians evolve, and what drives growth during peak years?&lt;/h3&gt;
&lt;p&gt;Physician earnings average approximately $60,000 during residency, rise to roughly $185,000 by the early thirties, and peak near $425,000 at age 50, before declining gradually to approximately $270,000 in the late 60s. Growth during peak earning years (ages 40–55) is driven almost entirely by business income: average wages are approximately flat at $285,000 across this age range, while business income and the probability of filing Schedule C rise steadily.&lt;/p&gt;
&lt;h3 id="q4-how-large-and-unusual-is-the-geographic-pattern-of-physician-earnings-and-what-is-the-causal-role-of-location"&gt;Q4. How large and unusual is the geographic pattern of physician earnings, and what is the causal role of location?&lt;/h3&gt;
&lt;p&gt;Physician earnings are highest in lower-income states (not on the coasts), unlike lawyers and the broader workforce. A movers event study finds that approximately 70% of the cross-commuting-zone income difference is attributable to location rather than worker characteristics; within specialty the estimate rises to approximately 85%. A two-way fixed-effects variance decomposition (with limited-mobility-bias corrections following Andrews et al. 2008 and Kline et al. 2020) reveals pronounced negative physician-location sorting, with the corrected covariance between individual and location effects being 0.6–0.8 times the variance of location effects in magnitude but opposite in sign — a pattern that reverses to positive sorting when the same methods are applied to lawyers.&lt;/p&gt;
&lt;h3 id="q5-what-instrument-is-used-to-identify-the-causal-effect-of-medicare-price-changes-on-physician-earnings-and-why-is-it-valid"&gt;Q5. What instrument is used to identify the causal effect of Medicare price changes on physician earnings, and why is it valid?&lt;/h3&gt;
&lt;p&gt;The authors construct a physician-year &amp;ldquo;Medicare price instrument&amp;rdquo; by fixing each physician&amp;rsquo;s service mix at its 2012–2017 average and then multiplying those fixed quantities by annually-updated RVU rates, summing over services. Because the fixed quantity weights exclude behavioral responses, and because national RVU changes from CMS periodic reviews affect physicians differentially according to their pre-determined service mix, variation across physicians and over time is plausibly exogenous to individual physicians&amp;rsquo; income shocks. Year-by-specialty fixed effects absorb common specialty-level price trends.&lt;/p&gt;
&lt;h3 id="q6-what-are-the-magnitudes-of-the-earnings-and-labor-supply-responses-to-medicare-price-changes"&gt;Q6. What are the magnitudes of the earnings and labor supply responses to Medicare price changes?&lt;/h3&gt;
&lt;p&gt;A 10% increase in the Medicare price instrument raises earnings of 40–55 year-old physicians by 2.4% (reduced-form), with a 2SLS elasticity of income to billed RVUs of 0.17. The total-RVU billing coefficient of 1.437 implies a supply elasticity of 0.437 (subtracting 1 for the mechanical component). At the procedure level, a 10% price increase for a specific code leads to 3.8% more billings for that code, of which 3.4 percentage points reflects treating additional patients. For physicians aged 56–70, a 10% earnings increase reduces that year&amp;rsquo;s retirement probability by 0.5 percentage points.&lt;/p&gt;
&lt;h3 id="q7-how-does-the-aca-insurance-expansion-affect-physician-earnings-and-retirement-and-what-is-the-implied-pass-through"&gt;Q7. How does the ACA insurance expansion affect physician earnings and retirement, and what is the implied pass-through?&lt;/h3&gt;
&lt;p&gt;Counties with a 10 percentage point higher pre-ACA uninsurance rate saw 3.9% higher physician earnings by 2017 (four years post-expansion). Scaled by the first stage (4.96 p.p. higher coverage), the elasticity of physician earnings to insurance coverage is 0.41. A 10 p.p. higher insurance coverage rate leads to a 1 p.p. lower retirement probability post-expansion (medium-run elasticity of retirement to income of approximately −1.1). In aggregate, 6% of $110 billion in annual ACA expansion spending — roughly $7.1 billion, or about $8,400 per physician — accrued to physicians.&lt;/p&gt;
&lt;h3 id="q8-how-does-the-earnings-specialty-choice-relationship-vary-across-the-physician-ability-distribution"&gt;Q8. How does the earnings-specialty choice relationship vary across the physician ability distribution?&lt;/h3&gt;
&lt;p&gt;In the individual-level discrete-choice model estimated on top-5 medical school graduates (likely unconstrained in specialty choice), the coefficient on hourly earnings is 0.014. In the aggregate score-group model, the implied earnings coefficient is 0.016 for USMLE scores above 260 and declines monotonically to −0.008 for scores at or below 190. This negative coefficient for low scorers is consistent with the theoretical prediction that higher earnings attract high-ability physicians, leaving fewer slots for lower-ability applicants due to binding entry restrictions — not a reversal of preferences.&lt;/p&gt;
&lt;h3 id="q9-what-are-the-quantitative-implications-for-specialty-choice-if-primary-care-incomes-were-raised-to-subspecialty-levels"&gt;Q9. What are the quantitative implications for specialty choice if primary care incomes were raised to subspecialty levels?&lt;/h3&gt;
&lt;p&gt;Raising primary care hourly income from $98 to $168 (the level of medicine subspecialists) would increase the share of top-5 medical school graduates choosing primary care by approximately 20 percentage points (about 48% would enter primary care, versus the current share), nearly doubling their representation. Nearly half of these reallocations would come from procedural specialties. An analogous exercise raising internal medicine earnings to dermatology levels shifts the average USMLE score in internal medicine from 230.2 to 239.6 — a 10-point increase — as higher-scoring applicants displace lower-scoring ones within a fixed slot constraint.&lt;/p&gt;
&lt;h3 id="q10-what-is-the-pass-through-from-medicare-reimbursements-to-physician-earnings-and-how-does-it-compare-to-rent-sharing-elsewhere"&gt;Q10. What is the pass-through from Medicare reimbursements to physician earnings, and how does it compare to rent-sharing elsewhere?&lt;/h3&gt;
&lt;p&gt;Direct estimates imply physicians retain $62 of each $100 in additional Medicare spending. Accounting for Medicare&amp;rsquo;s documented spillover into private insurance rates (following Clemens and Gottlieb 2017), the pass-through drops to $25 per $100 of total insurance spending. The authors note this is substantially higher than the modest rent-sharing found for average workers in response to firm-level shocks (Card et al. 2018), but comparable to rent-sharing with high-skilled workers benefiting from patent rents (Kline et al. 2019).&lt;/p&gt;
&lt;h3 id="q11-can-medicares-geographic-pricing-policy-explain-the-unusual-geographic-earnings-pattern-for-physicians"&gt;Q11. Can Medicare&amp;rsquo;s geographic pricing policy explain the unusual geographic earnings pattern for physicians?&lt;/h3&gt;
&lt;p&gt;The elasticity of Medicare&amp;rsquo;s Geographic Adjustment Factor (GAF) to commuting zone median household income is 0.09, compared to 0.33 for a broader local price index. Using the authors&amp;rsquo; short-run estimate that a 10% increase in Medicare prices raises earnings by 2.4%, a counterfactual simulation shows that if the GAF-to-income elasticity rose to 0.33 (aligning Medicare rates with the general cost-of-living gradient), the geographic physician earnings pattern would more closely resemble that of lawyers. The authors estimate that the gap in Medicare&amp;rsquo;s local cost adjustment explains approximately one-third of the unusual physician earnings geography, conditional on the short-run pass-through estimate.&lt;/p&gt;
&lt;h3 id="q12-how-does-the-theoretical-model-of-specialty-choice-and-entry-restrictions-guide-the-empirical-predictions"&gt;Q12. How does the theoretical model of specialty choice and entry restrictions guide the empirical predictions?&lt;/h3&gt;
&lt;p&gt;The model features a unit mass of physicians with heterogeneous ability (Pareto-distributed) and idiosyncratic specialty preferences (exponentially distributed). Physicians choose whether to specialize in period 1; government sets reimbursement rates in period 2; physicians choose labor supply in period 3. With a fixed number of residency slots, higher specialty earnings raise the ability cutoff for entry (rationing by ability). This generates a key nonmonotonic empirical prediction: higher-ability physicians respond positively to earnings increases (choosing a specialty more frequently), while lower-ability physicians respond negatively (displaced by the shift upward in the ability cutoff). The model also implies that demand shocks are not moderated by contemporaneous entry, so incumbents capture the full rent — motivating the estimated pass-through.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Medicare Price Instrument (Simulated RVU Instrument).&lt;/strong&gt; A physician-year measure of Medicare payment exposure constructed by holding each physician&amp;rsquo;s service mix fixed at its 2012–2017 average and multiplying those fixed quantities by time-varying national RVU rates, then summing across services. This purges the instrument of behavioral responses, creating exogenous cross-physician variation in price exposure arising from the interaction of fixed service mix with national RVU policy changes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Relative Value Unit (RVU).&lt;/strong&gt; The unit by which Medicare defines and reimburses each physician service in the Physician Fee Schedule. RVUs are intended to reflect the time, effort, and resources required to provide each service, but are subject to periodic review by CMS&amp;rsquo;s RVU Update Committee (RUC) and influenced by political factors. Changes in RVUs translate directly into changes in Medicare reimbursement rates for affected services.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Pass-Through (Reimbursement to Earnings).&lt;/strong&gt; The share of an additional dollar of Medicare (or insurance) spending that accrues to physicians personally as earnings, after accounting for practice costs, intermediaries, and behavioral responses. The paper estimates $62 per $100 of direct Medicare spending or $25 per $100 of total insurance spending (the latter accounting for Medicare&amp;rsquo;s spillover into private rates).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Negative Physician-Location Sorting.&lt;/strong&gt; The empirical finding — robust to limited-mobility-bias corrections — that higher-ability (higher-earning) physicians disproportionately locate in lower-income commuting zones, while lower-earning physicians concentrate in higher-income areas. This is the opposite of the pattern for lawyers and for worker-firm matching in the broader labor literature. The paper attributes part of this pattern to Medicare&amp;rsquo;s incomplete geographic adjustment of reimbursement rates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Ability Cutoff (am) in Residency Matching.&lt;/strong&gt; In the paper&amp;rsquo;s theoretical model, the minimum ability level required to gain entry into a restricted-entry specialty. Because the number of residency slots is fixed, the cutoff rises when a specialty&amp;rsquo;s relative earnings increase (attracting more high-ability applicants), displacing lower-ability physicians who would otherwise have entered. This makes the earnings-specialty relationship nonmonotonic across the ability distribution.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Business Income (Pass-Through Entity Income).&lt;/strong&gt; Income from physician-owned practices organized as sole proprietorships, S-corporations, or partnerships, reported on Schedule C or through pass-through entities rather than on Form W-2. In the tax data, business income accounts for nearly one-quarter of physician earnings at career peak and is the main source of earnings for top physicians, but is systematically underreported in survey data (ACS), leading to a roughly one-third underestimate of total earnings during peak years.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Geographic Adjustment Factor (GAF).&lt;/strong&gt; A Medicare policy parameter that multiplies the national RVU rate to adjust physician reimbursements for local input costs (specifically physicians&amp;rsquo; work, practice expenses, and malpractice). The paper documents that the GAF&amp;rsquo;s elasticity to local median household income is 0.09 — far below the 0.33 elasticity of the general local price index — constituting an effective subsidy to rural and lower-income markets relative to higher-income areas.&lt;/p&gt;</description></item><item><title>The Effect of Provider Diversity on Racial Health Disparities: Evidence from the Military</title><link>https://macropaperwarehouse.com/papers/the-effect-of-provider-diversity-on-racial-health-disparities-evidence-from-the-military/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-effect-of-provider-diversity-on-racial-health-disparities-evidence-from-the-military/</guid><description>&lt;p&gt;This paper asks whether racial concordance between patients and medical providers — specifically, whether Black patients are treated by Black physicians — improves use of preventive care and reduces mortality among patients with chronic, manageable diseases. The authors argue that trust and communication deficits along racial lines cause Black patients to underuse low-cost, life-saving preventive care, and that increasing the share of Black providers addresses this deficit.&lt;/p&gt;
&lt;p&gt;The authors use data from the Military Health System (MHS) Data Repository covering fiscal years 2003–2013, encompassing roughly 9.6 million beneficiaries. A distinctive feature of the MHS is that active-duty providers are themselves MHS beneficiaries, so their race is observed in the same eligibility files used for patients — overcoming the typical absence of provider-race data in claims databases. The study focuses on four chronic, deadly but manageable conditions: diabetes, hypertension, hypercholesterolemia, and clinical atherosclerotic cardiovascular disease. Preventive care is measured by medication fill-days for condition-appropriate generic drugs, HEDIS-recommended Comprehensive Diabetes Care compliance, and (for a subset) blood pressure control. Mortality is tracked across the full sample period.&lt;/p&gt;
&lt;p&gt;The identification strategy exploits quasi-random variation in provider racial composition induced by across-base moves. The MHS setting generates abundant moves driven by DoD personnel management needs — not by patient health or preferences. Using a movers-only differences specification (analogous to Finkelstein et al. 2016), the authors compare differential changes in outcomes for Black versus non-Black patients who move to bases with larger versus smaller increases in the share of Black providers. This design includes fixed effects for both sending and receiving bases, controlling flexibly for regional quality differences. The estimand is an intent-to-treat effect among patients living within 10 miles of a base (who use on-base care 66% of the time).&lt;/p&gt;
&lt;p&gt;The findings are consistent across all four disease samples. For diabetes, a move-induced one-standard-deviation increase in the share of Black diabetes providers is associated with a roughly 6 additional metformin fill-days per year (approximately 16% relative to the mean) and a 3 percentage-point increase (roughly 8% relative to the mean) in Comprehensive Diabetes Care compliance for Black relative to non-Black patients. Mortality falls by 0.4 percentage points — a 33% relative decline — for Black relative to non-Black diabetes patients following such a move.&lt;/p&gt;
&lt;p&gt;Pooling across all four chronic-disease samples, a one-standard-deviation move-induced increase in the Black provider share is associated with approximately 3 additional fill-days of relevant preventive medication and a roughly 0.2 percentage-point reduction in mortality — approximately 15% relative to the mean mortality rate — for Black relative to non-Black patients.&lt;/p&gt;
&lt;p&gt;A decomposition analysis combining the paper&amp;rsquo;s estimates with medical-literature parameters on the mortality effects of preventive medications finds that between 55% and 69% of the concordance mortality effect across the four disease samples can be attributed to improved medication adherence alone, with the remainder attributed to other aspects of the provider-patient relationship (e.g., lifestyle effects, other preventive care).&lt;/p&gt;
&lt;p&gt;Scope conditions: results are local to MHS movers, who are on average slightly younger and healthier than non-movers, potentially understating concordance benefits for the full population. The MHS covers over 3% of all Black U.S. residents, but beneficiaries may differ from the general population. The paper measures Black patient / Black provider concordance specifically; it does not establish a symmetric concordance effect for non-Black patients. The concordance effect estimated is relative — it captures how much Black patients benefit more than non-Black patients from moving to a higher Black-provider-share base. A system-wide spillover mechanism (non-Black providers improving care for Black patients when working alongside more Black providers) cannot be ruled out and would also be consistent with the core concordance motivation.&lt;/p&gt;
&lt;p&gt;Q: What is the central research question and why is the MHS an advantageous setting?
A: The paper asks whether racial concordance between providers and patients causes Black patients to use more preventive care and achieve better health outcomes, focusing on the trust and communication channel. The MHS is advantageous because active-duty providers are themselves MHS beneficiaries, making their race observable — a feature absent in most claims databases. Across-base moves are driven by DoD staffing needs rather than patient health or preferences, providing quasi-random variation in provider racial composition. The system offers complete claims data covering both on- and off-base care, allowing full mortality tracking.&lt;/p&gt;
&lt;p&gt;Q: How does the empirical strategy address selection concerns that plague prior concordance studies?
A: Prior studies face selection problems from Black patients choosing different doctors than white patients and from residential segregation concentrating Black patients and Black physicians in regions with distinct care quality. The movers-based differences specification directly addresses both problems: it uses only patients who move across bases, comparing how the same individual&amp;rsquo;s outcomes change relative to non-Black patients experiencing the same move, as a function of the move-induced change in the Black provider share. Inclusion of fixed effects for both sending and receiving bases accounts flexibly for regional quality differences. Balance tests on observable patient characteristics show no differential sorting of Black versus non-Black patients toward high-Black-provider-share bases.&lt;/p&gt;
&lt;p&gt;Q: What specific preventive care and outcome measures are used for each disease?
A: For diabetes, the primary measures are annual metformin fill-days and Comprehensive Diabetes Care (CDC) compliance — defined as receiving HbA1c testing, a retinal eye exam, and medical attention for nephropathy in the focal year — plus blood pressure control (available only from 2009 onward for on-base patients). For hypertension, the measures are annual fill-days of WHO-recommended antihypertensives (thiazides, ACEs/ARBs, or long-acting dihydropyridine CCBs) and blood pressure control. For hypercholesterolemia, the measure is fill-days of antilipemic agents, bile acid sequestrants, and statins. For atherosclerotic cardiovascular disease, the HEDIS statin therapy receipt indicator is used. Mortality is tracked across all four samples.&lt;/p&gt;
&lt;p&gt;Q: What are the main quantitative results for the diabetes sample?
A: A move-induced one-standard-deviation increase in the share of Black diabetes providers is associated with approximately 6 additional metformin fill-days annually for Black relative to non-Black patients (roughly 16% relative to the mean). Compliance with Comprehensive Diabetes Care increases by 3 percentage points for Black relative to non-Black patients (roughly 8% relative to the mean). Mortality falls by 0.4 percentage points for Black relative to non-Black patients — a 33% relative decline — in connection with the same one-standard-deviation increase in Black provider share.&lt;/p&gt;
&lt;p&gt;Q: What are the pooled results across all four chronic-disease samples?
A: Pooling across diabetes, hypertension, hypercholesterolemia, and atherosclerotic cardiovascular disease, a one-standard-deviation move-induced increase in the Black provider share is associated with approximately 3 additional preventive medication fill-days per year for Black relative to non-Black patients. The pooled mortality effect is a 0.2 percentage-point reduction — roughly 15% relative to the mean mortality rate — for Black relative to non-Black patients.&lt;/p&gt;
&lt;p&gt;Q: How much of the concordance mortality effect operates through medication adherence?
A: The decomposition combines the paper&amp;rsquo;s estimated concordance effects on medication fill-days with medical-literature estimates of the mortality impact of each additional fill-day. For the diabetes sample, increased metformin adherence (4.2 additional fill-days) explains approximately 58.8% of the 0.4 percentage-point concordance mortality effect, with the residual 41.2% attributed to other channels such as lifestyle changes or other preventive care. Across all four disease samples, the medication fill-day channel explains between 55% and 69% of the respective concordance mortality effects.&lt;/p&gt;
&lt;p&gt;Q: What specification checks do the authors conduct to validate causal identification?
A: The authors conduct five main checks. First, balance regressions show that move-induced changes in Black provider share are not differentially related to baseline patient characteristics for Black versus non-Black patients. Second, regressions of the probability of moving on initial Black provider share and its interaction with patient race yield a near-zero concordance coefficient (0.008, SE 0.023), indicating no differential sorting. Third, regressions of post-move on-base care share on the concordance interaction term yield a near-zero coefficient (0.002, SE 0.003), indicating no differential race-specific selection into on-base care. Fourth, a distance falsification test shows that concordance coefficients are near zero and statistically insignificant for patients living more than 10 miles from the base. Fifth, event-study dynamics show no pre-move divergence in preventive care adherence between Black and non-Black patients, with a positive divergence emerging only after the move to a higher Black-provider-share base.&lt;/p&gt;
&lt;p&gt;Q: How does the paper separate a concordance effect from a pure Black-physician-quality effect?
A: The paper estimates a &amp;ldquo;first stage&amp;rdquo; specification on the subsample receiving on-base care (where provider race is observed), regressing the change in the probability of visiting a Black provider on the move-induced change in Black provider density. The results show an approximately one-to-one relationship between higher Black provider availability and increased visits to Black providers for all patients, with only a modest differential by patient race. This confirms that non-Black patients also see more Black providers when Black provider density rises, allowing the interaction specification to isolate concordance from a pure physician-quality effect.&lt;/p&gt;
&lt;p&gt;Q: How do the authors assess the potential role of spillover effects?
A: The authors acknowledge they cannot rule out that some of the estimated concordance effect arises through system-wide spillovers — for instance, non-Black providers on bases with more Black colleagues may improve their care for Black patients through peer learning or information transmission. They note that even if such a spillover mechanism operates, it is still consistent with the paper&amp;rsquo;s core concordance motivation, because provider-knowledge deficiencies about treating Black patients are among the theorized channels of racial discordance.&lt;/p&gt;
&lt;p&gt;Q: What do the results imply for the overall racial mortality gap?
A: Among MHS beneficiaries aged 20–65, Black beneficiaries are roughly 38% more likely to have diabetes and die over the sample period than non-Black beneficiaries; this gap appears driven primarily by higher diabetes prevalence rather than a within-diabetes mortality gap. Applying the diabetes concordance mortality estimate (a 0.4 percentage-point reduction), the authors calculate that a one-standard-deviation increase in the Black provider share would reduce the overall diabetes mortality gap from 38% to approximately 21% — a substantial narrowing driven by the concordance effect operating through conditional-on-prevalence outcomes.&lt;/p&gt;
&lt;p&gt;Q: What are the policy implications of the findings?
A: The results imply that investments in increasing physician workforce diversity could meaningfully reduce racial mortality disparities in the United States, particularly for chronic diseases manageable through preventive medication. The paper notes the results are relevant to affirmative action policies in medical school admissions, specifically the pending Supreme Court cases Students for Fair Admissions v. University of North Carolina and Students for Fair Admissions v. Harvard at the time of writing. The MHS population covered in the study includes over 3% of all Black U.S. residents, so the policy stakes extend substantially beyond the military context.&lt;/p&gt;
&lt;p&gt;Q: What are the limitations of the study regarding generalizability?
A: Movers in the chronic-disease samples are on average about four years younger and 0.2 percentage points less likely to die than non-movers, suggesting the local average treatment effect for movers may understate concordance benefits for the full population. The MHS population may be healthier overall than the general population, though conditioning on chronic-disease patients mitigates this concern. The paper covers only Black-patient/Black-provider concordance; concordance effects for other racial and ethnic groups are not estimated. The estimate of the concordance coefficient technically captures how much the Black patient / Black provider concordance effect exceeds the non-Black patient / non-Black provider concordance effect, meaning the absolute magnitude of Black concordance benefits is understated if non-Black concordance effects are also positive.&lt;/p&gt;
&lt;p&gt;Racial concordance: In this paper&amp;rsquo;s usage, the match between the race of a patient and their treating physician — specifically Black patient / Black provider pairing — theorized to improve care through trust, communication, and reduced provider knowledge deficiencies about Black patients.&lt;/p&gt;
&lt;p&gt;Provider Black share: The fraction of outpatient office visits for a given chronic condition at a given military base that are attended by Black active-duty providers, used as the base-level treatment variable; varies across bases from zero to approximately 20 percentage points in the pooled sample.&lt;/p&gt;
&lt;p&gt;Movers-based differences specification: An identification strategy that restricts to patients who relocate across military bases exactly once during the sample period and estimates the differential change in outcomes for Black versus non-Black patients as a function of the move-induced change in the base&amp;rsquo;s Black provider share, including fixed effects for both the sending and receiving base.&lt;/p&gt;
&lt;p&gt;Intent-to-treat (ITT) effect: The concordance estimate as applied to all patients living within 10 miles of a base — regardless of whether they actually received on-base care — to avoid selection bias from differential race-specific decisions to seek care on versus off base.&lt;/p&gt;
&lt;p&gt;Comprehensive Diabetes Care (CDC): A HEDIS composite measure requiring receipt of all three of the following in the focal year: HbA1c testing, a retinal eye exam, and medical attention for nephropathy (via microalbumin exam, ACE/ARB therapy, or nephropathy treatment).&lt;/p&gt;
&lt;p&gt;Medication fill-days: Annual days of supply dispensed for condition-appropriate generic medications (metformin for diabetes; thiazides/ACEs/ARBs/CCBs for hypertension; antilipemic agents, bile acid sequestrants, and statins for hypercholesterolemia; statins for atherosclerotic cardiovascular disease), used as the primary preventive care adherence measure.&lt;/p&gt;
&lt;p&gt;Decomposition of concordance mortality effect: A calculation that uses the paper&amp;rsquo;s estimated concordance effect on medication fill-days, combined with medical-literature estimates of the mortality impact per fill-day, to determine what share of the total concordance mortality effect passes through medication adherence versus other channels (lifestyle, other preventive care).&lt;/p&gt;</description></item><item><title>The Effects of an Aging Population on the Structure of Bank Assets and Liabilities</title><link>https://macropaperwarehouse.com/papers/the-effects-of-an-aging-population-on-the-structure-of-bank-assets-and-liabilities/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-effects-of-an-aging-population-on-the-structure-of-bank-assets-and-liabilities/</guid><description>&lt;p&gt;Using 2001-2022 annual data on U.S. commercial and savings banks matched with county-level demographic data, this paper shows that banks operating in areas with older populations—measured by the deposit-weighted proportion of seniors (individuals over 65) in the counties where the bank has branches—issue more retail deposits and less wholesale funding, pay relatively lower retail deposit rates with greater stickiness across maturities, and experience smaller deposit withdrawals when market interest rates rise. On the asset side, these banks hold significantly more securities and fewer loans (particularly small business and residential mortgage loans) with longer maturities, substantially raising their asset-liability maturity gap. These findings are consistent with a lifecycle model in which seniors demand risk-free retail deposits as an investment vehicle while exhibiting lower borrowing demand, combined with the localization of banks&amp;rsquo; deposit-taking and lending. The paper instruments for a bank&amp;rsquo;s senior exposure using projected county-level senior population shares constructed from historical state-level fertility rates and county-level cohort change rates by race and sex, mitigating concerns about endogenous bank location relative to contemporaneous economic conditions.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary based on a working paper version, AI-assisted and human-reviewed. See the linked published article for the authoritative version.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-how-is-a-banks-exposure-to-seniors-measured-and-why-is-this-measure-preferred"&gt;Q1. How is a bank&amp;rsquo;s exposure to seniors measured, and why is this measure preferred?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;A bank&amp;rsquo;s &amp;rsquo;exposure to seniors&amp;rsquo; is defined as the deposit-weighted senior population share of all counties where the bank operates branches, using each county&amp;rsquo;s deposits at that bank as weights; this measure is preferred because it captures the bank&amp;rsquo;s actual demographic exposure to older depositors while accounting for the relative importance of each local market to the bank.&lt;/strong&gt; The paper instruments for this measure using projected county-level senior population shares derived from historical demographic data (state-level fertility rates by race, historical county-level cohort change rates by race and sex), which are orthogonal to the contemporaneous economic conditions that could cause population migration and confound the results.&lt;/p&gt;
&lt;h3 id="q2-how-does-senior-exposure-affect-retail-deposit-rates-and-stickiness"&gt;Q2. How does senior exposure affect retail deposit rates and stickiness?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Banks with greater senior exposure pay significantly lower interest rates on retail time deposits, and the spread between an equivalent-maturity competitive market rate and the bank&amp;rsquo;s retail deposit rate widens by more as market rates rise, indicating greater deposit rate stickiness; this effect is especially pronounced at longer maturities (24- and 60-month CDs), where seniors&amp;rsquo; preference for deposits as an investment vehicle rather than a transaction account gives banks greater market power.&lt;/strong&gt; Moreover, these banks&amp;rsquo; deposits are less likely to be withdrawn when the Federal Funds Rate rises, despite lower and slower-adjusting deposit rates, consistent with seniors&amp;rsquo; lesser sensitivity to interest rate differentials (limited recall in monitoring rates, as in Kahn, Pennacchi, and Sopranzetti 1999).&lt;/p&gt;
&lt;h3 id="q3-how-does-senior-exposure-affect-the-composition-and-maturity-of-assets"&gt;Q3. How does senior exposure affect the composition and maturity of assets?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Banks exposed to more seniors hold significantly more securities and fewer loans—particularly small business loans and residential mortgages—and choose securities and loans with much longer maturities, which substantially raises their asset-liability maturity gap.&lt;/strong&gt; The lifecycle model predicts this: in markets with older populations, the demand for loans is lower (seniors are net savers, and local businesses benefit from greater labor supply in younger areas), leaving the bank&amp;rsquo;s retail deposit surplus to be invested in securities. The long-maturity asset allocation is supported by the bank&amp;rsquo;s stable retail deposit base, which is less sensitive to market rate movements (increasing the effective duration of deposits beyond their stated maturity).&lt;/p&gt;
&lt;h3 id="q4-what-are-the-macroeconomic-implications-as-populations-age"&gt;Q4. What are the macroeconomic implications as populations age?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper&amp;rsquo;s findings predict economically important changes in banks&amp;rsquo; future asset-liability structures as U.S. populations age: aggregate bank loan-to-asset ratios should decline, security-to-asset ratios rise, retail deposit shares increase, wholesale funding shares decrease, and the banking system&amp;rsquo;s aggregate asset-liability maturity gap should widen—with corresponding implications for banks&amp;rsquo; interest rate risk exposure and the transmission of monetary policy through the bank lending channel.&lt;/strong&gt; The demographic shift is projected to continue: the U.S. share of the population over 65 is predicted to reach 22% by 2050, while the EU&amp;rsquo;s share is projected at 28% and China&amp;rsquo;s share of those over 60 is projected at 40% in 2050, making these dynamics relevant across advanced economies.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;bank exposure to seniors&lt;/strong&gt; : the deposit-weighted proportion of individuals over age 65 in the counties where a bank has branches; the paper&amp;rsquo;s key explanatory variable, capturing how much of the bank&amp;rsquo;s deposit base is drawn from an older population.
&lt;strong&gt;deposit rate stickiness&lt;/strong&gt; : the slower adjustment of retail deposit rates to changes in equivalent-maturity competitive market interest rates; greater stickiness implies a widening of the deposit rate spread as market rates rise; found here to be more pronounced for banks with higher senior exposure.
&lt;strong&gt;asset-liability maturity gap&lt;/strong&gt; : the difference between the bank&amp;rsquo;s asset average maturity and its deposit average maturity; measures the bank&amp;rsquo;s exposure to interest rate risk; found here to be significantly larger for banks with higher senior exposure due to longer-maturity assets and stable retail deposit funding.&lt;/p&gt;</description></item><item><title>The Future in Mind: Aspirations and Long-Term Outcomes in Rural Ethiopia</title><link>https://macropaperwarehouse.com/papers/the-future-in-mind-aspirations-and-long-term-outcomes-in-rural-ethiopia/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-future-in-mind-aspirations-and-long-term-outcomes-in-rural-ethiopia/</guid><description>&lt;p&gt;This paper tests whether a light-touch behavioral intervention targeting aspirations can produce persistent economic effects on a poor rural population. The research question is whether changing how poor people perceive their future opportunities — by raising aspirations — alters their investment decisions in ways that persist over a multi-year horizon. The authors conduct a randomized controlled trial in Doba, a remote mountainous district in rural Ethiopia roughly 380 kilometers from Addis Ababa, selected partly because its extreme isolation meant residents had almost no exposure to television or media, making even a single video screening a memorable event.&lt;/p&gt;
&lt;p&gt;The sample consists of 1,152 households (2,112 individuals) across 64 villages. Households were randomly assigned to one of three conditions: a treatment group shown four 15-minute documentaries featuring real rural individuals from similar communities who escaped poverty through goal-setting and hard work; a placebo group shown an Ethiopian entertainment comedy with no aspirational content; and a within-village control group who were only surveyed. Both the household head and spouse in treatment and placebo groups were invited to attend. Compliance was very high, with only 2 percent of individuals not complying with their assigned condition. Data were collected at baseline (2010), six months after screening (2011), and five years after baseline (2015–2016). Attrition was notably low: 96 percent of households were re-interviewed at the five-year endline, and 94 percent of individual respondents.&lt;/p&gt;
&lt;p&gt;Five years after the screening, treated households show meaningfully larger investment across three domains relative to the control group, with all headline results significant at 5 percent or less and robust to multiple hypothesis testing. First, on agricultural effort and investment: treated household heads and spouses work approximately one extra hour per day on their own farms (roughly 8.6 percent of the control mean per spouse). Treated households are 10 percentage points more likely to have adopted modern crop inputs (improved seeds, inorganic fertilizer) and 10 percentage points more likely to have invested in modern livestock inputs (feed, veterinary supplies). Holdings of productive tools are 20 percent higher than in the control group. Second, on educational investment: treated households spend approximately 36 percent more on children&amp;rsquo;s schooling than the control group. Among children who were of school-going age at the time of the intervention (aged 11–15 then, 16–20 at endline), the number completing full primary school is nearly double the control rate (0.16 per household versus 0.07 in the control). Third, on living standards: treated households experienced 0.33 to 0.38 fewer months of food insecurity in the previous year. Their holdings of consumer durables (furniture, kitchenware, phones) are 29 percent higher than the control group in value. Estimated house values are 27 percent higher. However, there is no statistically significant effect on measured food or frequent non-food consumption expenditure, a finding the authors interpret as consistent with households continuing to divert resources toward future-oriented investments rather than current consumption.&lt;/p&gt;
&lt;p&gt;The intervention&amp;rsquo;s effects appear to operate primarily through aspirations — defined in this paper as desired goals for the future that motivate investment and effort. Treated households report significantly higher aspirations and expectations for income, assets, and children&amp;rsquo;s education five years later. By contrast, the paper finds no persistent changes in time preferences, risk preferences, grit, or beliefs about returns to technology. Locus of control shifted six months after the intervention but did not persist to the five-year endline, and the authors argue that if locus of control were the operative mechanism, investment effects would also have dissipated. The placebo group shows no significant effects relative to the control, ruling out screening exposure or social attention as mechanisms.&lt;/p&gt;
&lt;p&gt;The paper is explicit about scope conditions. The study area was deliberately chosen for its extreme remoteness and media isolation, and the authors caution that this may have amplified the intervention&amp;rsquo;s salience and persistence relative to less isolated populations. External validity beyond comparable settings is uncertain. A back-of-the-envelope cost-effectiveness calculation finds that increases in durable asset holdings alone outweigh intervention costs by a factor of approximately two at reasonable scale.&lt;/p&gt;
&lt;p&gt;Q: What was the intervention and what made it distinct from other role model studies?
A: Treated households were invited to watch four 15-minute documentary films featuring real rural individuals from similar socioeconomic backgrounds who had escaped poverty through goal-setting, perseverance, and hard work. The films were produced in Oromiffa, the local language, and featured two male and two female role models depicting achievable actions such as installing irrigation or starting a small business. Unlike studies that vary exposure to in-person mentors or peers, participants received no ongoing mentorship, financial resources, or support of any kind beyond the single video screening, isolating the aspirations channel from material or informational transfers.&lt;/p&gt;
&lt;p&gt;Q: How were aspirations measured and validated?
A: Aspirations were measured using locally validated survey instruments (Bernard and Taffesse, 2014) that asked respondents what level of annual income, asset wealth, and oldest child&amp;rsquo;s education they would like to achieve in their lifetime. Test-retest reliability over two weeks produced within-respondent correlations of 0.77 to 0.98 across domains, which the authors benchmark against Angrist and Krueger (1999) standards for reliable income and education measures. The measures correlated in expected directions with wealth: mean income aspirations in the upper wealth tercile were 1.5 times those in the lower tercile, and asset aspirations in the upper tercile were 1.9 times those in the lower tercile.&lt;/p&gt;
&lt;p&gt;Q: What were the five-year effects on agricultural effort and investment?
A: Treated household heads and spouses worked approximately half an hour more per day each on their own farms relative to control, implying roughly one extra hour per day across the typical household&amp;rsquo;s adult members — an 8.6 percent increase over the control mean. Treated households were 10 percentage points more likely to have adopted modern crop inputs and 10 percentage points more likely to have invested in modern livestock inputs. Holdings of productive tools were 20 percent higher in value than in the control group. The overall agricultural investment index increased by 0.21 standard deviations relative to the control and 0.18 standard deviations relative to the placebo.&lt;/p&gt;
&lt;p&gt;Q: What were the five-year effects on children&amp;rsquo;s education?
A: Among children aged 16 to 20 at endline (who were 11 to 15, upper primary school age, at the time of the intervention), the number per household completing full primary school nearly doubled: 0.16 in the treatment group versus 0.07 in the control. These children in treated households also spent on average 33 minutes more per day attending school than the control group. Across all children, schooling expenditures in the treatment group were 36 percent higher than in the control and 30 percent higher than in the placebo. The education index increased by 0.25 standard deviations relative to the placebo and 0.21 standard deviations relative to the control.&lt;/p&gt;
&lt;p&gt;Q: Why did consumption expenditure not increase despite improvements in assets and food security?
A: The authors argue that the consumption result is theoretically ambiguous: if treated households continue to divert resources toward future-oriented investments (savings, productive assets, durable goods, housing), intertemporal substitution effects could offset income effects within the five-year observation window. The measured consumption variables — food and frequent non-food spending — do not capture the service flow value of accumulated durables or housing improvements, both of which increased substantially. The authors interpret this as evidence that households were still in an investment phase rather than having converted accumulated wealth into current consumption by endline.&lt;/p&gt;
&lt;p&gt;Q: What evidence supports aspirations as the operative mechanism rather than alternative channels?
A: The treatment group had significantly higher aspirations and expectations for income, assets, and children&amp;rsquo;s education at the five-year endline, while the placebo group did not. Measured time preferences, risk preferences, grit, and beliefs about returns to technology were all statistically unchanged for treated households. Locus of control shifted six months post-intervention but did not persist to five years, and the authors note that if locus of control were the driver, investment effects would also have dissipated alongside it. The null placebo effect rules out screening exposure, social attention, or information salience from outside facilitators as mechanisms.&lt;/p&gt;
&lt;p&gt;Q: How were locus of control and fatalistic beliefs assessed in this population?
A: The sample scored twice as high as Western samples on the classic Levenson (1981) fatalism scale. On the Feagin (1975) scale of perceived causes of poverty, the sample was more likely to attribute poverty to structural or fatalistic explanations than Western samples, and both measures of fatalistic beliefs were higher among poorer households within the sample. The study region&amp;rsquo;s worldview — rooted in traditional Waaqeffannaa religion, local variants of Orthodox Christianity (Fekade Egziabher), and Islam (Qadar) — emphasizes deference to authority, predestination, and resistance to change, providing qualitative grounding for the aspirations deficit being targeted.&lt;/p&gt;
&lt;p&gt;Q: What were the effects on food insecurity and subjective wellbeing?
A: Treated households reported 0.33 fewer months of food insecurity in the previous year relative to the control group (from a base of 2.71 months in the control), and 0.38 fewer months relative to the placebo. Treated participants scored approximately a quarter of a step higher on the Cantril ladder of self-reported wellbeing than the control group. There was no significant difference on the USDA food insecurity questionnaire, which the authors attribute to that scale&amp;rsquo;s unsuitability for households that consume largely from own production.&lt;/p&gt;
&lt;p&gt;Q: What were the effects on durable goods and housing?
A: Treated households reported 29 percent higher value of consumer durables (furniture, kitchenware, phones) than the control group and 32 percent higher than the placebo. Estimated house replacement values were 27 percent higher than the control and 21 percent higher than the placebo. Enumerators directly observed that treated households were more likely to have their own toilet facility, though this result was not significant relative to the placebo. There were no effects on the probability of having a non-organic roof, which the authors note is an especially expensive upgrade.&lt;/p&gt;
&lt;p&gt;Q: How does the paper rule out spillover effects from treated to control households?
A: The authors collected data on a supplementary sample of non-treated villages to serve as a &amp;ldquo;pure control&amp;rdquo; and used this to run a suggestive test for spillovers from treated households to untreated households within the same village. They found little evidence of large spillover effects, although they acknowledge limitations in the power of these tests. The physical design of the screenings — held in rooms with shuttered windows, requiring tickets for entry, conducted separately from placebo screenings — also minimized contamination during the intervention itself.&lt;/p&gt;
&lt;p&gt;Q: What were the early (six-month) results and what do they suggest about the timing of effects?
A: At six months, the shorter follow-up found increases in savings and investment in education, consistent with behavioral change beginning soon after treatment. Aspirations showed positive but noisier effects at immediate post-screening and six-month follow-ups, which the authors interpret as consistent with aspirations increasing gradually as people experiment with alternative futures (Appadurai, 2004) or as demotivating beliefs shift incrementally (Carvalho et al., 2023), rather than changing abruptly. This gradual pattern is consistent with a learn-by-doing dynamic where small initial investments generate returns that further raise aspirations.&lt;/p&gt;
&lt;p&gt;Q: How does this study&amp;rsquo;s attrition and follow-up compare to the literature?
A: The five-year attrition rate was very low: 96 percent of baseline households were re-interviewed and 94 percent of individual respondents. The authors cite Bouguen et al. (2019) as a benchmark, noting this is a high tracking rate relative to recent long-run RCT follow-ups in low- and middle-income countries. The low attrition strengthens confidence that endline estimates are not contaminated by selective dropout.&lt;/p&gt;
&lt;p&gt;Q: What is the cost-effectiveness of the intervention?
A: A back-of-the-envelope calculation indicates that increases in durable asset holdings alone outweigh the costs of the intervention by a factor of approximately two at reasonable implementation scale. The authors present this as a proof-of-concept estimate, not a full social cost-benefit analysis, and caution that cost-effectiveness may differ in settings with higher baseline media exposure or less extreme isolation.&lt;/p&gt;
&lt;p&gt;Q: What are the key scope conditions limiting external validity?
A: The study district (Doba) was chosen specifically for its extreme remoteness: at baseline, only 11 percent of respondents watched TV at least weekly and no household owned a television. The authors argue this isolation likely made the screening event especially salient and memorable, potentially amplifying effects relative to what would be expected in less isolated contexts. They are explicit that the findings represent a proof of concept for the aspirations mechanism and that effect magnitudes should not be assumed to replicate in settings with higher baseline media exposure or different cultural belief systems.&lt;/p&gt;
&lt;p&gt;Aspirations: Defined in this paper as desired goals for the future that motivate investment and effort in order to attain them (following Bandura, 1977; Locke and Latham, 1990). Measured via validated survey instruments asking respondents the level of income, assets, or children&amp;rsquo;s education they would like to achieve in their lifetime — distinct from expectations (what one expects to achieve) and from the village maximum (what one believes the most successful person in the village could achieve).&lt;/p&gt;
&lt;p&gt;Aspirations gap: The difference between an individual&amp;rsquo;s aspired level of income, assets, or education and their current reported level. Median aspirations gaps in the sample are 55 percent of median wealth aspirations and 58 percent of median income aspirations, indicating that aspirations exceed current levels by meaningful but not unrealistic margins.&lt;/p&gt;
&lt;p&gt;Capacity to aspire: Drawn from Appadurai (2004), defined as a navigational capacity — the ability to read and navigate a map of a journey into the future. In contexts of poverty, this capacity is described as more brittle because poorer individuals have narrower social networks, fewer role models, and less material slack for experimentation with alternative futures.&lt;/p&gt;
&lt;p&gt;Role model: A real individual from a similar socioeconomic background whose documented experience of escaping poverty through goal-setting and effort provides vicarious experience that allows audience members to imagine what is possible for people like them. Role models are most effective when their success appears attainable and when the steps to achieve it are visible.&lt;/p&gt;
&lt;p&gt;Zero-sum beliefs: The belief that gains for one individual come at the expense of others in the community, documented in the study area as part of a broader fatalistic, deterministic belief system. These beliefs can suppress effort and future-oriented investment by making individual advancement appear normatively transgressive or materially impossible.&lt;/p&gt;
&lt;p&gt;Source text origin: A classification in the paper&amp;rsquo;s pipeline framework distinguishing whether a summary is based on a full working paper PDF or HTML text versus abstract-only text. Abstract-only summaries are blocked as they miss scope conditions, quantitative results, and the full argument structure.&lt;/p&gt;
&lt;p&gt;Placebo group: Households randomly invited to watch an Ethiopian comedy entertainment program (with no aspirational content) rather than the role model documentaries. Used to separate the effect of the aspirations content from the effects of the screening event itself, exposure to outside facilitators, or social attention accompanying selection for the intervention.&lt;/p&gt;</description></item><item><title>The Gender Pay Gap: Micro Sources and Macro Consequences</title><link>https://macropaperwarehouse.com/papers/the-gender-pay-gap-micro-sources-and-macro-consequences/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-gender-pay-gap-micro-sources-and-macro-consequences/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper uses linked employer-employee data from Brazil (RAIS, 2007–2014, covering 267 million worker-years, 56 million unique workers, and 607,000 employers) to document that the gender pay gap of 13.3 log points is overwhelmingly driven by women sorting into lower-paying employers — 78.7% of the gender gap in employer pay fixed effects is attributable to between-employer sorting, not within-employer discrimination. To interpret this sorting, the authors develop an equilibrium on-the-job search model (extending Burdett and Mortensen 1998) with endogenous firm pay, amenities, and hiring, and provide a constructive proof that all model parameters are point-identified from linked employer-employee data. The estimated model finds that amenities explain approximately half of total compensation for both genders (mean amenity share 48.8% for men, 52.2% for women), that compensating differentials account for roughly half of the gender pay gap (reducing it from 13.3 to 4.6 log points in total-compensation terms), and that higher-ranked employers offer women higher amenities rather than higher pay — resolving the puzzle that women disproportionately work at large employers despite a flat employer-size-pay gradient for women. Eliminating gender differences in employer preferences (gender wedges) would raise output by 12.9% but pull women into low-amenity firms, reducing their welfare, while equal-pay and equal-hiring policies close part of the pay gap but lower worker welfare through adverse incentive effects on firms&amp;rsquo; compensation and hiring decisions.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a published paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-empirical-patterns-motivate-the-papers-framework"&gt;Q1. What empirical patterns motivate the paper&amp;rsquo;s framework?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Three facts from Brazilian linked employer-employee data require a richer model than standard pay-only frameworks: (i) 78.7% of the 11.3 log-point gender gap in employer pay fixed effects is a between-employer sorting gap (women work at lower-paying firms); (ii) pay is increasing in employer size for men (R² = 3.3%) but essentially flat for women (R² = 0.1%); and (iii) women are disproportionately concentrated at the largest employers, which is inconsistent with models in which large firms pay more if pay is all that matters.&lt;/strong&gt; These three facts together reveal that women value employer attributes other than pay, particularly at larger firms. Direct amenity proxies confirm this: women at larger employers are substantially less likely to be exposed to workplace hazards (coefficient −0.013, p &amp;lt; 0.01), less likely to be fired unjustly (coefficient −0.005, p &amp;lt; 0.01), much more likely to receive generous parental leave (coefficient 1.054, p &amp;lt; 0.01), and more likely to work part time. The AKM two-way fixed effects decomposition further shows that employer fixed effects account for 12.5% of the variance of log earnings for men and 11.1% for women, with the variance of earnings explained at 92.3% (men) and 93.1% (women).&lt;/p&gt;
&lt;h3 id="q2-what-is-the-equilibrium-model-and-how-does-it-generate-compensating-differentials"&gt;Q2. What is the equilibrium model and how does it generate compensating differentials?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The model extends Burdett-Mortensen on-the-job search to allow firms to simultaneously choose wages, amenities, and vacancies, with firms differing in three dimensions: productivity p, gender wedges τ (an implicit tax on employing women capturing taste-based discrimination or comparative advantage), and gender-specific amenity cost shifters ca,0 — making firm pay, amenities, and hiring jointly determined in equilibrium.&lt;/strong&gt; Workers maximize flow utility x = w + a (wage plus amenity value), and each gender climbs a separate firm utility ladder. Firms with higher composite productivity p̃ = (1−τ)p + a* − c(a*) offer higher utility to attract more workers given convex vacancy posting costs. Because amenity costs are convex and increasing in amenity value, firms optimally set amenities so that the marginal cost equals one (the unit wage), creating endogenous compensating differentials: high-amenity firms can pay lower wages while still attracting workers. The model is isomorphic to a standard wage-only Burdett-Mortensen model with wages replaced by flow utility and productivity replaced by composite productivity.&lt;/p&gt;
&lt;h3 id="q3-how-are-all-model-parameters-identified-constructively"&gt;Q3. How are all model parameters identified constructively?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The authors provide a five-step constructive identification proof that recovers all parameters — including the unobservable amenity values, gender wedges, and productivity distribution — without distributional assumptions: (1) gender-specific employer pay components from AKM; (2) employer utility ranks from the employer size distribution (higher-utility firms are larger in equilibrium); (3) labor market flow hazards (λU, λE, λG, δ) from worker flow data conditional on ranks; (4) firm-level parameters (p, τ, ca,0) by inverting equilibrium profit functions; (5) economy-wide parameters (cv,0, ηv, ηa) from aggregate labor share, firm pay-profit gradient, and aggregate amenity cost share.&lt;/strong&gt; The key insight for step (4) is that unobserved firm profits per matched worker can be inferred from equilibrium firm sizes (more profitable firms post more vacancies and hire more workers), and comparing utility levels inferred from sizes with observed wages identifies amenity values. For step (3), the involuntary job offer hazard λG is separately identified because job-to-job transitions involving a decline on the utility rank ladder — which cannot be voluntary (workers strictly prefer higher utility) — must be involuntary, allowing the hazard to be estimated by counting down-rank transitions.&lt;/p&gt;
&lt;h3 id="q4-what-are-the-estimated-structural-results-on-amenities-and-the-pay-amenity-tradeoff"&gt;Q4. What are the estimated structural results on amenities and the pay-amenity tradeoff?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Amenities are pervasive and quantitatively large: the mean amenity share of total compensation is 48.8% for men and 52.2% for women, yet compensating differentials explain the lion&amp;rsquo;s share of firm pay dispersion, with utility dispersion accounting for only 4.4% of pay dispersion for men and 3.6% for women — far less than pay dispersion alone might suggest.&lt;/strong&gt; Higher-ranked firms for men mostly offer higher pay, but higher-ranked firms for women mostly offer higher amenities. The estimated gender productivity gap is 8.3 log points (employment-weighted mean log productivity 0.864 for men, 0.781 for women), and the employment-weighted mean gender wedge is 0.059 for women but 0.235 for men (wedge represents an implicit disutility from hiring women, so higher means women face higher wedge on average in firms where they are less likely to work). Estimated labor market parameters show women receive fewer job offers from nonemployment (λU_F = 9.1% monthly vs. 10.4% for men) and have lower job destruction rates (δ_F = 2.8% vs. 3.6% for men), contributing to slower job-ladder climbing.&lt;/p&gt;
&lt;h3 id="q5-how-does-the-paper-decompose-the-gender-pay-gap-into-micro-sources"&gt;Q5. How does the paper decompose the gender pay gap into micro sources?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Shutting down firm heterogeneity in amenities — replacing gender-specific amenity values with their mean — closes 45% of the gender pay gap, largely because women relocate toward formerly male-dominated, higher-paying, lower-amenity firms; shutting down differences in employer preferences (gender wedges) eliminates the pay gap entirely; differences in labor market flow rates have little effect.&lt;/strong&gt; The total-compensation gender gap, which accounts for amenity differences, is only 4.6 log points — 40.7% of the raw pay gap of 11.3 log points — confirming that compensating differentials explain approximately half of the measured pay disadvantage. This decomposition is a novel contribution over Card et al. (2016), who rationalized the gap through exogenous gender-specific bargaining parameters without modeling amenities or their equilibrium provision.&lt;/p&gt;
&lt;h3 id="q6-what-are-the-macro-consequences-of-the-gender-pay-gap-for-output-and-welfare"&gt;Q6. What are the macro consequences of the gender pay gap for output and welfare?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Removing all gender differences from the economy (wedges, amenity costs, and flow rates) raises output by 6.1% and welfare by 2.1%, substantially below what pay differences alone might suggest; however, eliminating employer preferences over gender (gender wedges only) raises output by 12.9% at the cost of a welfare reduction for women, because women are pulled into high-paying, low-amenity firms.&lt;/strong&gt; The quantitative wedge between output gains (12.9%) and welfare gains when wedges are removed reveals that women&amp;rsquo;s sorting into amenity-rich firms is partly welfare-enhancing from their perspective, even if it involves accepting lower wages. This is a key insight for policy: policies targeting pay gaps without accounting for amenity losses can be welfare-reducing.&lt;/p&gt;
&lt;h3 id="q7-what-do-equal-pay-and-equal-hiring-policies-achieve-in-equilibrium"&gt;Q7. What do equal-pay and equal-hiring policies achieve in equilibrium?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Both equal-pay mandates (forcing firms to pay men and women identical wages) and equal-hiring mandates (requiring gender-neutral hiring) close part of the gender pay gap but lower worker welfare for both genders, because the policies generate adverse incentive effects: equal-pay mandates induce firms to reduce amenities for women (since the wage-amenity tradeoff is disrupted), and equal-hiring mandates distort firms&amp;rsquo; recruiting decisions in ways that raise vacancy costs.&lt;/strong&gt; These general-equilibrium effects would be missed in partial-equilibrium analyses. The paper thus provides a rigorous case that equal-treatment policies — while closing observable pay gaps — fail to achieve the underlying welfare gains from eliminating gender differences, and may generate unintended welfare losses.&lt;/p&gt;
&lt;h3 id="q8-how-does-the-model-resolve-the-employer-size-puzzle-and-what-discriminatory-mechanisms-does-it-admit"&gt;Q8. How does the model resolve the employer-size puzzle and what discriminatory mechanisms does it admit?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The flat employer-size-pay gradient for women (versus steeply increasing for men) is rationalized in the model because large employers offer women high amenities that substitute for pay; women optimally accept lower wages at large employers in exchange for amenity bundles that are unavailable at smaller firms.&lt;/strong&gt; The model accommodates three discrimination channels simultaneously: taste-based discrimination (Becker 1971, via the gender wedge τ), compensating differentials reflecting gender-specific job characteristics (Rosen 1986, via amenity cost shifters), and monopsony power (Robinson 1933, via search frictions). Even nondiscriminatory firms treat women differently than men as a best response to the equilibrium distribution of discriminatory firms — an equilibrium spillover of discrimination that purely partial-equilibrium analyses miss.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;gender wedge (τ)&lt;/strong&gt; : a firm-level parameter capturing the implicit disutility cost per unit of female employment, encompassing taste-based discrimination (Becker 1971) and comparative-advantage differences (Goldin 1992); estimated to explain substantial variation in women&amp;rsquo;s employment shares across firms, with female managers, routine manual tasks, and smaller size associated with lower wedges (R² = 54.6%).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;compensating differentials&lt;/strong&gt; : the wage reduction a worker accepts in exchange for favorable non-wage job attributes (amenities); in this paper, estimated to explain approximately half of the gender pay gap — the total-compensation gap is 4.6 log points vs. a pay gap of 11.3 log points — implying that women&amp;rsquo;s lower wages partly reflect their preference for amenity-rich employers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;amenity share&lt;/strong&gt; : the fraction of total compensation (wages plus amenities) attributable to non-wage job attributes; estimated at 48.8% for men and 52.2% for women, indicating that amenities are quantitatively as important as wages in total compensation for both genders.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;employer rank&lt;/strong&gt; : a revealed-preference ordering of employers by gender-specific utility offered to workers, identified by the employer size distribution (larger firms are higher-utility in equilibrium); the paper&amp;rsquo;s key object for separating the between-employer sorting component of the pay gap from the within-employer component.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;composite productivity (p̃)&lt;/strong&gt; : the model&amp;rsquo;s reduced-form measure of a firm&amp;rsquo;s profitability per worker, combining raw productivity p, the gender wedge τ, and the optimized amenity net of amenity costs; allows the equilibrium to be analyzed as a standard Burdett-Mortensen model with composite productivity replacing raw productivity and flow utility replacing wages.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;involuntary job offer hazard (λG)&lt;/strong&gt; : the arrival rate of unsolicited job offers that workers must accept regardless of utility ranking, capturing spousal relocations and other idiosyncratic transitions; identified from the frequency of utility-rank-decreasing job transitions, since voluntary transitions can only increase utility.&lt;/p&gt;</description></item><item><title>The Geography of job creation and job destruction</title><link>https://macropaperwarehouse.com/papers/the-geography-of-job-creation-and-job-destruction/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-geography-of-job-creation-and-job-destruction/</guid><description>&lt;p&gt;This paper asks why unemployment rates differ so persistently across local labor markets, and what role job creation and job destruction play in generating those differences. The authors document a comprehensive set of spatial labor market facts using administrative and survey microdata from Germany, the United States, and the United Kingdom, then build and calibrate a quantitative theoretical framework that accounts for all documented regularities.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and scope.&lt;/strong&gt; For Germany, the authors use administrative data from the German employment office (universe of vacancies and unemployed, 1999–2020) and the IAB social security sample (SIAB, 2% of all workers, 2000–2017) aggregated to 194 commuting zones. For the U.S., they use BLS Local Area Unemployment Statistics (2000–2019) at commuting zones, CPS worker flows at metropolitan areas, and JOLTS vacancy data for the 18 largest MSAs (covering roughly 40% of the U.S. labor force). For the UK, they use Nomis data and Jobcentre Plus vacancy records (2004–2006) for 378 Local Authority Districts.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Empirical findings.&lt;/strong&gt; Spatial unemployment rate differences are large and highly persistent. In Germany, the correlation of local unemployment rates across commuting zones over a 19-year span is 0.84 (West) and 0.77 (East). In the U.S., the correlation between 2000 and 2019 unemployment rates is 0.81; in the UK it is 0.76. In all three countries, local labor markets with lower unemployment are tighter (more vacancies per unemployed worker) and less productive. Firms in low-unemployment markets fill vacancies more slowly — in Germany, vacancy duration ranges from approximately 35 days in high-unemployment locations to approximately 65 days in low-unemployment locations, roughly an 85% difference.&lt;/p&gt;
&lt;p&gt;A formal steady-state decomposition reveals that across all three countries, differences in job-separation rates account for approximately two-thirds of the cross-sectional variation in unemployment rates, while differences in job-finding rates account for roughly one-third. Specifically: Germany 62.4% separations / 33.2% job-finding; U.S. 72.0% / 32.8%; UK 64.3% / 35.8%. This primacy of separation rates in the cross-section stands in stark contrast to business-cycle dynamics, where job-finding rates account for 50–60% of unemployment fluctuations (Fujita and Ramey, 2009).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Theory.&lt;/strong&gt; The authors embed a Diamond-Mortensen-Pissarides (DMP) model with endogenous separations — following Den Haan, Ramey, and Watson (2000) — into a Rosen-Roback spatial equilibrium framework. Locations differ in exogenous productivity; workers and firms are freely mobile; cost-of-living differences sustain the spatial equilibrium. The model is calibrated to the U.S. median-unemployment labor market (separation rate 0.0128, job-finding rate 0.2368, vacancy-filling rate 0.7365) plus the productivity differential between the 5th and 95th percentile unemployment locations (4.8% higher and 3.0% lower productivity than median, respectively). The baseline model, imposing the Hosios condition, matches the spatial patterns of separation rates, job-finding rates, tightness, vacancy duration, wages, and cost of living without targeting most of these. The decomposition in the calibrated baseline model attributes 33.5% of spatial unemployment variation to job-finding rates, compared to 32.8% in the data.&lt;/p&gt;
&lt;p&gt;The baseline model generates a counterfactual upward-sloping Beveridge curve and cannot explain why job-finding rates dominate business-cycle fluctuations. Introducing on-the-job search (with 12% of employed workers searching each period, calibrated from Faberman et al., 2017) resolves both problems. In the extended model, job-to-job transition rates are virtually constant across local labor markets (matching the data) but strongly procyclical over the business cycle. This asymmetry amplifies the response of vacancies and job-finding rates to aggregate productivity shocks while muting the cyclical variation in separation rates. The extended model&amp;rsquo;s business-cycle decomposition attributes 54.4% of unemployment volatility to job-finding rates, within the empirical 50–60% range.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Policy implications.&lt;/strong&gt; Under the Hosios condition, the decentralized equilibrium is efficient — large spatial differences in unemployment, tightness, and wages are efficient outcomes, not signs of mismatch. The relevant policy benchmark is not deviation of tightness from the national average but deviation from the model&amp;rsquo;s location-specific prediction conditional on local productivity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the central empirical puzzle the paper addresses?&lt;/strong&gt;
A: Spatial unemployment differences are large and persistent — in Germany, unemployment rates ranged from 1.9% to 11.9% across commuting zones even after 15 years of decline. These differences are not well understood theoretically, and the crucial missing empirical piece was data on job creation and vacancy filling across locations, which this paper provides for three countries.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How large and persistent are cross-sectional unemployment differences in each country?&lt;/strong&gt;
A: In Germany, commuting-zone unemployment ranged from 3.6% to 24.0% in 2000 and persisted with a 19-year correlation of 0.84 (West) and 0.77 (East). In the U.S., the 2000–2019 correlation is 0.81, with unemployment as low as 1.5% and as high as 16.9% in 2000. In the UK, the 2004–2018 correlation is 0.76, with 2004 unemployment ranging from 1.8% to 13.1%.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What do the data show about the relationship between unemployment and labor market tightness across locations?&lt;/strong&gt;
A: In all three countries, lower-unemployment labor markets are tighter — they have more vacancies per unemployed worker. This is documented for Germany using the universe of registered vacancies, for the U.S. using JOLTS data for 18 large MSAs, and for the UK using Jobcentre Plus administrative data. The relationship holds after controlling for local labor market composition (age, gender, education, occupation, industry shares).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What do vacancy-filling rates look like across locations, and how large are the differences?&lt;/strong&gt;
A: Vacancy-filling rates are lower in low-unemployment (tight) labor markets. In Germany, the monthly probability of filling a vacancy is approximately 50% higher in high-unemployment markets than in low-unemployment markets. Completed vacancy duration ranges from about 35 days in high-unemployment locations to about 65 days in low-unemployment locations — a difference of approximately 85%. The UK data show a strikingly similar elasticity of vacancy-filling rates with respect to unemployment rates to Germany.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What does the formal decomposition reveal about the sources of spatial unemployment differences?&lt;/strong&gt;
A: In a steady-state two-state decomposition, separation rates account for 62.4% (Germany), 72.0% (U.S.), and 64.3% (UK) of cross-sectional unemployment variation, while job-finding rates account for 33.2%, 32.8%, and 35.8%, respectively, with small residuals. This consistently assigns primary importance to separation rates across all three countries.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Why is the primacy of separation rates in the cross section surprising, and what literature does it contrast with?&lt;/strong&gt;
A: The business-cycle literature (Fujita and Ramey, 2009; Shimer, 2012) finds that job-finding rate variation accounts for 50–60% of unemployment fluctuations over the cycle, roughly twice the contribution of separation rates. The spatial pattern is the mirror image: separations dominate. Any credible theory of spatial unemployment must rationalize both patterns simultaneously — a challenge the paper explicitly takes up.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How does the baseline DMP model with endogenous separations generate the spatial patterns?&lt;/strong&gt;
A: Higher-productivity locations feature higher match surpluses. Higher surplus induces more vacancy creation and tighter markets, raising job-finding rates and lowering vacancy-filling rates. Crucially, a higher surplus means idiosyncratic shocks must be more negative to make the joint surplus negative, so fewer matches dissolve — separation rates are lower. The calibrated model reproduces the 32.8% job-finding / ~67% separation decomposition without targeting it (model yields 33.5% job-finding).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What are the calibration targets and key parameter values in the baseline model?&lt;/strong&gt;
A: The model is calibrated monthly to the U.S. economy. Median-unemployment-location targets: separation rate 0.0128, job-finding rate 0.2368, vacancy-filling rate 0.7365. Productivity targets: the 5th-percentile-unemployment location is 4.8% more productive than median, and the 95th-percentile-unemployment location is 3.0% less productive. Key calibrated values include matching elasticity alpha = 0.4711 (equal to worker bargaining power under Hosios), matching efficiency m = 0.4371, vacancy posting cost kappa = 0.3070, and flow nonmarket value z = 0.9072.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What are the two shortcomings of the baseline model, and how does on-the-job search resolve them?&lt;/strong&gt;
A: The baseline model generates a counterfactual upward-sloping Beveridge curve and cannot generate the asymmetry between cross-sectional and business-cycle drivers of unemployment. Adding on-the-job search (fraction phi = 0.12 of employed workers searching, calibrated from Faberman et al., 2017) resolves both. It corrects the Beveridge curve by allowing the model to match the spatial vacancy-unemployment relationship, and it introduces procyclical job-to-job mobility that amplifies the cyclical response of job-finding rates while dampening cyclical separation rate variation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How do job-to-job transition rates differ across space versus over the business cycle, and why does this matter?&lt;/strong&gt;
A: Job-to-job rates are virtually constant across the cross-section of local labor markets (the extended model is calibrated to match this). But they are strongly procyclical — high in booms, low in recessions, about as volatile as job-finding rates over the cycle. In a boom, more employed workers search, spurring vacancy creation, which raises both vacancy-filling probability (making vacancies easier to fill) and job-finding probability for the unemployed, amplifying the cyclical job-finding rate response while muting the cyclical separation rate response.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What does the extended model predict for business-cycle dynamics?&lt;/strong&gt;
A: The model with on-the-job search and aggregate productivity shocks (parameterized following Hagedorn and Manovskii, 2008) generates unemployment and vacancy rates that are an order of magnitude more volatile than productivity — matching the data. Labor market tightness is about twice as volatile as unemployment, as in the data. The Fujita-Ramey decomposition in the model attributes 54.4% of unemployment volatility to job-finding rates, which falls within the empirical range of 50–60%.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the paper&amp;rsquo;s efficiency result and its policy implication?&lt;/strong&gt;
A: Under the Hosios condition (imposed in calibration), the decentralized equilibrium is efficient: job creation and destruction are privately efficient in each market, and free mobility of workers and firms ensures efficient spatial allocation. Therefore, large observed differences in unemployment, tightness, and wages across locations are not evidence of inefficiency. The relevant signal for policy is not deviation from the national average but deviation from the model&amp;rsquo;s location-specific prediction conditional on productivity. Locations where data deviate from model predictions are candidates for policy intervention.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Do the spatial patterns survive controls for worker and firm composition?&lt;/strong&gt;
A: Yes. The authors regress labor market tightness and vacancy-filling rates on local unemployment rates and a full set of composition controls (age, gender, education, occupation, and industry shares) derived from the IAB microdata for Germany, along with year fixed effects. The relationship between local unemployment and both tightness and job-filling rates remains highly statistically and economically significant after these controls, for both Germany and the U.S.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How does the model handle wages and cost of living, and does it match the data?&lt;/strong&gt;
A: Wages are determined by state-contingent generalized Nash bargaining with worker bargaining power eta. Cost-of-living differences are backed out as the values needed to sustain the spatial equilibrium (Rosen-Roback). Neither wages nor costs of living are calibration targets in the cross section, yet the model closely matches the empirically observed wage gradient across local labor markets and the negative correlation between cost of living and local unemployment (using Economic Policy Institute Family Budget Calculator data).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Labor market tightness:&lt;/strong&gt; The ratio of vacancies posted in a local labor market to the number of unemployed workers in that market; the paper documents that tightness is systematically higher (more vacancies per unemployed worker) in lower-unemployment locations across all three countries.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Job-separation rate (EU rate):&lt;/strong&gt; The share of employed workers who transition from employment to unemployment in a period; in the paper&amp;rsquo;s framework, this is endogenously determined by the idiosyncratic match productivity threshold below which the joint match surplus turns negative, and it is the primary driver of spatial unemployment differences (accounting for roughly two-thirds of cross-sectional variation).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Job-finding rate (UE rate):&lt;/strong&gt; The share of unemployed workers who transition from unemployment to employment in a period; in the paper&amp;rsquo;s framework, this is higher in tighter (lower-unemployment) markets, but accounts for only roughly one-third of spatial unemployment variation — the opposite of its dominant role in business-cycle fluctuations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Spatial Beveridge curve:&lt;/strong&gt; The cross-sectional relationship between vacancy rates and unemployment rates across local labor markets; in the data it is downward sloping (low-unemployment locations have both high vacancies and low unemployment), which the baseline model fails to capture but the extended model with on-the-job search reproduces.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Endogenous separation threshold:&lt;/strong&gt; The location-specific minimum idiosyncratic match productivity below which the joint match surplus becomes negative and the worker-firm pair dissolves; this threshold is lower (tolerates a wider range of idiosyncratic shocks) in higher-productivity locations because the average surplus is larger, generating lower separation rates in more productive locations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Spatial equilibrium (Rosen-Roback):&lt;/strong&gt; The equilibrium condition in which differences in local costs of living adjust to make workers and firms indifferent across locations, sustaining persistent productivity-driven differences in wages and unemployment as equilibrium outcomes rather than disequilibrium phenomena.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Procyclical on-the-job search:&lt;/strong&gt; The mechanism by which the fraction of employed workers actively searching — and thus the rate of job-to-job transitions — is approximately constant across the cross-section of local labor markets but strongly procyclical over the business cycle. This asymmetry is the key to reconciling why job-finding rates drive business-cycle unemployment variation while separation rates drive spatial unemployment variation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Hosios condition:&lt;/strong&gt; The parametric restriction equating the unemployment elasticity of the matching function (alpha) and the workers&amp;rsquo; Nash bargaining weight (eta); when satisfied, job creation is efficient in every local labor market. The paper imposes this condition deliberately to demonstrate that the decentralized equilibrium is efficient despite large spatial differences in outcomes.&lt;/p&gt;</description></item><item><title>The Hitchhiker's Guide to Markup Estimation: Assessing Estimates From Financial Data</title><link>https://macropaperwarehouse.com/papers/the-hitchhikers-guide-to-markup-estimation-assessing-estimates-from-financial-data/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-hitchhikers-guide-to-markup-estimation-assessing-estimates-from-financial-data/</guid><description>&lt;p&gt;Using matched data combining physical production records (EAP survey) and financial statements (FARE) for 147,403 firm-years across 18 two-digit manufacturing industries in France (2009–2019), the paper audits the production-approach markup estimator — μ = α × (PY/WV) — when α must be estimated from revenue rather than quantity data. The central analytical result is that revenue-based markup estimates are positively correlated with true markups whenever the estimated output elasticity falls strictly between zero and the revenue elasticity (the Bond et al. 2021 knife-edge case). For firms sharing the same production function, the correlation is exactly one — revenue markups rank firms identically to quantity markups up to an additive constant. Monte Carlo simulations in an Atkeson–Burstein (2008) oligopoly model with translog production confirm this: correlation between revenue and true markups is 0.94 (SD 0.05) across 200 simulations with 1,600 firms, 180 markets, and 40 periods. In the French matched data, within-sector Pearson correlations between quantity and revenue markup estimates are 0.61 in levels and 0.80 in first differences; rank correlations reach 0.62 and 0.83 respectively, with medians above 0.65 and 0.84 across sectors. Cross-sectional relationships between markups and profit rates, labor shares, material shares, and market shares are qualitatively robust across estimation methods (Table VI). However, average log markup levels differ sharply: 0.37 (quantity) vs 0.13 (revenue); aggregate French manufacturing markups average 1.45 (quantity) vs 1.08 (revenue). The policy implication: revenue-based financial data are adequate for studying markup dispersion, trends, and firm-level correlates; they cannot identify markup levels.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-core-analytical-result-about-when-revenue-data-can-identify-true-markups"&gt;Q1. What is the core analytical result about when revenue data can identify true markups?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The production-approach markup estimator μ_it = α_it × (P_it Y_it / W_t V_it) recovers true markups from revenue data if and only if the estimated output elasticity α̂ equals the true quantity elasticity; when α̂ equals the revenue elasticity instead, estimated markups are identically one; in the empirically relevant intermediate case where α̂ lies strictly between these extremes, revenue markups are positively correlated with true markups.&lt;/strong&gt; The key insight is that standard estimators (ACF, LP, OLS) applied to revenue data produce an α̂ that lies strictly between zero and the revenue elasticity (which is less than the true output elasticity for firms with market power), so revenue markup estimates are neither correct nor uninformative. For firms sharing the same production technology — the common case in Cobb-Douglas with industry-level coefficients — revenue markups equal true markups up to an additive constant, implying a within-industry correlation of exactly one. The Bond et al. (2021) result (revenue markups ≡ 1) arises only in the knife-edge case where ACF converges exactly to the revenue elasticity; under the empirically dominant case of partial revenue elasticity contamination, the correlation is high but the level is biased downward.&lt;/p&gt;
&lt;h3 id="q2-how-does-the-paper-relate-to-the-bond-et-al-2021-critique-and-when-does-that-critique-apply-exactly"&gt;Q2. How does the paper relate to the Bond et al. (2021) critique, and when does that critique apply exactly?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Bond et al. (2021) showed that if the ACF procedure is applied to revenue data and converges to the revenue elasticity, then estimated markups are identically one regardless of true markups; the present paper demonstrates this is a knife-edge special case, not a generic property of revenue-based estimation.&lt;/strong&gt; The Bond et al. result requires that the estimated α̂ equals exactly the revenue elasticity r = α/(1+α·(μ−1)/μ) × (1/price index correction) — a relationship that holds only under specific functional form and market structure assumptions. In the Atkeson–Burstein Monte Carlo (translog production, oligopolistic competition), the ACF estimator applied to revenue data produces α̂ = 0.29 against true βv = 0.32 and revenue elasticity that is strictly lower — placing the estimate in the intermediate zone. The general principle: when α̂ is strictly between zero and the revenue elasticity, revenue markups are informative; when α̂ converges to the revenue elasticity, they are not.&lt;/p&gt;
&lt;h3 id="q3-what-do-monte-carlo-simulations-in-the-atkesonburstein-model-show-about-the-reliability-of-revenue-markup-estimates"&gt;Q3. What do Monte Carlo simulations in the Atkeson–Burstein model show about the reliability of revenue markup estimates?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Across 200 simulations with 1,600 firms organized in 180 markets over 40 periods, using a translog production function, the correlation between revenue-based ACF markup estimates and true quantity-based markups is 0.94 (SD 0.05), and all distributional moments of the markup distribution are well-recovered from financial data.&lt;/strong&gt; Table I reports the estimated variable input elasticity: true βv = 0.32, quantity estimate = 0.32, revenue-ACF estimate = 0.29 — the small downward bias in the elasticity generates a small downward bias in the markup level but does not destroy the cross-sectional ranking. Table II reports the markup distribution correlations: revenue vs. true = 0.94 (SD 0.05) vs. quantity vs. true = 1.00 (SD 0.01). Cross-sectional regression coefficients of revenue markups on quantity markups are 0.85 in levels and 0.99 in first differences; all moments (mean, standard deviation, median, interquartile range) are well-estimated from revenue data. The simulations thus confirm the analytical result: revenue data preserves the cross-sectional ranking of markups and their time-series variation, even though the level is biased downward.&lt;/p&gt;
&lt;h3 id="q4-what-does-the-matched-french-production-and-financial-statement-data-show-about-within-sector-correlations"&gt;Q4. What does the matched French production-and-financial-statement data show about within-sector correlations?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;In the French manufacturing data (EAP physical production records matched to FARE financial statements, 2009–2019, 147,403 observations, 18 two-digit industries), within-sector Pearson correlations between quantity and revenue markup estimates are 0.61 in levels and 0.80 in first differences; rank correlations reach 0.62 (levels) and 0.83 (first differences), with medians above 0.65 and 0.84 respectively across sectors.&lt;/strong&gt; Table III documents the divergence in average output elasticities: 0.54 (quantity) vs. 0.40 (revenue), with quantity exceeding revenue in 16 of 18 industries and an average gap of 38%. Table IV shows the implied log markup averages: 0.37 (quantity, SD 0.23) vs. 0.13 (revenue, SD 0.16) — a large level gap but smaller dispersion gap. Table V reports within-sector correlations: Pearson 0.61 (levels) and 0.80 (first differences); rank 0.62 and 0.83. Binned scatter regressions confirm: slope 0.89 (levels) and 0.92 (first differences). These within-sector correlations are what matter for studying which firms have relatively higher or lower markups, and they are high enough to support revenue-based research on markup dispersion, cyclicality, and firm-level correlates.&lt;/p&gt;
&lt;h3 id="q5-are-the-relationships-between-revenue-markup-estimates-and-other-firm-characteristics-reliable"&gt;Q5. Are the relationships between revenue markup estimates and other firm characteristics reliable?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Table VI shows that the associations of markup estimates with profit rate, labor share, material share, and market share are qualitatively robust across revenue and quantity estimation methods — the sign and relative magnitude of correlations are preserved, even though the level of markups differs.&lt;/strong&gt; This result validates using financial data for the dominant research application of production-approach markup estimation: studying the firm-level determinants and correlates of markups. The positive correlation between markups and profit rates, the negative correlation with labor shares, and the positive correlation with market shares all survive the switch from quantity to revenue-based estimation. The key caveat is that the magnitudes of these correlations are attenuated under revenue estimation due to the downward bias in markup levels, but the qualitative patterns are preserved with statistical significance.&lt;/p&gt;
&lt;h3 id="q6-what-do-aggregate-french-manufacturing-markup-trends-show-and-what-is-the-papers-policy-conclusion"&gt;Q6. What do aggregate French manufacturing markup trends show, and what is the paper&amp;rsquo;s policy conclusion?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Aggregate French manufacturing markup trends are qualitatively well-captured by revenue-based estimates (Figure 5), but the average level differs substantially: 1.08 (revenue) vs. 1.45 (quantity) — a gap of 37 log points.&lt;/strong&gt; The time-series evolution of the markup distribution (aggregate mean, trends, cyclical variation) is reliably estimated from financial statements. This implies that the empirical literature documenting rising markups over the past decades — primarily using revenue-based financial data — has correctly identified the direction and approximate magnitude of markup trends, even if the levels are underestimated. The paper&amp;rsquo;s policy conclusion is precise: revenue data are adequate for studying markup dispersion across firms, trends over time, and correlates with other firm characteristics; they are not adequate for measuring the absolute level of markups or for calibrating models that require accurate markup levels (e.g., models of aggregate price-setting or optimal tax policy).&lt;/p&gt;
&lt;hr&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;production-approach markup estimator&lt;/strong&gt; : the method pioneered by Hall (1986, 1988) and formalized by De Loecker and Warzynski (2012) that estimates firm-level markups as μ = α × (PY/WV), where α is the output elasticity of the variable input V, P is output price, Y is output, and W is input price; requires physical quantity data in principle but is widely applied to revenue (PY) data from financial statements.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;revenue elasticity&lt;/strong&gt; : the elasticity of firm revenue with respect to variable input, which equals the true output elasticity divided by a markup adjustment term; equals the output elasticity only for perfectly competitive firms; lies strictly below the output elasticity for firms with market power; when the estimated α̂ converges to the revenue elasticity, the production-approach estimator returns markups identically equal to one (the Bond et al. 2021 result).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bond et al. (2021) knife-edge&lt;/strong&gt; : the special case in which the ACF production function estimator applied to revenue data converges to the revenue elasticity — causing estimated markups to be identically one and uninformative about true markups; the present paper shows this is not a generic property but a specific functional-form restriction.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;within-sector cross-sectional rank&lt;/strong&gt; : the ordering of firms within an industry by their estimated markups; the statistic most reliably preserved when using revenue rather than quantity data, because within-industry production function homogeneity causes revenue markups to equal true markups up to an additive constant, so ranks are identical analytically.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;level vs. dispersion distinction&lt;/strong&gt; : the paper&amp;rsquo;s central empirical finding that revenue markup estimates reliably measure within-industry dispersion, trends, and rankings (the first moment of changes and cross-sectional variation) but cannot recover the absolute level of markups (the mean log markup is 0.13 from revenue vs. 0.37 from quantity in French manufacturing, and aggregate average is 1.08 vs. 1.45).&lt;/p&gt;</description></item><item><title>The housing wealth effect: Quasi-experimental evidence</title><link>https://macropaperwarehouse.com/papers/the-housing-wealth-effect-quasi-experimental-evidence/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-housing-wealth-effect-quasi-experimental-evidence/</guid><description>&lt;p&gt;This paper estimates a causal housing wealth effect on consumption using a quasi-natural experiment in Stockholm, Sweden. The identification exploits an unanticipated political decision — announced in September 2007 — to renew the operating contract of Bromma Airport through 2038, reversing a long-standing expectation of closure by 2011. Because the decision resulted from opaque political bargaining and was widely characterized as a political coup by opposition parties, the announcement was genuinely unexpected. The negative externality of continued airport operations (primarily aircraft noise exceeding 70 decibels within a mapped contour) capitalized locally into house prices within one quarter of the announcement. Using difference-in-differences on all single-family house transactions in Stockholm Municipality from 2004 to 2012, the authors estimate a house price decline of 19.4 percent for dwellings within 1,000 meters of the noise contour relative to those farther away (t-statistics above 5; robust to control variables and sample period). Co-op apartment prices show no statistically significant response, consistent with greater structural noise insulation in multi-story concrete buildings.&lt;/p&gt;
&lt;p&gt;The consumption outcome is new car purchases, observed at quarterly frequency in a registry-based household panel covering all Stockholm residents, with balance sheet information (loan-to-value ratios, bank deposits, mortgage types) and GIS-located residences. The paper focuses on the intensive margin — the log value of new cars purchased conditional on a purchase — since no effect is found on the extensive margin (probability of buying). A two-sample IV approach yields a short-run elasticity of 0.39: homeowners near the noise contour reduce the value of new cars purchased by 7.7–8.5 log points relative to homeowners farther away. Converting to a marginal propensity for expenditures (MPX): conditional on purchasing a new car, the car MPX is 2.5 cents per dollar of housing wealth lost; scaling by the annual new-car purchase rate of 0.049 per household yields an aggregate new-car MPX of 0.12 cents per dollar per year. Including a symmetry assumption for used cars raises the overall car MPX to 0.38 cents per dollar per year.&lt;/p&gt;
&lt;p&gt;Heterogeneity analysis reveals that the collateral channel dominates the pure wealth channel. Homeowners with loan-to-value ratios above 50 percent respond almost twice as strongly as those below (elasticities of 0.526 versus 0.269). Homeowners with below-median bank deposits respond with an elasticity of 0.694, roughly five times larger than those with larger deposits. The financing data show that 47 percent of a new car&amp;rsquo;s value is financed with credit on average, of which 71 percent takes the form of mortgage debt; however, households with high LTV ratios borrow one-third less per dollar of car value, almost entirely through reduced mortgage use.&lt;/p&gt;
&lt;p&gt;A calibrated life-cycle model (quarterly, ages 30–85, Cobb-Douglas preferences over non-durables and cars, long-term fixed-rate mortgage, adjustment costs for cars and mortgages, information friction) replicates the empirical findings. In simulation, a 19.4 percent permanent house-price shock reduces new car values purchased by 6.1 log points on average over the first four quarters, implying an elasticity of 0.31 and a new-car MPX of 0.20 cents per dollar — close to the empirical 0.12 cents and within the 95 percent confidence interval. The model decomposes the response: the collateral effect accounts for 93 percent of the car MPX and 83 percent of the total MPX in the first four quarters; the pure wealth effect accounts for the remainder. The model further shows that full information awareness would roughly double the one-year response, and that smaller shock magnitudes, shorter measurement windows, and crisis-era credit conditions (where more households are already at borrowing limits) each amplify estimated MPXs — helping account for the wide range of estimates (0.12 to 2.3 cents per dollar) in prior literature.&lt;/p&gt;
&lt;p&gt;The identification is validated by dose-response monotonicity with distance to the noise contour, placebo tests showing no response for apartment owners or renters, and absence of income effects or differential moving behavior in the treatment group.&lt;/p&gt;
&lt;p&gt;Q: What is the quasi-experiment and why is it well-suited for identifying housing wealth effects?
A: The Stockholm municipality unexpectedly renewed Bromma Airport&amp;rsquo;s operating contract through 2038 in September 2007, reversing a broadly held expectation that the airport would close by 2011. The decision emerged from closed-door political negotiations and was denounced as a political coup by opposition parties, making it genuinely unanticipated. Because the shock is geographically contained within the airport&amp;rsquo;s noise contour, it is unrelated to macroeconomic conditions and unlikely to generate general equilibrium feedback. The authors also verify that no differential income effects, tax changes, or other policies affected the treatment versus control groups over the study window.&lt;/p&gt;
&lt;p&gt;Q: How large is the estimated house price effect, and how precisely is it measured?
A: Dwellings within 1,000 meters of the noise contour experienced a price decline of 19.4 percent relative to dwellings farther away (baseline estimate, longer sample period). The estimate is highly significant with t-statistics above 5 in all specifications and is robust to the inclusion of rich property-level controls; adding controls changes the pre-crisis estimate only trivially (from -21.4 to -21.3 percent). Co-op apartment prices show no statistically significant response across all specifications, consistent with better structural insulation of multi-story concrete buildings.&lt;/p&gt;
&lt;p&gt;Q: What is the main consumption response finding?
A: Homeowners near the noise contour reduce the log value of new cars purchased by 7.7–8.5 log points relative to homeowners farther away (reduced form, intensive margin). There is no detectable effect on the extensive margin — the probability of purchasing a new car changes by only 0.029 percentage points per quarter against a baseline of approximately 1.2 percent per quarter. Two-sample IV yields an elasticity of 0.39 (statistically significant at 1 percent), meaning a 1 percent decline in house prices leads to a 0.39 percent reduction in new car values among purchasers.&lt;/p&gt;
&lt;p&gt;Q: What does the elasticity of 0.39 imply for the marginal propensity to spend on cars?
A: Conditional on purchasing a new car, the car MPX is 2.5 cents per dollar of housing wealth lost (calculated as 0.393 × 19.4% × SEK 250,000 average car value, divided by SEK 774,060 housing wealth loss). Scaling by the annual new-car purchase frequency of 0.049 per household yields an aggregate new-car MPX of 0.12 cents per dollar per year. Assuming an equal response for used cars, the overall car MPX is 0.38 cents per dollar per year. These estimates are substantially smaller than Mian et al. (2013)&amp;rsquo;s 1.8–2.3 cents per dollar, a discrepancy the model helps explain.&lt;/p&gt;
&lt;p&gt;Q: What is the role of the loan-to-value ratio in shaping the consumption response?
A: Homeowners with LTV ratios above 50 percent respond almost twice as strongly (elasticity 0.526) as those with LTV below 50 percent (elasticity 0.269). The financing data confirm the mechanism: on average 71 percent of car-purchase borrowing takes the form of mortgage debt, but households with high LTV ratios borrow one-third less per dollar of car value, with the difference almost entirely attributable to reduced mortgage use. This pattern is consistent with binding borrowing constraints preventing high-LTV households from extracting home equity for collateral.&lt;/p&gt;
&lt;p&gt;Q: What is the role of liquid savings (bank deposits) in the response?
A: Homeowners with bank deposits below the median respond with an elasticity of 0.694, roughly five times larger than homeowners with larger deposits (elasticity approximately 0.139). This heterogeneity is consistent with deposits serving as a buffer stock that allows wealthier households to smooth consumption without altering borrowing behavior after a wealth shock.&lt;/p&gt;
&lt;p&gt;Q: What does the quantitative model find about the relative importance of the collateral channel versus the pure wealth effect?
A: In the first four quarters following the shock, the collateral effect accounts for 93 percent of the car MPX response and 83 percent of the total expenditure MPX; the pure wealth effect accounts for only 7.5 percent of car MPX and 19 percent of total MPX over the same horizon. Over a longer horizon of 20 quarters, the collateral channel remains dominant at 69 percent of the car baseline, while the wealth effect rises to 32 percent. For non-durable consumption, the short-run collateral effect is 81 percent and the wealth effect is 19 percent.&lt;/p&gt;
&lt;p&gt;Q: How does the model match the empirical estimates?
A: Simulating a permanent 19.4 percent house-price shock for 200,000 household pairs, the model produces a 6.1 log point average reduction in new car values over the first four quarters, corresponding to an elasticity of 0.31 and a new-car MPX of 0.20 cents per dollar. The empirical estimate is 0.12 cents, and the model value falls within the empirical 95 percent confidence interval. The model also replicates the pattern of no extensive-margin response in the short run and a gradual build-up in the non-durable consumption response (maximum elasticity of 0.079 reached only after ten quarters).&lt;/p&gt;
&lt;p&gt;Q: Why is the short-run response concentrated in cars rather than non-durables?
A: The paper establishes an intertemporal smoothing mechanism for durables analogous to McKay and Wieland (2021): households delay or bring forward lumpy durable purchases in response to shocks to borrowing capacity. Although cars represent only 5.5 percent of total consumption in the model (Cobb-Douglas expenditure share), they account for 45–72 percent of the total expenditure response in the first four quarters after the house-price shock. The non-durable consumption response builds slowly and reaches its maximum after about ten quarters.&lt;/p&gt;
&lt;p&gt;Q: What factors does the model identify as explanations for the wide range of MPX estimates across studies?
A: Three factors are identified. First, shock magnitude: larger shocks produce smaller partial-equilibrium MPXs because more households hit borrowing constraints; across shock sizes from -30 to +20 percent, car and total MPXs can range from 1 to 2 cents per dollar. Second, measurement period: short-run (1-year) MPXs exceed long-run (3-year) MPXs, especially for durable goods. Third, the state of the economy: in a crisis-era bust following credit-fueled boom, many more households are constrained when prices fall, amplifying MPXs; Guerrieri and Iacoviello (2017) report car elasticities of 0.24 in the boom phase and 0.49 in the bust phase of the US financial crisis.&lt;/p&gt;
&lt;p&gt;Q: What is the role of the information friction in the model?
A: Because the quasi-experiment occurred in &amp;ldquo;normal times&amp;rdquo; just before the global financial crisis became acute, the authors argue that households were not immediately aware of the house-price shock; they only update their perceived housing wealth when they attempt to adjust their mortgage, trade cars, or receive a random information update. Under full information awareness, the one-year MPX would be approximately twice as large, and the one-year total MPX could be as much as three times as large (with a car MPX of 3 cents per dollar and total MPX well above 6 cents per dollar under full information with small positive shocks). The information friction thus attenuates the estimated MPX relative to a world of full information.&lt;/p&gt;
&lt;p&gt;Q: What placebo and robustness tests support the identification?
A: Co-op apartment owners show no statistically significant price or consumption response, consistent with their structural insulation from aircraft noise. Renters also show no consumption response. The dose-response test confirms a monotone relationship between distance to the noise contour and both house price and car expenditure effects. Income effects are absent (Figure B.2), and there is no differential probability of moving in either the short or long run. Tax reforms benefited both groups equally and had already been announced before the quasi-experiment.&lt;/p&gt;
&lt;p&gt;Q: How does this study&amp;rsquo;s identification strategy compare to instrumental variable approaches using housing supply elasticity?
A: Supply elasticity IV approaches (Mian et al. 2013; Aladangady 2017; Kaplan et al. 2020) rely on regional variation in construction constraints and must assume that consumption demand factors are either observed or uncorrelated with supply elasticity — an assumption critiqued by Davidoff (2016). This paper&amp;rsquo;s identification exploits an exogenous change in a local negative externality, yielding a geographically granular shock unrelated to macroeconomic conditions and free from general equilibrium feedback. The result is interpretable as a partial equilibrium housing wealth effect in the sense of Berger et al. (2018) and Guren et al. (2020).&lt;/p&gt;
&lt;p&gt;Housing wealth effect: The causal effect of a change in housing wealth on household consumption expenditure, decomposed in this paper into a pure wealth channel (change in lifetime resources) and a collateral channel (change in borrowing capacity via home equity).&lt;/p&gt;
&lt;p&gt;Marginal propensity for expenditures (MPX): The change in spending per dollar change in housing wealth; distinct from the marginal propensity to consume (MPC) because spending on durables may be lumpy and differ from the flow of consumption services. The paper distinguishes the car MPX conditional on purchase (2.5 cents per dollar), the aggregate new-car MPX (0.12 cents per dollar per year), and the total expenditure MPX.&lt;/p&gt;
&lt;p&gt;Collateral channel: The mechanism by which a decline in house prices reduces homeowners&amp;rsquo; borrowing capacity — because the house serves as collateral for mortgage debt — thereby tightening credit constraints and reducing spending, independent of any change in permanent income. The model assigns 93 percent of the short-run car MPX to this channel.&lt;/p&gt;
&lt;p&gt;Two-sample instrumental variable (TSIV): The empirical strategy of Angrist and Krueger (1992) used here to estimate the consumption elasticity: the house-price first stage is estimated in one sample (transaction data), and the reduced-form consumption effect is estimated in a second sample (household panel), with the IV elasticity computed as the ratio.&lt;/p&gt;
&lt;p&gt;Information friction: The assumption in the model that households do not immediately observe the spatial divergence in house prices; they update their perceived housing wealth only when they attempt to adjust their mortgage, trade a durable good, or receive a random information shock. This friction attenuates the short-run consumption response and is calibrated to &amp;ldquo;normal times&amp;rdquo; conditions.&lt;/p&gt;
&lt;p&gt;Noise contour: The geographic boundary around Bromma Airport within which properties are regularly exposed to noise levels of at least 70 decibels, as adjudicated by the Swedish Land and Environment Court. Properties within 1,000 meters of this contour define the treatment group.&lt;/p&gt;
&lt;p&gt;Intertemporal smoothing of durables: The pattern, documented in the model and complementary to McKay and Wieland (2021), whereby households adjust lumpy durable purchases (cars) rapidly in response to changes in borrowing capacity, so that durables account for a disproportionately large share of the total expenditure response in the short run (45–72 percent in the first four quarters despite a 5.5 percent Cobb-Douglas expenditure share).&lt;/p&gt;</description></item><item><title>The Impact of EITC on Education, Labour Market Trajectories, and Inequalities</title><link>https://macropaperwarehouse.com/papers/the-impact-of-eitc-on-education-labour-market-trajectories-and-inequalities/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-impact-of-eitc-on-education-labour-market-trajectories-and-inequalities/</guid><description>&lt;p&gt;This paper studies the effect of the Earned Income Tax Credit (EITC) on educational attainment and labor market trajectories through two complementary approaches. Using policy discontinuities at U.S. state borders—exploiting variation in state EITC generosity set as a percentage of the federal EITC—the paper finds that an increase in the state EITC leads to a statistically significant increase in the high school dropout rate. The mechanism is that a tax credit targeted at low-wage (low-skilled) workers increases the value of low-skilled employment and reduces the relative return to schooling, generating a powerful disincentive to pursue long-term studies. A structural life-cycle matching model with directed search and endogenous educational choices, search intensities, hirings, hours worked, and separations is developed to quantify the long-run general equilibrium effects: in the long run, EITC reduces the proportion of high-skilled workers, with ambiguous effects on income inequality that depend on the competing channels through which EITC affects both the supply and demand sides of the labor market.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary based on a working paper version, AI-assisted and human-reviewed. See the linked published article for the authoritative version.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-empirical-strategy-for-identifying-the-effect-of-eitc-on-education"&gt;Q1. What is the empirical strategy for identifying the effect of EITC on education?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper identifies the causal effect of state EITC on education by exploiting policy discontinuities at U.S. state borders, comparing contiguous PUMA pairs on opposite sides of state borders that differ in state EITC generosity.&lt;/strong&gt; State EITC rates are set as a percentage of the federal EITC and have varied considerably since the mid-1980s. Borrowing from the minimum wage literature (Dube et al., 2010; Hagedorn et al., 2015), the border-discontinuity design controls for local labor market conditions that vary continuously across state borders while isolating the effect of the discrete EITC policy difference.&lt;/p&gt;
&lt;h3 id="q2-what-is-the-labor-market-mechanism-linking-eitc-to-education"&gt;Q2. What is the labor market mechanism linking EITC to education?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;EITC raises the value of low-skilled employment by directly increasing the earnings of low-wage workers, which in turn reduces the relative return to investing in education, generating a powerful disincentive to pursue long-term studies.&lt;/strong&gt; When directed search is present—as supported by recent empirical studies—educational decisions affect both job-finding probabilities and labor incomes over the life cycle. EITC&amp;rsquo;s subsidization of low-skilled work contracts the education premium in this framework, making the forgone earnings cost of staying in school larger relative to the low-skilled employment option supported by the EITC.&lt;/p&gt;
&lt;h3 id="q3-what-does-the-life-cycle-matching-model-contribute"&gt;Q3. What does the life-cycle matching model contribute?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The structural life-cycle matching model with directed search and endogenous educational choices, search intensities, hirings, hours worked, and separations quantifies the general equilibrium and long-run effects of EITC that purely reduced-form studies cannot capture—including the feedback of an expanded low-skilled labor force on equilibrium wages and job creation.&lt;/strong&gt; The model endogenizes labor demand, capturing both household responses (education, hours, search intensity) and firms&amp;rsquo; responses (job creation and destruction). It is solved and estimated to replicate the life-cycle profile of labor market variables.&lt;/p&gt;
&lt;h3 id="q4-what-are-the-long-run-implications-for-inequality"&gt;Q4. What are the long-run implications for inequality?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;In the long run, EITC reduces the proportion of high-skilled workers in the economy, with ambiguous effects on income inequality because of offsetting channels: EITC directly increases earnings of low-skilled workers, but by expanding the supply of low-skilled labor it may also depress low-skilled wages; additional channels through unemployed workers&amp;rsquo; search effort and employed workers&amp;rsquo; hours further complicate the net effect.&lt;/strong&gt; The model is used to determine the optimal design of the EITC that balances the income-support objective against these unintended long-run effects.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;state EITC&lt;/strong&gt; : a supplement to the federal Earned Income Tax Credit set as a fixed percentage of the federal credit; varies across states; used in this paper as the identification source for the effect of EITC generosity on education via border discontinuities.
&lt;strong&gt;directed search&lt;/strong&gt; : a labor market framework in which workers and firms direct their search to specific submarkets with posted wages; in this setting, educational choice affects both job-finding probabilities and wages over the life cycle, amplifying the disincentive effects of EITC on education relative to random-search models.
&lt;strong&gt;education-EITC disincentive&lt;/strong&gt; : the mechanism by which EITC targeted at low-wage workers raises the relative value of low-skilled employment and reduces the return to schooling, generating an increase in high school dropout rates as a side effect of the anti-poverty policy.&lt;/p&gt;</description></item><item><title>The Impact of Incarceration on Employment, Earnings, and Tax Filing</title><link>https://macropaperwarehouse.com/papers/the-impact-of-incarceration-on-employment-earnings-and-tax-filing/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-impact-of-incarceration-on-employment-earnings-and-tax-filing/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;This paper estimates the causal effect of incarceration on employment, wage earnings, self-employment, and tax filing behavior using administrative criminal justice data linked to Internal Revenue Service (IRS) records for approximately half a million felony defendants in two U.S. states: North Carolina and Ohio. The study period covers cases filed from the early 2000s through 2014, with outcomes tracked through 2020 using IRS W-2 and 1040 records.&lt;/p&gt;
&lt;h3 id="research-question"&gt;Research Question&lt;/h3&gt;
&lt;p&gt;The central question is whether incarceration itself — as distinct from arrest, conviction, and other criminal justice interactions that precede or accompany it — causes lasting reductions in defendants&amp;rsquo; labor market outcomes. The paper explicitly holds fixed upstream interactions (conviction, arrest) to isolate the effect of the incarceration sentence.&lt;/p&gt;
&lt;h3 id="data-and-sample"&gt;Data and Sample&lt;/h3&gt;
&lt;p&gt;Criminal justice records from Ohio (Common Pleas courts in Franklin, Cuyahoga, and Hamilton counties, covering Columbus, Cleveland, and Cincinnati) and North Carolina (Administrative Office of the Courts and Department of Public Safety) are linked to de-identified IRS records via name, date of birth, sex, address, and partial Social Security Numbers. Match rates are 92% in Ohio and 95% in North Carolina. The sample is restricted to defendants aged 18–50 at time of offense with cases filed 2002–2014. IRS records include employer-reported W-2 wages (regardless of individual tax filing), self-employment income from Schedule C/SE, non-employee compensation (1099-MISC), and gig-economy earnings from 1099 returns. All dollar figures are adjusted to 2016 dollars using the PCE deflator.&lt;/p&gt;
&lt;h3 id="empirical-strategy"&gt;Empirical Strategy&lt;/h3&gt;
&lt;p&gt;Two independent quasi-experimental research designs are used:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;North Carolina — Sentencing guideline discontinuities&lt;/strong&gt;: North Carolina&amp;rsquo;s structured sentencing guidelines map offense class (E through I, the five least severe felony classes) and prior record points (a numerical criminal history score) into permissible punishment types (incarceration vs. probation) and sentence lengths. Allowable punishment types change discretely at five cell boundaries, generating discontinuities in incarceration sentences for otherwise similar defendants. The paper uses these five boundary discontinuities as excluded instruments in a parameterized regression discontinuity design stacked across offense classes. First-stage F-statistic = 115.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Ohio — Random assignment to judges&lt;/strong&gt;: Cases are randomly assigned by computer to judges at arraignment in the three counties studied. Judge leave-out mean sentence length is used as an instrument for individual sentence length. The design follows Norris et al. (2021) and yields F-statistic = 321. The instrument shifts sentences along both the extensive margin (any vs. no incarceration) and intensive margin (longer vs. shorter sentences).&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;Both designs produce complier populations for whom at least 37–45% are shifted on the extensive margin (from no incarceration to some incarceration), based on partial identification bounds using linear programming.&lt;/p&gt;
&lt;h3 id="main-findings"&gt;Main Findings&lt;/h3&gt;
&lt;p&gt;The paper&amp;rsquo;s central finding is that incarceration generates &lt;strong&gt;large short-run reductions&lt;/strong&gt; in labor market activity during the incapacitation period, but &lt;strong&gt;no detectable long-run reductions&lt;/strong&gt; in annual employment or earnings once defendants have been released and the incapacitation effects have dissipated.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;In the first year after case filing, when incarceration rates peak (roughly 75–100 additional days incarcerated for a 12-month sentence), employment falls by approximately &lt;strong&gt;10 percentage points&lt;/strong&gt; and total W-2 earnings contract commensurately.&lt;/li&gt;
&lt;li&gt;Within 3–4 years of filing, employment effects return to near zero and are statistically insignificant in both states.&lt;/li&gt;
&lt;li&gt;Five to nine years after filing, when effects on contemporaneous incarceration have dissipated, the estimated effect of a 12-month sentence on annual earnings is &lt;strong&gt;positive or near zero&lt;/strong&gt; in both states. The combined 95% confidence interval rules out reductions in annual wages greater than &lt;strong&gt;$231&lt;/strong&gt; (approximately 5% of the untreated complier mean) and rules out any adverse employment effects.&lt;/li&gt;
&lt;li&gt;Despite no long-run level effects, losses during incapacitation are never recouped. A one-year sentence reduces &lt;strong&gt;cumulative earnings over five years by approximately $2,914&lt;/strong&gt; — a 13% reduction relative to the complier mean.&lt;/li&gt;
&lt;li&gt;Effects on self-employment, independent contracting, 1040 filing, adjusted gross income, EITC take-up, and interstate migration are similarly null in the long run.&lt;/li&gt;
&lt;/ul&gt;
&lt;h3 id="incapacitation-vs-post-release-scarring"&gt;Incapacitation vs. Post-Release Scarring&lt;/h3&gt;
&lt;p&gt;The paper provides two tests for whether short-run earnings losses reflect incapacitation alone or also post-release scarring (e.g., human capital depreciation, employer discrimination, or discouragement effects):&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;A &amp;ldquo;visual IV&amp;rdquo; regression of year-t earnings effects on year-t days-incarcerated effects yields an R² of 0.83–0.85 across states, with the intercept near zero (positive and small), indicating that virtually all dynamic earnings impacts flow through contemporaneous incapacitation and not through a post-release channel.&lt;/li&gt;
&lt;li&gt;Constructed outcomes that impose the null of pure incapacitation (scaling pre-case average earnings or covariate-predicted earnings by the share of the year free from prison) closely track actual earnings effects in both states, further confirming that incapacitation is the dominant mechanism.&lt;/li&gt;
&lt;/ol&gt;
&lt;h3 id="pre-existing-labor-market-detachment"&gt;Pre-existing Labor Market Detachment&lt;/h3&gt;
&lt;p&gt;A key scope condition is defendants&amp;rsquo; severe labor market disadvantage prior to their case. Fewer than 50–60% of defendants are employed in the year before filing; average pre-case W-2 earnings (including zeros) are below $6,000. Among employed defendants, only 10% earn more than $22,000 per year. Untreated complier means for earnings in the year after case filing are below $4,000, with virtually no earnings or employment growth over the following nine years. The paper concludes that returning to pre-filing earnings levels is sufficient for incarcerated defendants to match their non-incarcerated peers — a low bar that is readily met.&lt;/p&gt;
&lt;h3 id="policy-implications"&gt;Policy Implications&lt;/h3&gt;
&lt;p&gt;Back-of-envelope aggregation implies incapacitation losses of approximately &lt;strong&gt;$6.16 billion per year&lt;/strong&gt; in foregone earnings for the U.S. prison population, concentrated in communities heavily affected by incarceration. However, a marginal reduction in incarceration rates would increase average earnings by only &lt;strong&gt;$51 for white men&lt;/strong&gt; and &lt;strong&gt;$213 for black men&lt;/strong&gt;, suggesting incarceration&amp;rsquo;s direct contribution to labor market inequality is modest relative to the $21,100 black-white earnings gap estimated by Bayer and Charles (2018). The paper concludes that upstream factors — other criminal justice interactions, human capital deficits, and broader socioeconomic disadvantage — are more plausibly responsible for low earnings among the formerly incarcerated.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-exact-treatment-variable-and-what-is-the-counterfactual"&gt;Q1. What is the exact treatment variable, and what is the counterfactual?&lt;/h3&gt;
&lt;p&gt;The treatment variable is months of incarceration sentenced in the focal case (a continuous, weakly positive ordered treatment). The counterfactual for non-incarcerated defendants in North Carolina is probation (all defendants are convicted by construction under structured sentencing guidelines). In Ohio, the authors cannot reject that all compliers who do not receive a prison sentence are still convicted, implying the counterfactual is also conviction and probation. All compliers therefore acquire a criminal record regardless of sentence. The treatment effect is thus the effect of incarceration conditional on conviction, holding fixed the criminal record.&lt;/p&gt;
&lt;h3 id="q2-how-are-effects-interpreted-given-multiple-instruments-and-continuous-treatment"&gt;Q2. How are effects interpreted given multiple instruments and continuous treatment?&lt;/h3&gt;
&lt;p&gt;Under a &amp;ldquo;weakly positive ordered treatment&amp;rdquo; assumption and standard LATE conditions, the 2SLS estimates can be interpreted as Average Causal Responses (ACRs) — weighted averages of the marginal dose effects (12 vs. 11 months, 6 vs. 5 months, 1 vs. 0 months, etc.) for complier subgroups shifted by each instrument. In North Carolina with five parameterized RD instruments, the estimate averages ACRs weighted by first-stage strength. In Ohio with a leave-out mean instrument, the estimate is a convex average of ACRs under the assumption that the linear first-stage model is a good approximation. Dosage weights for both states put mass on a wide range of sentence lengths including both extensive and intensive margins, though Ohio&amp;rsquo;s weights are more skewed toward shorter sentences.&lt;/p&gt;
&lt;h3 id="q3-how-large-are-the-first-stage-effects-and-how-strong-is-the-instrument"&gt;Q3. How large are the first-stage effects, and how strong is the instrument?&lt;/h3&gt;
&lt;p&gt;In North Carolina, sentences jump by 50% or more at sentencing guideline cell boundaries where allowable punishment types change to include incarceration. The first-stage F-statistic is 115. In Ohio, defendants assigned to the most severe judge receive incarceration sentences approximately six months longer than those assigned to the least severe judge (roughly 30% of the average non-zero sentence), with a slope of approximately 0.8 in the first-stage regression; F-statistic = 321. At least 37% of compliers in North Carolina and 45% in Ohio are shifted on the extensive margin (from no incarceration to some positive incarceration), with upper bounds as high as 95%.&lt;/p&gt;
&lt;h3 id="q4-what-evidence-supports-instrument-validity-exclusion-restriction-and-independence"&gt;Q4. What evidence supports instrument validity (exclusion restriction and independence)?&lt;/h3&gt;
&lt;p&gt;Instrument validity is tested by estimating 2SLS &amp;ldquo;effects&amp;rdquo; on pre-case outcomes measured 2–4 years before the focal case. In both states, the instruments show no relationship with pre-case employment, W-2 wages, total days previously incarcerated, or binary severe prior incarceration. The probability of being matched to IRS records and the quality of the match are also uncorrelated with the instruments. In Ohio, potential exclusion restriction violations from judges affecting conviction (not just sentence) are addressed empirically: nearly 90% of defendants are convicted, the most severe judge is only 0.7 p.p. more likely to convict than the least severe judge (t-stat = 1.53), and the estimated conviction rate among untreated compliers is 0.972 (s.e. 0.018), so one cannot reject that all non-incarcerated compliers are convicted.&lt;/p&gt;
&lt;h3 id="q5-how-does-the-paper-test-for-the-incapacitation-mechanism-against-post-release-scarring"&gt;Q5. How does the paper test for the incapacitation mechanism against post-release scarring?&lt;/h3&gt;
&lt;p&gt;Two complementary exercises are conducted. First, a &amp;ldquo;visual IV&amp;rdquo; plot regresses year-t earnings effects on year-t days-incarcerated effects across all post-filing years. If incapacitation is the sole channel, all points should lie on a line through the origin. The R² is 0.83 in North Carolina and 0.85 in Ohio, the estimated intercept is near zero (positive and small) in both states, and the slope (earnings lost per day incarcerated) is approximately $12. This implies cumulative earnings losses of $12 × 268 days = $3,216, very close to the directly estimated $2,914. Second, constructed outcomes that scale pre-case earnings or covariate-predicted earnings by the share of the year not incarcerated closely track actual earnings effects throughout the post-filing period, and both converge to zero as incapacitation effects fade — consistent with pure incapacitation and no net scarring.&lt;/p&gt;
&lt;h3 id="q6-what-are-the-long-run-59-years-earnings-and-employment-estimates-and-how-precisely-are-null-effects-ruled-out"&gt;Q6. What are the long-run (5–9 years) earnings and employment estimates, and how precisely are null effects ruled out?&lt;/h3&gt;
&lt;p&gt;Averaged across both states using inverse-variance weights, the estimated effect of a 12-month sentence on annual W-2 earnings five to nine years after filing is positive but statistically indistinguishable from zero. The 95% confidence interval rules out reductions in annual wages greater than $231 (approximately 5% of the untreated complier mean of roughly $4,500–$5,000). The 95% CI also rules out any adverse employment effects. The untreated complier mean for employment 5–9 years post-filing is approximately 40% in North Carolina and slightly above 40% in Ohio.&lt;/p&gt;
&lt;h3 id="q7-what-happens-to-cumulative-earnings-over-five-years-despite-null-long-run-level-effects"&gt;Q7. What happens to cumulative earnings over five years despite null long-run level effects?&lt;/h3&gt;
&lt;p&gt;Even though long-run annual earnings are unaffected, earnings losses during incapacitation are never made up. A one-year sentence reduces cumulative employment (measured as years with any W-2) and cumulative earnings over five years by approximately $2,914 — a 13% reduction relative to the complier mean. This reflects the mechanical loss of earnings during the period of physical incapacitation, without a subsequent compensating period of higher earnings after release.&lt;/p&gt;
&lt;h3 id="q8-do-defendants-with-stronger-pre-case-labor-market-attachment-show-different-long-run-patterns"&gt;Q8. Do defendants with stronger pre-case labor market attachment show different long-run patterns?&lt;/h3&gt;
&lt;p&gt;The sample is split between defendants employed in at least 2 of the 4 years prior to the case (53–57% of the sample across states) and those less attached. Both groups show zero long-run earnings and employment effects. Previously employed defendants experience much larger short-run earnings drops — more than three times larger in the first year post-filing — and their earnings recover more slowly, reaching zero effect approximately six years after filing (vs. three years for the previously unemployed). For a stricter cut (pre-case average earnings above $15,000, representing only 12–15% of the sample), the long-run earnings effect is −$1,426 (8% of the untreated complier mean), significant only at the 10% level, and partly attributable to residual incapacitation (19.6 additional days incarcerated 5–9 years post-filing). For defendants with pre-case earnings below $15,000, incarceration slightly increases long-run employment (2.4 pp, p = 0.01) and earnings ($400, p = 0.03), possibly reflecting rehabilitative benefits (GED or educational programs) for labor-market-detached individuals.&lt;/p&gt;
&lt;h3 id="q9-does-first-time-incarceration-extensive-margin-exposure-have-larger-long-run-effects-than-repeat-exposure"&gt;Q9. Does first-time incarceration (extensive-margin exposure) have larger long-run effects than repeat exposure?&lt;/h3&gt;
&lt;p&gt;The paper tests this by splitting the sample into defendants with and without prior incarceration history. Among defendants with no prior incarceration, the instruments generate large differences in lifetime exposure: a 12-month sentence increases the probability of ever being incarcerated over the next 5–9 years by 26 p.p. (North Carolina) and 41 p.p. (Ohio). Among those not receiving a sentence, 48% (North Carolina) and 19% (Ohio) are eventually incarcerated anyway, implying treatment causes a 52 and 81 p.p. increase in lifetime incarceration probability for extensive-margin compliers. Despite these large differences in lifetime exposure, long-run earnings and employment effects remain small and statistically insignificant in both subsamples. The difference in long-run effects between previously and never incarcerated defendants is not statistically significant (p = 0.29 for employment, p = 0.82 for earnings).&lt;/p&gt;
&lt;h3 id="q10-are-there-heterogeneous-effects-by-race-sex-or-criminal-history"&gt;Q10. Are there heterogeneous effects by race, sex, or criminal history?&lt;/h3&gt;
&lt;p&gt;There is no evidence of long-run scarring for any demographic or criminal history subgroup. Effects for black and non-black defendants are both positive for long-run earnings and employment. Non-black defendants show somewhat larger cumulative losses (consistent with marginally higher counterfactual earnings), but differences are not statistically significant. Estimates for women are imprecise due to small sample size. Among defendants with and without prior felony charges in the four years preceding the case, there are neither economically nor statistically significant long-run earnings or employment effects. Cumulative losses are somewhat larger for defendants without prior felony charges (p = 0.07), reflecting their higher pre-case earnings.&lt;/p&gt;
&lt;h3 id="q11-how-does-the-paper-handle-potential-migration-bias-in-outcomes"&gt;Q11. How does the paper handle potential migration bias in outcomes?&lt;/h3&gt;
&lt;p&gt;Tax filing and W-2 receipt in the state of sentencing are used to proxy for whether defendants remain in the same state. Among untreated compliers, 88% of those with a tax footprint maintain it in the state of sentencing. No statistically significant effects of incarceration on migration (measured as filing or receiving a W-2 in North Carolina or Ohio) are detected, suggesting prior studies of recidivism measured within-state are unlikely to be severely biased by migration responses.&lt;/p&gt;
&lt;h3 id="q12-what-effect-does-incarceration-have-on-mortality"&gt;Q12. What effect does incarceration have on mortality?&lt;/h3&gt;
&lt;p&gt;Incarceration reduces five-year mortality by approximately 0.8 percentage points (about 20% of the untreated mean). The authors note this is too small to explain the null long-run labor market effects: even if all defendants whose death was averted were employed, removing them from the employment count would reduce the employment effect of a 12-month sentence only to approximately zero.&lt;/p&gt;
&lt;h3 id="q13-how-do-the-papers-findings-compare-to-prior-studies-particularly-mueller-smith-2015"&gt;Q13. How do the paper&amp;rsquo;s findings compare to prior studies, particularly Mueller-Smith (2015)?&lt;/h3&gt;
&lt;p&gt;Mueller-Smith (2015) finds large and persistent negative incarceration effects on labor market outcomes in Texas using a structural decomposition and Lasso-based judge-covariate interactions as instruments. The paper argues methodological differences are the likely explanation: the Lasso-selected interacted instruments can be susceptible to many-weak instruments bias toward OLS. It notes that Mueller-Smith&amp;rsquo;s simpler 2SLS specifications (analogous to those used here) show no statistically significant earnings effects. North Carolina and Ohio are documented to be broadly similar to Texas (and the U.S. average) in rehabilitation program participation, recidivism rates, and incarceration rates, reducing the likelihood that genuine geographic heterogeneity explains the divergence.&lt;/p&gt;
&lt;h3 id="q14-what-is-the-papers-aggregate-extrapolation-of-incapacitation-earnings-losses"&gt;Q14. What is the paper&amp;rsquo;s aggregate extrapolation of incapacitation earnings losses?&lt;/h3&gt;
&lt;p&gt;Scaling the estimated $2,914 cumulative loss per 12-month sentence by the ratio of days exposed to total days in a year gives a per-day loss of approximately $12. Applied to the 1,435,500 people incarcerated in U.S. prisons on any given day in 2019 (excluding the more than 700,000 in jail), the implied aggregate yearly earnings loss from incapacitation is approximately $6.16 billion.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Incapacitation effect&lt;/strong&gt;: The mechanical reduction in earnings and employment that occurs while a defendant is physically confined in prison and unable to work, as distinct from any post-release scarring effect. The paper shows this is the dominant — and essentially sole — causal channel through which incarceration affects labor market outcomes in their sample.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Post-release scarring&lt;/strong&gt;: Persistent reductions in earnings or employment that persist after a defendant is released from prison, caused by mechanisms such as employer discrimination based on incarceration history, human capital depreciation, loss of job contacts, or psychological discouragement effects. The paper finds no evidence of scarring in either state.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Average Causal Response (ACR)&lt;/strong&gt;: The weighted average of the marginal dose effects of incarceration (e.g., effect of 12 vs. 11 months, 1 vs. 0 months) for groups of defendants whose sentence lengths are shifted by a given instrument. Contrasted with a binary LATE, the ACR averages across the full dosage distribution for compliers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Complier&lt;/strong&gt;: An individual whose incarceration sentence is shifted by the instrument — either from zero to some positive sentence (extensive margin) or from a shorter to a longer sentence (intensive margin). Counterfactual outcome means for compliers sentenced to zero months provide the baseline for evaluating effect magnitudes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sentencing guideline discontinuity&lt;/strong&gt;: The discrete jump in permissible punishment types and minimum sentence lengths at specific criminal history score thresholds within North Carolina&amp;rsquo;s structured sentencing grid. Defendants just above a threshold are more likely to be incarcerated than otherwise similar defendants just below, generating quasi-experimental variation exploited as an instrument.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Leave-out mean judge instrument&lt;/strong&gt;: In Ohio, each defendant&amp;rsquo;s assigned judge&amp;rsquo;s average incarceration sentence length computed over all other cases that judge handles (excluding the defendant&amp;rsquo;s own case), residualized on court-by-month fixed effects. Because judges are randomly assigned to cases, this measure is conditionally independent of defendant potential outcomes and serves as an instrument for sentence length.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Control complier mean&lt;/strong&gt;: The estimated mean potential outcome for compliers under the counterfactual of receiving zero months of incarceration. Used as a benchmark to evaluate the magnitude of treatment effects and to characterize how low the earnings baseline is for the population driving the causal estimates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Extensive vs. intensive margin of incarceration&lt;/strong&gt;: The extensive margin refers to the binary shift from receiving no prison sentence to receiving any prison sentence; the intensive margin refers to increasing sentence length conditional on some incarceration. The paper argues that neither margin appears to produce long-run labor market scarring, and uses linear programming bounds to estimate that at least 37–45% of compliers in each state are shifted on the extensive margin.&lt;/p&gt;</description></item><item><title>The Macroeconomic Consequences of Early Childhood Development Programs</title><link>https://macropaperwarehouse.com/papers/the-macroeconomic-consequences-of-early-childhood-development-programs/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-macroeconomic-consequences-of-early-childhood-development-programs/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper embeds early childhood development (ECD) investment into a general-equilibrium (GE), heterogeneous-agent, overlapping-generations model calibrated to U.S. data in order to quantify the aggregate and distributional consequences of large-scale, universal government ECD programs. The central finding is that a universal program spending $13,500 per child-year on children aged 0–3 — the same level as a well-studied North Carolina randomized controlled trial — generates long-run welfare gains of 12.7% in consumption-equivalent units for newborns under the veil of ignorance, income growth of 10.6%, an intergenerational mobility increase of 28.2% (roughly half the US–Canada gap), and a lifetime-earnings inequality reduction of 2.0% (roughly half the US–Germany gap). The key mechanism is dynastic: investing in a child today not only raises that child&amp;rsquo;s own skills and income but creates a better parental background — in terms of skills, assets, and education — for the next generation, so that more than two-thirds of the welfare gains accrue through this intergenerational channel rather than from the direct effect on the intervened generation. General equilibrium compresses the college wage premium and reduces welfare gains by approximately one-third relative to partial-equilibrium projections, but the policy remains self-financing in the long run. The model is validated against the first- and second-generation experimental evidence from Garcia et al. (2020, 2024), replicating both the 15 p.p. college graduation rate increase and the 1.54 lifetime income return per dollar spent that those RCTs documented.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a published paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-core-market-failure-that-motivates-government-ecd-investment"&gt;Q1. What is the core market failure that motivates government ECD investment?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper identifies two inter-related reasons why private early childhood investments fall below the social optimum: parents cannot borrow against their children&amp;rsquo;s future income (no child-to-parent compensation contract), and borrowing constraints together with uninsurable idiosyncratic return risk further depress parental investment below even that constrained optimum.&lt;/strong&gt; Under complete markets with compensating contracts, a poor parent who invests in a high-skilled child could smooth lifetime consumption intergenerationally. Without such contracts, the entire cost of investment falls on the parent in the early life-cycle when assets and income are low, reducing investment incentives sharply. Government ECD spending financed by future taxation on the child&amp;rsquo;s higher income imperfectly replicates this missing insurance-borrowing mechanism. The model finds that uncertain returns to skill investments — which the government can spread across the population but parents cannot insure privately — and incomplete credit markets are quantitatively more important than imperfect altruism in driving underinvestment.&lt;/p&gt;
&lt;h3 id="q2-how-is-the-model-structured-and-what-is-the-role-of-the-dynastic-framework"&gt;Q2. How is the model structured, and what is the role of the dynastic framework?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The model is a dynastic overlapping-generations Aiyagari life-cycle economy with four stages (childhood, college, work/parenthood, retirement), each of four-year periods, in which children&amp;rsquo;s cognitive and non-cognitive skills are determined by CES-aggregated parental time and money investments in ages 0–3, calibrated using the Cunha et al. (2010) skill-formation function.&lt;/strong&gt; At age 28 (period j=8), the working agent becomes a parent and chooses parental time τ and money m to invest in the child&amp;rsquo;s skill development across two periods; these decisions interact because time and money are estimated to be imperfect complements (CES exponent γ estimated from data). College attendance is endogenous — it depends on assets, skills, and a school-taste shock — and can be financed by parental transfers (constrained to be non-negative), work, or subsidized student loans. The dynastic structure means that changes in the distribution of parental skills, assets, and education feed forward into the next generation&amp;rsquo;s initial conditions, which is the source of the large long-run intergenerational amplification.&lt;/p&gt;
&lt;h3 id="q3-what-are-the-main-quantitative-results-of-the-benchmark-universal-ecd-policy"&gt;Q3. What are the main quantitative results of the benchmark universal ECD policy?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;A universal permanent policy investing $13,500 per child-year (ages 0–3), financed by an endogenous labor income tax, produces in the long-run steady state: a welfare gain of 12.7% in consumption-equivalent units (for a newborn under the veil of ignorance), a labor income increase of 10.6% (driven by an 11.7% rise in labor productivity), an intergenerational mobility increase of 28.2% as measured by minus the rank-rank coefficient, and a lifetime-earnings variance reduction of 2.0%.&lt;/strong&gt; The level of $13,500 is both the historically implemented per-child cost in the Garcia et al. (2020) RCT and close to the welfare-maximizing amount in the model (welfare peaks at 13.1% at a slightly higher spending level). Children of low-skilled, non-college parents gain the most (welfare gain 9.1%) versus children of college-educated, high-skilled parents (welfare gain 4.1%). Taxes in the long run are approximately unchanged from baseline because the expanded tax base offsets the direct program cost; in partial equilibrium without GE forces, taxes fall by 2.5 p.p. but GE compression of wages leaves only a negligible tax reduction in the benchmark.&lt;/p&gt;
&lt;h3 id="q4-how-do-the-decomposition-exercises-isolate-the-relative-importance-of-long-run-dynamics-versus-ge-and-taxation"&gt;Q4. How do the decomposition exercises isolate the relative importance of long-run dynamics versus GE and taxation?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Decomposing the 12.7% benchmark gain across four counterfactual implementations reveals that long-run intergenerational dynamics account for over two-thirds of total welfare gains, while GE forces reduce gains by roughly one-third and taxation costs are approximately offset by higher revenues in the long run.&lt;/strong&gt; Specifically: (i) a one-generation, partial-equilibrium version of the policy generates only 5.2% welfare gain; (ii) adding long-run intergenerational effects (permanent policy) raises this to 14.5%, an increase of 9.3 p.p.; (iii) further allowing for balanced-budget taxation in PE reduces gains from 17.3% to a net 12.5%; and (iv) incorporating GE effects — which compress the college wage premium — reduces labor productivity gains and welfare gains by about one-third. More than two-thirds of the aggregate welfare gain comes from changes in the distribution of initial conditions (newborns born into higher-skilled, better-resourced families) rather than from higher utility at a fixed initial-state distribution.&lt;/p&gt;
&lt;h3 id="q5-how-does-the-model-validate-against-experimental-evidence-and-what-does-this-imply-for-rct-estimates"&gt;Q5. How does the model validate against experimental evidence, and what does this imply for RCT estimates?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The model replicates two key experimental benchmarks: the Garcia et al. (2020) RCT finding of a 15 p.p. college graduation rate increase and a per-dollar lifetime income return of 1.55 (the model generates 1.54 for children of median-income parents), and the Garcia et al. (2024) estimate that second-generation income gains are 29% of first-generation gains (the model generates 20%, at or below that empirical estimate).&lt;/strong&gt; The validation is run as a small-scale, partial-equilibrium, one-generation exercise — exactly matching the RCT design — so the comparison is clean. The fact that the model replicates both generations&amp;rsquo; effects provides confidence in the intergenerational amplification mechanism. The paper interprets this as evidence that RCT evaluations of short-run, small-scale programs systematically underestimate the long-run benefits of universal programs, with the ratio of long-run GE gains to short-run PE estimates falling between 2 and 3 across a range of alternative education policies.&lt;/p&gt;
&lt;h3 id="q6-how-do-general-equilibrium-forces-shape-distributional-outcomes-and-why-do-they-cut-in-opposite-directions"&gt;Q6. How do general equilibrium forces shape distributional outcomes, and why do they cut in opposite directions?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;GE forces create a tension: they simultaneously generate most of the inequality reduction (by compressing the college wage premium) and eliminate most of the labor-productivity and welfare gains from partial-equilibrium projections.&lt;/strong&gt; As the universal program raises the share of college graduates, the relative wage of college workers falls. This wage compression is the primary driver of the 2.0% reduction in lifetime-earnings variance — approximately equal to half the US–Germany inequality gap — but it also lowers the productivity return on human capital investment, reducing GDP gains from 17.2% in partial equilibrium to 10.6% in general equilibrium. Because wages of college graduates fall, the government&amp;rsquo;s long-run tax savings from higher aggregate earnings are nearly eliminated compared to the partial-equilibrium case (from 2.5 p.p. reduction to negligible reduction). GE forces thus explain both the largest distributional benefit of the policy and its largest welfare cost.&lt;/p&gt;
&lt;h3 id="q7-what-do-robustness-and-extension-exercises-reveal-about-scalability-and-alternative-policies"&gt;Q7. What do robustness and extension exercises reveal about scalability and alternative policies?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The main results are robust across alternative elasticity-of-substitution parameter values for parental time and money investments, and the welfare-maximizing spending level closely tracks the benchmark $13,500 program.&lt;/strong&gt; A scale-up extension — in which the early childhood input requires college-graduate labor — finds nearly identical long-run welfare gains but smaller first-generation gains because program costs initially rise as college labor becomes scarcer. A comparison of alternative policies (investments in older children, investment subsidies rather than direct investments, college subsidies, parenting education programs) shows that policies investing directly in young children&amp;rsquo;s skills consistently achieve larger long-run GE welfare gains relative to their short-run PE estimates than alternative designs, because the intergenerational &amp;ldquo;better-parents&amp;rdquo; mechanism is most pronounced for early childhood. Among these alternatives, using equivalent resources as a lump-sum transfer at age 16 yields only 4.1% welfare gain — less than one-third of the 12.7% from early childhood investment — confirming that in-kind investments in childhood are more efficient than cash at the same fiscal cost when parents cannot be compensated by their children.&lt;/p&gt;
&lt;h3 id="q8-what-do-the-transition-dynamics-imply-for-the-political-economy-of-ecd-investment"&gt;Q8. What do the transition dynamics imply for the political economy of ECD investment?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;If the policy is introduced permanently, every new cohort born after the introduction is better off (each successive generation benefits more as the parental background improves), but older generations alive at the time of introduction face net welfare losses of approximately 1–3% on average because they bear higher taxes while receiving only indirect benefits through their children.&lt;/strong&gt; The paper calculates that if the government uses debt to smooth the financing cost over time — shifting part of the burden to future generations who will be richer — losses to initial cohorts are reduced to the point where a majority of adults would vote in favor of the policy. More than three-quarters of the long-run welfare gains are achieved within one generation of permanent policy implementation, alleviating concerns that benefits require an implausibly long time to materialize.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;dl&gt;
&lt;dt&gt;&lt;strong&gt;early childhood development (ECD) investment&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;publicly provided direct monetary investments in children aged 0–3, modeled as a perfect substitute for parental money investments m in the CES skill-formation technology; the paper&amp;rsquo;s primary policy instrument, calibrated at $13,500 per child-year to match the Garcia et al. (2020) North Carolina RCT.&lt;/dd&gt;
&lt;dt&gt;&lt;strong&gt;dynastic intergenerational amplification&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;the mechanism by which a government ECD investment raises not only the directly intervened child&amp;rsquo;s skills and income but also improves the distribution of parental skills, assets, and education for the next generation, amplifying aggregate welfare gains so that more than two-thirds of the total gain accumulates via this channel rather than the direct first-generation effect.&lt;/dd&gt;
&lt;dt&gt;&lt;strong&gt;child-to-parent compensation constraint&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;the assumption that parents cannot borrow against their child&amp;rsquo;s future income or receive direct compensation from the child for parental investments; identified as the primary source of underinvestment in the model alongside borrowing constraints and return uncertainty.&lt;/dd&gt;
&lt;dt&gt;&lt;strong&gt;skill-formation technology&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;a nested CES function adapted from Cunha et al. (2010) in which child skills θ&amp;rsquo; depend on current child skills (cognitive θc and non-cognitive θnc), parental skills θ, and a CES aggregate of parental time τ and money investments m; the calibrated complementarity between time and money implies that government money investments also crowd in parental time investment.&lt;/dd&gt;
&lt;dt&gt;&lt;strong&gt;general equilibrium skill-premium compression&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;the decline in the relative wage of college graduates that occurs in a GE model when the policy raises the share of college workers; the mechanism that simultaneously generates the policy&amp;rsquo;s distributional gains (reduced wage inequality) and reduces its aggregate productivity gains by approximately one-third relative to partial equilibrium.&lt;/dd&gt;
&lt;dt&gt;&lt;strong&gt;consumption-equivalent welfare gain under the veil of ignorance&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;the percentage increase in steady-state consumption that makes a newborn — who does not yet know her parental background — indifferent between being born in the baseline and the policy steady state; the paper&amp;rsquo;s primary welfare measure, equal to 12.7% under the benchmark universal program.&lt;/dd&gt;
&lt;/dl&gt;</description></item><item><title>The Macroeconomic Consequences of Exchange Rate Depreciations</title><link>https://macropaperwarehouse.com/papers/the-macroeconomic-consequences-of-exchange-rate-depreciations/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-macroeconomic-consequences-of-exchange-rate-depreciations/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;How does an exchange rate depreciation causally affect macroeconomic outcomes? The paper asks whether depreciations are expansionary or contractionary, and through which mechanism. The core identification challenge is endogeneity: exchange rate changes are driven by shocks that simultaneously affect output, making causal inference from unconditional variation misleading.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Empirical Strategy&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper studies &amp;ldquo;regime-induced&amp;rdquo; exchange rate depreciations by comparing macroeconomic outcomes for countries that peg their currency to the US dollar versus countries whose currencies float against the US dollar, in response to movements in the US dollar&amp;rsquo;s value. The identifying variation arises from the interaction between a country&amp;rsquo;s pre-existing exchange rate regime (peg vs. float) and changes in the US dollar&amp;rsquo;s nominal effective exchange rate (NEER), as measured by the BIS trade-weighted index against 24 relatively advanced economies (which are excluded from the analysis). This variation — which amounts to roughly 8% of total exchange rate variation in the sample — isolates a component of bilateral exchange rate changes that is orthogonal to idiosyncratic domestic shocks. The empirical specification is a local projection (Jorda, 2005) on annual data from 1973 to 2019 with country fixed effects and region-by-time fixed effects (four regions: Europe, Americas, Africa, Asia/Oceania). The main estimating equation regresses cumulative changes in outcome variables on the interaction term Peg × ΔUSD at horizons h = 0 to 9. Standard errors are two-way clustered by time and country. Exchange rate regime classification follows Ilzetzki, Reinhart, and Rogoff (2019); observations classified in the most ambiguous intermediate categories (coarse category 3) are dropped from the baseline.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Empirical Findings&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Regime-induced depreciations are strongly and persistently expansionary. In response to a 1% depreciation of the US dollar, the trade-weighted nominal effective exchange rate of pegger countries depreciates by 0.74% relative to floater countries on impact, rising to 0.9% before falling back to about 0.6% over years 3–5. The real effective exchange rate depreciates by a similar but slightly less persistent amount. The GDP response builds gradually, peaking after five years at approximately 0.4% per 1% US dollar depreciation. Expressed in terms of local currency depreciation, a 10% regime-induced depreciation results in a 5.5% increase in GDP over five years. Consumption rises by nearly 0.4% of GDP at peak. Investment also rises gradually, peaking after five years.&lt;/p&gt;
&lt;p&gt;Two findings are particularly important for identifying the transmission mechanism. First, net exports fall in response to a regime-induced depreciation. Imports rise more than exports for several years following the depreciation, ruling out an export-led boom driven by expenditure switching as the primary driver. Second, the short-term nominal interest rate rises modestly in pegging countries relative to floaters (by less than 0.1 percentage point per 1% depreciation), and the ex-post real interest rate response fluctuates around zero and is statistically insignificant throughout. This rules out looser monetary policy in pegger countries as the driver of the boom. Together, these two findings rule out a large set of standard open-economy models (including those with expenditure switching, monetary easing, and s = 0 financial frictions).&lt;/p&gt;
&lt;p&gt;The booms are concentrated in the service sector. Manufacturing, agriculture, and mining/construction responses are close to zero, indicating a domestic demand-led boom rather than an export-led one. The GDP response is entirely driven by countries with above-median capital account openness (as measured by the Chinn-Ito index); countries with below-median capital account openness show a similar exchange rate response but no significant output response. Results are similar across the early (1973–1995) and later (1996–2019) sub-periods.&lt;/p&gt;
&lt;p&gt;The Plaza Accord of 1985 provides a concrete illustration: the log real exchange rate of peggers depreciated by 12% (SE 2.7%) relative to floaters in the first year, while log GDP of peggers was 7.4% (SE 3.1%) higher after five years, implying a GDP response to a 10% depreciation of 6.2%, broadly consistent with the baseline estimates.&lt;/p&gt;
&lt;p&gt;Robustness checks controlling for Peg × US GDP growth, Peg × US inflation, Peg × US interest rate, Peg × commodity price changes, and Peg × global financial cycle (Miranda-Agrippino and Rey) leave results virtually unchanged.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Theoretical Framework&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;To explain these facts, the paper develops a four-region model (US, Euro Area, pegs to USD, pegs to euro) with imperfect financial openness. The model features (i) UIP deviations between the euro and US dollar driven by financial shocks (ψ_t), and (ii) sticky household portfolio shares, so that households invest a fixed fraction s of savings in foreign bonds and do not fully arbitrage cross-currency return differentials. When s = 0 (no household access to foreign assets), standard theory predicts that expenditure switching and real income channels dominate, yielding rising net exports — directly contradicting the data (Proposition 2). When s &amp;gt; 0, a &amp;ldquo;foreign credit channel&amp;rdquo; operates: following a regime-induced depreciation, expected future appreciation of the pegger currency makes foreign-currency borrowing cheaper, stimulating domestic consumption and investment, causing imports to rise more than exports (Proposition 3), consistent with the data.&lt;/p&gt;
&lt;p&gt;The model also accounts for unconditional exchange rate disconnect and the Mussa facts. Two shocks — UIP shocks (which generate a positive exchange rate–output correlation) and domestic discount factor shocks (which generate a negative correlation, since demand contractions lead to currency depreciations via monetary easing) — together produce a low unconditional correlation between exchange rates and output even though the conditional effect of regime-induced depreciation is large. The same logic explains why switching from fixed to floating exchange rates raises exchange rate volatility dramatically without raising macroeconomic volatility commensurately: pegging eliminates UIP shock exposure (reducing output volatility) but removes the ability to use monetary policy to offset discount factor shocks (raising output volatility), and these two effects roughly offset each other in the quantitative model.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-core-identification-strategy-and-what-assumption-is-required-for-it-to-yield-causal-estimates"&gt;Q1. What is the core identification strategy, and what assumption is required for it to yield causal estimates?&lt;/h3&gt;
&lt;p&gt;A1: The strategy compares macroeconomic outcomes in countries pegged to the US dollar versus countries floating against the US dollar when the US dollar&amp;rsquo;s value changes. The identifying assumption is that peggers are not differentially exposed (relative to floaters) to aggregate shocks that are correlated with the US dollar exchange rate. If this holds, the direct effects of shocks driving the US dollar move pegs and floats symmetrically and are absorbed by region-by-time fixed effects, leaving only the regime-induced component. Differential exposure to US dollar-correlated shocks is the main threat to identification, but the paper shows robustness by controlling for interactions of the peg indicator with US GDP growth, US inflation, US interest rate changes, commodity price changes, and the global financial cycle.&lt;/p&gt;
&lt;h3 id="q2-how-is-regime-induced-exchange-rate-variation-defined-and-how-large-is-it-relative-to-total-variation"&gt;Q2. How is &amp;ldquo;regime-induced&amp;rdquo; exchange rate variation defined, and how large is it relative to total variation?&lt;/h3&gt;
&lt;p&gt;A2: Regime-induced variation is the component of a country&amp;rsquo;s exchange rate change that arises from its pre-existing regime vis-à-vis the US dollar interacted with the change in the US dollar&amp;rsquo;s nominal effective exchange rate. It is identified via the interaction term Peg_i,t × ΔUSD_t in the local projection. This variation represents roughly 8% of total variation in exchange rates in the sample, so the strategy isolates a small but clean slice of total exchange rate movements.&lt;/p&gt;
&lt;h3 id="q3-how-do-nominal-and-real-effective-exchange-rates-respond-for-peggers-versus-floaters"&gt;Q3. How do nominal and real effective exchange rates respond for peggers versus floaters?&lt;/h3&gt;
&lt;p&gt;A3: In response to a 1% depreciation of the US dollar, the trade-weighted nominal effective exchange rate of peggers depreciates by 0.74% relative to floaters on impact, peaks around 0.9%, and then gradually declines to roughly 0.6% over years 3–5. The real effective exchange rate depreciates by a similar but slightly less persistent amount. The less-than-one-for-one response occurs because the classification includes imperfect pegs and imperfect floats; however, this misclassification attenuates both the first stage (exchange rate response) and the reduced form (output response) proportionally, so the ratio — the IV-style estimate — remains unbiased.&lt;/p&gt;
&lt;h3 id="q4-what-is-the-quantitative-magnitude-of-the-output-effect-and-how-is-it-computed"&gt;Q4. What is the quantitative magnitude of the output effect, and how is it computed?&lt;/h3&gt;
&lt;p&gt;A4: In response to a 1% US dollar depreciation, GDP of peggers rises by approximately 0.4% relative to floaters, peaking after five years and building gradually. To express this as a response to a 10% local currency depreciation: the average nominal exchange rate response over the first five years is roughly 0.7%, so the implied GDP response per 10% depreciation is 10 × 0.4 ÷ 0.7 ≈ 5.5%. The Plaza Accord case study yields a similar magnitude: a 12% first-year real exchange rate differential is followed by a 7.4% differential in log GDP after five years, implying 6.2% per 10% depreciation.&lt;/p&gt;
&lt;h3 id="q5-why-does-the-behavior-of-net-exports-rule-out-the-expenditure-switching-mechanism-as-the-primary-driver"&gt;Q5. Why does the behavior of net exports rule out the expenditure-switching mechanism as the primary driver?&lt;/h3&gt;
&lt;p&gt;A5: Standard open-economy models predict that a depreciation improves competitiveness, boosting exports and reducing imports — generating an improvement in net exports as the engine of expansion. The paper finds the opposite: imports rise more than exports for several years following a regime-induced depreciation, so net exports fall. This is inconsistent with an export-led expenditure-switching boom. The finding is also inconsistent with the real income channel (as formalized in Proposition 2): even with s = 0, standard models predict rising net exports, but the data show the reverse.&lt;/p&gt;
&lt;h3 id="q6-why-does-the-behavior-of-interest-rates-rule-out-monetary-policy-easing-as-the-driver"&gt;Q6. Why does the behavior of interest rates rule out monetary policy easing as the driver?&lt;/h3&gt;
&lt;p&gt;A6: If the US dollar depreciated because of loose US monetary policy, countries with currencies pegged to the US dollar would share US monetary policy more strongly, and one would expect a relative decline in nominal interest rates for peggers. The opposite is found: the nominal interest rate of peggers rises slightly relative to floaters (by less than 0.1 percentage point per 1% depreciation), and the real interest rate response is statistically indistinguishable from zero throughout the nine-year horizon. This rules out the interpretation that the boom is driven by an easing of monetary conditions in the pegger countries.&lt;/p&gt;
&lt;h3 id="q7-what-are-ex-post-uip-deviations-and-what-do-they-imply-about-the-shock-driving-the-variation"&gt;Q7. What are ex-post UIP deviations, and what do they imply about the shock driving the variation?&lt;/h3&gt;
&lt;p&gt;A7: Ex-post UIP deviations measure the excess return to holding assets denominated in pegger currencies relative to floater currencies. After the initial depreciation of pegger currencies, those currencies subsequently appreciate and their nominal interest rates are (if anything) higher than floater interest rates. This means the ex-post return to holding pegger-currency assets is higher than for floater-currency assets — a positive UIP deviation that builds over several years after the shock. These deviations imply that the shocks driving the US dollar depreciation are financial in nature (UIP shocks), not changes in expected near-term monetary policy fundamentals.&lt;/p&gt;
&lt;h3 id="q8-what-is-the-foreign-credit-channel-and-how-does-it-work-in-the-model"&gt;Q8. What is the foreign credit channel, and how does it work in the model?&lt;/h3&gt;
&lt;p&gt;A8: The foreign credit channel (the second term in equation (18) of Proposition 1) operates through the cost of foreign-currency borrowing. When the pegger currency depreciates on impact and then is expected to appreciate subsequently, the exchange-rate-adjusted cost of borrowing in foreign currency falls — that is, expected future appreciation of the domestic currency reduces the real cost of foreign credit. To the extent that households have portfolio shares in foreign bonds (s &amp;gt; 0), this stimulates consumption via intertemporal substitution. The channel is operative only when s &amp;gt; 0; with s = 0 (no household access to foreign assets), net exports must rise rather than fall (Proposition 2), contradicting the data.&lt;/p&gt;
&lt;h3 id="q9-how-does-proposition-1-establish-that-real-interest-rates-and-real-exchange-rates-are-sufficient-statistics-for-the-relative-responses-of-all-macroeconomic-aggregates-in-this-setting"&gt;Q9. How does Proposition 1 establish that real interest rates and real exchange rates are sufficient statistics for the relative responses of all macroeconomic aggregates in this setting?&lt;/h3&gt;
&lt;p&gt;A9: Under Assumption 1 (pegs to the US dollar and pegs to the euro face symmetric non-monetary fundamental shocks), the relative responses of consumption, output, exports, and imports of USD-peggers versus euro-peggers are functions only of the relative path of the real interest rate and the real effective exchange rate. This is because the underlying shocks to the US economy and the Euro Area economy are common to both groups of peggers and cancel out in the comparison. The monetary regime of a country is fully summarized by the paths of the real interest rate and the real exchange rate. Since the estimated relative real interest rate response is close to zero, the paper infers that the observed output differential must arise from the real exchange rate path — hence the title.&lt;/p&gt;
&lt;h3 id="q10-why-does-the-output-response-differ-by-capital-account-openness-but-not-by-trade-openness"&gt;Q10. Why does the output response differ by capital account openness but not by trade openness?&lt;/h3&gt;
&lt;p&gt;A10: The GDP response to a regime-induced depreciation is entirely driven by countries with above-median capital account openness (Chinn-Ito index). Countries below the median show a similar real exchange rate response but no significant output response. In contrast, splitting by trade openness (exports plus imports as a share of GDP) yields similar output responses in both sub-groups. This pattern is consistent with the model&amp;rsquo;s foreign credit channel, which operates through international capital flows (the parameter s representing financial openness). Countries with restricted capital accounts cannot borrow cheaply from abroad when their currencies become &amp;ldquo;cheap,&amp;rdquo; so the foreign credit channel is shut down. The result is inconsistent with the expenditure-switching channel, which would predict larger effects for more trade-open economies.&lt;/p&gt;
&lt;h3 id="q11-what-is-the-sector-composition-of-the-output-boom-and-what-does-it-imply-about-the-transmission-mechanism"&gt;Q11. What is the sector composition of the output boom, and what does it imply about the transmission mechanism?&lt;/h3&gt;
&lt;p&gt;A11: The bulk of the output response is concentrated in the service sector. Manufacturing, agriculture, and the mining/construction/energy sectors show responses close to zero, with only a modest boom in the latter at very long horizons. Services are predominantly non-tradable, so this sectoral pattern is consistent with a domestic demand-led boom (via the foreign credit channel) rather than an export-led boom (via expenditure switching on tradable goods). The foreign credit channel stimulates domestic demand broadly, which disproportionately raises output in the non-tradable sector.&lt;/p&gt;
&lt;h3 id="q12-how-does-the-model-reconcile-large-conditional-effects-of-exchange-rates-with-unconditional-exchange-rate-disconnect"&gt;Q12. How does the model reconcile large conditional effects of exchange rates with unconditional exchange rate disconnect?&lt;/h3&gt;
&lt;p&gt;A12: The paper introduces two shocks: UIP shocks (ψ_t) and domestic discount factor shocks (β_t). UIP shocks cause the exchange rate to depreciate and output to rise (a positive conditional correlation). Discount factor shocks reduce domestic demand; monetary policy responds by lowering interest rates, which depreciates the exchange rate, but if the monetary response is insufficient to fully offset the shock, output falls — generating a negative conditional correlation between the exchange rate and output. The unconditional correlation between the exchange rate and output is a weighted average of these two conditional correlations. If these effects are of similar magnitude and opposite sign, the unconditional correlation can be close to zero even though each structural shock generates a large conditional response. This is directly analogous to how supply and demand shocks can generate a small unconditional price-quantity correlation in a standard market setting.&lt;/p&gt;
&lt;h3 id="q13-how-does-the-model-provide-a-new-interpretation-of-the-mussa-fact"&gt;Q13. How does the model provide a new interpretation of the Mussa fact?&lt;/h3&gt;
&lt;p&gt;A13: The Mussa fact is that the collapse of Bretton Woods dramatically increased the volatility of real exchange rates in countries that switched to floating, without a corresponding increase in macroeconomic volatility. In the model, pegging has two opposing effects on output volatility: it insulates the economy from UIP shocks (reducing output volatility) but prevents the use of monetary policy to offset discount factor shocks (raising output volatility). In the quantitative model (Appendix D), these effects roughly offset each other, so moving from a peg to a float raises exchange rate volatility substantially while leaving macroeconomic volatility roughly unchanged — consistent with the Mussa fact. This contrasts with the Itskhoki-Mukhin interpretation, which attributes Mussa facts to exchange rates (driven by UIP shocks) having little effect on output; in the present paper, the conditional effects are large but cancel in the unconditional moments.&lt;/p&gt;
&lt;h3 id="q14-what-does-the-paper-imply-for-the-tradeoffs-associated-with-adopting-a-fixed-versus-flexible-exchange-rate-regime"&gt;Q14. What does the paper imply for the tradeoffs associated with adopting a fixed versus flexible exchange rate regime?&lt;/h3&gt;
&lt;p&gt;A14: Traditional analyses of the monetary trilemma emphasize that pegging to the US dollar forces a country to follow US interest rate policy. The paper argues that a first-order consequence of pegging — one that may outstrip the traditional monetary policy tradeoff in importance — is that the country imports the financial shocks (UIP shocks) that drive the US exchange rate while potentially reducing its exposure to home-grown financial shocks. When the US dollar depreciates due to financial shocks, pegger countries experience a stimulatory foreign credit inflow. Conversely, when the US dollar appreciates due to financial shocks, pegger countries face tighter financial conditions. The importance of this financial shock trade-off, the paper argues, may greatly exceed the importance of the traditional monetary trilemma in environments where financial shocks are a dominant driver of exchange rate fluctuations.&lt;/p&gt;
&lt;h3 id="q15-how-does-the-paper-handle-the-potential-concern-that-the-peg-classification-is-imperfect"&gt;Q15. How does the paper handle the potential concern that the peg classification is imperfect?&lt;/h3&gt;
&lt;p&gt;A15: The paper notes that misclassification of pegs and floats attenuates both the exchange rate response (first stage) and the output response (reduced form) proportionally. Since the ultimate quantity of interest is the ratio of the output response to the exchange rate response (analogous to an IV estimate), misclassification in both the numerator and denominator does not introduce bias. This is analogous to an instrumental variables regression where the first stage need not have a high R-squared for the IV estimate to be valid. The paper also shows robustness to alternative treatments of the ambiguous intermediate categories (Ilzetzki-Reinhart-Rogoff coarse category 3), including them as pegs or floats, with similar results in both cases.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Regime-induced depreciation&lt;/strong&gt;: A change in a country&amp;rsquo;s bilateral exchange rate that arises specifically because the country has a pre-existing peg (or float) to a reference currency, and that reference currency&amp;rsquo;s value changes in world markets. The variation is defined as the component of a country&amp;rsquo;s exchange rate movement driven by the interaction between its exchange rate regime vis-à-vis the US dollar and changes in the US dollar&amp;rsquo;s nominal effective exchange rate. This is distinguished from all other exchange rate variation, including that driven by domestic idiosyncratic shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Foreign credit channel&lt;/strong&gt;: The mechanism in the paper&amp;rsquo;s model through which a regime-induced depreciation stimulates domestic demand. When the domestic currency depreciates on impact and is expected to appreciate subsequently, the exchange-rate-adjusted cost of borrowing in foreign currency falls. Households with portfolio shares in foreign bonds (s &amp;gt; 0) borrow more cheaply from abroad, stimulating consumption via intertemporal substitution. This channel requires imperfect financial openness (s &amp;gt; 0 but not full UIP arbitrage) and predicts that the output boom is domestic-demand-led with falling net exports — consistent with the data.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;UIP shock (ψ_t)&lt;/strong&gt;: An exogenous shock to uncovered interest parity between the US dollar and the euro, interpreted as arising from frictions in international financial markets or from exogenous shifts in demand for one currency over another. A positive ψ_t represents an increase in demand for the euro (relative to the US dollar), depreciating the US dollar. These shocks are the paper&amp;rsquo;s preferred interpretation of the financial shocks driving the US dollar exchange rate, consistent with the observed joint behavior of exchange rates and interest rates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Imperfect financial openness (parameter s)&lt;/strong&gt;: The share of household savings invested in foreign bonds. At s = 0, households have no access to foreign assets (as in Gabaix-Maggiori and Itskhoki-Mukhin); at full financial integration with UIP holding (ψ_t = 0), there is no foreign credit channel. The paper&amp;rsquo;s model is intermediate: s &amp;gt; 0 but portfolio weights are sticky, so households do not fully arbitrage cross-currency expected return differentials. The foreign credit channel is operative only when s &amp;gt; 0, and the strength of the output boom is increasing in s/σ (the ratio of financial openness to the coefficient of relative risk aversion).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sufficient statistics (real interest rate and real exchange rate)&lt;/strong&gt;: Under Proposition 1, conditional on Assumption 1 (symmetric non-monetary fundamental shocks across pegger groups), the relative responses of all macroeconomic aggregates for peggers to the US dollar versus peggers to the euro are functions only of the relative path of the real effective exchange rate and the relative path of the real interest rate. The full set of underlying shocks — monetary, financial, productivity, or discount factor — does not need to be separately identified; only the paths of these two prices matter for relative macroeconomic outcomes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Exchange rate disconnect&lt;/strong&gt;: The empirical finding, documented extensively since Meese and Rogoff (1983), that exchange rates have very low unconditional correlations with macroeconomic aggregates such as output and consumption. In the paper&amp;rsquo;s sample, real exchange rates of floating countries are three to four times more volatile than GDP and consumption, and the unconditional correlation of the real exchange rate with GDP is mildly negative (around −0.05 to −0.07). The paper offers a new explanation: this low unconditional correlation reflects the cancellation of large but opposite-signed conditional correlations from UIP shocks and discount factor shocks, rather than indicating that exchange rates have small effects on the economy.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mussa fact&lt;/strong&gt;: The empirical observation (Mussa, 1986) that when countries switched from fixed to floating exchange rates after the collapse of Bretton Woods, real exchange rate volatility increased dramatically — for floaters roughly 50–60% higher standard deviation in the paper&amp;rsquo;s sample than for peggers — but the volatility of GDP, consumption, and other macroeconomic aggregates did not increase correspondingly. The paper interprets this through its two-shock model as the result of two opposing effects of pegging: insulation from UIP shocks (which reduces macroeconomic volatility) versus inability to use monetary policy to offset discount factor shocks (which raises macroeconomic volatility), with the two effects roughly offsetting in the quantitative model.&lt;/p&gt;</description></item><item><title>The Macroeconomic Impact of Climate Change: Global Versus Local Temperature</title><link>https://macropaperwarehouse.com/papers/the-macroeconomic-impact-of-climate-change-global-versus-local-temperature/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-macroeconomic-impact-of-climate-change-global-versus-local-temperature/</guid><description>&lt;p&gt;The paper shows that the macroeconomic impact of climate change is &lt;strong&gt;an order of magnitude larger&lt;/strong&gt; than what standard country-level panel estimates suggest. The key identification innovation is to measure the effect of global mean temperature shocks using time-series local projections, rather than using cross-country variation in local temperatures as in the conventional panel literature. A shock to global mean temperature tracks extreme weather events (droughts, heat waves, wind, precipitation anomalies) that affect all countries simultaneously; a local temperature anomaly in one country does not.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Empirical approach&lt;/strong&gt;: The authors estimate local projections of world GDP growth on exogenous global mean temperature shocks. The shock is the innovation to global mean temperature after removing a 2-year autoregressive component and a low-frequency trend, following Hamilton (2018). Two estimation samples: &lt;strong&gt;BU&lt;/strong&gt; (Barro-Ursúa macro history, 43 countries, 1860–2019) and &lt;strong&gt;PWT&lt;/strong&gt; (Penn World Tables, 173 countries, 1960–2019).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Key empirical results&lt;/strong&gt; (Section 3): A 1°C shock to global mean temperature causes world GDP to fall by &lt;strong&gt;14% after 6 years&lt;/strong&gt; in the PWT sample (95% CI: 6%–22%); significant at the 5% level in years 2–8; does not mean-revert within the 10-year sample horizon. In the BU sample, the peak GDP decline is &lt;strong&gt;18% after 5 years&lt;/strong&gt; (95% CI: 6%–30%). Converting the cumulative IRF ratio to a permanent temperature change yields a &lt;strong&gt;22–34% long-run GDP decline per 1°C&lt;/strong&gt; of permanent global warming (PWT and BU respectively). By contrast, local temperature shocks — estimated from a standard cross-country panel with country and year fixed effects — generate effects of &lt;strong&gt;1–3% per °C&lt;/strong&gt;, not statistically significant at the 5% level.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why global &amp;gt; local&lt;/strong&gt; (Section 4): Four categories of extreme climatic events (heat waves, droughts, wind, precipitation anomalies) jointly account for roughly &lt;strong&gt;half&lt;/strong&gt; of the estimated global temperature effect on GDP. None of these are strongly correlated with local temperature anomalies because extreme weather reflects ocean-atmosphere dynamics (El Niño/ENSO) that elevate global mean temperature rather than any single country&amp;rsquo;s local temperature. In addition, capital and investment both decline persistently after global temperature shocks (capital response significant at 5% level), and warm/low-income countries are disproportionately affected.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Structural model&lt;/strong&gt; (Section 5): A parsimonious neoclassical growth model embeds climate change as aggregate TFP changes. Households maximize ∫e^{−ρt}U(C_t)dt; firms use Cobb-Douglas technology Z_t K_t^α L_t^{1−α}. The damage function governing TFP is:&lt;/p&gt;
&lt;p&gt;Z_t = Z_0 exp( ∫&lt;em&gt;0^t ζ_s T̂&lt;/em&gt;{t−s} ds )&lt;/p&gt;
&lt;p&gt;where T̂_t is excess global mean temperature above baseline and ζ_s = A(e^{−Bs} − e^{−Cs}) is the structural damage function. When ζ_s → 0, shocks have level but not growth effects; no statistically significant evidence of growth effects is found in Figure 3 of the paper. The model is calibrated with: risk aversion γ = 1 (log utility), capital share α = 0.33, annual capital depreciation δ = 0.08, and pure time preference ρ = 0.02. &lt;strong&gt;Proposition 1&lt;/strong&gt; (model inversion) shows that, to first order, ŷ_t = ẑ_t + α ∫K_{t,s} ẑ_s ds, where K_{t,s} is the sequence-space Jacobian of the neoclassical growth model. This delivers identification: observed output impulse responses recover the structural TFP damage function ζ_s without imposing functional form on the capital channel.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Estimation results&lt;/strong&gt; (Section 5.3, Figure 12): The estimated damage function implies a &lt;strong&gt;4% peak short-run productivity decline 2 years after&lt;/strong&gt; a 1°C transitory global temperature shock; the effect decays slowly and remains significant for up to 10 years. The capital response (non-targeted moment) closely matches its empirical counterpart, providing an overidentification check. The local temperature damage function, estimated by targeting the local-panel output IRF, peaks at only &lt;strong&gt;0.5%&lt;/strong&gt; and is &lt;strong&gt;more than 8× smaller&lt;/strong&gt; in cumulative productivity effect; it is not statistically different from zero at the 5% level.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Business-as-usual counterfactual&lt;/strong&gt; (Section 6.1–6.2): Temperature rises from 2024, reaching &lt;strong&gt;3°C above preindustrial by 2100&lt;/strong&gt; (asymptoting to 3.3°C), equivalent to 2°C of additional warming since 2024. Under the global temperature damage function:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;World output by 2050: &lt;strong&gt;−28%&lt;/strong&gt; vs. no-warming baseline&lt;/li&gt;
&lt;li&gt;World output by 2100: &lt;strong&gt;−53%&lt;/strong&gt; (accumulated TFP losses reach −40%)&lt;/li&gt;
&lt;li&gt;Capital by 2100: &lt;strong&gt;−51%&lt;/strong&gt; (investment initially rises as households anticipate lower permanent income, then decumulates rapidly)&lt;/li&gt;
&lt;li&gt;Consumption by 2100: &lt;strong&gt;−53%&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;2024 welfare loss (consumption equivalent): &lt;strong&gt;35%&lt;/strong&gt;; welfare continues declining as temperatures rise, eventually reaching &lt;strong&gt;56%&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;95% CI for 2100 output loss: &lt;strong&gt;29%–77%&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;All effects statistically significant at the 5% level&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Under the local temperature damage function with the same warming scenario: long-run output declines only &lt;strong&gt;9%&lt;/strong&gt;, welfare loss is &lt;strong&gt;5%&lt;/strong&gt;, and neither is statistically significant at the 5% or 10% level — consistent with conventional estimates (Nordhaus 1992, Dell et al. 2012, Burke et al. 2015).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Social Cost of Carbon&lt;/strong&gt; (Section 6.2, Panel F): The SCC is defined as the consumption-equivalent amount households would pay at time 0 to avoid one additional ton of CO2, using the temperature-response function from Dietz et al. (2021a). Baseline result: &lt;strong&gt;$1,207 per ton&lt;/strong&gt; (2024 international dollars), more than &lt;strong&gt;6× larger&lt;/strong&gt; than the $185/ton estimate in Rennert et al. (2022). 95% CI: &lt;strong&gt;$399–$2,015 per ton&lt;/strong&gt;. Climate sensitivity range (half/double median): &lt;strong&gt;$600–$2,400 per ton&lt;/strong&gt;. BU sample (larger damage functions): &lt;strong&gt;&amp;gt;$1,500 per ton&lt;/strong&gt;. Using the local temperature damage function yields an SCC of only &lt;strong&gt;$149/ton&lt;/strong&gt;, consistent with conventional estimates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sensitivity&lt;/strong&gt; (Section 6.4): Higher time preference ρ &amp;gt; 0.04 lowers welfare losses below 20% and the SCC below 3× conventional high-end estimates — the only scenario where results converge toward prior estimates. Near-Stern discount rates (ρ → 0): welfare loss &amp;gt;40% and SCC &amp;gt;$2,500/ton. A 6°C-by-2100 scenario yields welfare losses &amp;gt;60%.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Historical growth accounting&lt;/strong&gt; (Section 6.3): Starting the model in 1960 and imposing the realized 1960–2019 warming path, then holding temperature constant at its 2019 level, reveals:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;World GDP per capita would be &lt;strong&gt;25% higher today&lt;/strong&gt; without warming since 1960&lt;/li&gt;
&lt;li&gt;By 2040, output is &lt;strong&gt;32% below potential&lt;/strong&gt; from past warming — one-quarter of losses from historical warming are yet to materialize (due to delayed damage function and transitional capital dynamics)&lt;/li&gt;
&lt;li&gt;Climate change reduced the annual world growth rate by as much as &lt;strong&gt;a third of baseline&lt;/strong&gt; by the 21st century&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Policy implication&lt;/strong&gt;: Most decarbonization interventions cost ~$80/ton on average (Bistline et al. 2023). Under conventional SCC estimates based on local temperature ($149/ton), the US Domestic Climate Cost (DCC) falls below policy cost, making unilateral emissions reduction prohibitively expensive. Under the paper&amp;rsquo;s global temperature SCC of $1,207/ton, the DCC of the United States exceeds $80/ton even accounting for the fraction of global climate benefits that accrue domestically — &lt;strong&gt;unilateral decarbonization becomes cost-effective for large economies such as the US&lt;/strong&gt;.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope conditions&lt;/strong&gt;: The neoclassical model abstracts from adaptation, mitigation, trade, urbanization, and endogenous emissions. The identification assumption requires that global mean temperature innovations are uncorrelated with other global economic confounders at business-cycle and trend frequencies; the paper checks robustness against alternative detrending, exclusion of WWII and COVID-19 years, El Niño/ENSO controls, and instrumental variables for temperature based on solar/volcanic forcing. The conversion from medium-run to long-run effects relies on the constrained ζ_s = A(e^{−Bs} − e^{−Cs}) functional form ruling out growth effects — consistent with the data but not formally testable beyond the 10-year horizon. Counterfactuals involve 2–3°C temperature changes substantially beyond the sample&amp;rsquo;s moderate perturbations; the model&amp;rsquo;s extrapolation may understate damages if nonlinearities exist at extreme temperatures (the authors note their conservative constrained-form approach).&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-why-do-global-temperature-shocks-produce-gdp-effects-an-order-of-magnitude-larger-than-local-temperature-panel-estimates"&gt;Q1. Why do global temperature shocks produce GDP effects an order of magnitude larger than local temperature panel estimates?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Global mean temperature shocks are strongly correlated with extreme weather events — heat waves, droughts, wind storms, and precipitation anomalies — that simultaneously affect all countries; these four event categories jointly account for roughly half of the global temperature effect on GDP.&lt;/strong&gt; Local temperature anomalies in a given country (as measured in standard cross-country panels with year fixed effects absorbed) are not correlated with these same events, because El Niño/ENSO and related ocean-atmosphere dynamics elevate global mean temperature without proportionally elevating any one country&amp;rsquo;s local temperature. Local panel studies also implicitly allow economic activity to shift toward cooler regions within a given year — an option unavailable when global warming affects all locations simultaneously. The resulting bias in local-panel estimates is not &amp;ldquo;aggregation bias&amp;rdquo; in the sense of Jensen&amp;rsquo;s inequality, but rather an identification problem: local panels identify a different object (the effect of temperature relative to other countries in the same year) rather than the aggregate climate impact the paper measures.&lt;/p&gt;
&lt;h3 id="q2-what-is-the-identification-strategy-and-what-are-the-main-threats"&gt;Q2. What is the identification strategy and what are the main threats?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The global temperature shock is identified as the innovation to global mean temperature after removing a 2-year AR component and a Hamilton (2018) low-frequency trend, yielding a shock orthogonal to its own recent history and to long-run trends.&lt;/strong&gt; The main threats are: (i) global business-cycle confounders (worldwide recessions that simultaneously lower activity and emissions), addressed by controlling for quadratic time trends and global aggregate demand proxies; (ii) reverse causality (economic expansion warming the atmosphere), addressed by IV estimates using solar/volcanic forcing as instruments; (iii) low-frequency correlation between climate trends and productivity growth, addressed by flexible detrending and robustness to sample period. All major specification checks generate quantitatively similar results, and the paper passes placebo tests for large global confounders (WWII, COVID-19).&lt;/p&gt;
&lt;h3 id="q3-how-does-the-structural-model-translate-medium-run-shock-responses-into-long-run-warming-effects"&gt;Q3. How does the structural model translate medium-run shock responses into long-run warming effects?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Proposition 1 (model inversion) shows that the output impulse response decomposes into a direct TFP effect ẑ_t and a capital channel ŷ_t = ẑ_t + α ∫K_{t,s} ẑ_s ds, where K_{t,s} is the sequence-space Jacobian of the neoclassical growth model (Auclert et al. 2021); this allows recovery of the structural TFP damage function {ζ_s} from the observed 10-year output IRF by non-linear least squares, without having to observe TFP directly.&lt;/strong&gt; The counterfactual for a gradually rising temperature path (BAU scenario with 2°C additional warming since 2024) is then solved via the full nonlinear model — not via the log-linearization used in estimation — because the 2–3°C excursion far exceeds the sample&amp;rsquo;s modest temperature perturbations. The capital response (non-targeted moment) closely tracks its empirical counterpart, providing a strong overidentification check that the model&amp;rsquo;s capital dynamics are correctly specified.&lt;/p&gt;
&lt;h3 id="q4-why-does-capital-initially-rise-in-the-bau-counterfactual-before-declining"&gt;Q4. Why does capital initially rise in the BAU counterfactual before declining?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Following standard permanent-income logic, when households learn at date 0 that global temperatures will rise and future TFP will fall, they temporarily increase saving and investment to accumulate buffer capital before the productivity decline materializes; this front-loads some capital accumulation in the early transition years (2024–2030s), briefly pushing capital above baseline, before the accumulated TFP losses overwhelm the saving motive and capital begins an extended decline.&lt;/strong&gt; The net effect is still a 51% capital shortfall by 2100 because persistently lower TFP reduces the marginal product of capital over decades, depressing investment and allowing the capital stock to drift far below its no-warming balanced growth path.&lt;/p&gt;
&lt;h3 id="q5-how-is-the-social-cost-of-carbon-defined-and-computed"&gt;Q5. How is the Social Cost of Carbon defined and computed?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The SCC is defined as the dollar amount C such that households are indifferent between (a) a world where one additional ton of CO2 is emitted at time 0 and (b) a world in steady-state where the household has paid C at time 0 (equation 7: V^{ss}(K^{ss} − C) = V^{SCC}_0(K^{ss})).&lt;/strong&gt; The temperature response to a 1-ton CO2 pulse is taken from Dietz et al. (2021a) — temperature peaks at 0.002°C after a 1-gigaton pulse and stabilizes. The model generates the productivity path {Z^{SCC}_t} via the structural damage function, solves for equilibrium capital and consumption paths, and computes the value function V^{SCC}_0. The resulting $1,207/ton exceeds prior estimates by 6× because the global-temperature damage function implies 4% peak TFP losses per 1°C transitory shock, compared to the ~0.5% peak implied by local temperature — and the SCC is essentially the capitalized sum of these future productivity losses, so the ratio scales proportionally.&lt;/p&gt;
&lt;h3 id="q6-why-are-historical-climate-losses-so-large-if-year-to-year-warming-is-small"&gt;Q6. Why are historical climate losses so large if year-to-year warming is small?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The key is cumulation: annual warming increments are individually small (tenths of a degree), but the damage function {ζ_s} is persistent (effects last 10+ years), so each year&amp;rsquo;s increment adds a flow of persistent TFP losses that stack on top of prior increments.&lt;/strong&gt; The paper&amp;rsquo;s growth accounting shows that climate change reduced the world growth rate by up to one-third of baseline in the 21st century — a number that appears modest in any single year but, compounded over decades, translates into a 25% GDP per capita shortfall by 2019. Additionally, because the estimated damage function has a 2-year lag before peak TFP impact, a substantial share of past warming&amp;rsquo;s losses are yet to be realized — the paper estimates GDP will be 32% below its potential by 2040 even with no further warming.&lt;/p&gt;
&lt;h3 id="q7-what-does-the-sensitivity-analysis-reveal-about-the-robustness-of-the-results"&gt;Q7. What does the sensitivity analysis reveal about the robustness of the results?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The key sensitivity is the rate of time preference ρ: at ρ = 0.02 (baseline, consistent with secular interest rate decline), welfare loss is 35%; at ρ = 0.04 (above recent market rates), welfare loss is still above 20%; only at implausibly high discount rates does the welfare loss fall below 15%.&lt;/strong&gt; The SCC is more sensitive to ρ than welfare because the SCC is a capitalized stock valuation while welfare is an annualized flow. BU sample damage functions (larger IRF) raise welfare loss to 42% and 2100 GDP loss to 61%; these represent the high end of the estimates. The climate sensitivity range ($600–$2,400/ton for the SCC) reflects uncertainty in the physics of CO2-to-temperature conversion, not in the estimated economic damage function. Across all these dimensions, the global-temperature estimates remain order-of-magnitude larger than local-temperature estimates.&lt;/p&gt;
&lt;h3 id="q8-what-is-the-policy-implication-for-large-economies-considering-unilateral-decarbonization"&gt;Q8. What is the policy implication for large economies considering unilateral decarbonization?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The domestic decarbonization test compares the Domestic Climate Cost (DCC) — the fraction of the global SCC that accrues to the decarbonizing country — against the marginal cost of abatement (~$80/ton average, Bistline et al. 2023).&lt;/strong&gt; Under conventional local-temperature estimates ($149/ton global SCC), the US DCC falls below $80/ton, implying unilateral action destroys domestic value. Under the paper&amp;rsquo;s $1,207/ton global SCC, the US DCC comfortably exceeds $80/ton even if the US only captures a fraction of world welfare gains — because global temperature extremes (hurricanes, heat waves, droughts) strike the US directly, the DCC/SCC ratio is much higher than under local estimates where the US appears less exposed. This fundamentally changes the cost-benefit calculus for large-economy unilateral climate policy.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;global mean temperature shock&lt;/strong&gt;: a time-series innovation to world average surface temperature, identified by Hamilton (2018) detrending; captures ocean-atmosphere climate variability (El Niño/ENSO) correlated with extreme weather events affecting all countries simultaneously; the paper&amp;rsquo;s key identification variable, distinct from local temperature variation used in standard cross-country panels.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;global vs. local temperature effect&lt;/strong&gt;: the paper&amp;rsquo;s central finding that the GDP effect per 1°C global mean temperature shock (14–18%) is an order of magnitude larger than the effect per 1°C local temperature shock (1–3%); the gap is explained by extreme climatic events (heat waves, droughts, wind, precipitation) that co-move with global mean temperature but not with individual countries&amp;rsquo; local temperatures.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;structural damage function&lt;/strong&gt; (ζ_s): the kernel relating excess global mean temperature T̂_{t−s} to log TFP at time t, specified as ζ_s = A(e^{−Bs} − e^{−Cs}); estimated from the PWT output impulse response via model inversion (Proposition 1); implies a 4% peak TFP loss 2 years after a 1°C transitory shock, decaying slowly over 10 years; rules out permanent growth effects consistent with the data.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Social Cost of Carbon&lt;/strong&gt; (SCC): the one-time dollar amount households would pay at time 0 to avoid one additional ton of CO2; equals (in the linear limit) the present discounted value of all flow consumption-equivalent welfare losses from the induced warming; paper estimates $1,207/ton (2024 international dollars), more than 6× prior estimates, because the global-temperature damage function implies much larger per-degree productivity losses than local-temperature estimates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;committed climate losses&lt;/strong&gt;: future GDP shortfalls already locked in by past warming, arising because the estimated damage function has a delayed peak (year 2) and slow decay (10+ years) — temperature rises in recent years continue reducing productivity for the following decade; the paper estimates these committed losses alone will lower GDP 32% below potential by 2040 even with temperature held constant at 2019 levels.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;BAU scenario&lt;/strong&gt;: the business-as-usual warming path used for the main counterfactual — global mean temperature reaches 3°C above preindustrial by 2100 (asymptoting to 3.3°C), implying 2°C of additional warming since the 2024 baseline; under this scenario the model implies 53% GDP loss, 51% capital loss, 53% consumption loss, and a 35% consumption-equivalent welfare loss by 2100.&lt;/p&gt;</description></item><item><title>The Margins of Trade</title><link>https://macropaperwarehouse.com/papers/the-margins-of-trade/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-margins-of-trade/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Eaton and Fieler seek to reconcile two literatures that have advanced in parallel but remained at odds: (i) general equilibrium models of bilateral trade flows (the &amp;ldquo;gravity&amp;rdquo; tradition) and (ii) empirical work on the margins of trade — the decomposition of bilateral trade into the extensive margin (number of products traded), the quantity margin (physical volumes), and the unit-value (price) margin. Standard GE models cannot accommodate two of the most robust empirical regularities: that richer importing countries pay higher unit values for the same product, and that richer exporting countries charge higher unit values. The paper builds a framework that captures all three margins jointly while still delivering the standard gravity equation and the Arkolakis-Costinot-Rodriguez-Clare (ACR) welfare formula.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The analysis uses UN COMTRADE bilateral merchandise trade data for the 50 largest economies by GDP in 2007, the most disaggregated 6-digit HS product classification (HS6). The working sample covers 2,611,700 importer-exporter-HS6 triads representing US $9.62 trillion of trade. Country characteristics (GDP, population) come from the World Development Indicators; geographical variables from CEPII.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Empirical Regularities Addressed&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In a standard gravity decomposition of bilateral trade, the elasticities of total trade value with respect to importer and exporter GDP are both approximately 1.1 and the distance elasticity is approximately −0.81. Decomposing total value into its extensive, quantity, and price margins reveals: (i) the extensive margin of exporters rises strongly with exporter GDP (elasticity 0.76) but the corresponding importer extensive margin is much smaller (0.34), contrary to what the Eaton-Kortum (2002) model predicts; (ii) the unit-value margin rises with both importer GDP per capita (elasticity approximately 0.13 in product-level regressions controlling for exporter-product fixed effects) and exporter GDP per capita (elasticity approximately 0.22 controlling for importer-product fixed effects); (iii) there is no significant interaction between importer and exporter per capita income in bilateral trade values (coefficient 0.002, statistically insignificant), rejecting the Linder-type prediction from one-dimensional quality models that rich countries disproportionately sell to other rich countries.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Building on the Ricardian EK framework with a continuum of varieties, CES aggregation, and perfect competition, the paper introduces two dimensions of quality:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Vertical quality&lt;/strong&gt; (q) complements quantity: as spending on a variety increases, both physical quantity and vertical quality rise. This drives the positive relationship between importer per capita income and unit values, because buyers in richer (higher-wage) countries optimally demand higher vertical quality.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Horizontal quality&lt;/strong&gt; (Q) perfectly substitutes for quantity and is determined by the producing country&amp;rsquo;s endowment of intermediates per worker. Because a better-equipped worker produces higher horizontal quality, this dimension rises with the exporter&amp;rsquo;s wage, explaining why richer countries charge higher unit values.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;The model uses Fréchet-distributed productivities as in EK. Despite the non-homothetic intricacies introduced by the two quality dimensions, the trade-share equation is identical to EK&amp;rsquo;s homothetic formulation, and the welfare formula takes the standard ACR form with the elasticity of real income with respect to the home trade share equal to −1/(α̃θ).&lt;/p&gt;
&lt;p&gt;To accommodate the extensive margin, the paper introduces stochastic minimum shipment sizes: small-value flows are observed probabilistically, generating zeros in the trade matrix. Products are treated as bundles of varieties, and the number of varieties per product follows a discretized Weibull distribution.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Estimation and Key Parameter Values&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Multilateral resistance terms (Φ) are estimated from bilateral trade flow regressions. Using product-level unit values and wages/Φ estimates, the authors estimate three structural parameters: γ = 0.13 (cost elasticity of vertical quality, governing how spending splits between quantity and price), ν = 0.22 (elasticity of horizontal quality with respect to intermediate use), and θ = 4 (Fréchet shape parameter, calibrated from the literature as it is imprecisely identified from prices). From IV regressions of product-level spending on unit values — instrumenting a given destination&amp;rsquo;s price with the same exporter&amp;rsquo;s average price to other destinations — the implied demand elasticity with respect to price is −2.83, and the corresponding β (governing the distribution of the structural error across varieties) is estimated at 0.65. Three shipment-size parameters (λ₁ = 2.26×10⁻⁷, λ₂ = 0.042, λ₃ = 0.48) are fitted to match the observed bilateral extensive margins (R-squared 0.79).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A simulation of five million varieties, aggregated into approximately 3,807 traded products, reproduces the key margins of trade in the data. The model with only seven parameters (γ, ν, θ, β, λ₁, λ₂, λ₃) captures: (i) the positive relationship between unit values and both importer and exporter per capita income; (ii) the concave relationship between GDP and the extensive margin (leveling off for large countries); (iii) the standard gravity elasticities of bilateral trade on GDP and distance. Two discrepancies remain: the model understates the effects of per capita income on the extensive margin (shifting them toward the quantity margin), and it does not generate the Alchian-Allen distance effect on unit values.&lt;/p&gt;
&lt;p&gt;Disaggregation to the level of 15 HS sections confirms the pooled results: 80% of HS6 products show positive importer-income elasticities and 94% show positive exporter-income elasticities of unit values. Although section-specific γ and ν estimates are formally rejected to be equal (χ²(28) = 1,249 against a critical value of 41), the implied improvement in model fit is only 3.6% of the total sum of squared residuals, vindicating the aggregate approach.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-are-the-two-dimensions-of-quality-in-the-model-and-why-does-the-paper-require-both"&gt;Q1. What are the two dimensions of quality in the model, and why does the paper require both?&lt;/h3&gt;
&lt;p&gt;A1: Horizontal quality (Q) substitutes perfectly for physical quantity and is valued identically by all users — a worker equipped with more intermediates produces goods of higher horizontal quality. Vertical quality (q) complements quantity: the isobeneft surface requires a CES aggregator with ρ &amp;lt; 0 (elasticity of substitution below one between effective quantity and vertical quality) so that a buyer spending more on a variety optimally raises both the amount and the vertical quality dimension. One dimension of quality is insufficient: with a single dimension, if rich countries both produce and prefer higher-quality goods, market shares of rich exporters should be systematically higher in rich importing destinations than in poor ones. No such interaction is found in the data (the interaction coefficient in bilateral trade regressions is 0.002, statistically insignificant), requiring the two-dimension structure to break the link.&lt;/p&gt;
&lt;h3 id="q2-how-does-the-model-predict-that-unit-values-rise-with-importer-per-capita-income"&gt;Q2. How does the model predict that unit values rise with importer per capita income?&lt;/h3&gt;
&lt;p&gt;A2: The optimal spending on vertical quality for any variety is governed by the elasticity γ/(1+γ): as a buyer&amp;rsquo;s wage rises, spending on any variety rises, and the fraction of that spending that goes to higher unit values (price) has elasticity γ/(1+γ) with respect to spending. The structural elasticity of unit values with respect to the importer wage is δ_{w,M} = γ/(1+γ). With γ = 0.13, this equals approximately 0.115, close to the empirically estimated importer per capita income elasticity of 0.12–0.13 from product-level regressions with exporter-product fixed effects. Richer importers also face a higher price index Φ (lower competition), contributing an additional negative elasticity δ_{Φ,M} = −1/[θ(1+γ)] on Φ, reinforcing the unit-value–income gradient.&lt;/p&gt;
&lt;h3 id="q3-how-does-the-model-predict-that-unit-values-rise-with-exporter-per-capita-income"&gt;Q3. How does the model predict that unit values rise with exporter per capita income?&lt;/h3&gt;
&lt;p&gt;A3: In the model, horizontal quality Q is produced by equipping workers with intermediates: Q = m^ν where ν &amp;gt; 0 and m is intermediate use per worker. Because m is determined by the optimal factor mix and rises with the wage (w), horizontal quality rises endogenously with the exporter&amp;rsquo;s wage. The structural elasticity of unit values with respect to the exporter wage is δ_{w,X} = ν/(1+γ). With ν = 0.22 and γ = 0.13, this equals approximately 0.195, consistent with the estimated exporter per capita income elasticity of 0.20–0.22. The exporter Φ contributes δ_{Φ,X} = ν/[θ(1+γ)] &amp;gt; 0.&lt;/p&gt;
&lt;h3 id="q4-why-does-the-model-still-deliver-a-standard-gravity-equation-despite-the-non-homothetic-quality-structure"&gt;Q4. Why does the model still deliver a standard gravity equation despite the non-homothetic quality structure?&lt;/h3&gt;
&lt;p&gt;A4: The key result is that the trade-share equation — the fraction of varieties that destination n sources from country i — takes the same Fréchet-based form as in Eaton-Kortum (2002): πni = Ti(dni C̃i)^{−θ} / Φn, where C̃i = Ci/Qi is the horizontal-quality-adjusted unit cost. Although quality is non-homothetic in individual variety demands, the distribution of the maximum effective inverse cost across sources conditional on country i being the cheapest is independent of the source country — the key aggregation property inherited from the Fréchet structure. As a result, country i&amp;rsquo;s share in total absorption by n equals its share in the number of varieties sourced from i, and aggregate bilateral trade flows satisfy a standard log-linear gravity equation. The gains from trade are given by the standard ACR formula Un = constant × (Tn d^{−θ}_{nn} / πnn)^{1/(α̃θ)}.&lt;/p&gt;
&lt;h3 id="q5-how-does-the-paper-handle-the-extensive-margin-empirically-and-what-does-the-data-show"&gt;Q5. How does the paper handle the extensive margin empirically, and what does the data show?&lt;/h3&gt;
&lt;p&gt;A5: The extensive margin is defined as the fraction of HS6 product categories that destination n imports from source i. In panel regressions, the elasticity of the extensive margin with respect to exporter GDP is 0.76 (much less than the total trade value elasticity of 1.16, implying an intensive margin of 0.38), while the importer extensive margin elasticity is 0.34. Both elasticities display a concave relationship with GDP in levels: the range of products both exported and imported expands rapidly for small countries but levels off at high GDP. The standard EK model predicts an importer extensive margin elasticity that is zero or negative (larger importers source more domestically), inconsistent with the positive 0.34 found in the data.&lt;/p&gt;
&lt;h3 id="q6-how-does-the-paper-model-the-extensive-margin-and-what-are-the-parameter-estimates"&gt;Q6. How does the paper model the extensive margin, and what are the parameter estimates?&lt;/h3&gt;
&lt;p&gt;A6: The extensive margin arises from stochastic minimum shipment sizes. Trade flows for individual varieties exist according to model-implied values, but are only observed in a given year if the flow exceeds the stochastic shipment size drawn from an exponential distribution H(x) = 1 − exp(−λ₁x). Products are treated as bundles of varieties drawn from a discretized Weibull distribution f(M) parameterized by λ₂ and λ₃. The three parameters are estimated by minimizing squared differences between model-predicted and observed bilateral extensive margins across all country pairs. The estimates are λ₁ = 2.26×10⁻⁷ (SE 1.21×10⁻⁷), λ₂ = 0.042 (SE 0.020), λ₃ = 0.48 (SE 0.10), with an R-squared of 0.79. These imply a mean shipment size of $4.42 million (median $3.07 million) and a mean number of varieties per product of 1,597 (median 344).&lt;/p&gt;
&lt;h3 id="q7-how-is-β-estimated-and-what-does-it-govern"&gt;Q7. How is β estimated, and what does it govern?&lt;/h3&gt;
&lt;p&gt;A7: β governs how spending is distributed across varieties — specifically, the elasticity of spending on a variety with respect to its effective inverse cost. It also governs the elasticity of physical demand with respect to price: the model implies that log spending on a product equals log value minus (β/(1−β)) times log unit price. To identify β, the authors regress product-level trade values on unit prices, instrumenting a given importer&amp;rsquo;s price for a product with the same exporter&amp;rsquo;s average price of that product to all other destinations. The IV estimate of −β/(1−β) is −1.83 (SE 0.019), compared with the OLS estimate of −0.25, indicating substantial simultaneity bias. The implied price elasticity of demand is −2.83 and the implied β is 0.65.&lt;/p&gt;
&lt;h3 id="q8-what-are-the-price-regularities-documented-at-the-product-level-and-how-does-the-two-quality-model-explain-price-overlaps-across-country-pairs"&gt;Q8. What are the price regularities documented at the product level, and how does the two-quality model explain price overlaps across country pairs?&lt;/h3&gt;
&lt;p&gt;A8: Regressions of unit values at the importer-exporter-HS6 level show that individual exporters charge systematically higher prices to richer importers for the same product (elasticity 0.12 with exporter-product fixed effects), and that buyers pay systematically higher prices for products from richer exporters (elasticity 0.22 with importer-product fixed effects). A one-dimensional quality model would predict no overlap between prices charged by a rich and a poor exporter across destinations: even Japan&amp;rsquo;s lowest-priced sales should exceed Malaysia&amp;rsquo;s highest-priced sales for the same product. Back-of-envelope calculations using the regression coefficients predict a Malaysian product should sell in Norway at 0.3 log points above a Japanese product in Pakistan — systematic overlap. The paper documents this overlap in the raw data using two HS6 examples: motorcycle hubs (HS871493) and washing machines under 10kg (HS845011). The two-quality model resolves this by making horizontal quality an exporter attribute that raises prices proportionally but leaves market share determination to the EK gravity equation, allowing rich and poor country exporters to coexist in all markets.&lt;/p&gt;
&lt;h3 id="q9-how-does-the-model-address-the-absence-of-a-linder-type-income-interaction-effect-in-aggregate-trade-flows"&gt;Q9. How does the model address the absence of a Linder-type income interaction effect in aggregate trade flows?&lt;/h3&gt;
&lt;p&gt;A9: In one-dimensional quality models (e.g., Fajgelbaum, Grossman, and Helpman 2011), rich countries produce high-quality goods appealing primarily to high-income households, so rich-to-rich bilateral trade flows should be systematically higher than rich-to-poor flows. A gravity regression of bilateral trade values on importer and exporter fixed effects, distance, and an interaction of log importer GDP per capita × log exporter GDP per capita yields a coefficient of 0.0020 (SE 0.016), which is small and statistically insignificant. The two-quality model is consistent with this: horizontal quality enters as an exporter fixed effect (it affects prices proportionally for all destinations) and the demand system is structured so that all destinations spend the same share of absorption on a given source&amp;rsquo;s varieties, regardless of income level.&lt;/p&gt;
&lt;h3 id="q10-how-robust-are-the-results-to-disaggregation-by-industry"&gt;Q10. How robust are the results to disaggregation by industry?&lt;/h3&gt;
&lt;p&gt;A10: For each of 4,786 HS6 products with more than 20 country pairs, separate price regressions are estimated. Across all products, 80% have positive importer per capita income elasticities and 94% have positive exporter per capita income elasticities. The 15 broad HS sections account for only 10% of the variance in importer-income elasticities and 13% of the variance in exporter-income elasticities across HS6 products, suggesting high within-industry heterogeneity. A quasi-likelihood ratio test formally rejects equal γ and ν across sections (χ²(28) = 1,249 against a critical value of 41), but the reduction in total sum of squared residuals from allowing section-specific parameters is only 3.6%, and the R-squared increases from 0.353 to 0.376. The authors conclude the aggregate approach is vindicated for the purpose of characterizing common patterns.&lt;/p&gt;
&lt;h3 id="q11-how-does-the-model-simulate-trade-and-how-many-products-does-it-generate"&gt;Q11. How does the model simulate trade, and how many products does it generate?&lt;/h3&gt;
&lt;p&gt;A11: The simulation draws productivities for 5,000,000 varieties across 50 countries using the estimated model parameters (θ = 4, γ = 0.13, ν = 0.22, β = 0.65, and the estimated gravity fixed effects). For each variety, the cheapest source is determined; trade values and unit values are computed using equations (28) and (29); censoring due to stochastic shipment sizes generates zeros. Varieties are aggregated into products by partitioning sequentially using the estimated Weibull distribution. The simulation yields 3,842 total simulated products of which 3,807 are traded between at least one country pair, compared with 4,973 HS6 products in the COMTRADE data. A Monte Carlo exercise confirms that the estimation procedure recovers parameter values close to the true values when applied to simulated data.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Vertical quality (q):&lt;/strong&gt; A dimension of quality that complements physical quantity. In the paper&amp;rsquo;s utility specification, vertical quality and effective quantity enter a CES aggregator with elasticity of substitution below one (ρ &amp;lt; 0). A buyer spending more on a variety raises both quantity and vertical quality simultaneously, in proportions governed by γ. Vertical quality rises endogenously with the importer&amp;rsquo;s wage because higher-income buyers optimally demand it; it is the mechanism behind the positive relationship between importer per capita income and unit values.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Horizontal quality (Q):&lt;/strong&gt; A dimension of quality that substitutes perfectly for physical quantity (enters the aggregator multiplicatively with quantity). All buyers value an increase in Q equivalently regardless of income level, so it does not generate Linder-type income-matching in trade flows. Horizontal quality is produced by the exporter: better-equipped workers produce higher horizontal quality (Q = m^ν), so it rises with the exporter&amp;rsquo;s wage. It is the mechanism behind the positive relationship between exporter per capita income and unit values.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Extensive margin (E_{ni}):&lt;/strong&gt; In the paper&amp;rsquo;s empirical framework, the fraction of HS6 product categories that destination n imports from source i in a given year. The paper shows this margin rises with both importer and exporter size but in a concave, nonlinear fashion. It is generated in the model by stochastic minimum shipment sizes that probabilistically censor small-value variety flows.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Intensive margin:&lt;/strong&gt; Total bilateral trade value divided by the extensive margin. The paper further decomposes the intensive margin into a quantity margin and a unit-value (price) margin. The paper&amp;rsquo;s key contribution is to generate all three margins jointly from one parsimonious framework.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Stochastic minimum shipment size:&lt;/strong&gt; A modeling device, drawn from distribution H(x) (parameterized as exponential with parameter λ₁), that determines whether a given variety&amp;rsquo;s trade flow is observed in any year. If the annual flow x_{ni}(ω) exceeds the drawn minimum size x̄, the shipment is observed with certainty; otherwise, it is observed with probability x_{ni}(ω)/x̄. This mechanism generates the concavity of the extensive margin with respect to GDP without departing from the standard gravity framework.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Effective inverse cost (v_{ni}):&lt;/strong&gt; Defined as Z_i(ω)/[C̃_i d_{ni}], where Z_i is country i&amp;rsquo;s Fréchet-distributed productivity for variety ω, C̃_i = C_i/Q_i is the horizontal-quality-adjusted unit cost, and d_{ni} is the iceberg trade cost. A buyer in n sources variety ω from the country maximizing v_{ni}. This formulation ensures that horizontal quality differences across exporters are absorbed into the effective cost, preserving the EK aggregation result.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;γ (vertical quality cost elasticity):&lt;/strong&gt; The parameter governing how spending on a variety divides between physical quantity and vertical quality. Spending has elasticity 1/(1+γ) with respect to quantity and elasticity γ/(1+γ) with respect to unit price. The paper estimates γ = 0.13 from product-level unit value regressions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;ν (horizontal quality elasticity):&lt;/strong&gt; The parameter governing how horizontal quality rises with intermediate use per worker: Q = m^ν. Combined with γ, it determines the structural elasticity of unit values with respect to exporter per capita income: δ_{w,X} = ν/(1+γ). The paper estimates ν = 0.22.&lt;/p&gt;</description></item><item><title>The Nature of Long-Term Unemployment: Predictability, Heterogeneity, and Selection</title><link>https://macropaperwarehouse.com/papers/the-nature-of-long-term-unemployment-predictability-heterogeneity-and-selection/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-nature-of-long-term-unemployment-predictability-heterogeneity-and-selection/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper studies the sources of long-term unemployment (LTU, defined as failing to find a job within six months) using administrative data on the universe of unemployment spells in Sweden from 1992 to 2016, merged with exceptionally rich individual characteristics including income and employment histories, employer records, asset portfolios, and IQ scores. The central question is how much of the observed decline in aggregate job-finding rates with unemployment duration reflects genuine state dependence — structural deterioration of individual prospects from being unemployed — versus dynamic selection, the mechanical compositional shift as higher-job-finding workers exit first. Using machine-learning prediction models and a complementary multiple-spell identification strategy, the paper finds that observable heterogeneity in LTU risk is substantial: the hold-out R-squared of the baseline prediction model is 15%, and rises to more than twice that value when rich administrative variables are added relative to a model using only standard socio-demographics. Applying the prediction model across durations, the paper shows that persistent heterogeneity can account for approximately 49% of the observed decline in aggregate job-finding rates over the spell (from 70% to 55% between 0 and 6 months), and potentially as much as 88% under a proportionality assumption on selection based on unobservables. In contrast, the same rich data reject the hypothesis that compositional changes in the pool of unemployed workers explain the cyclical rise in LTU risk in recessions.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-conceptual-framework-for-decomposing-duration-dependence"&gt;Q1. What is the conceptual framework for decomposing duration dependence?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper develops a statistical framework that decomposes the observed change in average job-finding rates over the spell into true duration dependence — the individual-level within-spell decline — and dynamic selection, the change in pool composition as higher-job-finding workers exit first.&lt;/strong&gt; The key insight is that dynamic selection is identified by the persistent covariance in job-finding probabilities across durations: if a worker who has a high job-finding probability early in a spell also has a high probability later, selection of such workers out of unemployment will lower the average for the remaining pool even if no individual&amp;rsquo;s rate changes. The framework shows that the hold-out R-squared of a prediction model provides a lower bound for the share of variance in outcomes that is ex ante determined, and that the cross-duration covariance of predictions identifies the persistent heterogeneity component that drives selection.&lt;/p&gt;
&lt;p&gt;The paper further shows that combining prediction-based identification (which recovers observable plus transitory heterogeneity) with multiple-spell identification (which recovers persistent observable and unobservable heterogeneity) yields a tighter lower bound on overall heterogeneity. Observable and unobservable approaches are complementary: the multiple-spell approach identifies persistent unobservable heterogeneity that observables miss, while the observables approach captures transitory heterogeneity that changes across spells.&lt;/p&gt;
&lt;h3 id="q2-what-is-the-predictive-power-of-observable-characteristics"&gt;Q2. What is the predictive power of observable characteristics?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The baseline prediction model using standard administrative variables achieves a hold-out R-squared of 15%, measuring the share of variance in job-finding outcomes that is predictable from characteristics determined before the spell begins.&lt;/strong&gt; This estimate is more than twice as large as a model using only basic socio-demographics (age, gender, education, marital status, citizenship, number and age of children). Prior employment history — even if available for only one or two years — is the most powerful predictor, potentially proxying for unobservable worker characteristics. Additional variables available for limited samples or years (occupation, assets, IQ, union membership) add only modest further predictive power beyond the baseline, suggesting saturation of the observable signal. The predictive power of a linear model is nearly as high as the ensemble of LASSO, gradient-boosted trees, and random forests, indicating that the gains come from data richness rather than nonlinearities exploited by machine learning algorithms.&lt;/p&gt;
&lt;p&gt;The unobserved heterogeneity estimated using repeated unemployment spells corresponds to roughly half of the estimated observable heterogeneity. The combined lower bound on the variance that is ex ante determined is at least 19% of total variation in job-finding outcomes.&lt;/p&gt;
&lt;h3 id="q3-how-much-of-duration-dependence-is-explained-by-dynamic-selection"&gt;Q3. How much of duration dependence is explained by dynamic selection?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Applying the prediction model at multiple durations within the spell, the paper finds that nearly three-quarters of predictable heterogeneity is persistent over the spell of unemployment, implying that dynamic selection accounts for at least 49% of the observed aggregate decline in job-finding rates from the start to 6 months into the spell.&lt;/strong&gt; In 2006, the aggregate 6-month job-finding rate fell from 70% at spell start to 55% at 6 months of ongoing unemployment — a 15 percentage point decline. The lower bound from persistent observable heterogeneity accounts for a decline from 70% to 62.7%, or 49% of the total observed decline. Under a proportionality assumption that unobservable selection mirrors observable selection, the paper estimates that dynamic selection can explain as much as 88% of the observed decline.&lt;/p&gt;
&lt;p&gt;The paper also finds substantial heterogeneity in the individual-level dynamics across workers with different observable characteristics: the individual-level decline in job-finding over the spell is strongly negatively correlated with the job-finding probability at the start of the spell, meaning that workers who start with lower job-finding chances also experience stronger within-spell declines, further compressing the heterogeneity in job-finding over time.&lt;/p&gt;
&lt;h3 id="q4-what-does-the-analysis-imply-about-the-proportional-hazard-assumption"&gt;Q4. What does the analysis imply about the proportional hazard assumption?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper tests and rejects the key assumption in proportional hazard models that job-finding rates decline at the same proportional rate across all workers.&lt;/strong&gt; The individual predictions at different durations reveal significant heterogeneity in the dynamics across workers with different observable characteristics: workers do not all experience the same proportional decline with duration. This rejection is robust to corrections for sampling error and to non-parametric tests. The heterogeneity in dynamics is empirically distinguishable from the level heterogeneity in job-finding probabilities, and exists over and above what dynamic selection alone would generate.&lt;/p&gt;
&lt;h3 id="q5-can-compositional-changes-in-the-pool-of-unemployed-explain-ltu-risk-in-recessions"&gt;Q5. Can compositional changes in the pool of unemployed explain LTU risk in recessions?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Despite the paper&amp;rsquo;s finding that rich observable characteristics explain substantial duration dependence over the spell, the same data reject the heterogeneity hypothesis for cyclicality: compositional changes in the observable characteristics of workers who become unemployed in recessions do not translate into higher predicted LTU risk for the average unemployed worker.&lt;/strong&gt; The distribution of predicted job-finding risk changes over the business cycle, but the direction is not consistent with the hypothesis that recessions selectively pull in high-LTU-risk workers. Instead, unemployed workers are exposed to substantial changes in LTU risk over the business cycle that operate through within-individual declines in job-finding chances, not through composition. Recessions disproportionately hurt the job-finding prospects of workers with lower education and income, but this is a general worsening of individual prospects rather than a compositional shift in the pool.&lt;/p&gt;
&lt;h3 id="q6-what-are-the-policy-implications"&gt;Q6. What are the policy implications?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The policy implications differ sharply between genuine state dependence and dynamic selection as sources of LTU.&lt;/strong&gt; If duration dependence at the individual level is large — skill atrophy, scarring, signaling stigma — early intervention is warranted to interrupt the deterioration process before it becomes irreversible. The paper&amp;rsquo;s evidence that selection is quantitatively dominant over the spell of unemployment suggests that much of the observed fall in aggregate job-finding with duration does not reflect structural deterioration of individual prospects, limiting the case for early intervention aimed at preventing skill erosion. However, the paper also finds that individual-level declines are heterogeneous and concentrated among workers with already-low job-finding chances, suggesting that targeted risk-profiling — already used by Public Employment Services in many countries including Sweden — is well motivated. The separate finding that recessions generate genuine within-individual worsening of prospects (rather than compositional shift) implies that countercyclical support is warranted on different grounds.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;dl&gt;
&lt;dt&gt;&lt;strong&gt;dynamic selection&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;the mechanical change in the composition of the unemployment pool as workers with higher job-finding probabilities exit unemployment first, leaving an increasingly low-job-finding-probability remainder; identified in the paper as the dominant driver of the observed aggregate decline in job-finding rates with unemployment duration, accounting for at least 49% and potentially up to 88% of the observed decline.&lt;/dd&gt;
&lt;dt&gt;&lt;strong&gt;genuine state dependence (true duration dependence)&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;the within-individual structural deterioration of a worker&amp;rsquo;s job-finding rate from the experience of unemployment itself — skill erosion, employer stigma signaling — as distinct from compositional selection; the paper finds this is a minor contributor to the observed aggregate duration dependence pattern in Sweden.&lt;/dd&gt;
&lt;dt&gt;&lt;strong&gt;persistent heterogeneity&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;the covariance of individual job-finding probabilities across different durations (or spell cohorts); the component of overall heterogeneity that drives dynamic selection, identified empirically by the cross-duration covariance of prediction model outputs in hold-out samples.&lt;/dd&gt;
&lt;dt&gt;&lt;strong&gt;LTU risk profiling&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;the practice of predicting workers&amp;rsquo; probability of becoming long-term unemployed at spell entry using observable characteristics, to target active labor market programs; the paper provides the statistical foundation and quantifies the gains from richer administrative data relative to standard survey-based characteristics.&lt;/dd&gt;
&lt;/dl&gt;</description></item><item><title>The Optimal Taxation of Couples</title><link>https://macropaperwarehouse.com/papers/the-optimal-taxation-of-couples/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-optimal-taxation-of-couples/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; What is the optimal joint nonlinear earnings tax schedule for married couples? How should one spouse&amp;rsquo;s marginal tax rate depend on the other&amp;rsquo;s earnings? When is individual earnings-based (separable) taxation optimal versus family-income-based taxation, and what determines the sign and magnitude of &amp;ldquo;jointness&amp;rdquo; — the dependence of one spouse&amp;rsquo;s marginal tax on the other&amp;rsquo;s earnings?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model.&lt;/strong&gt; The paper studies a canonical unitary household model in which each couple consists of two spouses who jointly maximize utility subject to a joint budget constraint. Spousal productivities are drawn from a joint distribution F with arbitrary dependence structure. The planner maximizes a weighted sum of couples&amp;rsquo; utilities, with Pareto weights that are decreasing functions of productivities. Utility takes a quasi-linear form in consumption and labor disutility with constant labor supply elasticity parameter γ (implying earnings elasticity γ/(γ-1)). The tax problem is equivalent to a two-dimensional mechanism design problem in which the planner chooses allocations as functions of reported productivity types, subject to incentive compatibility and budget feasibility. Because spousal productivities are two-dimensional, the problem is a multi-dimensional screening problem whose properties are poorly understood in general.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Methodology.&lt;/strong&gt; The authors proceed in two directions. First, they establish conditions under which the first-order approach (FOA) — restricting attention to local incentive constraints — is valid in this bi-dimensional setting. They show, for the special case of the benchmark economy (symmetric, independent types, separable Pareto weights), that FOA validity is equivalent to convexity of a certain transformation of the value function, and derive necessary and sufficient conditions that are strictly weaker than their unidimensional analogs — so the FOA is more likely to hold in two dimensions than in one. For the general economy, they invoke an Implicit Function Theorem argument in Hölder space to show that the FOA holds for Pareto weights sufficiently close to utilitarian (i.e., when the planner is not &amp;ldquo;too redistributive&amp;rdquo;). Second, assuming FOA validity, they characterize optimal taxes via a second-order nonlinear PDE. Since this PDE cannot be solved analytically in general, they apply the Coarea Formula to derive closed-form expressions for conditional averages of optimal tax distortions over various subsets of the type space, expressed entirely in terms of structural primitives (labor supply elasticities, Pareto weights, and elasticities of the joint distribution of productivities).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings.&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Average distortions and assortativeness.&lt;/strong&gt; Average optimal distortions on married individuals are ranked by the degree of positive quadrant dependence (PQD) in spousal productivities: more assortative matching implies higher optimal tax rates. Optimal distortions on married individuals are always weakly lower than on single individuals with the same productivity, same elasticities, and same marginal productivity distribution — strictly so unless matching is perfectly positively assortative. The intuition is that when couples pool resources, intra-family redistribution already occurs, and distortionary taxation crowds this out; more random matching produces more within-family redistribution, reducing the marginal social value of public redistribution through taxation.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Optimality of separable (individual earnings-based) taxation.&lt;/strong&gt; In the benchmark economy with independent types, optimal taxes are exactly separable (individual earnings-based), and optimal distortions on married individuals equal precisely one-half of those on comparable single individuals. With separable Pareto weights and independent types more generally, taxes remain separable. Once types are positively dependent, however, the planner optimally introduces jointness even under separable social weights.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Jointness and tail (in)dependence.&lt;/strong&gt; Optimal jointness — whether one spouse&amp;rsquo;s marginal tax rate increases or decreases in the other&amp;rsquo;s earnings — depends critically on tail dependence of the joint productivity distribution, captured by the copula and survival copula elasticities. For right-tail dependent distributions (so that extremely productive individuals are likely to be matched with extremely productive partners), positive jointness is optimal at the top (raising taxes on high earners whose partners are also high earners) and negative at the bottom. For right-tail independent distributions (such as the Gaussian copula, which is tail-independent for any finite ρ), the distortion-reducing motive dominates: optimal jointness is negative at the top and positive at the bottom, conditional on standard convergence conditions.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Primary vs. secondary earners.&lt;/strong&gt; The secondary earner (lower-productivity spouse) faces on average higher optimal distortions than the primary earner when the planner values redistribution to couples with a very unproductive spouse (α(w,0) ≥ 1), because the phasing out of transfers targeted to such couples generates high marginal tax rates on secondary earners. Family earnings-based taxation is optimal only when total family productivity and relative spousal productivity are independent, and when social weights are measurable only with respect to total family output.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Restricted taxation.&lt;/strong&gt; Optimal distortions under any of the three restricted tax regimes (anonymous, separable, family earnings-based) exactly equal the relevant conditional average of unrestricted optimal distortions. This establishes that the welfare difference between the restricted and unrestricted optimum stems solely from the planner&amp;rsquo;s inability to tag taxes to individual productivity types within the restricted class.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;Quantitative Findings (calibrated to 2020 CPS data on U.S. married couples, ages 25-65, worked ≥ 20 weeks).&lt;/strong&gt; Spousal productivities are positively but not perfectly dependent, with Kendall&amp;rsquo;s tau = 0.21 and Pearson correlation = 0.25 for productivities (0.21 for earnings). The joint distribution is well approximated by a Gaussian copula (ρ = 0.33) with Pareto-lognormal marginals (a = 2.95, Gini = 0.31). The Gaussian copula is tail-independent, so consistent with analytical results, optimal jointness is positive for low earners and negative for high earners (the latter arising at earnings above approximately $8.5 million in the benchmark specification). The quantitative magnitude of optimal jointness is small — marginal taxes for one spouse change by at most several percentage points as a function of the other spouse&amp;rsquo;s earnings. Individual earnings-based taxation provides a good approximation to the unrestricted optimum. By contrast, family earnings-based (joint) taxation is a poor approximation in all specifications, with marginal taxes on family income varying substantially with the earnings share of the secondary earner, and this conclusion holds even when Pareto weights explicitly favor family earnings-based taxation (k = 0 case). The implied top marginal tax rate converges toward approximately 55 percent (corresponding to limiting distortion of ≈1.35 = 1/γa with γ = 0.25, a = 2.95) but the convergence is slow, so optimal marginal rates remain substantially below this limit even at earnings of $300,000.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-mechanism-design-formulation-and-why-is-foa-validity-a-key-concern-in-the-bi-dimensional-setting"&gt;Q1. What is the mechanism design formulation, and why is FOA validity a key concern in the bi-dimensional setting?&lt;/h3&gt;
&lt;p&gt;A: The planner&amp;rsquo;s problem is cast as a direct mechanism in which couples report their two-dimensional productivity type (w1, w2) and receive allocations (consumption, earnings). Incentive compatibility requires that no couple prefers to misreport. In one-dimensional models (Mirrlees 1971), restricting attention to local incentive constraints (the FOA) yields the standard ODE characterization of optimal taxes and is valid for a broad class of primitives. In two dimensions, solutions to multi-dimensional screening problems generically display &amp;ldquo;bunching&amp;rdquo; (Rochet-Choné 1998, Armstrong 1996), and the FOA may fail. The key difference exploited in this paper is the absence of participation constraints in the public finance setting, which eliminates the main force driving FOA failure in industrial organization models.&lt;/p&gt;
&lt;h3 id="q2-what-are-the-necessary-and-sufficient-conditions-for-foa-validity-in-the-benchmark-economy-with-independent-types"&gt;Q2. What are the necessary and sufficient conditions for FOA validity in the benchmark economy with independent types?&lt;/h3&gt;
&lt;p&gt;A: (Proposition 1) In the benchmark economy (symmetric, independent types, separable Pareto weights), FOA validity is equivalent to the condition that x·(1 + λ̃(x^{-γ})/2) is increasing in x, where λ̃(t) = [∫_t^∞ (1-α̃(w))g(w)dw] / (γtg(t)). The unidimensional analog requires x·(1 + λ̃(x^{-γ})) to be increasing. Since the bi-dimensional condition multiplies λ̃ by 1/2 rather than 1, the set of primitives satisfying it is strictly larger: every (G, α̃, γ) for which the unidimensional FOA holds also satisfies the bi-dimensional condition, but not vice versa. Economically, the FOA holds as long as the planner is not &amp;ldquo;too redistributive&amp;rdquo; — i.e., Pareto weights on low types are not so high as to violate these monotonicity conditions.&lt;/p&gt;
&lt;h3 id="q3-what-is-the-coarea-formula-result-equation-27-and-why-is-it-the-central-technical-tool"&gt;Q3. What is the Coarea Formula result (equation 27) and why is it the central technical tool?&lt;/h3&gt;
&lt;p&gt;A: Given that the optimality conditions form a PDE system that cannot generally be solved pointwise, the authors integrate the optimality condition (equation 20) over subsets of the type space defined by level sets of an arbitrary function Q(w1, w2). The Coarea Formula allows them to express the result as: E[Σ_i λ*_i γ_i (∂lnQ/∂lnw_i) | Q=t] = [1 − E[α|Q≥t]] / [−∂ln P(Q≥t)/∂ln t]. By choosing different Q functions (e.g., Q = w_i, Q = max{k_1 w_1, k_2 w_2}, Q = R(w) for total family productivity, Q = I(w) for relative productivity), the formula delivers closed-form expressions for distinct conditional averages of optimal distortions, all expressed in terms of exogenous primitives. This contrasts with variational approaches (Golosov et al. 2014, Spiritus et al. 2022) that express optimal taxes in terms of endogenous moments.&lt;/p&gt;
&lt;h3 id="q4-how-do-optimal-distortions-on-married-individuals-compare-to-those-on-single-individuals-and-what-is-the-exact-quantitative-relationship-in-the-independent-types-benchmark"&gt;Q4. How do optimal distortions on married individuals compare to those on single individuals, and what is the exact quantitative relationship in the independent-types benchmark?&lt;/h3&gt;
&lt;p&gt;A: (Proposition 4) In the benchmark economy with independent types, the optimal distortion on spouse i with productivity t equals exactly one-half of the optimal distortion λ^{sng,&lt;em&gt;}(t) in the corresponding unidimensional economy: λ&lt;/em&gt;&lt;em&gt;i(t, w&lt;/em&gt;{-i}) = (1/2)λ^{sng,*}(t), and this is independent of the partner&amp;rsquo;s productivity w_{-i}. The intuition: the deadweight cost of taxing any individual depends only on her own characteristics (elasticity, productivity, density), not on whom she is married to. However, the redistributive benefit of taxation depends on matching — when matching is random, every high-productivity individual is married on average to an average person, so the incremental social benefit of extracting tax revenue from her is exactly half of what it would be if she were single (since half the benefit goes to a partner who is already average). More generally (Proposition 5 and Corollary 2), average distortions are weakly lower for married individuals than for singles as long as matching is not perfectly positively assortative.&lt;/p&gt;
&lt;h3 id="q5-what-is-average-jointness-and-how-is-it-measured"&gt;Q5. What is average jointness and how is it measured?&lt;/h3&gt;
&lt;p&gt;A: Average jointness J_i(t) is defined as the ratio of average distortions on spouse i conditional on the partner having above-t productivity to average distortions conditional on the partner having below-t productivity, minus one. Jointness is positive if the marginal tax rate on spouse i is on average increasing in the partner&amp;rsquo;s productivity, negative if decreasing, and zero for separable (individual earnings-based) taxes. The paper characterizes jointness through auxiliary functions H_i(t) (conditional distortion relative to unconditional average), whose behavior is determined by the copula elasticities η_i and survival copula elasticities η̄_i — the percentage change in the conditional quantile of the partner&amp;rsquo;s productivity when one spouse&amp;rsquo;s productivity quantile increases by 1%.&lt;/p&gt;
&lt;h3 id="q6-what-is-the-role-of-tail-dependence-in-determining-the-sign-of-optimal-jointness"&gt;Q6. What is the role of tail dependence in determining the sign of optimal jointness?&lt;/h3&gt;
&lt;p&gt;A: (Proposition 7, Lemma 4) For right-tail dependent distributions — where the probability that an extremely productive person is married to an extremely productive partner remains bounded away from zero as productivity → ∞ — the redistributive benefit of positive jointness (targeting taxes to the richest couples) dominates its distortionary cost, so optimal average jointness is positive at the top. For right-tail independent distributions (where this probability converges to zero), the distortionary cost of positive jointness dominates, and optimal jointness is negative at the top. Exactly symmetric logic applies at the bottom using the survival copula and left-tail dependence. The bivariate lognormal/Gaussian copula is right-tail independent for any finite correlation ρ, while a distribution with perfect assortative matching in the tails would be right-tail dependent. The speed of convergence to tail independence, measured by κ = lim_{u→0} ln(u)/ln(C(u,u)) ∈ [1/2, 1), also matters: slower convergence (κ closer to 1) implies smaller optimal jointness under tail independence.&lt;/p&gt;
&lt;h3 id="q7-when-is-individual-earnings-based-separable-taxation-optimal-and-when-is-family-earnings-based-taxation-optimal"&gt;Q7. When is individual earnings-based (separable) taxation optimal, and when is family earnings-based taxation optimal?&lt;/h3&gt;
&lt;p&gt;A: (Propositions 4, 8, Corollary 1) Individual earnings-based taxation is optimal when Pareto weights are separable and spousal productivities are independent. When types are positively dependent, the planner introduces jointness even with separable social weights, because conditioning taxes on both spouses&amp;rsquo; earnings facilitates redistribution across couple types. Family earnings-based taxation is optimal when: (i) social weights are measurable only with respect to total family productivity r (i.e., the planner cares only about total family output, not the identity or relative productivity of individual spouses), and (ii) total family productivity r and relative spousal productivity ι are statistically independent. When r and ι are not independent, even a planner with an intrinsic preference for family earnings-based taxation will find it optimal to depart from it.&lt;/p&gt;
&lt;h3 id="q8-what-does-proposition-9-corollary-7-establish-about-the-relationship-between-restricted-and-unrestricted-optimal-taxes"&gt;Q8. What does Proposition 9 (Corollary 7) establish about the relationship between restricted and unrestricted optimal taxes?&lt;/h3&gt;
&lt;p&gt;A: (Corollary 7) For each restricted tax regime (anonymous, individual earnings-based, family earnings-based), the optimal distortions under the restricted tax equal the corresponding conditional average of unrestricted optimal distortions. Specifically: optimal individual earnings-based distortions equal E[λ*_i | w_i = t] (the average unrestricted distortion at productivity t); optimal family earnings-based distortions equal E[weighted average of λ*_i | R(w) = r]. This reveals that the unrestricted and restricted planners solve the same tradeoff between redistribution benefits and distortionary costs, but the restricted planner must apply a single tax rate to groups of couples that cannot be distinguished under the restriction. The welfare loss from restriction comes entirely from this forced bunching, not from a different objective or a different first-order condition.&lt;/p&gt;
&lt;h3 id="q9-what-do-the-quantitative-results-say-about-the-goodness-of-approximation-of-separable-vs-family-earnings-based-taxation"&gt;Q9. What do the quantitative results say about the goodness of approximation of separable vs. family earnings-based taxation?&lt;/h3&gt;
&lt;p&gt;A: In the calibrated benchmark economy (Gaussian copula, ρ = 0.33, Pareto-lognormal marginals, γ = 0.25, m = 0.35), optimal jointness is quantitatively small — the marginal tax rate on one spouse changes by at most several percentage points as a function of the other spouse&amp;rsquo;s earnings over the plotted range. Individual earnings-based (separable) taxation therefore provides a good approximation to the unrestricted optimum across all specifications considered. By contrast, family earnings-based taxation is a poor approximation: the marginal tax rate on family income varies substantially with the earnings share of the secondary earner (the ratio min{y1,y2}/(y1+y2)), and the deviation from the optimal unrestricted tax is large. This finding is robust across different Pareto weight specifications (m ∈ {0.35, 1.5}, k ∈ {0, 1, 2}) and holds even when k = 0, i.e., when the planner&amp;rsquo;s social weights inherently prefer family earnings-based taxation.&lt;/p&gt;
&lt;h3 id="q10-how-do-the-calibration-results-relate-to-the-analytical-comparative-statics-predictions"&gt;Q10. How do the calibration results relate to the analytical comparative statics predictions?&lt;/h3&gt;
&lt;p&gt;A: The calibration validates the analytical predictions quantitatively. The analytical result (Proposition 5) that optimal distortions in the U.S. lie between those under random matching (1/2 of single-individual rates) and perfect assortative matching (same as single-individual rates) is confirmed: optimal tax rates for married individuals in the calibrated economy lie between the independence and perfect-dependence gray-line benchmarks in Figure 6. The analytical prediction (Proposition 7) that the Gaussian copula implies positive jointness at the bottom and negative at the top is confirmed, with the switch to negative jointness occurring above approximately $8.5 million in earnings. The slow convergence of the Gaussian copula to tail independence (κ = (1+ρ)/2 ≈ 0.665) explains the small magnitude of optimal jointness relative to the FGM copula (which has κ = 1/2, faster convergence, and exhibits more pronounced jointness as shown in the appendix). The analytical limiting distortion of E[λ*_i | w_i = t] → 1/(γa) ≈ 1.35 as t → ∞ (corresponding to a top marginal tax rate of approximately 55 percent) is confirmed, though convergence is slow and rates remain substantially below this limit at $300,000 in earnings.&lt;/p&gt;
&lt;h3 id="q11-how-does-the-paper-relate-to-and-advance-beyond-kleven-kreiner-and-saez-20072009"&gt;Q11. How does the paper relate to and advance beyond Kleven, Kreiner, and Saez (2007/2009)?&lt;/h3&gt;
&lt;p&gt;A: Kleven et al. (2009) studied couples taxation but avoided the multi-dimensional screening complexity by restricting the secondary earner to binary labor supply. The working paper by Kleven et al. (2007) considered the continuous setting but noted the difficulty of the FOA and derived several special-case insights. The current paper extends KKS in several systematic ways: it provides the first formal proof that the FOA conditions are strictly weaker in bi-dimensional than unidimensional settings; generalizes the formula for average distortions to arbitrary joint distributions (not just independent types); characterizes optimal jointness under positive dependence (not just independence); establishes the role of tail (in)dependence in determining the sign of jointness; compares optimal taxes for married vs. single individuals; and derives conditions under which family earnings-based or individual earnings-based taxation is optimal. It also shows that the KKS result on jointness sign (determined by the third derivative of the SWF) applies only under independence and can be reversed even with arbitrarily small positive dependence, as demonstrated with the Gaussian copula example.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;First-Order Approach (FOA) in multi-dimensional taxation.&lt;/strong&gt; The restriction of the mechanism design problem to local incentive constraints only — dropping global (non-local) incentive compatibility conditions and solving a relaxed problem. In the paper&amp;rsquo;s context, FOA validity is equivalent to convexity of a specific transformation vx* of the optimal utility function in the &amp;ldquo;linearized&amp;rdquo; type space X. The paper shows that the condition for FOA validity is strictly weaker (i.e., a strictly larger set of primitives satisfies it) in the bi-dimensional couples setting than in the corresponding unidimensional model, because the absence of participation constraints eliminates the main force driving FOA failure in industrial organization multi-dimensional screening.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;&lt;em&gt;Optimal tax distortion λ&lt;/em&gt;_i(w).&lt;/em&gt;* The monotone transformation of the marginal tax rate defined by λ_i(w) = [∇_i T(y(w))] / [1 − ∇_i T(y(w))], where ∇_i T is the partial derivative of the tax function with respect to spouse i&amp;rsquo;s earnings. This transformation maps [−∞, ∞] marginal tax rates to (−1, ∞) distortions. The optimal tax schedule is characterized by the function λ* satisfying a system of PDEs; the paper studies conditional averages of λ* rather than λ* pointwise.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Coarea Formula.&lt;/strong&gt; A mathematical result from geometric measure theory that, in this context, converts an integral of the PDE optimality condition over a two-dimensional domain into an integral over the level sets of an arbitrary function Q(w). Applied to equation (20), it yields: E[Σ_i λ*_i γ_i (∂lnQ/∂lnw_i) | Q=t] = [1 − E[α|Q≥t]] / [−∂ln P(Q≥t)/∂ln t]. By choosing different Q functions, the formula delivers conditional averages of optimal distortions over different subsets of the type space, all in terms of exogenous primitives. This is the paper&amp;rsquo;s principal analytical tool for characterizing optimal taxes without solving the PDE explicitly.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Jointness (positive/negative).&lt;/strong&gt; The dependence of the optimal marginal tax rate on one spouse&amp;rsquo;s earnings on the other spouse&amp;rsquo;s earnings. Taxes are positively jointed at w if ∂²T/∂y_1∂y_2 &amp;gt; 0 (so raising one spouse&amp;rsquo;s earnings increases the marginal tax rate on the other); negatively jointed if this cross-partial is negative; disjointed (separable) if it is zero. Average jointness J_i(t) at productivity t is measured as the ratio of conditional average distortions above and below the partner&amp;rsquo;s productivity threshold, minus one. Optimal jointness is the paper&amp;rsquo;s primary policy object for understanding how taxes on one spouse should respond to the other&amp;rsquo;s earnings.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Copula and survival copula elasticities (η_i, η̄_i).&lt;/strong&gt; Defined as η_i(t) = ∂ln C(u)/∂ln u_i and η̄_i(t) = ∂ln C̄(u)/∂ln ū_i, where C is the copula of the joint productivity distribution, C̄ is the survival copula, and u_i = G_i(t_i), ū_i = 1−G_i(t_i) are the corresponding quantiles. These elasticities measure the percentage change in the conditional quantile of the partner&amp;rsquo;s productivity when one spouse&amp;rsquo;s productivity quantile increases by 1%. They quantify the additional distortionary cost introduced by jointness relative to a separable tax schedule: smaller elasticities (stronger dependence) correspond to larger distortionary costs of jointness at the boundaries of probability mass.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Tail (in)dependence.&lt;/strong&gt; A joint distribution F is right-tail dependent if lim_{t→∞} P(w_{-i}≥t | w_i≥t) &amp;gt; 0, i.e., extremely productive individuals have a positive probability of being matched with equally extreme partners. It is right-tail independent if this limit is zero. The speed of convergence to tail independence is measured by κ = lim_{u→0} ln(u)/ln(C(u,u)) ∈ [1/2, 1). Tail dependence determines the sign of optimal average jointness in the tails: right-tail dependence favors positive jointness at the top; right-tail independence favors negative jointness at the top. The Gaussian copula is right-tail independent for any finite ρ; a perfectly assortative matching distribution is right-tail dependent.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Positive quadrant dependence (PQD) order.&lt;/strong&gt; A partial ordering on joint distributions with the same marginals: F^b ≥_{PQD} F^a if F^b(w) ≥ F^a(w) for all w, equivalently if Cov(φ_1(w_1), φ_2(w_2)) ≥ 0 for any two increasing functions. The paper uses this order to rank economies by the &amp;ldquo;assortativeness&amp;rdquo; of matching, and shows that optimal average distortions are monotone in this order (Proposition 5): more assortative matching implies weakly higher optimal tax distortions on each married individual.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Pareto-lognormal (PLN) distribution.&lt;/strong&gt; Used in the calibration to model the marginal distribution of spousal productivities. Defined as G(t) = Φ((ln t − μ)/σ) − a·exp(aμ + a²σ²/2)·Φ((ln t − μ)/σ − aσ), parameterized by location μ, scale σ, and tail parameter a. The PLN family has a lognormal body and a Pareto tail with tail parameter a, making it suitable for capturing the empirical finding of a thin left tail (implying optimal marginal taxes approaching zero as earnings → 0) and a thick right tail (implying a positive limiting marginal tax rate of approximately 1/(1 + 1/(γa)) as earnings → ∞).&lt;/p&gt;</description></item><item><title>The Price of Housing in the United States, 1890–2006</title><link>https://macropaperwarehouse.com/papers/the-price-of-housing-in-the-united-states-18902006/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-price-of-housing-in-the-united-states-18902006/</guid><description>&lt;p&gt;Lyons, Shertzer, Gray, and Agorastos construct the first consistent, annual, quality-adjusted market rent and home sales price series for American cities spanning 1890–2006. The paper addresses a fundamental data gap: no annual city-level series existed for market rents at any point in the 20th century, and no annual city-level sales price series existed prior to 1975. Existing national series—the BLS Rent of Primary Residence (RoPR) for rents and the Shiller index for sales—carry well-documented methodological limitations that the authors argue have produced materially misleading stylized facts about long-run U.S. housing markets.&lt;/p&gt;
&lt;p&gt;The Historical Housing Prices (HHP) dataset draws on just under 2.7 million newspaper real estate listings from 30 U.S. cities across 1890–2006. Listings must contain a price, a size measure (rooms or bedrooms), property type (house or apartment), and a location indicator. The authors construct hedonic price indices using a rolling-windows methodology—baseline three-year rolling windows with annual step size—that controls for size, type, and standardized within-city location, allowing coefficients to vary over time rather than imposing a fixed vector across the full century. City-level indices are aggregated to national indices using population weights from census data interpolated between census years. Listed prices serve as proxies for transaction prices; the authors validate these against census distributions and against post-1975 FHFA and Case-Shiller series.&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s findings revise several established stylized facts. First, real market rents did not fall over the 20th century as implied by the RoPR series. Instead, real rental price levels were approximately 20% higher in 2006 than in 1890, fluctuating within a relatively narrow band. The RoPR series, by contrast, implies a near-halving of real rents between 1914 and 2006. Second, the paper documents a substantial interwar housing boom-bust absent from the Shiller index: real sales prices rose approximately 47% between 1920 and 1928, then fell 27% by 1935, with the 1928 peak not recovered in real terms until 1968. Third, contrary to the Shiller index&amp;rsquo;s depiction of minimal housing price growth from 1950 to 1995, the HHP series shows real sales prices rising 21% between 1953 and 1974—a period for which Shiller relies on a truncated sample of government-backed mortgages that excluded higher-valued homes.&lt;/p&gt;
&lt;p&gt;On the return to homeownership, the paper finds average nominal housing returns across 1890–2006 of approximately 11% per year, composed of 3.8% capital gain and 7.2% rental return. Gross market rental yields exceeded 8% annually for much of 1900–1945, fell to 7% by 1960, and to 3% by 2006. Capital gains were largely unimportant before the 1940s and became the dominant return component only from 1970 onward; the post-1980 period with sustained capital gains is characterized as historically anomalous. Returns varied substantially across cities, with some cities outperforming the S&amp;amp;P 500 in the prewar era while most underperformed equities from 1981–2006.&lt;/p&gt;
&lt;p&gt;The paper also examines implications for the CPI. The HHP series implies nominal rents grew at approximately 3.5% per year from 1914 to 2006, versus 2.6% per year for the RoPR component. A back-of-the-envelope alternative CPI using HHP rental data yields overall price growth of 3.3% per year rather than the official 3.1%, suggesting the measured increase in U.S. living standards since World War I may be modestly overstated. Finally, cross-city analysis shows that land constraints and, increasingly, regulatory constraints explain divergence in price growth across cities, with the role of zoning becoming more pronounced after 1980.&lt;/p&gt;
&lt;p&gt;Q: What is the core data source and how are the indices constructed?
A: The HHP dataset comprises just under 2.7 million newspaper real estate listings from 30 U.S. cities, 1890–2006, sampled from real estate sections (typically the last Sunday of each month). Valid listings require price, size, property type, and within-city location. Hedonic indices are estimated using rolling three-year windows with annual steps, controlling for size, type, and standardized location, allowing hedonic coefficients to evolve over time rather than imposing a fixed vector. City indices are aggregated to national indices using population-weighted census data interpolated between census years.&lt;/p&gt;
&lt;p&gt;Q: Why are the HHP series based on listing prices rather than transaction prices, and how is this limitation addressed?
A: Transaction-price records require local archival effort infeasible across 30 cities over 116 years, and rental transaction data are essentially unavailable historically. The authors argue that hedonic mix-adjustment makes listed prices strong predictors of selling prices during normal market conditions, and that a substantial share of houses transact at their exact listing price. Validation against census distributions and against post-1975 FHFA and Case-Shiller series supports the approach; the authors acknowledge listing prices may diverge from transaction prices at cyclical peaks and troughs.&lt;/p&gt;
&lt;p&gt;Q: What does the paper find about the long-run trajectory of real market rents, and how does this revise existing understanding?
A: The HHP series shows real rental price levels in 2006 were approximately 20% higher than in 1890 or 1914, fluctuating within a relatively narrow band over the century. The BLS RoPR series implies real rents fell by nearly half between 1914 and 2006. The HHP findings align with the most influential proposed corrections to the RoPR by Gordon &amp;amp; van Goethem (2007) for 1915–1939 and broadly with Crone et al. (2010) in terms of overall growth levels for 1940–1995, though the HHP series shows a sharper rental spike after World War II rent controls were lifted that the BLS methodology captures only with deliberate lag.&lt;/p&gt;
&lt;p&gt;Q: What does the paper find about the interwar housing cycle, and why does the Shiller index miss it?
A: The HHP series documents that real sales prices rose approximately 47% between 1920 and 1928, then fell 27% by 1935, with the 1928 nominal peak not regained until 1946 and the real peak not until 1968. The Shiller index for 1890–1934 is based on a 1934 survey of owner recollections of past transaction prices and assessed values, which the authors argue reflects homeowners&amp;rsquo; lack of awareness of the changing value of their homes over prior decades. The HHP finding is consistent with census data, Nicholas &amp;amp; Scherbina&amp;rsquo;s study of New York City, and Fishback &amp;amp; Kollmann&amp;rsquo;s analysis of New Deal reports.&lt;/p&gt;
&lt;p&gt;Q: What does the paper find about the 1953–1974 period, and what explains the divergence from the Shiller index?
A: The HHP series shows housing sales prices increased 21% in real terms between 1953 and 1974, while the Shiller index (based on the Home Purchase Component of the CPI) implies a moderate decline of around 10%. The Shiller index for this period uses a truncated sample of government-backed mortgages subject to FHA loan limits; when the authors truncate their own data using the same statutory FHA limits ($30,000 in 1973, $45,000 in 1974, $60,000 in 1977), approximately 50% of their 1971–1979 listings are excluded and their truncated series matches the Shiller index more closely. This supports the Greenlees (1982) critique of downward bias in the Home Purchase CPI component.&lt;/p&gt;
&lt;p&gt;Q: What are the long-run return components to homeownership at the national level?
A: Average nominal housing returns across 1890–2006 were approximately 11% per year: 3.8% capital gain and 7.2% rental return. Before World War II (1890–1945), average nominal rental returns ranged from 7.9% to 8.3% per sub-period while capital gains averaged near zero or negative in real terms. Only in 1981–2006 did capital gains (averaging 5.8%) exceed the rental return (averaging 5.3%). The return to housing has thus been dominated by rental income over the long run, with the post-1980 era of sustained capital gains constituting a historical anomaly.&lt;/p&gt;
&lt;p&gt;Q: How do rental yields evolve over the sample period?
A: Gross market rental yields exceeded 8% annually for much of 1900–1945, with spikes after both World Wars and a dramatic fall from nearly 11% to below 7% during the early 1920s boom, consistent with a bubble dynamic before the Great Depression. Yields fell to approximately 7% by 1960 and to 3% by 2006. City-level heterogeneity was substantial: rental returns exceeded 15% in some cities in the two decades before the Great Depression, and most cities saw returns above 10% nominally during 1930–1945, while even by 1981–2006 cities like Phoenix and St. Louis averaged above 12%.&lt;/p&gt;
&lt;p&gt;Q: What does the paper find about housing and the business cycle?
A: Real growth rates in GDP and housing prices moved in the same direction in 72 of 116 years for sales prices and 65 of 116 years for rental prices. The paper identifies three major downturns where falling rents led falling prices which led falling GDP: the Great Depression (rents fell from 1924, prices from 1929, GDP from 1930), the early 1990s recession (rents from 1988, prices from 1990, GDP from 1991), and the end-of-sample period (rents from 2002). Only after World War I (1920–21) and World War II (1945–46) did clear economic contractions occur without equivalent housing price downturns.&lt;/p&gt;
&lt;p&gt;Q: What does the paper find about cross-city variation in housing returns, and what does this imply for the volatility puzzle?
A: Capital gains and rental returns vary substantially across cities and time periods; some cities saw returns exceeding the S&amp;amp;P 500 before World War II (including New York and Chicago), while most underperformed equities from 1981–2006. The authors argue that the apparently low volatility of housing returns at the national level documented by Jordà et al. (2019) is partly an aggregation artifact: local housing markets with very different trajectories are combined into a national index, dampening measured variance. The mild positive correlation between city-level capital gains and rental returns has an R² of 0.24.&lt;/p&gt;
&lt;p&gt;Q: What are the implications for CPI measurement?
A: The HHP series implies nominal rents grew at approximately 3.5% per year from 1914 to 2006, compared with 2.6% per year for the BLS RoPR component, with higher growth concentrated in the years after both World Wars and in the 1965–1985 period. A back-of-the-envelope alternative CPI substituting HHP rental data yields overall price growth of 3.3% per year rather than the official 3.1%. If rental price growth before 1985 is understated in the BLS data, then there has been less improvement in the U.S. standard of living since World War I than was previously understood.&lt;/p&gt;
&lt;p&gt;Q: What does the paper find about the role of supply constraints in explaining cross-city price divergence?
A: Natural land constraints are positively linked to price growth throughout the 20th century, with the relationship sharpest during 1930–1945 (before the postwar suburban expansion) and again after 1980. Regulatory constraints—measured at the turn of the millennium—have become an increasingly important driver of cross-city price differences, consistent with zoning functioning as a tax (Gyourko &amp;amp; Krimmel 2021). The paper also finds evidence suggesting land-use regulations are partly driven by expectations of future price growth, consistent with the homeowner-voter hypothesis (Fischel 2015; Trounstine 2018).&lt;/p&gt;
&lt;p&gt;Q: How does the paper validate its series against existing sources?
A: The HHP rental series aligns closely with the Rees and Jacobs (1961) series for 1890–1914. For sales, the HHP series matches the Case-Shiller-Weiss and FHFA repeat-sales indices at both national and city level after 1990 despite methodological differences. The paper finds approximately 25% more price growth than the CSW series over 1975–2006 (117% versus 90% in the 30 HHP cities), attributing some of the divergence to OFHEO appraisal-based valuations before 1992 and the HHP coverage of the broader owned housing market beyond single-family homes.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;Historical Housing Prices (HHP) Project: A dataset of just under 2.7 million newspaper real estate listings from 30 U.S. cities, 1890–2006, used to construct annual, quality-adjusted hedonic price indices for both rented and owned housing segments at the city and national level.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Rolling-windows hedonic methodology: An index construction approach that runs sequential hedonic regressions over two-, three-, or five-year overlapping windows with annual step size, allowing the coefficients on size, type, and location to evolve over time rather than imposing a fixed vector across the full sample period, reducing bias from unobserved quality changes.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Market rent vs. contract rent: Market rent (the listing price for a rental unit actively advertised) is conceptually distinct from contract rent (the rent paid by tenants currently in situ), which is what the BLS RoPR series measures. Market rents adjust to vacancy and lease resets faster than contract rents, producing substantially more short-run volatility and a materially different long-run trend.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Gross rental yield (rent-to-price ratio): Annual rental income from a property divided by its market sales price, computed as RI_{c,t} / HPI_{c,t}. Gross yields exceeded 8% annually for much of 1900–1945 and fell to 3% by 2006 nationally, making rental income the dominant component of total housing returns for most of the century.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Total return to housing: The sum of the capital gain (percentage change in sales price) and the rental return (rental income divided by sales price), computed at annual, city, and national frequency for 1890–2006. The average nominal total return was approximately 11% per year, with 3.8% from capital gains and 7.2% from rental income.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Rent of Primary Residence (RoPR): The BLS survey-based series measuring changes in contract rents for a rotating panel of rental units, used as the shelter component of the CPI. The HHP series implies this series understates rental price growth by approximately 0.9 percentage points per year (3.5% vs. 2.6% nominal growth), concentrated in post-World War periods and 1965–1985, due to tenant non-response bias and delayed incorporation of new construction.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Supply constraints and cross-city divergence: Natural land constraints (geographic barriers to development) and regulatory constraints (zoning and land-use regulation) that limit housing supply, both positively associated with price growth, with regulatory constraints becoming increasingly important after 1980 and consistent with the hypothesis that land-use regulations are partly driven by homeowner expectations of future price appreciation.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;</description></item><item><title>The price of intelligence: How should socially-minded firms price and deploy AI?</title><link>https://macropaperwarehouse.com/papers/the-price-of-intelligence-how-should-socially-minded-firms-price-and-deploy-ai/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-price-of-intelligence-how-should-socially-minded-firms-price-and-deploy-ai/</guid><description>&lt;p&gt;Leading AI firms such as OpenAI and Anthropic publicly claim dual mandates of profit and social welfare, raising the question of whether—and how—a social mandate should change their pricing and deployment decisions. This paper provides a framework to answer this question, deriving a Modified Lerner Rule for socially minded AI firms that extends the standard profit-maximizing Lerner Rule to incorporate incentives for aggregate efficiency, distributional concerns, and labor market stability. Using U.S. data on 525 detailed occupations, the paper evaluates optimal pricing and deployment paths for an AI capable of replacing human labor in each job at 50% of the cost. The main finding is that a welfarist firm (one that values profits and social welfare) should price closer to marginal cost because, for the jobs considered, efficiency gains outweigh distributional concerns—AI does not primarily displace low-income workers. A conservative firm focused on labor market stability should price above the profit-maximizing level in the short run, but not in the long run. The paper concludes that the most pro-social course of action for AI firms with market power is to refrain from exercising that power, and that proposals to tax AI to protect labor markets miss the counteracting role of market power.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-modified-lerner-rule-and-what-motives-does-it-capture"&gt;Q1. What is the Modified Lerner Rule and what motives does it capture?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The Modified Lerner Rule states (P − MC)/P = M/ε, where ε is the demand elasticity and M is a modifier that summarizes the motives of the socially minded firm; M = 1 corresponds to the standard profit-maximizing Lerner Rule.&lt;/strong&gt; The modifier M reflects four distinct considerations: (1) profit motives push M toward 1; (2) aggregate efficiency considerations push M toward 0 (marginal-cost pricing, which maximizes the &amp;ldquo;size of the pie&amp;rdquo;); (3) distributional concerns (who benefits from AI) can be positive or negative depending on whether AI substitutes for high- or low-income workers; and (4) the incentive to minimize labor market disruptions pushes M above 1 in the short run, because the cost of labor disruption is higher while workers are still adjusting, but not in the long run. The formula is derived in a general equilibrium model where the AI firm has a monopoly over an AI capable of replicating human skills.&lt;/p&gt;
&lt;h3 id="q2-what-does-the-welfarist-case-imply-for-pricing"&gt;Q2. What does the welfarist case imply for pricing?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;A firm that values both profits and aggregate social welfare should price closer to marginal cost than the profit-maximizing firm, because for all 525 occupations considered, the aggregate efficiency gains from AI adoption outweigh the distributional costs.&lt;/strong&gt; This finding reflects the structure of AI&amp;rsquo;s labor market effects: since AI does not primarily displace low-income workers in the US occupational data used, distributional concerns do not push toward restricting AI access. The welfarist firm therefore faces a dominant efficiency motive to expand access by pricing down toward marginal cost, accepting lower profits in exchange for greater welfare gains from AI adoption.&lt;/p&gt;
&lt;h3 id="q3-what-does-the-conservative-case-imply"&gt;Q3. What does the conservative case imply?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;A firm focused solely on labor market stability should price above the profit-maximizing level in the short run, because restricting AI deployment reduces the speed of worker displacement; but this above-profit-maximizing pricing is optimal only temporarily, and converges toward profit-maximizing pricing in the long run as workers adjust.&lt;/strong&gt; The intuition is that the cost of disrupting the labor market is highest when workers have not yet adjusted—their human capital is not yet redeployed—so a conservative firm acts as a gradual deployer. This conservative pricing is distinct from the welfarist case: the conservative motive restricts access more than a welfarist mandate, since it is willing to sacrifice efficiency to slow disruption.&lt;/p&gt;
&lt;h3 id="q4-why-does-the-paper-argue-against-taxing-ai"&gt;Q4. Why does the paper argue against taxing AI?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper argues that proposals to tax AI firms to protect workers from displacement overlook the fact that AI firms may already exercise significant market power, which protects workers by restricting AI supply below the efficient level.&lt;/strong&gt; Adding a tax on top of an already-restricted supply would harm consumers (who face high AI prices and limited access) without providing meaningful additional protection for workers (since output is already suppressed by market power). The paper&amp;rsquo;s analysis implies that the first-order social priority is to have AI firms refrain from exercising their market power—by pricing closer to marginal cost—rather than further restricting supply through taxation.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Modified Lerner Rule&lt;/strong&gt; : (P − MC)/P = M/ε, where M captures a socially minded firm&amp;rsquo;s weighting of profit, aggregate efficiency, distributional, and stability motives; the paper&amp;rsquo;s key pricing formula, derived from a GE model with a monopoly AI firm.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;welfarist vs. conservative firm&lt;/strong&gt; : two polar cases: the welfarist firm maximizes a weighted sum of profits and aggregate welfare (implying near-marginal-cost pricing); the conservative firm prioritizes labor market stability (implying above-profit-maximizing pricing in the short run to slow displacement).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;labor disruption cost&lt;/strong&gt; : the welfare cost to workers of being displaced by AI, which is higher in the short run when workers must reallocate across jobs or sectors and lower in the long run after adjustment; the paper&amp;rsquo;s formal treatment of this cost motivates the conservative firm&amp;rsquo;s gradual deployment strategy.&lt;/p&gt;</description></item><item><title>The role of wage expectations in the labor market</title><link>https://macropaperwarehouse.com/papers/the-role-of-wage-expectations-in-the-labor-market/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-role-of-wage-expectations-in-the-labor-market/</guid><description>&lt;p&gt;This paper develops a Mortensen-Pissarides (DMP) search and matching model with internally rational (IR) agents who hold subjective beliefs about wages rather than perfect knowledge of the Nash bargaining outcome. The standard DMP model struggles with two empirical regularities: high volatility of U.S. labor market variables relative to productivity, and a near-zero correlation between labor market tightness and productivity post-1989. The IR model significantly improves alignment with U.S. labor market data relative to the standard rational expectations benchmark, by generating a self-referential belief mechanism: shifts in beliefs about the future returns to labor affect current wages, which agents use to update beliefs. Wage expectations in the model are consistent with European Commission professional forecasters data, and an econometric test rejects the rational expectations null hypothesis for survey real wage expectations.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary based on a working paper version, AI-assisted and human-reviewed. See the linked published article for the authoritative version.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-internal-rationality-and-how-does-it-differ-from-standard-rational-expectations"&gt;Q1. What is internal rationality and how does it differ from standard rational expectations?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Internal rationality (IR) means agents know all internal aspects of their optimization problem and maximize their objectives given their knowledge, but lack perfect information about the equilibrium wage function that emerges from Nash bargaining; they therefore hold subjective beliefs about wages.&lt;/strong&gt; Under standard rational expectations, workers and firms know the exact wage function from Nash bargaining. Under IR, they have limited foresight about the outcome of wage negotiations and use a subjective model to form wage expectations. This is a small but disciplined departure from RE: the paper considers belief systems implying only a small deviation from rational expectations that match aspects of survey wage expectations.&lt;/p&gt;
&lt;h3 id="q2-what-is-the-empirical-failure-of-the-standard-dmp-model-that-motivates-the-paper"&gt;Q2. What is the empirical failure of the standard DMP model that motivates the paper?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The standard DMP model fails on two counts: it cannot reproduce the high observed volatility of unemployment, vacancies, and market tightness relative to productivity, and it cannot generate the near-zero post-1989 correlation between productivity and labor market tightness.&lt;/strong&gt; The first failure—the Shimer (2005) puzzle—has attracted extensive research, but the near-zero tightness-productivity correlation has been largely neglected. The paper shows that allowing for small deviations from rational expectations in the form of internal rationality resolves both puzzles simultaneously.&lt;/p&gt;
&lt;h3 id="q3-what-is-the-self-referential-belief-mechanism-and-how-does-it-generate-extra-dynamics"&gt;Q3. What is the self-referential belief mechanism and how does it generate extra dynamics?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The model has a self-referential mechanism: shifts in beliefs about future returns to labor affect current wages, and agents use realized wages to update their beliefs about future wages; this creates an additional dynamic source beyond technology shocks that helps match the data.&lt;/strong&gt; When firms and workers revise beliefs about future wages upward, current wages rise through the Nash bargaining outcome (since reservation values of both parties shift); this realization then feeds back into updating beliefs, generating wage and employment dynamics not tied to current productivity. This mechanism provides a microfoundation for previous adaptive learning models of unemployment.&lt;/p&gt;
&lt;h3 id="q4-what-is-the-empirical-validation-of-the-models-wage-expectations"&gt;Q4. What is the empirical validation of the model&amp;rsquo;s wage expectations?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Wage expectations in the IR model are validated against survey data from European Commission professional forecasters, and an econometric test rejects the rational expectations null hypothesis for real wage expectations from survey data.&lt;/strong&gt; The consistency between model-implied and surveyed wage expectations provides external validation for the IR departure from RE, showing that the subjective beliefs assumed in the model correspond to beliefs actually held by professional forecasters rather than to arbitrary deviations.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;internal rationality (IR)&lt;/strong&gt; : a bounded rationality concept in which agents fully optimize given their beliefs and knowledge of their own decision problem, but lack perfect knowledge of equilibrium objects (here, the wage function emerging from Nash bargaining); allows small, disciplined deviations from rational expectations.
&lt;strong&gt;DMP model&lt;/strong&gt; : the Mortensen-Pissarides-Diamond search and matching model; the standard theory of equilibrium unemployment; criticized for generating insufficient labor market volatility relative to productivity (the Shimer puzzle) and for counterfactual positive tightness-productivity correlation.
&lt;strong&gt;belief shock&lt;/strong&gt; : an exogenous shift in agents&amp;rsquo; subjective beliefs about future wages; generates employment and wage dynamics independently of current productivity shocks via the self-referential mechanism; introduced as an additional structural shock in the IR-DMP model.&lt;/p&gt;</description></item><item><title>The Social Tax: Redistributive Pressure and Labor Supply</title><link>https://macropaperwarehouse.com/papers/the-social-tax-redistributive-pressure-and-labor-supply/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-social-tax-redistributive-pressure-and-labor-supply/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This paper asks whether informal redistributive pressure — the social obligation to share earned income with kin and social networks — distorts labor supply in low-income communities. The authors conceptualize such pressure as a &amp;ldquo;social tax&amp;rdquo; on earnings and develop the first direct causal test of whether it reduces labor supply, output, and earnings among full-time workers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Setting and Sample&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The study works with 474 full-time piece-rate factory workers (464 of whom are women) employed in cashew processing plants run by Olam in Côte d&amp;rsquo;Ivoire. Workers are paid biweekly in cash entirely through piece rates for individual nut-peeling output, creating a direct mapping between labor supply and income. At baseline, workers report transferring 25–35% of their income to individuals outside their household, with 77% having made at least one transfer in the previous 3 months. Workers also strongly believe that earning more triggers more transfer requests: 77% agree that if someone starts earning more by working harder, people will ask that person more often for financial help.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Intervention&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors introduce a blocked savings account into which workers can deposit any earnings above a self-chosen threshold (set at least as high as their own baseline average earnings). Earnings above the threshold are automatically deposited by the factory directly into the account with the Banque Populaire de Côte d&amp;rsquo;Ivoire; the cash component of pay is unchanged. Funds cannot be withdrawn until the end of the blocked period (9 months in Phase 1; 3 months in Phase 2). The key design feature is that the account reduces the effective social tax rate only on earnings &lt;em&gt;increases&lt;/em&gt; above baseline, thereby eliminating income effects and generating only a pure substitution effect — an unambiguous positive prediction on labor supply if a social tax exists.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Experimental Design&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Workers are randomized into three conditions: (1) Control (no account); (2) Private account (existence unknown to anyone outside the worker); (3) Non-private account (existence and forthcoming unblock date revealed to network members via promotional text messages). The contrast between Private and Non-private isolates the role of redistributive pressure specifically — holding constant all other features of the blocked account product. The experiment runs in two cross-randomized phases conducted between 2018 and 2019.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Take-up of blocked accounts is dramatically higher when accounts are private: 60% in Phase 2 (Private) versus 14% (Non-private), a 77% decline (p&amp;lt;0.001). Among workers who declined Non-private accounts, 96% cite anticipated increases in transfer requests as an important factor.&lt;/p&gt;
&lt;p&gt;Being offered a Private account sharply raises labor supply. Pooling both phases, the Private arm increases average daily earnings by 175.9 FCFA, or &lt;strong&gt;11.4%&lt;/strong&gt; (p=0.012), relative to Control or Non-private arms. This is accompanied by a &lt;strong&gt;6.2 percentage point (9.7%)&lt;/strong&gt; increase in daily work attendance (p=0.023), with the entire attendance effect driven by reduced absenteeism rather than turnover. Effects in Phase 1 (Private vs. Control: +11.3%, p=0.032) and Phase 2 (Private vs. Non-private: +11.5%, p=0.043) are nearly identical in magnitude, indicating the results are not sensitive to cross-phase design. The treatment effect magnitude is equivalent to each worker working an additional 1.19 days in every two-week paycycle. Because 89% of workers have no income outside the factory, these constitute increases in total earned income.&lt;/p&gt;
&lt;p&gt;Heterogeneity is consistent with the hypothesized mechanism: among workers who report difficulty saving due to redistributive pressure, the Private treatment increases earnings by &lt;strong&gt;15.0%&lt;/strong&gt; (p=0.018); among those not reporting such difficulty, the estimated effect is near zero and insignificant (p=0.95). Among workers who report transfers to acquaintances (the most likely social-tax-motivated transfers), the effect is &lt;strong&gt;17.5%&lt;/strong&gt; (p=0.014). Workers without a partner — for whom intra-household redistribution is irrelevant — experience a &lt;strong&gt;15.8%&lt;/strong&gt; earnings increase (p=0.017), indicating that extra-household pressure drives the results.&lt;/p&gt;
&lt;p&gt;Outgoing transfers do not decline. The design leaves cash-on-hand unchanged by construction, and consistent with this, there is no significant change in the likelihood or amount of transfers from treated workers to their networks. Total outgoing transfers are if anything higher among Private account workers (p=0.049), suggesting no loss in redistribution to the network.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Social Tax Rate Estimation&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Combining the 11.4% treatment effect on output with a labor supply elasticity estimated from an end-of-experiment piece-rate randomization (intensive-margin elasticity of 0.17; total elasticity of approximately 1.11), the authors estimate the social tax rate for the average worker in the sample at &lt;strong&gt;9–14%&lt;/strong&gt;. For the subset who actually take up Private accounts, the implied social tax rate is &lt;strong&gt;19–23%&lt;/strong&gt;.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Results pertain to full-time female piece-rate workers in formal cashew processing plants in Côte d&amp;rsquo;Ivoire, with average tenure of 1.7 years. Because the intervention lowers the tax only on earnings &lt;em&gt;above&lt;/em&gt; baseline (not on all earnings), the estimates do not directly capture the total distortion from eliminating all redistributive pressure. Alternative confounds — fairness/morale effects, self-control, privacy concerns, goal-setting — are each tested and ruled out as primary drivers.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-theoretical-basis-for-predicting-that-private-accounts-unambiguously-increase-labor-supply"&gt;Q1. What is the theoretical basis for predicting that Private accounts unambiguously increase labor supply?&lt;/h3&gt;
&lt;p&gt;The authors model redistributive pressure as a social tax rate τ₁ on gross earnings. The blocked account reduces this tax to τ₂ &amp;lt; τ₁ only on earnings &lt;em&gt;above&lt;/em&gt; baseline labor supply e₁, creating a kink in the budget constraint. Starting from e₁, the worker faces only a pure substitution effect (no income effect) when τ₂ falls, because her net earnings at e₁ are unchanged. Equation (2) in the paper shows formally that the income effect term drops out, and the derivative of labor supply with respect to τ₂ is unambiguously negative (i.e., reducing τ₂ increases effort). This &amp;ldquo;clean&amp;rdquo; prediction — no income effect, no ambiguity — is the central design advantage relative to simply shielding existing earnings.&lt;/p&gt;
&lt;h3 id="q2-how-do-take-up-rates-differ-between-private-and-non-private-accounts-and-what-do-workers-say-explains-the-difference"&gt;Q2. How do take-up rates differ between Private and Non-private accounts, and what do workers say explains the difference?&lt;/h3&gt;
&lt;p&gt;In Phase 2, take-up of Private accounts is 60% versus only 14% for Non-private accounts — a 77% reduction (p&amp;lt;0.001). Among workers who declined a Non-private account, 96% cite the anticipation of increased transfer requests from network members knowing about the account as an important factor in their decision. Only 5% cite any other reason. This pattern is strong direct evidence that the fear of redistribution — not other features of the accounts — drives take-up differences.&lt;/p&gt;
&lt;h3 id="q3-what-are-the-treatment-effects-on-earnings-and-attendance-and-how-consistent-are-they-across-phases-and-subsamples"&gt;Q3. What are the treatment effects on earnings and attendance, and how consistent are they across phases and subsamples?&lt;/h3&gt;
&lt;p&gt;Pooled across both phases, the Private arm raises daily earnings by 175.9 FCFA (11.4%, p=0.012) and attendance by 6.2 percentage points (9.7%, p=0.023). In Phase 1 alone (Private vs. Control), earnings rise 11.3% (p=0.032). In Phase 2 alone (Private vs. Non-private), earnings rise 11.5% (p=0.043). Restricting to workers not previously treated in Phase 1, the effect is 12.8% (p=0.034); restricting further to workers new to the study in Phase 2 only, the effect is 17.3% (p=0.020). The authors cannot reject that effects across these three Phase 2 subsamples are statistically the same (p=0.427), ruling out sensitivity to the cross-randomized design.&lt;/p&gt;
&lt;h3 id="q4-how-does-treatment-effect-heterogeneity-support-the-redistributive-pressure-mechanism"&gt;Q4. How does treatment effect heterogeneity support the redistributive pressure mechanism?&lt;/h3&gt;
&lt;p&gt;Workers who report difficulty saving because &amp;ldquo;someone else will need it for something urgent&amp;rdquo; see earnings increase by 15.0% (p=0.018) from the Private treatment; those not reporting this difficulty see near-zero, insignificant effects (p=0.95). Workers who make transfers to acquaintances — transfers especially unlikely to reflect altruism — see earnings rise 17.5% (p=0.014). Workers with below-median baseline earnings, potentially those facing the strongest relative disincentive to work, see larger effects. Each of these heterogeneous patterns is in the direction predicted if the social tax is the operative mechanism.&lt;/p&gt;
&lt;h3 id="q5-do-the-treatment-effects-reflect-substitution-away-from-outside-earnings-or-genuine-total-income-gains"&gt;Q5. Do the treatment effects reflect substitution away from outside earnings or genuine total income gains?&lt;/h3&gt;
&lt;p&gt;No. The paper finds no treatment effects on earnings outside the factory. At baseline, 89% of workers report zero outside earnings, and on average 93% of total income comes from factory wages. Consequently, the 11.4% earnings increase represents a near-one-for-one increase in total earned income.&lt;/p&gt;
&lt;h3 id="q6-do-private-accounts-reduce-transfers-to-the-network"&gt;Q6. Do Private accounts reduce transfers to the network?&lt;/h3&gt;
&lt;p&gt;No. The design ensures that cash-on-hand is unchanged by construction — workers receive the same or slightly higher take-home cash pay (the difference is positive but insignificant). Consistent with this, neither the probability of making transfers (p=0.37) nor transfers to family (p=0.35) or non-family (p=0.93) change significantly. Total outgoing transfers in the endline survey are if anything higher in the Private arm (p=0.049, though this may partly reflect redistribution of unblocked savings). The net transfer amount is positive but insignificant (p=0.32). The authors conclude the intervention did not make others in workers&amp;rsquo; networks worse off.&lt;/p&gt;
&lt;h3 id="q7-how-do-the-authors-rule-out-morale-or-fairness-effects-as-an-explanation"&gt;Q7. How do the authors rule out morale or fairness effects as an explanation?&lt;/h3&gt;
&lt;p&gt;Treatment assignment was conducted by lottery with ID numbers drawn in front of workers, clearly dissociating it from employer favoritism. More directly, the authors test for morale effects using the 3–4 week &amp;ldquo;announcement period&amp;rdquo; between treatment disclosure and account activation. If disgruntlement among non-Private workers drove results, output should fall during this period — but estimated announcement effects are near zero (0.8% of control mean, p=0.859 in Phase 2). In contrast, effects arise immediately in the first active paycycle: earnings jump 11.4% (p=0.082) even before workers have seen any deposits occur. The fairness story also cannot explain why effects are concentrated precisely among workers who report more redistributive pressure.&lt;/p&gt;
&lt;h3 id="q8-how-do-the-authors-test-and-rule-out-self-control-as-the-primary-mechanism"&gt;Q8. How do the authors test and rule out self-control as the primary mechanism?&lt;/h3&gt;
&lt;p&gt;Self-control cannot explain why Non-private accounts — which offer the same commitment benefit — have dramatically lower take-up than Private accounts. Separately, the authors test a core prediction of time inconsistency models by surprising workers with an option to opt out of the next deposit, randomly varying whether the offer comes 4 days before payday or on payday itself. Under quasi-hyperbolic preferences, workers should be more likely to opt out on the payday itself. Counter to this prediction, 94% of workers keep their earnings in the account on payday, compared to 86% four days before — and these means are not statistically distinguishable, with the relative magnitudes actually running opposite to time inconsistency predictions.&lt;/p&gt;
&lt;h3 id="q9-how-do-the-authors-address-the-concern-that-non-private-accounts-may-raise-the-tax-rate-above-the-baseline-inflating-treatment-effect-estimates"&gt;Q9. How do the authors address the concern that Non-private accounts may raise the tax rate above the baseline, inflating treatment effect estimates?&lt;/h3&gt;
&lt;p&gt;The concern is that Non-private SMS alerts could make network members more aware of available cash than under the status quo, pushing the effective comparison above the Control level. The authors note that (a) paydays are already publicly known in this setting and workers regularly face transfer requests around them; (b) workers must physically withdraw savings from a bank after the unblock date, and can even re-block funds; and (c) the magnitude of effects when comparing Private to Control is nearly identical to the effect when comparing Private to Non-private (11.3% vs. 11.5%), suggesting the Non-private condition does not materially raise the tax above the status quo.&lt;/p&gt;
&lt;h3 id="q10-how-do-the-authors-rule-out-privacy-concerns-rather-than-redistributive-pressure-as-the-driver-of-low-non-private-take-up-and-treatment-effects"&gt;Q10. How do the authors rule out privacy concerns (rather than redistributive pressure) as the driver of low Non-private take-up and treatment effects?&lt;/h3&gt;
&lt;p&gt;Four arguments are provided. First, Phase 1 effects (Private vs. Control, no Non-private arm) are the same magnitude as Phase 2 effects, yet Phase 1 cannot be confounded by privacy concerns. Second, among workers who refused Non-private accounts, 96% cite transfer request anticipation; none volunteer generic privacy concerns. Third, heterogeneity effects — concentrated among high-redistributive-pressure workers — have no obvious connection to privacy preferences. Fourth, two placebo SMS exercises: 95% of Non-private workers grant permission to send generic bank promotional texts, and 88% of workers who had Phase 1 Private accounts grant permission for messages about their past (already-spent) savings — indicating no inherent aversion to having some financial information shared with networks. Since these workers forgo 11.5% of full-time earnings by refusing Non-private accounts, privacy concerns alone are implausible as a full explanation.&lt;/p&gt;
&lt;h3 id="q11-how-is-the-social-tax-rate-estimated-and-what-does-the-range-look-like"&gt;Q11. How is the social tax rate estimated and what does the range look like?&lt;/h3&gt;
&lt;p&gt;The authors combine the 11.4% ITT treatment effect (used as the ratio e₁/e₂) with a compensated labor supply elasticity ζ estimated from an end-of-experiment piece-rate randomization. The piece-rate experiment (varying piece rates over four values from −15% to +30% of baseline over 6 days) yields an intensive-margin elasticity of 0.17. Using the ratio of attendance to intensive-margin effects from Table 3, the implied extensive-margin elasticity is 0.94, giving ζ ≈ 1.11. With this elasticity and assuming τ₂ = 0 (most conservative), the ITT-implied social tax rate is 9%; assuming τ₂ = 5%, it is 14%. For compliers (workers who actually take up Private accounts), the estimated rate is 19–23%. If instead the lower elasticity estimate of 0.32 (comparable to Goldberg 2016) is used, the ITT tax rate would be at least 29%.&lt;/p&gt;
&lt;h3 id="q12-what-are-the-broader-implications-discussed-by-the-authors"&gt;Q12. What are the broader implications discussed by the authors?&lt;/h3&gt;
&lt;p&gt;The authors propose that if redistributive pressure distorts work incentives, it may also distort other costly income-generating actions: technology adoption, human capital investment, and formal sector participation. They note that 74% of workers believe taking a formal job would increase transfer requests, even though network members could also access such jobs. A speculative but highlighted policy implication is that formal safety nets (health or unemployment insurance) could reduce social tax burdens on non-recipients by absorbing demand for redistribution, potentially generating positive productivity externalities.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Social Tax&lt;/strong&gt;: The paper&amp;rsquo;s central concept. Redistributive pressure from kin and social networks is modeled as a tax rate τ₁ on gross earnings — not altruistic transfers, but transfers made under social pressure that workers would prefer to avoid. The &amp;ldquo;tax&amp;rdquo; analogy captures that the obligation is proportional to visible income and reduces the private return to earning more. The paper explicitly does not take a stance on the underlying microfoundation (risk-sharing, cultural norms, or a mix).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Blocked Savings Account&lt;/strong&gt;: A date-based savings account (implemented with Banque Populaire de Côte d&amp;rsquo;Ivoire) into which any earnings above a worker-chosen threshold are automatically deposited by the factory. Funds are inaccessible until the blocked period ends (3–9 months). Workers cannot withdraw during the period, making deposited earnings unavailable to fulfill transfer requests and therefore effectively reducing the social tax rate on earnings increases.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Private vs. Non-private Treatment&lt;/strong&gt;: The paper&amp;rsquo;s key experimental contrast. A Private account&amp;rsquo;s existence is unknown to anyone in the worker&amp;rsquo;s network. A Non-private account triggers SMS messages to network members disclosing that the worker is saving and announcing when the unblock date approaches. The contrast isolates whether the shielding of income from social visibility — not the commitment device per se — drives take-up and labor supply.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Substitution Effect without Income Effect&lt;/strong&gt;: The paper&amp;rsquo;s design deliberately places the tax reduction only on earnings &lt;em&gt;above&lt;/em&gt; baseline, creating a kink in the budget constraint. Starting from the existing labor supply level, there is no change in net earnings at the margin — eliminating the income effect of a tax reduction — so any labor supply response is a pure compensated (substitution) effect. This makes any observed increase in labor supply an unambiguous signal that a distortionary social tax exists.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Intent to Treat (ITT) vs. Treatment on the Treated (ToT)&lt;/strong&gt;: The ITT estimate (11.4% earnings increase) reflects the effect of being &lt;em&gt;offered&lt;/em&gt; a Private account on all offered workers, including those who did not take up. The ToT estimate — relevant for workers who actually used the accounts — implies a higher social tax rate (19–23%) because only roughly half of offered workers take up the accounts and only those workers face a materially reduced effective tax rate.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Compensated (Hicksian) Labor Supply Elasticity (ζ)&lt;/strong&gt;: The ratio used to infer the social tax rate from the observed treatment effect. The paper estimates ζ ≈ 1.11 (extensive margin ζₐ ≈ 0.94, intensive margin ζₑ ≈ 0.17) from an end-of-experiment piece-rate randomization. The social tax rate is recovered as τ₁ = 1 − (1−τ₂)(e₁/e₂)^(1/ζ) from Equation (5).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Piece Rate Setting&lt;/strong&gt;: Workers earn a linear piece rate for every kilogram of cashews peeled, with no fixed pay component. This setting ensures that every unit of additional effort by a worker translates directly into higher earnings, and that any observed earnings changes cleanly reflect labor supply responses rather than hour or schedule effects.&lt;/p&gt;</description></item><item><title>The Zero-Beta Interest Rate</title><link>https://macropaperwarehouse.com/papers/the-zero-beta-interest-rate/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-zero-beta-interest-rate/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper proposes and measures the zero-beta rate — the expected return on a portfolio of stocks with zero market beta, constructed to be orthogonal to the SDF innovations spanned by standard factors — as the correct intertemporal price of consumption, and argues that safe interest rates (Treasury bill yields) are not. Using 130 stock portfolios (81 sorted on combinations of beta, size, value, investment, and profitability; 49 industry portfolios) and GMM estimation with five macro instruments (T-bill yield, inflation, term spread, excess bond premium, and the U6 unemployment rate) over January 1973 to December 2020, the paper estimates the zero-beta rate to average 8.3% per year in real terms with a standard deviation of 9.3%, producing a spread of roughly 7.6% per year over the expected real Treasury bill yield. The paper then shows that this zero-beta rate fits the aggregate consumption Euler equation remarkably well: the macro instruments that best predict the real return of the zero-beta portfolio are nearly proportional to those that predict real consumption growth, a non-mechanical result that survives when the sample is restricted to exclude COVID. Statistical Euler equation tests (Stock-Wright [2000] weak-instrument-robust GMM) reject the Euler equation for all IES values when applied to the Treasury bill, fail to reject it for any IES value when applied to the volatile market return (weak identification), but fail to reject it only for IES below 0.5 (risk aversion above 2) when applied to the zero-beta rate — providing identification from the intermediate predictability of the zero-beta portfolio. Monetary policy shock regressions using Romer-Romer and Nakamura-Steinsson shocks further show that an unexpected monetary tightening raises the real Treasury bill yield but lowers the real zero-beta rate, consistent with the Euler equation&amp;rsquo;s prediction that the intertemporal price should fall when expected consumption growth falls. Finally, the high level and volatility of the zero-beta rate implies that the entire variation of the price-dividend ratio of a consumption claim can be attributed to variation in the zero-beta rate without requiring time-varying equity risk premia — resolving the equity premium puzzle and Campbell&amp;rsquo;s [1991] excess volatility puzzle simultaneously, at the cost of an unexplained convenience spread on safe assets.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-zero-beta-rate-and-how-does-the-paper-construct-it"&gt;Q1. What is the zero-beta rate and how does the paper construct it?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The zero-beta rate is the expected return on a portfolio of stocks that is constructed to have zero covariance with all included asset-pricing factors; in a model where the SDF innovations are spanned by those factors, the expected return of this portfolio equals the intertemporal marginal rate of substitution — the correct price for rearranging consumption over time.&lt;/strong&gt; The paper follows a three-step procedure. First, it estimates the betas of 130 CRSP stock portfolios with respect to seven factors (Fama-French 5-factor model augmented with a bond excess return factor and a default spread factor) using the Ledoit-Wolf (2017) shrinkage estimator for the factor covariance matrix, to mitigate the rank problem arising from 130 portfolios and 574 monthly observations. Second, it uses the betas to construct the minimum-variance zero-beta portfolio — the portfolio that minimizes return variance subject to having zero exposure to each factor, exploiting all 130 portfolios. Third, it regresses the return of this portfolio on five macro instruments (T-bill yield, lagged inflation, term spread, excess bond premium [EBP], and U6 unemployment) using GMM, in an exactly-identified system in which the same instruments used to predict the portfolio return are used as moment conditions. The fitted value of this regression is the zero-beta rate — the predictable component of the zero-beta portfolio return.&lt;/p&gt;
&lt;p&gt;The key econometric property is that the GMM procedure simultaneously estimates the factor loadings and the predictive regression in a way that accounts for the estimation error in both. The standard errors for the predictive coefficients (γ) account for the fact that the betas used to construct the zero-beta portfolio are themselves estimated. Comparatively, an infeasible OLS regression predicting the zero-beta portfolio return has nearly identical point estimates and only slightly smaller standard errors, showing that the main estimation uncertainty comes from the predictive regression rather than from the beta estimation step.&lt;/p&gt;
&lt;h3 id="q2-what-are-the-key-properties-of-the-estimated-zero-beta-rate"&gt;Q2. What are the key properties of the estimated zero-beta rate?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The estimated zero-beta rate has three properties that distinguish it from safe rates and make it a plausible intertemporal price: it is predictable by macro instruments (with all predictors except the unemployment rate individually significant at 5%), it is high and volatile (8.3% annually on average, std dev 9.3%, compared to a low and stable Treasury bill yield), and it co-moves in the direction economic theory predicts with macro conditions.&lt;/strong&gt; The spread between the zero-beta rate and the expected real Treasury bill yield averages roughly 7.6% per year. The average real zero-beta rate is similar to the average real return of the CRSP market index (which averages 11.8% annualized nominal, or about 8.1% real), consistent with earlier estimates of the average zero-beta return by Hong and Sraer (2016) and Bali et al. (2017). The standard deviation of the zero-beta portfolio&amp;rsquo;s excess return over its expected value is about 2.7% per month (9.4% annualized), substantially below the standard deviation of the market return, which is why the paper can reject predictability for the zero-beta portfolio even though predicting market returns is notoriously difficult.&lt;/p&gt;
&lt;p&gt;In terms of time-series patterns: (1) the zero-beta rate increases more than one-for-one with the Treasury bill yield (consistent with Treasury bills having money-like qualities per Nagel [2016]); (2) it is decreasing in lagged inflation; (3) it falls when a recession is likely — specifically, the unemployment rate, term spread, and EBP collectively predict the zero-beta rate so that it is particularly low when U6 unemployment is low, the yield curve is inverted, and the EBP is high, conditions associated with elevated recession risk (Kiley [2022]).&lt;/p&gt;
&lt;h3 id="q3-how-does-the-zero-beta-rate-fit-the-aggregate-consumption-euler-equation"&gt;Q3. How does the zero-beta rate fit the aggregate consumption Euler equation?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The central empirical finding is that the macro instruments that best predict the real return of the zero-beta portfolio are nearly proportional to the macro instruments that best predict real consumption growth — a non-mechanical result because the two regressions are estimated entirely separately, on different data series, with no consumption data used to construct the zero-beta rate.&lt;/strong&gt; In the linearized Euler equation, the real zero-beta rate should predict real consumption growth in proportion to 1/σ (the IES). If the prediction coefficients on the instruments for the zero-beta rate are γ₀, and the corresponding coefficients for consumption growth are γc, then the vector Δ = γ₀ − σ × γc should be close to zero. Graphically, the expected real zero-beta rate and expected real consumption growth track each other closely over the full sample (Figure 1 in the paper), while the expected real Treasury bill return bears essentially no resemblance to expected consumption growth. With five instruments (L=5), the result is not mechanical: one would need L=1 to always find a σ that makes the result hold. The result is even stronger when the sample ends in December 2019, excluding the COVID episode, because the COVID consumption collapse introduces four-to-seventeen standard deviation consumption growth realizations that attenuate the fit.&lt;/p&gt;
&lt;p&gt;Robustness: ridge-penalized estimation (using cross-validation to minimize out-of-sample squared forecast error) substantially attenuates both expected consumption growth and the zero-beta rate toward zero, but they remain approximately proportional. The result holds across alternative specifications (different factor models, different instruments) tested in Appendix Section G.&lt;/p&gt;
&lt;h3 id="q4-what-do-the-statistical-euler-equation-tests-find"&gt;Q4. What do the statistical Euler equation tests find?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Using Stock-Wright (2000) weak-instrument-robust GMM tests of the non-linear consumption Euler equation, the paper finds: (a) the T-bill return fails the Euler equation test for all IES values (rejected); (b) the market return fails to reject the Euler equation for almost any IES values (weak-instruments problem, consistent with Yogo [2004]); and (c) the zero-beta rate fails to reject the Euler equation for IES values below 0.5 (σ above 2, i.e. risk aversion above 2 under CRRA), and rejects for IES above 0.5.&lt;/strong&gt; The procedure tests the instrumented non-linear Euler equation: for a conjectured value of σ, it estimates δ (the discount factor) from the unconditional Euler moment, then tests the instrumented Euler moments for each of the five instruments. The test statistic is chi-square with 5 degrees of freedom; the confidence set is the set of σ values that cannot be rejected. The zero-beta portfolio&amp;rsquo;s intermediate predictability (between the easily-predicted T-bill and the hard-to-predict market return) provides the identification that gives this test meaningful power. The paper obtains its preferred IES estimate of approximately 0.2 (σ ≈ 5) by noting that scaling the zero-beta rate down by a factor of five makes it match expected consumption growth most closely.&lt;/p&gt;
&lt;p&gt;The test faces a boundary problem: when σ is very large (above 10), the April 2020 consumption collapse creates a very large SDF realization that dwarfs all others, making the variance-covariance matrix nearly singular and the test uninformative. For this reason, the analysis is restricted to σ ≤ 10.&lt;/p&gt;
&lt;h3 id="q5-what-happens-to-the-zero-beta-rate-after-a-monetary-policy-shock"&gt;Q5. What happens to the zero-beta rate after a monetary policy shock?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Using the Romer-Romer (2004) and Nakamura-Steinsson (2018) identified monetary policy shocks, the paper finds that a surprise tightening raises the nominal and real Treasury bill yield but lowers the real zero-beta rate — opposite to what the standard Euler equation with the Treasury bill predicts.&lt;/strong&gt; Both shocks are normalized to a 100-basis-point increase in the federal funds rate on impact. The estimated effect on the real Treasury bill yield is an immediate increase of roughly the same magnitude (slightly more transitory for the Romer-Romer shock). In contrast, the real zero-beta rate falls following the shock, and the effect is larger and more persistent for the Nakamura-Steinsson shock.&lt;/p&gt;
&lt;p&gt;This result is consistent with the consumption Euler equation applied to the zero-beta rate, because the monetary shock also lowers expected consumption growth (well-established in the impulse-response literature). The decomposition in Appendix Section E shows the mechanism: a higher Treasury bill yield raises the zero-beta rate, but the monetary tightening also flattens the yield curve and widens credit spreads (raises the EBP); since both the term spread and EBP are predictors of the zero-beta rate with large coefficients, the indirect effects through these variables dominate and lower the zero-beta rate overall. This finding resolves a tension in structural macro models: while the standard New Keynesian model uses the Euler equation with the safe rate (as in Smets and Wouters [2003, 2007]) and requires habits or wedges to match the hump-shaped consumption response to monetary shocks, the Euler equation with the zero-beta rate is satisfied without additional mechanisms. A stylized three-period New Keynesian model in Appendix Section F shows a monetary tightening can simultaneously raise the safe rate and lower the zero-beta rate through endogenous changes in the convenience spread.&lt;/p&gt;
&lt;h3 id="q6-can-the-zero-beta-rate-explain-valuation-ratio-variation-without-time-varying-risk-premia"&gt;Q6. Can the zero-beta rate explain valuation ratio variation without time-varying risk premia?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper shows that the zero-beta rate is sufficiently high, volatile, and persistent to generate the observed variation in the price-dividend ratio of a consumption claim under the assumption of a constant and small equity risk premium, reproducing the finding of Campbell (1991) that discount rates must vary — but attributing the variation to the zero-beta rate rather than to time-varying excess returns.&lt;/strong&gt; The Campbell-Shiller decomposition of the log price-dividend ratio implies that the expected price-dividend ratio must predict either future real zero-beta rates, future excess returns, or a combination. Under the hypothesis of constant expected excess returns, the price-dividend ratio variation is driven entirely by zero-beta rate variation. In a VAR that includes the five macro instruments plus the CAPE ratio, the implied standard deviation of expected log price-dividend ratio of a consumption claim is approximately 28% — comparable to the 27% standard deviation in the Campbell-Cochrane (1999) model calibration, which achieves this variation through habit formation and time-varying risk premia. The equivalent calculation using the Treasury bill yield rather than the zero-beta rate produces a standard deviation of only 9%, consistent with Campbell&amp;rsquo;s (1991) original finding that the risk-free rate variation is insufficient.&lt;/p&gt;
&lt;p&gt;The paper interprets this as a resolution of both the equity premium puzzle (the average zero-beta return is approximately equal to the average market return, implying a roughly zero equity premium over the zero-beta rate) and the excess volatility puzzle (the zero-beta rate generates sufficient variation in the discount rate). The trade-off is that the unexplained spread between the zero-beta rate and the Treasury bill yield — averaging 7.6% annually — is instead left as a &amp;ldquo;convenience puzzle.&amp;rdquo; The paper argues this reframing is progress because the type of models required to explain a large convenience spread on safe assets (frictions, segmented markets, money-like demand for liquid assets) are quite different from those designed to explain large time-varying risk premia.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;dl&gt;
&lt;dt&gt;&lt;strong&gt;zero-beta rate&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;the expected return on the minimum-variance portfolio of stocks that has zero covariance with each of the included asset-pricing factors; in a model where the SDF innovations are spanned by those factors, equals the conditional expectation of the intertemporal marginal rate of substitution, making it the correct intertemporal price of consumption; measured in this paper at 8.3% annually on average using GMM on 130 CRSP stock portfolios, January 1973–December 2020.&lt;/dd&gt;
&lt;dt&gt;&lt;strong&gt;convenience spread&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;the gap between the zero-beta rate and the expected real Treasury bill yield, averaging approximately 7.6% per year in this paper&amp;rsquo;s estimates; interpreted as the non-pecuniary value that holders of safe assets (Treasury bills and equivalents) receive from liquidity, collateral, and money-like services — not an expected excess return relative to consumption risk but a departure from the risk-return tradeoff for investors who value safety and liquidity independently of consumption hedging.&lt;/dd&gt;
&lt;dt&gt;&lt;strong&gt;SDF-orthogonal equity portfolio&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;the minimum-variance portfolio constructed by the paper to have zero covariance with all seven asset-pricing factors (Fama-French 5 plus bond and default factors); the portfolio whose expected return equals the zero-beta rate because, by construction, no factor risk premium enters its expected return; estimated using the Ledoit-Wolf (2017) shrinkage estimator applied to 130 stock portfolios to address the rank problem in beta estimation.&lt;/dd&gt;
&lt;dt&gt;&lt;strong&gt;Euler equation failure with safe rates&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;the empirical finding that the expected real Treasury bill return does not co-move with expected real consumption growth — the standard failure documented by Hansen-Singleton (1983), Dunn-Singleton (1986), and Yogo (2004) — which the paper reinterprets not as a structural failure of the representative agent model but as a consequence of using the wrong interest rate; the same Euler equation holds when applied to the zero-beta rate, which the paper argues is the correct intertemporal price.&lt;/dd&gt;
&lt;/dl&gt;</description></item><item><title>To Own or to Rent? The Effects of Transaction Taxes on Housing Markets</title><link>https://macropaperwarehouse.com/papers/to-own-or-to-rent-the-effects-of-transaction-taxes-on-housing-markets/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/to-own-or-to-rent-the-effects-of-transaction-taxes-on-housing-markets/</guid><description>&lt;h2 id="layer-1--summary"&gt;Layer 1 — Summary&lt;/h2&gt;
&lt;p&gt;Using sales and leasing transaction records for the Greater Toronto Area (2006–2018), this paper finds three novel effects of a higher property transaction tax: higher buy-to-rent transactions alongside lower buy-to-own transactions despite both being taxed, a lower sales-to-leases ratio, and a lower price-to-rent ratio. The empirical identification exploits the City of Toronto&amp;rsquo;s introduction of a city-level Land Transfer Tax (LTT) in February 2008 — covering only the city and not surrounding GTA municipalities — comparing outcomes on opposite sides of the city border before and after the tax change. A 1.3 percentage-point higher effective LTT rate causes buy-to-rent purchases to rise by 9.3% while owner-occupier purchases fall by 9.6%; the leases-to-sales ratio rises by 26% and the price-to-rent ratio falls by 3.8%. To explain these facts, the paper develops a search model featuring household tenure choice (own vs. rent) subject to heterogeneous credit costs, endogenous homeowner moving decisions, and free entry of buy-to-rent investors; the key mechanism is that the LTT reduces homeowners&amp;rsquo; mobility — because owner-occupiers expect to transact multiple times over their lifetimes and thus bear the tax repeatedly — discouraging entry into ownership and raising demand for rentals, which in turn attracts investor entry even though investors too pay the tax, since investors need not re-transact whenever a tenant vacates. The implied deadweight loss is large at 111% of tax revenue, with more than half of this due to distorting decisions to own or rent; taking the rental market into account accounts for losses equal to 73% of tax revenue, which is two-thirds of the total loss.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-q-what-are-the-three-novel-empirical-facts-documented-in-this-paper"&gt;Q1. Q: What are the three novel empirical facts documented in this paper?&lt;/h3&gt;
&lt;p&gt;A: Using MLS data on both sales and leases in the Greater Toronto Area, the paper documents: (1) a 1.3 pp higher effective LTT rate causes buy-to-rent (BTR) investor purchases to increase by 9.3%, in stark contrast to a 9.6% fall in owner-occupier (buy-to-own) purchases — a divergence that is counterintuitive because both types of buyer are subject to the same tax; (2) the ratio of leases to sales rises by 26%, indicating that rental-market activity increases relative to ownership-market activity; and (3) the price-to-rent ratio falls by 3.8%, meaning house prices decline relative to rents.&lt;/p&gt;
&lt;h3 id="q2-q-what-is-the-empirical-identification-strategy-and-why-is-it-credible"&gt;Q2. Q: What is the empirical identification strategy and why is it credible?&lt;/h3&gt;
&lt;p&gt;A: The paper uses a geographic regression discontinuity approach comparing communities on opposite sides of the Toronto city border, where the new city-level LTT applies on one side but not the other, in a difference-in-differences framework spanning January 2006–January 2008 (pre-policy) and February 2008–February 2012 (post-policy). The sample is restricted to properties within 3 or 5 km of the boundary. The paper verifies that property characteristics do not differ significantly across the border and that cross-border differences do not change after the LTT, supporting the parallel-trends assumption. The effective LTT rate increase is measured at 1.3 percentage points (assuming 40% first-time buyers, who receive a partial exemption). Buy-to-rent transactions are identified in the MLS data by matching properties that appear in both the sales and leases datasets within an 18-month window following sale.&lt;/p&gt;
&lt;h3 id="q3-q-what-is-the-intuition-for-why-the-ltt-raises-buy-to-rent-investment-even-though-it-taxes-investors"&gt;Q3. Q: What is the intuition for why the LTT raises buy-to-rent investment even though it taxes investors?&lt;/h3&gt;
&lt;p&gt;A: The mechanism hinges on the asymmetry in expected future transaction costs between owner-occupiers and investors. Owner-occupiers face idiosyncratic match-quality shocks — they periodically want to move to a different property as their circumstances or preferences change — so choosing homeownership means expecting to pay the LTT on each future move. This makes homeownership less attractive relative to renting, reducing household entry into the ownership market and increasing demand for rental properties. Investors (landlords), by contrast, do not need to re-transact in the ownership market simply because a tenant moves out; they retain the property and find a new tenant. Investors therefore face a lower expected frequency of LTT payments per year of property holding than owner-occupiers. As a result, the LTT&amp;rsquo;s negative effect on investor returns is smaller in magnitude than the increase in rental demand it generates. In equilibrium, the price-to-rent ratio falls by enough to attract more BTR investors in spite of the direct cost the tax imposes on them, and investor purchases rise.&lt;/p&gt;
&lt;h3 id="q4-q-how-does-the-ltt-affect-homeowner-mobility-the-lock-in-effect-and-what-are-its-welfare-implications-within-the-ownership-market"&gt;Q4. Q: How does the LTT affect homeowner mobility (the &amp;ldquo;lock-in&amp;rdquo; effect) and what are its welfare implications within the ownership market?&lt;/h3&gt;
&lt;p&gt;A: The LTT makes existing homeowners more tolerant of poor match quality with their current property, since the cost of moving — paying the tax again — has risen. Moving rates therefore decline as households remain in properties for longer on average. To mitigate future tax costs, buyers also become more selective (&amp;ldquo;picky&amp;rdquo;) when initially matching with a property, requiring higher match quality before purchasing. This reduces the frequency of moves but increases the cost and duration of search for new buyers. The welfare consequences within the ownership market are: (a) misallocation of properties among owner-occupiers as average match quality falls because households move less often to renew it; partially offset by (b) higher initial match quality for newly matched buyers, but at the cost of longer search. The LTT-induced distortions within the ownership market account for a loss equal to 38% of tax revenue.&lt;/p&gt;
&lt;h3 id="q5-q-what-are-the-models-quantitative-predictions-for-the-four-year-post-reform-period-and-how-do-they-compare-to-the-empirical-estimates"&gt;Q5. Q: What are the model&amp;rsquo;s quantitative predictions for the four-year post-reform period, and how do they compare to the empirical estimates?&lt;/h3&gt;
&lt;p&gt;A: The model is calibrated to the City of Toronto for 2006–8 (homeownership rate ~54%) and simulated for a 1.3 pp LTT increase, with the mobility hazard rate used as the internal calibration target. For the four-year period following the tax change, the model predicts: owner-occupier transactions fall by 14%; buy-to-rent transactions rise by 35%; the leases-to-sales ratio rises by 15%; the price-to-rent ratio falls by 1.6%; and the homeownership rate falls by 0.23 percentage points. These figures are broadly consistent in magnitude with the estimated LTT effects on the variables not directly targeted in calibration (i.e., the transaction-volume and price-to-rent results from the empirical estimation).&lt;/p&gt;
&lt;h3 id="q6-q-what-are-the-long-run-steady-state-effects-and-why-do-they-differ-from-the-four-year-effects"&gt;Q6. Q: What are the long-run (steady-state) effects and why do they differ from the four-year effects?&lt;/h3&gt;
&lt;p&gt;A: Tenure-choice variables are very slow to adjust because annual flows are small relative to housing stocks. In the new steady state, the homeownership rate falls by 2.4 percentage points and the leases-to-sales ratio rises by 23% — both substantially larger than the four-year effects. By contrast, four-year effects on owner-occupier transactions and the price-to-rent ratio are already close to their new steady states. Buy-to-rent transactions overshoot their steady-state level (the four-year rise of 35% compares to a steady-state rise of 5.1%) because of a one-off surge in investor entry as the rental market absorbs the transition; once the stock of rental properties has adjusted, the flow of new buy-to-rent purchases settles lower.&lt;/p&gt;
&lt;h3 id="q7-q-how-are-the-welfare-deadweight-losses-decomposed-across-distortion-channels"&gt;Q7. Q: How are the welfare (deadweight) losses decomposed across distortion channels?&lt;/h3&gt;
&lt;p&gt;A: The new LTT generates a total welfare loss equivalent to 111% of the extra revenue it raises. The decomposition is: distortions to flows between the rental and ownership markets (i.e., the tenure-choice margin) account for a loss equal to 60% of extra revenue; distortions within the rental market account for 13% of tax revenue; distortions within the ownership market (lock-in and match-quality misallocation) account for 38% of tax revenue. The presence of the rental market in the analysis — encompassing both the across-market and within-rental-market channels — accounts for a loss equivalent to 73% of tax revenue, which is two-thirds of the total loss. The paper characterises this as &amp;ldquo;large.&amp;rdquo;&lt;/p&gt;
&lt;h3 id="q8-q-what-is-the-across-market-misallocation-mechanism-behind-the-60-welfare-loss-from-tenure-distortions"&gt;Q8. Q: What is the across-market misallocation mechanism behind the 60% welfare loss from tenure distortions?&lt;/h3&gt;
&lt;p&gt;A: Because owner-occupiers expect to transact more frequently than buy-to-rent investors, the same ad valorem tax falls more heavily on owner-occupiers. In equilibrium, the cost of credit paid by the marginal home-buyer must fall — that is, fewer creditworthy households enter ownership. This displaces some creditworthy households into the rental market, creating a misallocation: properties are allocated away from owner-occupiers (who value them as a place of residence and benefit from match quality) toward rentals intermediated through investors. The welfare loss arises because credit-worthy households who would prefer to own are now renters, and the resource costs of intermediating through investors are incurred unnecessarily.&lt;/p&gt;
&lt;h3 id="q9-q-what-policy-experiment-does-the-paper-consider-beyond-the-baseline-ltt-analysis"&gt;Q9. Q: What policy experiment does the paper consider beyond the baseline LTT analysis?&lt;/h3&gt;
&lt;p&gt;A: The paper studies an alternative tax structure that imposes a higher LTT rate on buy-to-rent investors relative to owner-occupiers, calibrated to nullify the implicit tax advantage investors enjoy under a uniform rate. By raising barriers to investor entry, this differential tax reduces the across-market welfare losses from lower homeownership. However, the paper notes an important caveat: pushing the investor tax rate ever higher to boost homeownership would ultimately produce large welfare costs in the opposite direction, as households who cannot qualify for mortgage credit (uncreditworthy households) would be displaced into the ownership market by a shortage of rental properties. Investors play a socially valuable role in providing housing access to households who cannot or choose not to bear the costs of credit.&lt;/p&gt;
&lt;h3 id="q10-q-what-data-source-is-used-and-why-is-it-unusually-well-suited-to-this-analysis"&gt;Q10. Q: What data source is used and why is it unusually well-suited to this analysis?&lt;/h3&gt;
&lt;p&gt;A: The paper uses Multiple Listing Service (MLS) records from the Toronto Regional Real Estate Board covering the Greater Toronto Area, 2006–2018. The dataset is distinctive in including both sales transactions and lease transactions, allowing the paper to match the two and construct the novel buy-to-rent identifier. MLS data cover approximately 78% of detached-house transactions in the Toronto Land Registry for 2006–2012, and the rental listings capture over 90% of properties listed on alternative platforms. This combination of sales and lease records is what makes it possible to document the three novel empirical facts and to study both the ownership and rental markets jointly.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Buy-to-rent (BTR) transaction:&lt;/strong&gt; In this paper&amp;rsquo;s definition, a sale in the ownership market where the buyer subsequently lists the same property on the rental market within 18 months. BTR buyers are investors/landlords who supply rental housing by purchasing from the ownership market. Distinct from buy-to-own (owner-occupier purchases) and buy-to-sell (flipping) transactions. Identified in the MLS data by matching address and transaction dates across the sales and leases databases.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Buy-to-own (BTO) transaction:&lt;/strong&gt; A sale in the ownership market where the buyer occupies the property as a homeowner — the residual category after removing BTR and buy-to-sell transactions from total sales. In the City of Toronto, the fraction of all transactions classified as BTO declined from 89% to 84% between 2006 and 2017.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Effective LTT rate:&lt;/strong&gt; The mean land transfer tax paid as a percentage of the sales price, combining provincial- and city-level taxes, averaged over detached-house transactions in the City of Toronto and adjusted for first-time buyer exemptions. The introduction of the city-level LTT in February 2008 raised the effective LTT rate by 1.3 percentage points (assuming 40% first-time buyers).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Match quality:&lt;/strong&gt; In the paper&amp;rsquo;s search model, the idiosyncratic value a particular household places on a particular property, which evolves stochastically over time. When match quality deteriorates sufficiently, a homeowner wishes to move to a better-matched property. Match quality is the source of the &amp;ldquo;lock-in&amp;rdquo; effect: higher transaction taxes raise the threshold quality decline a household is willing to tolerate before moving, reducing mobility. Because investors are not tied to a specific property in the same way (a tenant moving out does not require the investor to transact), this mechanism falls more heavily on owner-occupiers than on BTR investors.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Lock-in effect:&lt;/strong&gt; The reduction in homeowner mobility caused by a higher transaction tax. Homeowners become more tolerant of deteriorating match quality (stay longer in poorly matched properties) and more selective when initially purchasing (require higher match quality to justify the transaction cost). The paper treats this as operating on the intensive margin of homeownership decisions, contrasted with the extensive margin (the own-vs.-rent choice).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Credit cost / credit friction:&lt;/strong&gt; Heterogeneous household-level costs of accessing mortgage finance or credit. In the model, a household must pay a credit cost to enter the ownership market. Households with lower credit costs are more likely to choose homeownership; a higher transaction tax effectively raises the total cost of ownership (since it must be paid on each future move), shifting the margin at which the credit cost equals the net benefit of owning, thereby reducing the equilibrium homeownership rate.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Leases-to-sales ratio:&lt;/strong&gt; The ratio of new lease transactions to sales transactions in the housing market, used as a measure of the relative activity of the rental and ownership markets. A higher ratio indicates more households are being accommodated in the rental market relative to the ownership market. The LTT raises this ratio by 26% in the empirical estimation and 15% in the four-year model simulation, with a steady-state increase of 23%.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Price-to-rent ratio:&lt;/strong&gt; The ratio of house prices to rents, used as a summary statistic for the relative cost of owning versus renting. In the paper&amp;rsquo;s model, a fall in the price-to-rent ratio is the price signal that attracts additional buy-to-rent investor entry: as tenure-choice distortions shift more households toward renting, rents rise relative to prices, improving the return to BTR investment until the rental market clears. The LTT lowers the price-to-rent ratio by 3.8% empirically and 1.6% in the four-year model simulation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Deadweight loss as a fraction of tax revenue:&lt;/strong&gt; The welfare cost of the LTT measured in units of tax revenue raised, allowing comparison across tax instruments. The paper finds a deadweight loss of 111% of tax revenue for the Toronto LTT. Prior literature, which focused only on the intensive margin (mobility distortions within the ownership market), missed the across-market and within-rental-market channels that together account for 73 percentage points of this total.&lt;/p&gt;
&lt;hr&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary based on published open-access version. AI-assisted, human review pending.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;</description></item><item><title>Tokenomics: Optimal monetary and fee policies</title><link>https://macropaperwarehouse.com/papers/tokenomics-optimal-monetary-and-fee-policies/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/tokenomics-optimal-monetary-and-fee-policies/</guid><description>&lt;p&gt;The rapid proliferation of cryptocurrency tokens—roughly 10,000 outstanding with a total market capitalization around $3 trillion as of early 2024—raises new questions about the design of token monetary policy and fee structures. This paper makes two contributions. Empirically, using supply histories for approximately 2,000 tokens, it documents three systematic patterns: average token money growth rates decline with age and stabilize at about 0.2% per month; long-run money growth rates and convergence speeds are positively correlated across tokens in the cross-section; and tokens more widely held by retail investors have relatively lower long-run money growth rates and convergence speeds. Theoretically, the paper derives optimal issuance and fee policies for a profit-maximizing issuer in a dynamic model where commitment matters, showing that a fully committed (Ramsey) issuer who maximizes profits after the initial period makes choices that maximize the total utility value of all tokens, that without any commitment no equilibrium with a positive token price exists unless fees are charged, and that under partial commitment issuers with higher commitment credibility optimally choose lower long-run money growth rates and fee ratios.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-are-the-three-empirical-facts-about-crypto-monetary-policies"&gt;Q1. What are the three empirical facts about crypto monetary policies?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Fact 1: the average money growth rate across the ~2,000 tokens declines with cohort age and stabilizes at approximately 0.2% per month, with more recent cohorts converging faster to their long-run growth rates.&lt;/strong&gt; Fact 2: in the cross-section of tokens, the estimated long-run money growth rate and the speed of convergence to it are positively correlated—tokens that will eventually have higher money growth also converge more quickly to that level. Fact 3: tokens whose circulating supply is widely distributed among retail investors (measured by the proportion of wallet addresses holding less than 0.1% of total supply) have both lower long-run money growth rates and lower convergence speeds. Together these facts suggest that the degree of decentralization of token ownership systematically influences the choice of monetary policy.&lt;/p&gt;
&lt;h3 id="q2-what-are-the-main-theoretical-results-about-commitment"&gt;Q2. What are the main theoretical results about commitment?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Without any commitment, no equilibrium with a positive token price can be sustained if the issuer cannot charge user fees; fees create credibility by giving a Markov-perfect issuer an incentive to restrict future token issuance (to collect fees on legacy token holders), which supports a positive price.&lt;/strong&gt; At the other extreme, with full commitment the Ramsey issuer maximizes profits subject to the full sequence of resource constraints, and a central analytical result is that at steady state the Ramsey issuer&amp;rsquo;s profit-maximizing choices are equivalent to maximizing the total utility value of all tokens outstanding. This equivalence—profit maximization = welfare maximization for token holders—holds because commitment prevents the issuer from diluting legacy holders; under full commitment, the issuer has no incentive to deviate from the policy that maximizes total token value.&lt;/p&gt;
&lt;h3 id="q3-what-does-partial-commitment-imply-for-policy-design"&gt;Q3. What does partial commitment imply for policy design?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Under partial commitment (modeled as a probability of sticking to a given policy rather than re-optimizing), issuers with higher commitment probability optimally choose lower long-run money growth rates and reduce both the money growth rate and the fee ratio more slowly over time.&lt;/strong&gt; This is because higher commitment credibility raises the present value of legacy tokens (users expect the issuer to honor its policy), which makes it optimal to extract less seigniorage and fewer fees in each period while sustaining a higher token price. The model is analytically solvable under partial commitment despite rich economic mechanisms, and the results help interpret Fact 3: tokens with more decentralized ownership (higher effective commitment due to governance constraints) exhibit lower money growth.&lt;/p&gt;
&lt;h3 id="q4-how-do-the-models-predictions-compare-with-the-bitcoin-and-ethereum-cases"&gt;Q4. How do the model&amp;rsquo;s predictions compare with the Bitcoin and Ethereum cases?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Bitcoin exemplifies near-full commitment: its supply schedule is essentially deterministic and hard-coded, resembling the Ramsey policy, though the paper notes a non-zero probability of a fork that could alter the policy.&lt;/strong&gt; Ethereum exemplifies frequent policy changes (the transition from proof-of-work to proof-of-stake altered issuance and burn rules significantly). Discretionary platforms like MakerDAO, which allow issuer discretion over token supply, more closely resemble the Markov-perfect benchmark. The paper&amp;rsquo;s framework covers this entire spectrum and predicts that tokens with more commitment (closer to Bitcoin) should have lower long-run money growth and slower convergence—consistent with the empirical facts.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;token monetary policy&lt;/strong&gt; : the issuer&amp;rsquo;s choice of the rate at which new tokens are minted over time (the money growth rate) and the fees charged per transaction; the paper shows these jointly determine the value of a token and the issuer&amp;rsquo;s profit.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Ramsey policy for tokens&lt;/strong&gt; : the full-commitment profit-maximizing policy for a token issuer; a key result is that the Ramsey issuer&amp;rsquo;s profit-maximizing choices, after the initial period, are equivalent to maximizing the total utility value of all tokens.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;partial commitment&lt;/strong&gt; : a probabilistic commitment technology in which the issuer maintains a given policy with some probability and re-optimizes with the complementary probability; the paper uses this to model the spectrum between Bitcoin (high commitment) and more discretionary platforms.&lt;/p&gt;</description></item><item><title>Trade with Nominal Rigidities: Understanding the Unemployment and Welfare Effects</title><link>https://macropaperwarehouse.com/papers/trade-with-nominal-rigidities-understanding-the-unemployment-and-welfare-effects/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/trade-with-nominal-rigidities-understanding-the-unemployment-and-welfare-effects/</guid><description>&lt;p&gt;Standard international trade models assume perfectly flexible prices and full employment. This paper introduces nominal rigidities (downward wage rigidity) into a quantitative trade model and asks how this changes the welfare gains from trade liberalization. The central finding is that standard flexible-price estimates overstate the welfare gains by approximately one-third: trade liberalization can generate unemployment in import-competing sectors when wages cannot fall, and the forgone output from these workers is a welfare cost that flexible-price models miss entirely. The paper calibrates the degree of downward wage rigidity to cross-country data on unemployment volatility and shows that the magnitude of the overstatement is robust across a range of calibrations. An analytical decomposition separates the allocative efficiency gains (which standard models capture) from the employment losses (which they miss), clarifying when the overstatement is large versus small.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-how-does-downward-wage-rigidity-create-unemployment-from-trade-liberalization"&gt;Q1. How does downward wage rigidity create unemployment from trade liberalization?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;When import competition reduces demand for domestic labor in an affected sector, wages cannot fall fast enough to clear the labor market under downward rigidity; workers are priced out of re-employment in the short run, generating cyclical unemployment in the import-competing sector that persists until real wages adjust through inflation erosion.&lt;/strong&gt; The unemployment is involuntary and represents forgone production — a social cost that flexible-price models attribute to zero by assumption.&lt;/p&gt;
&lt;h3 id="q2-why-is-the-overstatement-approximately-one-third"&gt;Q2. Why is the overstatement approximately one-third?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The one-third figure comes from the ratio of the employment-loss welfare cost to the total flexible-price welfare gain in the paper&amp;rsquo;s benchmark calibration; the rigidity-driven employment loss is large enough relative to the allocative efficiency gain to reduce net welfare gains substantially, but not so large as to eliminate them.&lt;/strong&gt; This ratio is not universal — it depends on the degree of wage rigidity, the sectoral composition of trade exposure, and the speed of labor reallocation — but the paper shows it is robust across plausible parameter ranges.&lt;/p&gt;
&lt;h3 id="q3-does-trade-liberalization-still-generate-net-welfare-gains"&gt;Q3. Does trade liberalization still generate net welfare gains?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Yes, on net the welfare gains from trade remain positive even with downward wage rigidity — the overstatement of one-third means the true gains are positive but smaller than flexible-price models predict, not negative.&lt;/strong&gt; The paper does not argue against trade liberalization but against using flexible-price welfare estimates without adjustment for unemployment costs.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;downward wage rigidity&lt;/strong&gt; : the empirical constraint that nominal wages adjust slowly downward; the key friction this paper adds to the quantitative trade model, generating unemployment in sectors hit by import competition.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;welfare overstatement&lt;/strong&gt; : the gap between the flexible-price welfare gain from trade liberalization (the standard model&amp;rsquo;s prediction) and the true gain once unemployment costs from nominal rigidity are accounted for; approximately one-third in the paper&amp;rsquo;s benchmark calibration.&lt;/p&gt;</description></item><item><title>Traditional Institutions in Modern Times: Dowries as Pensions When Sons Migrate</title><link>https://macropaperwarehouse.com/papers/traditional-institutions-in-modern-times-dowries-as-pensions-when-sons-migrate/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/traditional-institutions-in-modern-times-dowries-as-pensions-when-sons-migrate/</guid><description>&lt;p&gt;This paper asks whether dowry — a transfer from the bride&amp;rsquo;s family to the groom&amp;rsquo;s household upon marriage, prevalent throughout India — enables male migration by providing liquidity that compensates parents for the old-age support they would otherwise lose when sons leave the village. The core friction is that in patrilocal societies, sons traditionally co-reside with parents and share income in old age; migration disrupts this arrangement and introduces income-sharing frictions (limited commitment, information asymmetries, remittance costs). Dowry attenuates this friction by providing a liquid pool of resources at the time of marriage that the son can transfer to parents, lowering the net return to migration needed for a household to find migration optimal.&lt;/p&gt;
&lt;p&gt;The authors develop a collective household model in which parents and sons jointly maximize a Pareto-weighted utility function. The model yields six testable predictions: (1) net marriage transfers can flow in either direction; (2) parents are more likely to take from the dowry when sons migrate; (3) conditional on migration, the probability of parental taking increases in the son&amp;rsquo;s income and in parental bargaining power; (4) aggregate male migration rates are higher in districts with stronger historical dowry traditions; (5) migration responses to a reduction in migration costs are larger in dowry areas, provided migration rates are relatively low; and (6) parents who receive remittances from migrant sons are more likely to have also taken from the dowry.&lt;/p&gt;
&lt;p&gt;To test predictions 1–3 and the remittance auxiliary prediction, the authors collected two original datasets: a Destination Survey of 557 prime-age men in Gurugram (near Delhi) conducted in 2018, of whom 62% were migrants; and an Origin Survey of 2,541 households across 34 districts in six North Indian states conducted in 2020, covering 3,069 sons, 20% of whom were migrants. These are the first quantitative data on property rights over dowry in India. Across the Destination and Origin surveys, 45% and 27% of grooms&amp;rsquo; parents, respectively, took from the dowry on net. Parents of migrants are 27 percentage points (Destination) and 8 percentage points (Origin) more likely to take than parents of non-migrants. For migrant sons, a doubling of the son&amp;rsquo;s occupational score raises the likelihood of parental taking by 19 percentage points; no such relationship exists for non-migrants. When sons report that parents held veto power over the marriage — a proxy for parental Pareto weight — parents of migrant sons are 28 percentage points more likely to be net takers. Parents whose migrant son sends financial remittances are 17 percentage points more likely to have taken from the dowry (coefficient 0.168, SE 0.074).&lt;/p&gt;
&lt;p&gt;To test predictions 4 and 5, the authors use the Ancestral Characteristics data (Giuliano and Nunn 2018) to construct district-level measures of dowry tradition strength, validated against 1999 REDS and IHDS survey data, where a one-unit increase in the historical dowry measure is associated with 81–109% higher gross or net dowry payments. Using the NSS Round 64 migration module (2007–08), they find that the continuous dowry tradition measure is associated with a 2.7–3.7 percentage point increase in migration probability against a mean of 23.8%. For the highway construction identification strategy, the authors exploit the staggered rollout of the Golden Quadrilateral and North-South/East-West corridor (5,846+ km, $71 billion), using modern staggered-entry difference-in-differences estimators (Borusyak et al. 2021; Callaway and Sant&amp;rsquo;Anna 2020). Young men (ages 15–30) in dowry districts exhibit a large, significant increase in out-migration following highway construction with no pre-trends, while the effect for non-dowry males is indistinguishable from zero. Older males (ages 31–45) show no such effect in either group, consistent with the mechanism operating at marriage. The highway effects are concentrated in inter-district, employment-driven migration.&lt;/p&gt;
&lt;p&gt;Scope conditions: the migration-enabling mechanism operates through marriage-age liquidity and patrilocal support norms; results are specific to male migration in India. The model assumes parents and sons act collectively, matching is based on grooms&amp;rsquo; earning potential, and migration frictions cause income-sharing transfers to be infeasible when the son migrates.&lt;/p&gt;
&lt;p&gt;Q: What is the central hypothesis of the paper?
A: The hypothesis is that dowry, by providing a liquid transfer at the time of marriage, allows sons to compensate parents for the old-age support that would otherwise be lost when sons migrate. Because migration introduces frictions that prevent optimal post-migration income sharing between parents and sons, dowry lowers the minimum net return to migration required for the household to find migration optimal, thereby enabling more migration.&lt;/p&gt;
&lt;p&gt;Q: What is the &amp;ldquo;Seeking&amp;rdquo; versus &amp;ldquo;Satisfied&amp;rdquo; distinction in the model, and why does it matter?
A: &amp;ldquo;Satisfied&amp;rdquo; parents are those whose own income plus the maximum feasible marriage transfer (bounded by the bride&amp;rsquo;s endowment dE when dowry is present) is at least as large as their consumption allocation under no migration; migration then Pareto-improves the household for any non-negative return R. &amp;ldquo;Seeking&amp;rdquo; parents have insufficient income plus endowment, so migration reduces their consumption unless the son&amp;rsquo;s return R exceeds a threshold B(d). Because dowry strictly increases the feasible transfer ceiling, B(d=1) ≤ B(d=0), meaning dowry converts some Seeking households into effectively Satisfied ones and lowers the migration threshold for the rest.&lt;/p&gt;
&lt;p&gt;Q: What share of grooms&amp;rsquo; parents actually take from the dowry, and how does migration status affect this?
A: In the Destination Survey (62% migrants), 45% of parents take from the dowry on net; in the Origin Survey (20% migrants), 27% do. Parents of migrants are 27 percentage points more likely to take in the Destination Survey and 8 percentage points more likely in the Origin Survey, consistent with the model prediction that migration increases net taking.&lt;/p&gt;
&lt;p&gt;Q: How does the son&amp;rsquo;s earnings level affect parental taking, and does this pattern hold for non-migrants?
A: For migrant sons, a 100% increase in the son&amp;rsquo;s occupational score increases the likelihood of parents taking by 19 percentage points. For non-migrant sons, the son&amp;rsquo;s occupational score has no meaningful association with taking. This asymmetry is consistent with prediction 3: when migration occurs and the alpha income-sharing channel is shut down, parents with higher-income migrant sons have a higher relative marginal return to consumption and thus take more of the dowry.&lt;/p&gt;
&lt;p&gt;Q: What is the remittance auxiliary prediction, and is it borne out in the data?
A: The model predicts that parents who receive remittances from migrant sons should also be more likely to have taken from the dowry, because households first exhaust the costless dowry transfer before making costly or risky remittances — so remittance-receiving parents are precisely those Seeking households where dowry was already taken. The data confirm this: parents whose migrant son sends financial remittances are 17 percentage points more likely to have taken from the dowry (coefficient 0.168, SE 0.074, significant at 5%) compared to parents of migrants who do not remit.&lt;/p&gt;
&lt;p&gt;Q: How is the district-level dowry tradition measure constructed and validated?
A: The measure merges the Giuliano and Nunn (2018) Ancestral Characteristics data — which uses ethnographic sources to estimate the share of each district&amp;rsquo;s current population belonging to historically dowry-practicing groups — with district-level demographic data. Validation against the 1999 REDS shows that a one-unit increase in the historical dowry measure is associated with 81% higher gross dowry payments and 109% higher net dowry payments without region fixed effects, with a still-significant 79% for net dowry including region fixed effects. Additional validation in the IHDS confirms the historical measure predicts gold payments at marriage (coefficient 0.152 without state fixed effects, 0.185 with state fixed effects).&lt;/p&gt;
&lt;p&gt;Q: What is the association between historical dowry traditions and migration in nationally representative data?
A: Using the NSS Round 64 migration module (2007–08) for males aged 15–45, against a mean migration rate of 23.8%, the continuous dowry measure is associated with a 2.66 percentage point increase in migration probability with no controls (significant at 1%), and 3.67 percentage points with full controls including state fixed effects, year-of-birth fixed effects, caste fixed effects, distance controls, and education controls (significant at 5%).&lt;/p&gt;
&lt;p&gt;Q: What is the highway construction identification strategy, and what does it show?
A: The authors exploit the staggered construction timing of the Golden Quadrilateral and NS-EW highway corridors (beginning 1999, 5,846+ km, $71 billion investment) across Indian districts, assembling new data on district-level construction timing from a complete capital projects database. Using staggered-entry event study estimators robust to heterogeneous treatment effects, they separately estimate highway effects in districts with and without strong dowry traditions. For young men aged 15–30, dowry districts show a large, significant increase in out-migration after highway construction with no pre-trends; non-dowry districts show an effect indistinguishable from zero. Older men (31–45) show no significant effect in either group.&lt;/p&gt;
&lt;p&gt;Q: Why is the age heterogeneity (15–30 vs. 31–45) in the highway results important for the mechanism?
A: The model predicts that dowry&amp;rsquo;s migration-enabling role operates at the time of marriage, when the liquid transfer is made. Men aged 31–45 at the time of highway construction would largely have already been married before the roads were built, so they cannot retroactively benefit from the new liquidity channel. Young men (15–30) are near or below marriage age and can time their marriages and migration decisions in response to reduced migration costs. The null result for older men and the strong result for younger men together confirm the marriage-time liquidity channel.&lt;/p&gt;
&lt;p&gt;Q: Why is the highway effect concentrated in inter-district rather than intra-district migration?
A: The Golden Quadrilateral connects districts to other districts, and the model&amp;rsquo;s mechanism relies on migration creating income-sharing frictions that are more severe at longer distances. Intra-district moves are shorter, less likely to disrupt co-residence and informal support arrangements, and less likely to require the dowry&amp;rsquo;s compensatory role. The concentration of effects in inter-district migration is directly consistent with the proposed channel.&lt;/p&gt;
&lt;p&gt;Q: How does the paper address concerns about pre-trends and robustness in the highway analysis?
A: The event study plots show no pre-trends in migration for either dowry or non-dowry districts prior to highway construction. Robustness checks include additional geographic controls, caste-by-year fixed effects, time-varying cultural controls, the alternative Callaway-Sant&amp;rsquo;Anna estimator, adjusted age distributions, and varying dowry tradition cutoffs at 1%, 10%, and 25% thresholds. Results are stable across these specifications.&lt;/p&gt;
&lt;p&gt;Q: What do the theory and evidence imply about the modern transformation of dowry&amp;rsquo;s function?
A: While dowry historically served as a pre-mortem bequest to the bride adapted to patrilocal society, the modern practice has evolved so that grooms&amp;rsquo; parents frequently capture the transfer. The evidence is consistent with this reallocation of property rights serving a new function: providing parents with a pension substitute when sons migrate and traditional co-residential support breaks down. The authors speculate this functional evolution may partly explain why dowry prevalence has grown despite legal bans, as declining patrilocality creates rising demand for this type of intergenerational transfer mechanism.&lt;/p&gt;
&lt;p&gt;Q: What are the policy implications of the findings?
A: The paper suggests that policies discouraging dowry — which has many well-documented negative consequences including intimate partner violence, female infant mortality, and adverse resource allocation — may be more effective if paired with expansions of formal pension programs or other mechanisms for old-age support. Without such alternatives, eliminating dowry could inadvertently reduce male migration and associated economic development benefits because the migration-enabling liquidity function of dowry would go unfilled.&lt;/p&gt;
&lt;p&gt;Q: Does the mechanism apply equally to households with both sons and daughters?
A: The theoretical appendix shows that in a household with a son and a daughter, the daughter&amp;rsquo;s dowry outflow partially offsets the son&amp;rsquo;s inflow, reducing but not eliminating the migration-enabling effect. However, the net aggregate effect on male migration remains positive because more sons live in households where sons outnumber daughters, so the dowry inflow for the son exceeds the outflow on average across the population.&lt;/p&gt;
&lt;p&gt;Dowry (in the paper&amp;rsquo;s sense): A transfer from the bride&amp;rsquo;s family accompanying marriage that in the modern Indian context is liquid at the time of the wedding and over which grooms&amp;rsquo; parents frequently exercise property rights — distinct from the traditional anthropological conception of dowry as a pre-mortem bequest to the bride.&lt;/p&gt;
&lt;p&gt;Net Taker: A groom&amp;rsquo;s parent who receives a positive net transfer from the son&amp;rsquo;s dowry (tau &amp;gt; 0 in the model), meaning the flow of dowry resources is from the son/bride&amp;rsquo;s side to the groom&amp;rsquo;s parents.&lt;/p&gt;
&lt;p&gt;Seeking vs. Satisfied parents: Model categories distinguishing parents whose consumption needs can be met from own income plus the maximum feasible marriage transfer (Satisfied, no migration distortion) from those whose needs cannot (Seeking, requiring a minimum migration return threshold B(d) &amp;gt; 0 for migration to be household-optimal).&lt;/p&gt;
&lt;p&gt;Migration friction (alpha = 0 under migration): The modeling assumption that income-sharing transfers between migrant sons and parents are infeasible or prohibitively costly due to limited commitment, information asymmetries, and remittance costs — the friction that dowry&amp;rsquo;s lump-sum transfer at marriage is designed to circumvent.&lt;/p&gt;
&lt;p&gt;Ancestral Characteristics dowry measure: The district-level variable from Giuliano and Nunn (2018) measuring the share of the current population belonging to historically dowry-practicing ethnic groups, used as a proxy for the strength of local dowry traditions.&lt;/p&gt;
&lt;p&gt;Patrilocality: The residential norm in which sons remain with or near their parents after marriage and provide old-age support — the norm whose breakdown via migration creates the income-sharing friction that dowry helps resolve.&lt;/p&gt;
&lt;p&gt;Pareto weight (theta): The weight assigned to parents&amp;rsquo; utility in the collective household problem, capturing parental bargaining power; empirically proxied by whether sons report that parents held veto power over the marriage choice.&lt;/p&gt;</description></item><item><title>Trust and Innovation Within the Firm</title><link>https://macropaperwarehouse.com/papers/trust-and-innovation-within-the-firm/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/trust-and-innovation-within-the-firm/</guid><description>&lt;p&gt;This paper investigates whether and how a CEO&amp;rsquo;s inherited generalized trust enhances innovation within firms, offering a micro-foundation for the well-documented macro-level relationship between societal trust and economic growth. The author argues that trust — by inducing tolerance of failure — encourages researchers to undertake high-risk, explorative R&amp;amp;D rather than safe exploitation of known approaches.&lt;/p&gt;
&lt;p&gt;The empirical foundation is a matched CEO-firm-patent dataset covering 5,753 CEOs at 3,598 US public firms during 2000–2011, encompassing 700,000 patents and over one million inventors. CEO trust is measured as an inherited trait: each CEO&amp;rsquo;s ethnic origin is inferred probabilistically from their last name using de-anonymized US censuses from 1910–1940, and ethnic-origin-specific trust levels are drawn from the US General Social Survey (GSS), restricted to respondents in highly prestigious occupations. The resulting trust measure is the weighted average of ethnic-specific trust scores across a CEO&amp;rsquo;s likely ethnic composition.&lt;/p&gt;
&lt;p&gt;The main empirical strategy exploits within-firm variation across CEO transitions, using firm and year fixed effects to compare patenting before and after a CEO change. The identifying assumption — that the timing of CEO transitions and the new CEO&amp;rsquo;s trust level are not predicted by prior firm patenting trends — is supported by event-study tests showing flat pre-trends. A one-standard-deviation increase in CEO inherited generalized trust (equivalent to the difference between Greek and English averages) is associated with a 6.2–6.3% increase in patent filings, statistically significant at the 1% level. For the average firm, this equals approximately 1.1 additional patents annually, worth roughly $6.8 million. The effect is larger among exogenous transitions (CEO retirement or death): 8.5% in the restricted sample, and an IV estimate of 8.2%. The back-of-envelope calculation suggests this trust-innovation channel could account for approximately 37% (range: 16–58%) of the effect of trust on GDP per capita growth.&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s central mechanism — risk taking — is tested by examining the distribution of patent quality rather than the mean. Under the risk-taking mechanism, trust should increase the variance of R&amp;amp;D project quality, raising high-quality patents without necessarily increasing low-quality ones. Consistent with this, CEO trust raises only above-median quality patents (measured by forward citation decile), with effects increasing monotonically toward the top decile and no statistically significant effect on below-median patents. Average patent quality as measured by citation-weighted counts or patent value rises by 4–6%. Trust also disproportionately raises the share of explorative patents (those with at least 90% of backward citations outside the firm&amp;rsquo;s existing knowledge stock) by 1 percentage point over a base of 17%.&lt;/p&gt;
&lt;p&gt;The transmission channel is examined using BERT-based classification of nearly one million Glassdoor employee reviews. Under more trusting CEOs, firms exhibit stronger top-down trust sentiment (managers trusting workers), particularly among R&amp;amp;D workers and scientists. The effect materializes within the first two years of a CEO term. Director selection provides an additional transmission mechanism: under more trusting CEOs, newly appointed directors are more trusting and departing directors are less trusting.&lt;/p&gt;
&lt;p&gt;A within-CEO design using bilateral trust (toward researchers in specific countries) with CEO fixed effects addresses omitted CEO characteristics. A one-standard-deviation increase in CEO bilateral trust toward a country is associated with a 5% increase in patents by inventors in that country&amp;rsquo;s R&amp;amp;D lab, controlling for firm-by-year, CEO, and inventor-country fixed effects.&lt;/p&gt;
&lt;p&gt;The effect is strongest when CEO trust is matched to a high-quality researcher pool; in firms with mostly low-quality researchers, high trust may be counterproductive. Trust is also a substitute for R&amp;amp;D knowledge: the effect disappears when the CEO holds a non-MBA graduate degree or has prior R&amp;amp;D experience.&lt;/p&gt;
&lt;p&gt;Q: What is the main research question?
A: The paper asks whether a CEO&amp;rsquo;s generalized trust causes more and higher-quality innovation within the firm, and through what mechanism. It also asks how trust transmits from the CEO to researchers who rarely interact with the CEO directly.&lt;/p&gt;
&lt;p&gt;Q: How is CEO trust measured?
A: CEO trust is measured as an inherited trait using a two-step procedure. First, each CEO&amp;rsquo;s last name is probabilistically mapped to one or more ethnic origins using four de-anonymized US censuses (1910–1940). Second, ethnic-origin-specific trust is computed from GSS respondents in highly prestigious occupations. The CEO&amp;rsquo;s trust measure is the weighted average across ethnic compositions. This measure is shown to be more precise than an individual-level survey measure and approximately 80% as precise as a game-based measure, without introducing attenuation bias.&lt;/p&gt;
&lt;p&gt;Q: What is the baseline patent effect and how large is it economically?
A: A one-standard-deviation increase in CEO inherited trust is associated with a 6.2–6.3% increase in patent filings (statistically significant at 1%). For the average baseline firm, this is approximately 1.1 additional patents per year, valued at roughly $6.8 million. When patent quality is accounted for, the effect rises to 9.9% using citation-weighted patent count and 11.5% using patent value based on excess stock returns on grant dates.&lt;/p&gt;
&lt;p&gt;Q: Is the effect causal? What identification strategy is used?
A: The main strategy uses firm and year fixed effects, identifying the effect from within-firm variation around CEO transitions. Pre-trend tests confirm that neither the timing of CEO changes nor the new CEO&amp;rsquo;s trust level predicts prior firm patenting. Among exogenous transitions (CEO retirements and deaths), the effect is 8.5%, and an IV estimate using the predecessor&amp;rsquo;s trust as instrument yields 8.2% (significant at 10%), both comparable to the baseline.&lt;/p&gt;
&lt;p&gt;Q: What is the macroeconomic significance of the trust-innovation channel?
A: Combining the paper&amp;rsquo;s trust-to-patents estimate (0.042–0.062) with Akcigit et al.&amp;rsquo;s (2017) patents-to-GDP-growth estimate (0.026–0.066) and the cross-country trust-to-growth coefficient (0.007), the trust-innovation channel could explain approximately 37% of the effect of trust on growth, with a plausible range of 16–58%.&lt;/p&gt;
&lt;p&gt;Q: What is the mechanism linking CEO trust to innovation?
A: The conceptual mechanism is that a more trusting manager interprets researcher failure as bad luck rather than bad type, making her more likely to tolerate failure and continue employing the researcher. This increases the researcher&amp;rsquo;s incentive to pursue explorative, high-risk R&amp;amp;D over safe exploitation of known approaches. The mechanism implies a variance-increasing effect on the R&amp;amp;D quality distribution, rather than a mean shift.&lt;/p&gt;
&lt;p&gt;Q: How is the risk-taking mechanism tested against alternative mechanisms?
A: The paper examines the distribution of patent quality by citation decile. Under mean-shifting alternatives (delegation, cooperation, relational contracting), trust should raise all quality brackets. Under risk-taking, trust raises only high-quality patents. The results show CEO trust has monotonically increasing effects from low to high quality deciles, with no statistically significant effect on below-median patents, consistent only with the variance-increasing (risk-taking) mechanism.&lt;/p&gt;
&lt;p&gt;Q: What patent quality measures are used and what do they show?
A: Beyond forward citation deciles, the paper uses explorativeness (patents with at least 90% of backward citations outside the firm&amp;rsquo;s existing knowledge stock), disruptiveness (Funk and Owen-Smith, 2017), patent importance (Kelly et al., 2021), backward citations to scientific literature, and patent scope. Trust increases all these measures with statistically significant positive coefficients. The share of explorative patents rises by 1 percentage point over a base of 17%. Average citation count and patent value increase by 4–6%.&lt;/p&gt;
&lt;p&gt;Q: Does CEO trust raise R&amp;amp;D expenditure?
A: No. The coefficients from regressing R&amp;amp;D expenditure on CEO trust are neither statistically significant nor large enough to explain the innovation effect. The patent effect is also robust to controlling for R&amp;amp;D inputs, suggesting that trust affects the type of projects chosen (consistent with risk-taking) or their realized outcomes, rather than the scale of R&amp;amp;D.&lt;/p&gt;
&lt;p&gt;Q: How does CEO trust transmit to corporate culture?
A: Using BERT-based classification of nearly one million Glassdoor reviews covering 266 firms and 397 CEO terms between 2008 and 2017, the paper finds that CEO trust is associated with stronger top-down trust sentiment (managers trusting workers). The normalized effect of a one-standard-deviation increase in CEO trust on overall trust sentiment is 0.257, on top-down trust 0.531, and on bottom-up trust only 0.141 (statistically insignificant). The effect is strongest among reviewers who identify as scientists, researchers, or engineers, and materializes within the first two years of the CEO term.&lt;/p&gt;
&lt;p&gt;Q: What evidence exists for transmission via director selection?
A: Under more trusting CEOs, newly appointed directors — especially those who remain until the end of the CEO term — are more trusting, and departing directors are less trusting. The average director trust improves during the CEO&amp;rsquo;s term. Because 54% of director hirings and 46% of turnovers occur within the first two years, this change also materializes quickly, consistent with the dynamic pattern of trust culture change.&lt;/p&gt;
&lt;p&gt;Q: What is the within-CEO bilateral trust result and what does it add?
A: Using within-CEO variation in bilateral trust toward researchers from different countries (from Eurobarometer surveys), and controlling for CEO, inventor-country, and firm-by-year fixed effects, a one-standard-deviation increase in CEO bilateral trust toward a country is associated with a 5% increase in patents by inventors in that country&amp;rsquo;s R&amp;amp;D lab. This design allows CEO fixed effects, ruling out unobserved CEO-level confounders such as management style or R&amp;amp;D ability.&lt;/p&gt;
&lt;p&gt;Q: When is CEO trust counterproductive?
A: CEO trust is beneficial only when matched to a high-quality researcher environment. Using residual patent output (controlling for observable firm and CEO characteristics) as a proxy for researcher quality, the effect of CEO trust on patents, patent output per R&amp;amp;D dollar, and future sales/employment/TFP is significant only among firms in the top two quintiles of researcher quality. In firms with mostly low-quality researchers, high CEO trust may be counterproductive by failing to screen out bad researchers.&lt;/p&gt;
&lt;p&gt;Q: How does the trust effect vary by industry and CEO background?
A: The effect is ubiquitous across industries but especially pronounced in pharmaceutical and ICT firms. The timing varies: it manifests quickly in ICT (short R&amp;amp;D lag) and more slowly in pharma (long R&amp;amp;D horizon). The effect vanishes when the CEO holds a non-MBA graduate degree or has prior R&amp;amp;D experience, suggesting trust is a substitute for direct knowledge of R&amp;amp;D processes.&lt;/p&gt;
&lt;p&gt;Q: Are the results robust?
A: Yes. The paper reports 14 categories of robustness checks including alternative patent transformations, alternative trust measures (LASSO, World Value Survey, Global Preference Survey, alternative GSS questions), alternative standard error clustering, Poisson count models, restriction to granted patents, exogenous transition subsamples, modern difference-in-differences estimators (de Chaisemartin et al., 2024; Sun and Abraham, 2021; Callaway and Sant&amp;rsquo;Anna, 2021; Borusyak et al., 2024), and leave-one-ethnicity-out. The baseline result is stable across all these checks.&lt;/p&gt;
&lt;p&gt;Inherited generalized trust: The paper&amp;rsquo;s measure of a CEO&amp;rsquo;s trust disposition, defined as the probability-weighted average of ethnic-origin-specific trust levels (from the GSS) based on the CEO&amp;rsquo;s likely ethnic composition inferred from their last name and historical census records. It captures the culturally transmitted component of trust, distinct from individual-level noise.&lt;/p&gt;
&lt;p&gt;Explorative R&amp;amp;D: In the paper&amp;rsquo;s framework (building on March, 1991), research activities that involve testing untested paths, carrying high risk of failure but high potential for innovation, as opposed to exploitation of well-known approaches with low failure risk. The paper argues CEO trust encourages researchers to shift toward exploration.&lt;/p&gt;
&lt;p&gt;Tolerance of failure: A manager&amp;rsquo;s propensity to attribute a researcher&amp;rsquo;s failure to bad luck rather than bad type. Under the paper&amp;rsquo;s mechanism, a more trusting manager gives greater weight to bad luck, making her more likely to retain the researcher after failure, thereby incentivizing risk taking.&lt;/p&gt;
&lt;p&gt;Top-down trust: In the paper&amp;rsquo;s BERT-based classification of Glassdoor reviews, the direction of trust from managers toward workers (as opposed to bottom-up trust from workers toward managers). The paper finds CEO trust primarily raises top-down trust sentiment, especially among R&amp;amp;D workers.&lt;/p&gt;
&lt;p&gt;Patent explorativeness: A patent quality measure defined as the share of its backward citations that fall outside the firm&amp;rsquo;s existing knowledge stock; patents are classified as explorative if at least 90% of backward citations are outside that stock. The paper uses this as a direct measure of explorative R&amp;amp;D output.&lt;/p&gt;
&lt;p&gt;Bilateral trust: CEO d&amp;rsquo;s directed trust toward individuals from country c, computed analogously to inherited generalized trust but using Eurobarometer survey data on country-pair trust attitudes among European-origin populations. Used in the within-CEO design to control for CEO fixed effects.&lt;/p&gt;
&lt;p&gt;Variance-increasing mechanism: The paper&amp;rsquo;s characterization of the risk-taking channel, in which CEO trust raises the variance (not the mean) of the R&amp;amp;D project quality distribution by encouraging researchers to pursue high-risk, high-reward exploration. Empirically identified by the pattern that trust raises only above-median quality patents with monotonically increasing effects toward the top decile.&lt;/p&gt;</description></item><item><title>Turbulent business cycles</title><link>https://macropaperwarehouse.com/papers/turbulent-business-cycles/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/turbulent-business-cycles/</guid><description>&lt;p&gt;Firm-level evidence shows that recessions are characterized not just by aggregate downturns but by a sharp rise in turbulence—a reshuffling of firms&amp;rsquo; productivity rankings in which high-productivity firms are less likely to maintain their relative standing. This paper documents four stylized facts about the macroeconomic and cross-sectional effects of turbulence (measured as one minus the Spearman rank correlation of firm-level TFP between adjacent years in Compustat data): turbulence is countercyclical; increases in turbulence reallocate labor and capital from high- to low-productivity firms; turbulence is negatively correlated with aggregate manufacturing TFP and the aggregate stock market; and an increase in turbulence is associated with persistent declines in real GDP, consumption, investment, and employment. To explain the mechanism, the authors build a real business cycle model with heterogeneous firms and financial frictions: when turbulence rises, high-productivity firms&amp;rsquo; expected equity values fall because their productivity is less likely to persist, which tightens their borrowing constraints relative to low-productivity firms, inducing reallocation that reduces aggregate TFP. Crucially, turbulence differs from uncertainty shocks because it changes both the conditional mean and variance of the firm productivity distribution, enabling it to generate synchronized recessions with declining aggregate activity.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-how-is-turbulence-measured-and-how-does-it-differ-from-uncertainty"&gt;Q1. How is turbulence measured and how does it differ from uncertainty?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Turbulence is measured as one minus the Spearman rank correlation (ρₜ) of firm-level total factor productivity between adjacent years using Compustat data; a low correlation indicates more churning of productivity rankings, so 1 − ρₜ rises in recessions.&lt;/strong&gt; The authors use an instrumental variable approach to correct for attenuation bias from measurement error in firm-level TFP, following Bloom et al. (2018) for the baseline construction. The conceptual distinction from uncertainty is that uncertainty shocks only raise the conditional variance of the productivity distribution while leaving the conditional mean unchanged. A turbulence shock changes both: it makes the conditional mean of future productivity lower for currently high-productivity firms and higher for currently low-productivity firms, thereby inducing reallocation from high to low producers and generating first-moment effects on aggregate output that pure uncertainty shocks cannot produce.&lt;/p&gt;
&lt;h3 id="q2-what-are-the-empirical-facts-about-turbulence-and-how-are-they-established"&gt;Q2. What are the empirical facts about turbulence, and how are they established?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper documents four facts using a vector autoregression with turbulence orthogonalized against uncertainty and other aggregate shocks: (1) turbulence is countercyclical, rising sharply in recessions; (2) an increase in turbulence reallocates labor and capital from high- to low-productivity firms, an effect that is amplified by financing constraints; (3) turbulence is negatively correlated with aggregate manufacturing TFP and aggregate stock market value; and (4) turbulence shocks generate persistent declines in GDP, consumption, investment, and employment.&lt;/strong&gt; The reallocation effects in fact (2) remain significant after controlling for the confounding effects of recessions and uncertainty, and the amplification by financing constraints is separately identified.&lt;/p&gt;
&lt;h3 id="q3-what-is-the-model-mechanism-through-which-turbulence-drives-recessions"&gt;Q3. What is the model mechanism through which turbulence drives recessions?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;In the model, firms produce using capital and labor subject to idiosyncratic productivity and borrowing constraints tied to expected equity value; when turbulence rises, high-productivity firms are less likely to remain productive, reducing their expected equity value and tightening their borrowing constraints relative to low-productivity firms.&lt;/strong&gt; This differential tightening induces reallocation of labor and capital toward low-productivity firms, reducing aggregate TFP. The feedback through equity values and collateral constraints amplifies the reallocation and generates aggregate-level recessions with synchronized declines in activity. The mechanism is distinct from models in which all firms face symmetric uncertainty shocks: turbulence creates differential effects by firm productivity level.&lt;/p&gt;
&lt;h3 id="q4-how-does-the-model-match-the-observed-macroeconomic-dynamics"&gt;Q4. How does the model match the observed macroeconomic dynamics?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The calibrated model replicates the empirical dynamics: it generates the observed reallocation from high- to low-productivity firms, declines in aggregate TFP and stock market value, and persistent contractions in GDP, consumption, investment, and employment following a turbulence shock.&lt;/strong&gt; The financial frictions play a quantitatively important role in amplifying the reallocation effects, consistent with the empirical finding that financing constraints amplify the cross-sectional reallocation documented in fact (2).&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;turbulence&lt;/strong&gt; : the rate of churning in firms&amp;rsquo; productivity rankings, measured as one minus the Spearman rank correlation of firm-level TFP between adjacent years; distinct from uncertainty in that it changes both the conditional mean and variance of the productivity distribution.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;reallocation channel&lt;/strong&gt; : the mechanism through which turbulence depresses aggregate TFP by shifting labor and capital from high- to low-productivity firms, amplified by tighter credit constraints on high-productivity firms whose expected equity value falls when productivity persistence declines.&lt;/p&gt;</description></item><item><title>Unconventional monetary policy spillovers and the (in)convenience of Treasuries</title><link>https://macropaperwarehouse.com/papers/unconventional-monetary-policy-spillovers-and-the-inconvenience-of-treasuries/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/unconventional-monetary-policy-spillovers-and-the-inconvenience-of-treasuries/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper asks why unconventional monetary policy (UMP) spillovers from the European Central Bank (ECB) to the U.S. Treasury yield curve vary so substantially over time, and whether the time-varying &amp;ldquo;convenience&amp;rdquo; of Treasuries — their non-pecuniary premium as the world&amp;rsquo;s preeminent safe asset — can explain that variation. The core claim is that a declining convenience yield on Treasuries makes them more substitutable with other safe sovereign bonds, thereby amplifying the portfolio-balance channel through which foreign large-scale asset purchases (LSAPs) depress U.S. term premia.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Methodology&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors use high-frequency identification of ECB monetary policy surprises following Altavilla et al. (2019), defined as the first principal component of intraday changes in 1-, 3-, 6-, 12-, and 24-month euro OIS rates plus 5- and 10-year German and French bond yields, measured in the 10-20 minute window bracketing each ECB decision press conference. Surprises are normalized so that one unit raises the 24-month euro OIS by 10 basis points. The sample runs from March 2001 to December 2023, covering approximately 265-268 ECB announcement dates. U.S. zero-coupon Treasury yields come from Gürkaynak et al. (2007); the yield is decomposed into an expected short-rate path and a term premium using the shadow-rate term structure model (SRTSM) of Wu and Xia (2016). The convenience yield on Treasuries is proxied by the spread between the 10-year Treasury yield and the maturity-matched overnight index swap (OIS) rate, so that a positive (and rising) spread indicates declining convenience. Structural breaks in the convenience yield are identified via the Bai-Perron test.&lt;/p&gt;
&lt;p&gt;The empirical strategy has three main components: (i) 700-business-day rolling regressions of Treasury yields and their decomposition on ECB surprises to document time variation; (ii) interaction regressions (following equation 5/9) that condition the ECB shock effect on lagged convenience-yield proxies, net Treasury supply, intermediary balance-sheet constraints (proxied by G10 covered-interest-parity deviations), and inflation-anchoring indicators; and (iii) a policy decomposition following Swanson (2021) that decomposes ECB surprises into &amp;ldquo;target,&amp;rdquo; &amp;ldquo;forward guidance,&amp;rdquo; and &amp;ldquo;LSAP&amp;rdquo; components. These empirical findings are rationalized in a two-country preferred-habitat model, extending Gourinchas, Ray, and Vayanos (in press) (GRV) by allowing the demand-slope parameter governing investor price elasticity to vary with the convenience yield. Functional derivatives and Malliavin calculus are used to characterize dynamic impulse responses to elasticity shifts.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings with Quantitative Magnitudes&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Rising spillovers post-GFC, concentrated at long maturities.&lt;/strong&gt; Rolling regressions show that ECB-to-U.S. spillovers were statistically indistinguishable from zero during the conventional-policy era but grew significantly after 2010, well before the ECB&amp;rsquo;s Expanded Asset Purchase Programme (EAPP) launched in 2015 and before &amp;ldquo;whatever it takes&amp;rdquo; (summer 2012). Spillovers began to dissipate not when ECB purchases ended (March 2022) but when the Fed announced tapering in November 2021 — consistent with the convenience channel rather than mere co-movement in LSAP volumes. A Bai-Perron test detects five structural breaks in the relationship between ECB surprises and 10-year Treasury yields.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Term-premium dominance, amplified by inconvenient Treasuries.&lt;/strong&gt; At average convenience-yield levels, a one-standard-deviation ECB loosening shock (lowering the 24-month euro OIS by 10 basis points) reduces the 10-year Treasury yield by approximately &lt;strong&gt;4.4 basis points&lt;/strong&gt; (column 5, Table 2). When the Treasury convenience yield is one standard deviation below its historical average (i.e., Treasuries are less convenient), the spillover increases by &lt;strong&gt;1.64 basis points&lt;/strong&gt;, making the total effect approximately &lt;strong&gt;6.1 basis points&lt;/strong&gt; — a shift from the bottom 20th to below the 12th percentile of the unconditional distribution of daily Treasury yield changes. This amplification operates entirely through the term premium; the expected path of short rates shows no statistically significant sensitivity to the convenience yield interacted with ECB shocks.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Net Treasury supply amplification.&lt;/strong&gt; Conditional on the net publicly available U.S. debt stock (Treasury debt less Fed holdings, as a percent of GDP), a one-standard-deviation ECB shock at average supply reduces the 10-year yield by approximately &lt;strong&gt;3.9 basis points&lt;/strong&gt;; when net supply is one standard deviation above its historical average (approximately 7.6 percentage points of GDP), the same shock generates a &lt;strong&gt;5.35 basis-point&lt;/strong&gt; decline — a 50-percent amplification (Table 5, column 5).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Intermediary constraints amplification.&lt;/strong&gt; Conditioning on the first principal component of G10 CIP deviations against the dollar (a proxy for intermediary balance-sheet tightness), a CIP deviation one standard deviation above average amplifies the ECB spillover from approximately &lt;strong&gt;3.9 basis points to 6.2 basis points&lt;/strong&gt; (Table 7).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Inflation anchoring.&lt;/strong&gt; Periods when inflation expectations lie outside the interquartile range of the historical distribution are associated with larger spillovers to 10-year Treasury yields, an effect that is statistically significant both above the 75th and below the 25th percentile of expectations, with point estimates of the interaction coefficient reaching approximately &lt;strong&gt;5.0-5.3 basis points&lt;/strong&gt; (Table 6).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Policy asynchronicity.&lt;/strong&gt; Spillovers are especially pronounced when the Federal Reserve is tightening while the ECB is easing. The rolling regressions show term-premium spillovers become dominant (relative to expected-path spillovers) post-2014, coinciding with U.S. normalization. The calibrated model shows that, during policy asynchronicity combined with lower convenience, the home short-rate tightening is partially offset by capital inflows induced by foreign QE, with the attenuation especially pronounced at intermediate and long maturities and persistent across multiple periods.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Alternative channels ruled out.&lt;/strong&gt; Horse-race regressions against the VIX, MOVE index, Economic Policy Uncertainty (EPU) index, Monetary Policy Uncertainty (MPU) index, and 30-day EUR/USD spot variance show none of these candidates displaces the convenience channel. Short-rate-risk decompositions (Bundick et al. 2017) and equity-orthogonal risk premium shocks (Leombroni et al. 2021) cannot explain the post-Taper Tantrum timing pattern of rising term-premium spillovers.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;All empirical results apply to ECB-to-U.S. spillovers; the paper explicitly leaves Bank of England-to-U.K. Gilt spillovers for future work.&lt;/li&gt;
&lt;li&gt;The portfolio-balance amplification through convenience is specific to unconventional monetary policy (LSAP shocks); target and forward-guidance components drive spillovers through different channels (expected short-rate path) and do not exhibit the same convenience-contingent amplification.&lt;/li&gt;
&lt;li&gt;The mechanism operates through preferred-habitat investors demanding sovereign-grade credit; the Bund convenience yield does not amplify U.S. spillovers, consistent with Bunds being an imperfect representation of the full portfolio requiring substitution under ECB capital-key-based purchases.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-how-do-the-authors-measure-ecb-monetary-policy-surprises-and-why-do-they-prefer-this-measure"&gt;Q1. How do the authors measure ECB monetary policy surprises, and why do they prefer this measure?&lt;/h3&gt;
&lt;p&gt;A1: Surprises are the first principal component of intraday changes in 1-, 3-, 6-, 12-, and 24-month euro OIS rates plus 5- and 10-year German and French bond yields, measured from 10-20 minutes pre-announcement to 10-20 minutes post-press conference. This cross-section of yields is preferred because it summarizes shocks to the overall stance of policy both at and away from the effective lower bound, including effects on different parts of the yield curve. The composite measure therefore subsumes both conventional rate actions and unconventional (LSAP, forward guidance) dimensions. Surprises are normalized so one unit raises the 24-month euro OIS by 10 basis points.&lt;/p&gt;
&lt;h3 id="q2-what-is-the-key-empirical-fact-about-the-timing-of-spillover-emergence-and-dissipation"&gt;Q2. What is the key empirical fact about the timing of spillover emergence and dissipation?&lt;/h3&gt;
&lt;p&gt;A2: Rolling regressions show ECB spillovers to U.S. Treasury yields became statistically significant when the rolling window began integrating observations starting in approximately 2010 — substantially before the ECB&amp;rsquo;s EAPP (2015) and even before &amp;ldquo;whatever it takes&amp;rdquo; (summer 2012). Moreover, spillovers began to dissipate not when the ECB&amp;rsquo;s Pandemic Emergency Purchase Programme ended (March 2022) but when the Fed announced tapering in November 2021. This timing pattern is inconsistent with a simple &amp;ldquo;both central banks doing QE simultaneously&amp;rdquo; explanation and instead points to the importance of Federal Reserve balance sheet behavior for the convenience of Treasuries.&lt;/p&gt;
&lt;h3 id="q3-how-do-the-authors-decompose-the-treasury-yield-and-what-does-the-decomposition-reveal-about-the-channel-of-transmission"&gt;Q3. How do the authors decompose the Treasury yield, and what does the decomposition reveal about the channel of transmission?&lt;/h3&gt;
&lt;p&gt;A3: Following standard term-structure decomposition, the n-year yield equals the expected path of short-term rates over the maturity plus a maturity-specific term premium. Rolling regressions on this decomposition show that term-premium spillovers dominate expected-path spillovers, especially post-2014 when the Federal Reserve is out of sync with other advanced economies. Early ECB UMP spillovers showed a more even mix of expected-path and term-premium effects; later spillovers loaded much more heavily on the term premium. This is consistent with the portfolio balance channel — LSAPs remove duration risk and compress term premia, and this effect transmits internationally.&lt;/p&gt;
&lt;h3 id="q4-how-is-the-convenience-yield-proxied-and-why-does-the-paper-use-this-proxy-in-particular"&gt;Q4. How is the convenience yield proxied, and why does the paper use this proxy in particular?&lt;/h3&gt;
&lt;p&gt;A4: The authors use the spread between the sovereign bond yield and the maturity-matched overnight index swap rate (Y − OIS), expressed so that a larger spread (sovereign yield higher than OIS) reflects less convenience. Prior to the GFC, Treasury yields ran below swap rates (negative spread, high convenience); post-GFC, the spread reversed and turned positive, reflecting deterioration in Treasury specialness. This proxy is preferred because it captures the relative convenience as priced by the marginal investors the model focuses on — those with sovereign credit quality preferences and arbitrageurs — rather than broader measures such as the Treasury-to-corporate spread.&lt;/p&gt;
&lt;h3 id="q5-what-is-the-quantitative-impact-of-convenience-yield-variation-on-the-size-of-ecb-spillovers-to-us-yields"&gt;Q5. What is the quantitative impact of convenience yield variation on the size of ECB spillovers to U.S. yields?&lt;/h3&gt;
&lt;p&gt;A5: In the most conservative specification (Table 2, column 5), an ECB loosening shock that lowers 24-month euro OIS by 10 basis points reduces the 10-year Treasury yield by 4.4 basis points when the convenience yield is at its historical average. When the convenience yield falls one standard deviation below average (Treasuries are less convenient), the spillover increases by 1.64 basis points to approximately 6.1 basis points. A one-standard-deviation change in 10-year Treasury yields in the sample is 5.86 basis points; the 4.4 bp response falls in the bottom 20th percentile of unconditional daily yield changes, while the 6.1 bp response falls below the 12th percentile.&lt;/p&gt;
&lt;h3 id="q6-does-the-amplification-of-spillovers-from-ecb-shocks-by-treasury-inconvenience-operate-through-the-term-premium-or-the-expected-short-rate-path"&gt;Q6. Does the amplification of spillovers from ECB shocks by Treasury inconvenience operate through the term premium or the expected short-rate path?&lt;/h3&gt;
&lt;p&gt;A6: The amplification operates entirely through the term premium. In Table 2, columns 7 and 8, the interaction coefficient between the ECB shock and the convenience yield proxy is positive and statistically significant for the 10-year term premium but is not statistically different from zero for the expected path of short rates. The authors interpret this as confirming the portfolio balance channel: displaced Bund investors substitute into Treasuries, raising Treasury prices and compressing term premia, with no mechanical connection to market participants&amp;rsquo; updating of expected future Federal Reserve policy rates.&lt;/p&gt;
&lt;h3 id="q7-how-does-net-treasury-supply-interact-with-the-size-of-ecb-spillovers"&gt;Q7. How does net Treasury supply interact with the size of ECB spillovers?&lt;/h3&gt;
&lt;p&gt;A7: Net U.S. Treasury supply (debt outstanding as a percent of GDP, less Fed holdings) is strongly positively correlated with the swap spread, confirming the link between supply and convenience. Interaction regressions (Table 5) show that a one-standard-deviation ECB shock at average net supply reduces 10-year yields by 3.9 basis points. When net supply is one standard deviation above average (approximately 7.6 percentage points of GDP), the same shock generates a 5.35 basis-point decline — roughly a 50 percent amplification. The point estimates suggest this operates primarily through term premia, though those interaction coefficients are statistically insignificant in the term premium specification.&lt;/p&gt;
&lt;h3 id="q8-how-do-intermediary-balance-sheet-constraints-relate-to-treasury-convenience-and-ecb-spillover-amplification"&gt;Q8. How do intermediary balance-sheet constraints relate to Treasury convenience and ECB spillover amplification?&lt;/h3&gt;
&lt;p&gt;A8: The authors follow Du, Hébert, and Huber (2023) in using deviations from covered interest parity (CIP) among G10 currencies against the dollar as a proxy for the shadow cost of intermediary balance-sheet constraints. When CIP deviations are at historical average, the ECB spillover to 10-year Treasury yields is approximately 3.9 basis points; when CIP deviations are one standard deviation above average, the spillover rises to approximately 6.2 basis points. The authors also use the plausibly exogenous variation from quarter-end &amp;ldquo;window dressing&amp;rdquo; (per Correa, Du, and Liao 2020): LSAP-type ECB surprises landing near quarter-end generate larger spillovers to the term premium, and the further into the quarter an announcement occurs, the larger the LSAP shock&amp;rsquo;s effect on the term premium — consistent with balance-sheet constraints amplifying the portfolio balance channel.&lt;/p&gt;
&lt;h3 id="q9-what-is-the-theoretical-model-and-what-is-the-key-innovation-relative-to-the-baseline-grv-framework"&gt;Q9. What is the theoretical model, and what is the key innovation relative to the baseline GRV framework?&lt;/h3&gt;
&lt;p&gt;A9: The paper extends the two-country preferred-habitat model of Gourinchas, Ray, and Vayanos (in press), in which segmented investors demand bonds of specific maturities and currencies while capital-constrained global arbitrageurs partially bridge the segmentation. The key innovation is allowing the demand-slope parameter α_j(τ) — which in GRV is fixed and governs how inelastic investors are with respect to price — to vary over time as a function of the convenience yield. When Treasuries are special (high convenience), α_H(τ) is large, demand is inelastic, and foreign shocks have limited pass-through. When convenience falls, α_H(τ) shrinks, demand becomes more elastic, investors reallocate more aggressively in response to yield differentials, and U.S. term premia respond more strongly to ECB purchases. Functional derivatives and Malliavin calculus are used to characterize both instantaneous and dynamic amplification effects.&lt;/p&gt;
&lt;h3 id="q10-what-does-the-calibrated-model-predict-about-the-maturity-structure-of-spillover-amplification"&gt;Q10. What does the calibrated model predict about the maturity structure of spillover amplification?&lt;/h3&gt;
&lt;p&gt;A10: In the calibration exercise (Figure 4), the elasticity perturbation is modeled as a smooth function (transformed Cauchy distribution) centered at the 10-year maturity, and the ECB QE shock is a purchase concentrated at the 5-year maturity amounting to 10 percent of euro-area GDP. The marginal change in the home yield impulse response (the quantity ∂²_{α_H,b} log P^τ_{Hs}) is positive across nearly all maturities and horizons, but is most pronounced around the 5-year maturity and during the first few periods after the shock — where the ECB purchase profile and the demand perturbation are most closely aligned in tenor. Amplification effects are persistent across horizons due to the dynamic multiplier in Theorem 3.1.&lt;/p&gt;
&lt;h3 id="q11-how-does-the-model-rationalize-the-2019-yield-curve-inversion"&gt;Q11. How does the model rationalize the 2019 yield curve inversion?&lt;/h3&gt;
&lt;p&gt;A11: In August 2019, the 10-year Treasury yield fell below short-term rates despite a robust domestic labor market, while the Fed was raising rates and the ECB remained accommodative. The model&amp;rsquo;s asynchronicity exercise (Section 3.3) shows that combining a home short-rate increase with ongoing foreign QE and a contemporaneous decline in Treasury convenience produces attenuated or even reversed yield curve responses. More elastic investors facing a flatter demand curve shift into longer-term Treasuries — whose relative yields remain attractive globally — resulting in a yield-curve inversion driven not by recession expectations but by asymmetric monetary policy and a time-varying convenience premium.&lt;/p&gt;
&lt;h3 id="q12-do-alternative-explanations--risk-sentiment-policy-uncertainty-exchange-rate-volatility--explain-the-time-variation-in-ecb-spillovers"&gt;Q12. Do alternative explanations — risk sentiment, policy uncertainty, exchange rate volatility — explain the time variation in ECB spillovers?&lt;/h3&gt;
&lt;p&gt;A12: No. Horse-race regressions in Table 9 condition the ECB shock on lagged VIX, MOVE index, Economic Policy Uncertainty (Baker et al. 2016), Monetary Policy Uncertainty (Husted et al. 2020), and 30-day EUR/USD spot variance. None of these measures displaces the baseline convenience-yield interaction, which remains statistically significant across all specifications. Elevated EPU is associated with smaller spillovers (consistent with uncertainty impairing substitution), but this does not reduce the magnitude or significance of the convenience-yield interaction. Exchange-rate variance does not alter spillover size. A rolling regression decomposing the term premium into a short-rate-uncertainty component (Bundick et al. 2017) and a residual shows the empirical pattern is more consistent with the residual — not the short-rate-volatility channel. An equity-orthogonal risk premium shock (Leombroni et al. 2021) explains some term premium effects in the early GFC period (2008-2012) but cannot rationalize the post-Taper Tantrum pattern of growing term-premium spillovers.&lt;/p&gt;
&lt;h3 id="q13-how-does-the-swanson-2021-decomposition-confirm-the-portfolio-balance-channel"&gt;Q13. How does the Swanson (2021) decomposition confirm the portfolio balance channel?&lt;/h3&gt;
&lt;p&gt;A13: Following Swanson (2021), the authors decompose ECB surprises into a &amp;ldquo;target surprise&amp;rdquo; (change in 3-month OIS futures), a &amp;ldquo;forward guidance surprise&amp;rdquo; (residual from projecting 24-month futures onto the target surprise), and an &amp;ldquo;LSAP surprise&amp;rdquo; (residual from projecting French and German 10-year bond yields onto target and forward guidance). In the full sample (Table 3), LSAP shocks drive spillovers to U.S. yields exclusively at higher maturities and exclusively through the term premium; they have no statistically significant impact on the expected path of short rates. Conditioning LSAP shocks on the convenience yield (Table 4, panel c) shows that it is specifically LSAP-type announcements combined with Treasury inconvenience that generate larger medium- and long-term term-premium spillovers, confirming the portfolio balance mechanism.&lt;/p&gt;
&lt;h3 id="q14-what-are-the-implications-for-fiscal-and-monetary-policy"&gt;Q14. What are the implications for fiscal and monetary policy?&lt;/h3&gt;
&lt;p&gt;A14: The paper argues that the persistently low long-term rates and yield curve inversions observed between the GFC and the COVID-19 pandemic were driven partly by ECB LSAPs amplified by U.S. quantitative tightening, which increased net Treasury supply, reduced Fed absorption, constrained dealer balance sheets, and lowered Treasury convenience. Simultaneously, U.S. monetary tightening raised short-term rates while ongoing ECB easing depressed long rates, reshaping the yield curve in a manner consistent with the model. More broadly, the effectiveness of conventional domestic monetary policy tightening is attenuated when the convenience yield is compressed and foreign QE is ongoing — not because the short rate fails to move, but because more elastic investors reallocate around it. This suggests policy asynchronicity, combined with declining convenience, creates a constraint on monetary independence that may require more forceful or coordinated policy action.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Convenience yield (Treasury convenience premium)&lt;/strong&gt;
The non-pecuniary value that investors derive from holding U.S. Treasury securities over and above cash flows and credit risk — arising from their deep and liquid markets, broad regulatory compatibility, high-quality collateral function, and reserve-currency status. Operationalized in this paper as the spread between the n-year Treasury yield and the maturity-matched overnight index swap (OIS) rate; a positive and rising spread indicates declining convenience, not increasing yield risk.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Portfolio balance channel (of unconventional monetary policy transmission)&lt;/strong&gt;
The mechanism by which large-scale asset purchases by one central bank displace investors from their target allocations, inducing them to substitute into other assets — including foreign sovereign bonds — thereby compressing yields and term premia in those markets. Distinguished from the signaling/expected-path channel in that it operates through changes in duration risk (term premia) rather than revisions to expected future short rates, and is unique to UMP because it targets long-duration assets.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Preferred habitat investors&lt;/strong&gt;
Investors with persistent, institutionally determined demand for bonds of specific maturities and issuers (e.g., insurance companies, pension funds), arising from regulatory constraints, risk management practices, or balance sheet matching. Their demand is modeled as relatively price-inelastic when assets command a convenience premium, and more elastic when that premium erodes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Demand-slope parameter α_j(τ)&lt;/strong&gt;
In the extended GRV preferred-habitat model, the parameter governing the price elasticity of preferred-habitat investor demand for country-j bonds of maturity τ. Large values imply inelastic demand (strong habitat preferences), small values imply elastic demand and greater cross-border substitutability. The paper&amp;rsquo;s key innovation is treating this parameter as time-varying — specifically, as a function of the observed Treasury convenience yield rather than a fixed structural constant.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Policy asynchronicity&lt;/strong&gt;
The condition in which the Federal Reserve is tightening monetary policy (raising rates or conducting quantitative tightening) while other advanced-economy central banks (specifically the ECB) are simultaneously easing through LSAPs. The paper argues that asynchronicity interacts with a declining convenience yield to amplify ECB spillovers to U.S. term premia and attenuate the effectiveness of Federal Reserve tightening at the long end of the yield curve.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Swap spread (as inconvenience proxy)&lt;/strong&gt;
The spread of the sovereign bond yield over the maturity-matched OIS rate (Y − OIS). Expressed so that a larger positive value indicates greater Treasury inconvenience. Prior to the GFC, 10-year Treasury yields ran below swap rates (negative spread); post-GFC, this relationship reversed, with the spread turning persistently positive and exhibiting structural breaks consistent with Bai-Perron tests.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Exorbitant privilege&lt;/strong&gt;
The benefit the United States accrues from the global dominance of its sovereign debt and currency, which structurally insulates U.S. financial markets from foreign monetary policy shocks through inelastic global demand for Treasuries. The paper argues this insulation is not structural but endogenous and state-dependent: erosion of exorbitant privilege — operationalized as a declining convenience yield — substantially increases U.S. vulnerability to foreign monetary shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Gâteaux/Malliavin functional derivative (as used in the model)&lt;/strong&gt;
Mathematical tools used to characterize how the impulse response function of the yield curve to policy shocks changes when the demand-slope parameter α_k(τ) is perturbed. The mixed Gâteaux differential ∂²_{α_k,b} log P^(τ)_{js} captures both the instantaneous amplification (direct pass-through increase) and the intertemporal propagation (dynamic multiplier) of a foreign policy shock under lower convenience, enabling a tractable decomposition of state-contingent spillover magnitudes across maturities and horizons.&lt;/p&gt;</description></item><item><title>Uniform Priors for Impulse Responses</title><link>https://macropaperwarehouse.com/papers/uniform-priors-for-impulse-responses/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/uniform-priors-for-impulse-responses/</guid><description>&lt;p&gt;Structural vector autoregressions (SVARs) identified with sign restrictions are a widely used tool for estimating dynamic causal effects in macroeconomics. Critics—notably Baumeister and Hamilton (2015) and Watson (2020)—have called for caution because the standard practice of using a uniform prior over the set of orthogonal matrices (with respect to the Haar measure) induces non-uniform marginal prior distributions over the identified sets of individual impulse responses. This paper formally challenges that caution: through an if-and-only-if theorem the authors show that the uniform prior over orthogonal matrices is not only sufficient but also necessary to induce a uniform joint prior distribution over the identified set for the &lt;em&gt;vector&lt;/em&gt; of impulse responses—a result that holds for any prior distribution over the reduced-form parameters. The paper additionally shows how to conduct posterior inference based on a uniform joint prior for the vector of impulse responses, which requires modifying the prior for the reduced-form parameters away from the standard Minnesota prior while retaining the uniform prior over orthogonal matrices. An application to Watson&amp;rsquo;s (2020) empirical example finds that joint credible sets under this new prior are similar to, but wider than, those obtained under the standard approach, and that imposing tighter identifying restrictions sharpens inference under both priors.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-core-result-and-what-does-it-imply-for-applied-work"&gt;Q1. What is the core result and what does it imply for applied work?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The central result is an if-and-only-if theorem: the uniform prior over the set of orthogonal matrices is both sufficient and necessary for the conventional Bayesian approach to induce a uniform joint prior distribution over the identified set for the vector of impulse responses, for any prior over the reduced-form parameters.&lt;/strong&gt; The critics&amp;rsquo; concern about non-uniform individual marginal priors does not extend to the joint object: when inference targets the full vector of impulse responses, the standard Haar prior is exactly appropriate. Practitioners interested in joint inference on the shape and comovement of the impulse response function need not heed the call for caution.&lt;/p&gt;
&lt;h3 id="q2-why-does-non-uniformity-of-individual-marginal-priors-not-imply-non-uniformity-of-the-joint-distribution"&gt;Q2. Why does non-uniformity of individual marginal priors not imply non-uniformity of the joint distribution?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The marginal distribution extracted from a uniform joint distribution over a compact manifold need not be uniform; marginal uniformity and joint uniformity are different properties, and only the latter is required for observationally equivalent vectors to be distinguished solely by the identifying restrictions.&lt;/strong&gt; Baumeister and Hamilton (2015) and Watson (2020) correctly note that individual impulse responses have non-uniform marginal priors under the Haar measure, but this is not the relevant criterion when the object of interest is the entire impulse response vector. The paper&amp;rsquo;s theorem shows the joint distribution is uniform, which is the property that ensures the identification restrictions—not the prior—drive the posterior shape.&lt;/p&gt;
&lt;h3 id="q3-how-does-one-implement-a-uniform-joint-prior-for-the-vector-of-impulse-responses"&gt;Q3. How does one implement a uniform joint prior for the vector of impulse responses?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The authors show that a uniform joint prior for the vector of impulse responses requires a modified prior for the reduced-form parameters: one that is independent between (B, Σ) and Q, takes a model-dependent non-standard form for (B, Σ), and retains a uniform prior over orthogonal matrices.&lt;/strong&gt; The induced reduced-form prior resembles but differs from both the standard Minnesota prior and Uhlig&amp;rsquo;s (2005) &amp;ldquo;weak prior.&amp;rdquo; Because the induced prior for (B, Σ, Q) is still a uniform-normal-inverse-Wishart (UNIW) distribution, the conventional sampling algorithm applies without modification; analysts supply the modified reduced-form prior while continuing to draw Q uniformly from the Haar measure.&lt;/p&gt;
&lt;h3 id="q4-what-does-the-empirical-illustration-show"&gt;Q4. What does the empirical illustration show?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;In Watson&amp;rsquo;s (2020) empirical example, joint credible sets under the uniform-joint-prior approach are similar to but wider than those under the standard Minnesota-prior approach.&lt;/strong&gt; The widening is consistent with theory: the uniform joint prior spreads probability mass more evenly over the identified set rather than concentrating it toward regions favored by the Minnesota prior. The finding that tighter identifying restrictions sharpen inference under both approaches reinforces the conclusion of Inoue and Kilian (2022b) that many sign restrictions help when the focus is on joint distributions.&lt;/p&gt;
&lt;h3 id="q5-how-is-the-analysis-generalized"&gt;Q5. How is the analysis generalized?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper extends the results to a broader class of objects of interest—any smooth function of impulse responses, such as combinations of structural elasticities and standard deviations—with an importance-sampling correction when the induced prior over orthogonal matrices is not uniform in the extended case.&lt;/strong&gt; The generalization exploits the diffeomorphism between IR parameters and orthogonal reduced-form parameters, which allows the change-of-variables formula to apply to any smooth object of interest.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;vector of impulse responses&lt;/strong&gt; : the collection of impulse responses across all variables, shocks, and horizons, treated as a single vector object for joint inference; contrasted with individual impulse responses (the response of one variable to one shock at one horizon).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;uniform prior over orthogonal matrices (Haar measure)&lt;/strong&gt; : the unique probability measure on the set of n×n orthogonal matrices invariant under left and right multiplication; the standard prior used in Bayesian sign-restricted SVARs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;identified set&lt;/strong&gt; : the set of vectors of impulse responses that are observationally equivalent given the data and the sign restrictions; the conventional approach draws uniformly from this set under the Haar prior.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;uniform-normal-inverse-Wishart (UNIW) prior&lt;/strong&gt; : the joint prior over orthogonal reduced-form parameters consisting of the Haar prior over Q and a normal-inverse-Wishart prior over (B, Σ); conjugate and computationally tractable.&lt;/p&gt;</description></item><item><title>US Public Debt and Safe Asset Market Power</title><link>https://macropaperwarehouse.com/papers/us-public-debt-and-safe-asset-market-power/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/us-public-debt-and-safe-asset-market-power/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper asks whether the U.S. government exploits its market power as the dominant global supplier of safe assets when setting the quantity of public debt, and quantifies the macroeconomic consequences of this strategic behavior. The paper develops a two-country general equilibrium model in which U.S. public debt provides a non-pecuniary benefit to foreign holders (capturing liquidity, collateral, and safety value) and the U.S. is the monopoly provider of this asset — facing a downward-sloping demand curve for Treasuries, so that issuing more debt reduces the convenience yield. The paper then tests empirically whether the data favor this monopoly model over a price-taking benchmark, exploiting the industrial organization insight that rotations in the demand curve (changes in elasticities during high- versus low-volatility regimes) can distinguish strategic from competitive behavior. Using quarterly data from 1935 to 2020, the paper finds that the data reject price-taking behavior in favor of the monopoly model across a wide range of specifications. Quantitatively, the monopoly calibration implies that U.S. market power generates approximately 45% of the observed convenience yield as a markup (about 30 basis points out of 68 basis points on average), causes safe asset supply to be roughly half what it would be under price-taking, and generates welfare gains to the U.S. of 0.21% in permanent consumption equivalents — almost half of which is attributable to market power rather than to the non-pecuniary value itself.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-theoretical-framework-for-us-market-power-in-safe-assets"&gt;Q1. What is the theoretical framework for U.S. market power in safe assets?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper develops a deterministic, infinite-horizon, two-country model in which the U.S. is the sole provider of an asset with a non-pecuniary benefit to foreign (Rest of World) households, so the U.S. faces a downward-sloping demand curve for its public debt and acts as a monopolist in equilibrium.&lt;/strong&gt; In the model, purchasing U.S. public debt yields a non-pecuniary flow benefit captured by an increasing, concave function f(b*). Because of this benefit, the equilibrium return on U.S. debt is lower than the return on capital — the gap being the convenience yield, defined as the spread between the U.S. capital return and the return on U.S. public debt. The U.S. Ramsey government internalizes the inverse demand function for its debt when solving its optimal fiscal problem, creating a standard monopoly markup: the equilibrium markup equals the inverse of the demand elasticity, µ = 1/ε_D, where ε_D is the price elasticity of foreign demand for U.S. Treasuries. Under price-taking, the markup is zero and the convenience yield reflects only the non-pecuniary value; under the monopoly model, the convenience yield is inflated by the markup, reducing debt supply below the competitive level.&lt;/p&gt;
&lt;h3 id="q2-how-does-the-paper-test-monopoly-versus-price-taking-behavior-empirically"&gt;Q2. How does the paper test monopoly versus price-taking behavior empirically?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper applies the conduct-testing approach of Bresnahan (1982) and the model selection test of Rivers and Vuong (2002): since rotations in the demand curve (changes in the elasticity, holding the level fixed) shift prices only if the firm exploits market power, a finding that convenience yields increase in high-elasticity regimes while quantities decrease is evidence of strategic behavior.&lt;/strong&gt; The paper uses a regime indicator for periods of high global volatility (measured by the rolling standard deviation of MSCI UK Index returns over 1935–2020) as the demand rotator: during high-volatility periods, investors&amp;rsquo; demand for safe assets is more inelastic (flight-to-safety), causing the demand curve to both shift outward and rotate (become steeper). The monopoly model predicts that the U.S. responds to the more inelastic demand by raising the convenience yield through higher markups and restricting supply, whereas the price-taking model attributes any convenience yield increase purely to shifts in marginal cost.&lt;/p&gt;
&lt;p&gt;Empirically, the data show that convenience yields are higher and debt-to-GDP ratios lower during high-volatility periods — inconsistent with the price-taking model&amp;rsquo;s prediction that both prices and quantities should rise in a demand shift, and consistent with the monopoly model&amp;rsquo;s prediction of reduced supply. The estimated demand semi-elasticities are −0.20% per log-point in low volatility and −0.59% per log-point in high volatility (OLS), implying demand elasticities of 1.07 in low volatility and 3.18 in high volatility. The Rivers-Vuong test statistics reject price-taking in favor of the monopoly model at the 1% significance level under both OLS and IV specifications and across a wide range of assumed cost elasticities.&lt;/p&gt;
&lt;h3 id="q3-what-is-the-quantitative-magnitude-of-safe-asset-underprovision"&gt;Q3. What is the quantitative magnitude of safe asset underprovision?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Using a demand elasticity of 2.2 (the average of the OLS and IV estimates from specifications without the demand rotator, consistent with the prior literature), the paper&amp;rsquo;s calibrated monopoly model implies that the U.S. safe asset supply is approximately half as large as it would be if the U.S. acted as a price taker: the steady-state total safe assets-to-GDP ratio is 0.39 in the monopoly equilibrium versus 0.59 in the competitive equilibrium.&lt;/strong&gt; The markup accounts for approximately 45% of the average convenience yield of 68 basis points, implying a markup of about 30 basis points. The interest rate on U.S. public debt is 0.97% in the monopoly equilibrium versus 1.09% in the competitive equilibrium — a difference of 12 basis points — reflecting both the lower debt level and the higher convenience yield that the monopoly generates. These results hold across alternative parameterizations of the cost and demand elasticities.&lt;/p&gt;
&lt;h3 id="q4-what-are-the-welfare-implications-of-safe-asset-market-power"&gt;Q4. What are the welfare implications of safe asset market power?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Market power generates significant welfare gains to the U.S. and welfare losses to the Rest of World: transitioning from the monopoly steady state to an economy with no special role for U.S. assets costs the U.S. 0.21% in permanent consumption equivalents and benefits the Rest of World by 0.34%; transitioning to a competitive equilibrium (price-taking but maintaining the special role) costs the U.S. 0.08% and benefits the Rest of World by 0.10%.&lt;/strong&gt; This decomposition implies that roughly 60% of the U.S. welfare gain from its safe asset status is attributable to the non-pecuniary value (the benefit function f), and approximately 40% is attributable to market power per se. The interpretation is that the U.S. captures surplus from global safe asset demand through both the intrinsic value of its debt and through monopoly rents from restricting supply. The paper interprets these welfare gains as a quantification of &amp;ldquo;exorbitant privilege&amp;rdquo; arising from the supply side rather than from risk premium considerations.&lt;/p&gt;
&lt;h3 id="q5-what-happens-when-safe-asset-competition-increases"&gt;Q5. What happens when safe asset competition increases?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper analyzes the effects of introducing Cournot competition among multiple sovereign safe asset suppliers, finding that while the aggregate supply of global safe assets increases substantially with more competitors, the U.S. public debt level itself is fairly stable, borrowing costs for the U.S. increase, and the Rest of World welfare improves.&lt;/strong&gt; With N=2 symmetric Cournot competitors, the aggregate safe asset supply approximately doubles relative to the monopoly baseline, but each supplier&amp;rsquo;s equilibrium quantity is roughly unchanged. As N increases further, aggregate supply continues to grow, convenience yields fall, and interest rates on U.S. debt rise. A domestic financial fringe competing with U.S. government debt is modeled differently: because the U.S. government internalizes domestic fringe profits, domestic competition results in less competitive pressure, higher markups, and smaller welfare losses for the U.S. than the same amount of competition from foreign suppliers. These results quantify the macroeconomic stakes of initiatives to create alternative safe assets, such as euro area supranational safe bonds or Chinese reserve currency aspirations.&lt;/p&gt;
&lt;h3 id="q6-what-identifies-strategic-versus-competitive-behavior-using-debt-holder-composition"&gt;Q6. What identifies strategic versus competitive behavior using debt holder composition?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;As a complementary identification strategy, the paper exploits time variation in the composition of U.S. Treasury holders: foreign investors (primarily official sector) have more inelastic demand than domestic investors (primarily financial institutions and mutual funds), and the increasing share of foreign investors since the 1970s implies a secular decline in the average demand elasticity.&lt;/strong&gt; The paper estimates demand elasticities separately for the two groups, finds the foreign investor curve is more inelastic, and uses the implied time-varying average elasticity as a second demand rotator. The conduct test under this alternative approach also rejects price-taking in favor of the monopoly model. The monopoly model explains the observed increase in long-term convenience yields since the 1970s through rising markups driven by the shift toward less elastic foreign investors, rather than through rising marginal costs of debt issuance.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;dl&gt;
&lt;dt&gt;&lt;strong&gt;safe asset market power&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;the U.S. government&amp;rsquo;s ability to internalize the downward-sloping foreign demand curve for U.S. Treasuries and restrict supply to maintain a high convenience yield; the paper provides the first formal empirical test and quantification of this strategic behavior in the global safe asset market.&lt;/dd&gt;
&lt;dt&gt;&lt;strong&gt;convenience yield markup&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;the component of the observed convenience yield on U.S. Treasuries attributable to monopoly pricing rather than to the intrinsic non-pecuniary value of the assets; estimated at approximately 45% of the total convenience yield (about 30 out of 68 basis points) under the paper&amp;rsquo;s baseline demand elasticity of 2.2.&lt;/dd&gt;
&lt;dt&gt;&lt;strong&gt;demand rotator (Bresnahan identification)&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;a variable that changes the elasticity of demand without shifting its level, enabling identification of strategic conduct: observing that prices rise and quantities fall when demand becomes more inelastic (as during high-volatility regimes) is evidence of monopoly pricing, since a price taker would not respond to an elasticity change alone.&lt;/dd&gt;
&lt;dt&gt;&lt;strong&gt;safe asset underprovision&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;the quantity distortion from monopoly pricing in the global safe asset market; the paper estimates the steady-state safe-asset-to-GDP ratio is approximately 50% lower in the monopoly equilibrium than in the competitive benchmark, reflecting the standard monopoly restriction of output to exploit the downward-sloping demand curve.&lt;/dd&gt;
&lt;/dl&gt;</description></item><item><title>Vanguard: Black Veterans and Civil Rights After World War I</title><link>https://macropaperwarehouse.com/papers/vanguard-black-veterans-and-civil-rights-after-world-war-i/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/vanguard-black-veterans-and-civil-rights-after-world-war-i/</guid><description>&lt;p&gt;This paper provides the first causal evidence on how military service shaped Black civil rights activism in the aftermath of World War I. The research question is whether random induction into the segregated National Army caused Black men to join the nascent NAACP and become prominent community leaders during the New Negro era. The authors leverage the WWI draft lottery — in which each registrant&amp;rsquo;s unique serial number was drawn from a bowl to determine induction order — as an instrument for military service, a source of exogenous variation not previously exploited in the literature.&lt;/p&gt;
&lt;p&gt;To support this analysis, Ang and Chinoy construct an unusually rich dataset by digitizing nearly one million Black draft registration cards from the first registration (June 17, 1917), linking them through the 1930 full-count census to 233,517 NAACP member observations across 227 branches from 1912 to 1940, and supplementing with Veterans Administration records, Army Transport Service passenger lists, and biographical dictionaries of prominent African Americans. The instrument — serial number percentile within draft board and race (SNP%) — is validated against all observed pre-draft registrant characteristics and yields a first-stage F-statistic of 1,051 in the preferred specification.&lt;/p&gt;
&lt;p&gt;The main finding is that Black men randomly induced to serve in the military were nearly three times more likely to join the NAACP than observably similar registrants from the same draft board (TSLS coefficient 0.0219, se = 0.0049, against a sample mean NAACP participation rate of 0.8%). The authors estimate that the draft induced more than 10,000 Black men to join the NAACP in total. Military service also raised the probability of appearing in biographical dictionaries of historically prominent African Americans by a factor of roughly 1.6 (TSLS coefficient 0.0027, se = 0.0012, sample mean 0.17%). These results are robust to alternative instruments, flexible polynomial specifications of SNP%, state-year fixed effects, and alternative veteran-status measures from VAMI and ATS records. They are also not explained by differential residential mobility: adding controls for interstate and North-South migration leaves the main coefficient essentially unchanged (0.0217-0.0218).&lt;/p&gt;
&lt;p&gt;In contrast, TSLS estimates for all socioeconomic outcomes — literacy, home ownership, employment, census-predicted income, actual 1940 income, and educational attainment — are small and insignificant, ruling out human capital acquisition as a mechanism. Club involvement measured in the census is likewise unaffected, indicating that NAACP membership reflects specifically civil rights activism rather than generically greater social participation.&lt;/p&gt;
&lt;p&gt;The mechanism the paper identifies is experienced discrimination. Effects on NAACP participation increase monotonically with the racial gap in induction rates across draft boards (significant at p = 0.01). Effects are large and significant for men assigned to camps that restricted Black soldiers&amp;rsquo; access to military training (coefficient 0.0351, se = 0.0104) and to officer promotion (coefficient 0.0360, se = 0.0111), and are large for men in both restriction types simultaneously (coefficient 0.0367, se = 0.0114). In contrast, men attending less discriminatory camps show small and insignificant effects. Among the two all-Black combat divisions, NAACP participation is highest for veterans of the 92nd Division — subjected to constant racial abuse under U.S. command — and lower for the 93rd Division, which served under more hospitable French command. Previously unstudied veteran surveys from Virginia and Connecticut corroborate this narrative: respondents from camps with training and promotion restrictions were more than twice as likely to mention racial injustice, and mentions of injustice were more predictive of postwar civic engagement than any other survey theme.&lt;/p&gt;
&lt;p&gt;The scope of the paper is Black male registrants in the first WWI draft registration (men aged 21-30 as of June 17, 1917), linked to a sample of approximately 300,000 in the 1930 census. Effects are attenuated for men from counties with greater racial hostility — proxied by Confederate state status, Confederate monument density, and county lynching rates — consistent with the interpretation that activism was more feasible in less repressive environments.&lt;/p&gt;
&lt;p&gt;Q: What is the core identification strategy and why was it not feasible to use it before this paper?
A: The paper uses each Black registrant&amp;rsquo;s serial number percentile within his draft board and racial group (SNP%) as an instrument for WWI military service. Unlike the WWII and Vietnam drafts, which used birthday-based lotteries, the WWI lottery assigned induction order by drawing unique serial numbers from a bowl, making serial number rank the source of quasi-random variation. This source had never been exploited in the literature, partly because the serial numbers had to be hand-captured from digitized draft card images.&lt;/p&gt;
&lt;p&gt;Q: How strong is the first stage, and was the lottery truly random?
A: The first-stage F-statistic is 1,051, and a ten-percentile decrease in SNP% is associated with a 34.5 percentage point increase in the probability of serving. Bivariate serial numbers show some non-random patterns — nine of 13 pre-draft characteristics correlate with raw SN% — likely because some Southern boards inflated numbers for white registrants. Conditioning on board fixed effects and using SNP% within board-race cells eliminates these correlations; Panel B of Appendix Table A1 shows the largest standardized coefficient falls to 0.006.&lt;/p&gt;
&lt;p&gt;Q: What is the magnitude of the effect on NAACP membership and how does the causal estimate compare to a naive OLS?
A: The TSLS coefficient is 0.0219 (se = 0.0049) against a sample mean of 0.8%, implying roughly a threefold increase in NAACP membership. The OLS estimate of 0.0116 understates the causal effect, consistent with the marginal man induced by the lottery being observationally weaker than infra-marginal volunteers.&lt;/p&gt;
&lt;p&gt;Q: Does the effect reflect simply that veterans moved to Northern cities where NAACP branches were more accessible?
A: No. Adding indicators for interstate migration and North-South migration leaves the TSLS coefficient essentially unchanged at 0.0218 and 0.0217, respectively. The Great Migration channel is thus not the operative mechanism.&lt;/p&gt;
&lt;p&gt;Q: Did military service improve Black veterans&amp;rsquo; economic outcomes?
A: TSLS estimates for literacy, home ownership, employment, census-predicted income, actual 1940 income, and educational attainment are all small and statistically insignificant. This contrasts sharply with evidence on Black veterans of WWII and Korea (Greenberg et al., 2022) and is consistent with the documented absence of meaningful postwar benefits or training for Black WWI soldiers.&lt;/p&gt;
&lt;p&gt;Q: If it was not human capital or migration, what mechanism does the paper establish?
A: The primary mechanism is exposure to institutional discrimination during military service. Three distinct empirical patterns converge: (1) effects increase monotonically with draft board racial disparities in induction rates; (2) effects are large and significant for men at camps that denied training and promotion, and near zero for men at less discriminatory camps; (3) veteran survey mentions of racial injustice are more common among men from discriminatory camps and are more predictive of postwar NAACP membership than any other survey theme.&lt;/p&gt;
&lt;p&gt;Q: How do the two all-Black combat divisions differ in their postwar NAACP participation, and what does this reveal?
A: Veterans of the 92nd Division, who fought under U.S. command amid constant racial abuse, show the highest NAACP participation rates. Veterans of the 93rd Division, who fought under French command and were received with relative hospitality, show lower (though not statistically significantly lower) participation. Since both divisions received similar formal training and neither group shows socioeconomic gains, the differential reflects discrimination exposure rather than skill acquisition.&lt;/p&gt;
&lt;p&gt;Q: What is the quantitative scale of the effect for the most discriminatory camps?
A: For men assigned to camps with restrictions on both training and promotion, the TSLS coefficient on NAACP membership is 0.0367 (se = 0.0114) — more than 1.5 times the average estimate of 0.0219. Men at camps without restrictions show coefficients that are small and statistically insignificant.&lt;/p&gt;
&lt;p&gt;Q: How does county-level racial hostility moderate the effect?
A: The effects of military service on NAACP membership are larger — more positive — for men from counties with fewer Confederate monuments, lower lynching rates, and non-Confederate state status. This is interpreted as evidence that activism in response to discriminatory military experiences was more feasible in less racially hostile local environments, rather than as evidence that discrimination exposure was lower.&lt;/p&gt;
&lt;p&gt;Q: What is the paper&amp;rsquo;s aggregate policy implication regarding the scale of the draft&amp;rsquo;s effect on the civil rights movement?
A: The authors estimate that the WWI draft induced more than 10,000 Black men to join the NAACP. Veterans accounted for nearly 15% of all male NAACP members, against roughly 8% of Black male adults in the population, and were significantly more likely to appear in biographical dictionaries of prominent African Americans. The draft thus constituted a sizable and measurable contribution to the organizational vanguard of the early civil rights movement.&lt;/p&gt;
&lt;p&gt;Q: How does the paper contribute to the economics of discrimination beyond documenting discriminatory behavior by majority actors?
A: Most economics research on discrimination studies the conduct of white decision-makers (e.g., racial bias in hiring, lending, or bail). This paper examines how experiences of discrimination reshape the political behavior and aspirations of the minority group itself. The results show that institutional betrayal — systematic exclusion, degradation, and denial of training — generated deep discontent that translated into aggressive political mobilization, a dynamic the authors trace through subsequent episodes including the WWII Double V campaign and responses to police killings.&lt;/p&gt;
&lt;p&gt;Serial number percentile within draft board and race (SNP%): The instrument constructed by the authors. Each WWI registrant received a serial number from 1 to the size of his draft board; those numbers were drawn in random order to determine induction priority. SNP% measures where a registrant fell in that draw relative to others in his board and racial group, and serves as the source of quasi-random variation in veteran status.&lt;/p&gt;
&lt;p&gt;New Negro era: The period of invigorated Black political and cultural assertiveness following WWI, characterized by renewed racial pride, economic independence, and progressive politics. The movement spanned the Harlem Renaissance, the Universal Negro Improvement Association, the American Negro Press, and the Brotherhood of Sleeping Car Porters, and represented a rejection of the &amp;ldquo;conservatism, parochialism, and political accommodationism&amp;rdquo; of older Black leaders.&lt;/p&gt;
&lt;p&gt;Draft board racial gap: The authors&amp;rsquo; measure of draft board discrimination, defined as the difference in induction rates between Black and white registrants within a given draft board. The interquartile range spans roughly 0 to 20 percentage points, with a notable fraction of boards exhibiting gaps exceeding 30 percentage points.&lt;/p&gt;
&lt;p&gt;Camp discrimination: The denial of military training and officer promotion opportunities to Black soldiers, documented in War Department reports by military intelligence officers tasked with monitoring the treatment of Black soldiers. The paper classifies each camp as restricted or unrestricted on each dimension and uses this classification to estimate heterogeneous treatment effects.&lt;/p&gt;
&lt;p&gt;Institutional betrayal: The paper&amp;rsquo;s characterization of the U.S. government&amp;rsquo;s treatment of Black WWI soldiers — drafting them at higher rates than whites, denying them training and promotion, and assigning them to menial labor — as generating a profound sense of injustice that motivated postwar political activism rather than loyalty or accommodation.&lt;/p&gt;
&lt;p&gt;NAACP membership as civil rights activism proxy: The paper uses dues-paying membership in local NAACP branches as its primary quantitative measure of civil rights participation. Membership involved active financial cost (annual fees of $1 to $10 at a time when median Black family income was below $500), exposure to harassment and violence in the South, and participation in local protest and legal advocacy, distinguishing it from passive civic engagement.&lt;/p&gt;</description></item><item><title>Voluntary Minimum Wages: The Local Labor Market Effects of National Retailer Policies</title><link>https://macropaperwarehouse.com/papers/voluntary-minimum-wages-the-local-labor-market-effects-of-national-retailer-policies/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/voluntary-minimum-wages-the-local-labor-market-effects-of-national-retailer-policies/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This paper studies the labor market effects of voluntary minimum wages (VMWs) — company-wide, publicly announced wage floors set by large private employers — in the U.S. low-wage retail and service sector from 2014 to 2023. The central questions are: (1) How do VMWs affect wages and employment at the adopting large retailers? (2) Do VMWs generate wage spillovers to other employers in shared local labor markets?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Setting&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors use anonymized payroll data obtained from a large U.S. credit bureau, covering the wage distributions and employment of over 4,000 firms and approximately 18 million hourly workers (roughly 22–24% of the U.S. hourly workforce) from January 2013 to August 2023. The database is skewed toward retail and service sectors: over a third of covered workers are in retail, and over half in retail and services combined. Critically, the data also include worker flow information — records of individual workers moving between firms — enabling the authors to define shared labor markets via actual employment transitions rather than broad geographic or industry proxies.&lt;/p&gt;
&lt;p&gt;The sample of VMW events consists of &lt;strong&gt;20 voluntary minimum wage policies across 5 large retailers&lt;/strong&gt; (each with over 150,000 employees nationally), restricted to events with no other major wage policy within six months before or after the focal event. Voluntary minimum wage announcements were identified from an inventory maintained by the National Employment Law Project and independently verified through media sources, then matched to anonymized companies using employer size, industry, and observed shifts in the wage distribution.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Identification Strategy&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors adapt the &lt;strong&gt;gap design&lt;/strong&gt; from the national minimum wage literature. For each company-by-commuting-zone (CZ) cell, the &amp;ldquo;gap&amp;rdquo; measures the percent increase in average hourly wages that would be required to bring all workers in the area up to the company&amp;rsquo;s new voluntary minimum. The gap is averaged over months −6 to −3 before the event (months −3 to −1 serve as a built-in placebo-in-time check). This variation in bite across CZs — arising because the same nominal VMW level implies different wage increases depending on local wage distributions — is combined with a stacked event study across 20 VMW events. Spillover effects are estimated by regressing log average wages at non-policy establishments on the large retailer&amp;rsquo;s CZ-level gap measure, progressively narrowing the definition of &amp;ldquo;labor market&amp;rdquo; from: (i) all non-policy establishments in the same CZ, to (ii) establishments in industries connected to the large retailer by worker flows (15 three-digit NAICS industries), to (iii) specific establishments with documented pre-event worker flows to or from the large retailer (&amp;ldquo;connected establishments&amp;rdquo;).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Own effects:&lt;/em&gt; For $15 VMW events, moving from a CZ gap of 0 to a gap of 1 is associated with an approximately 88 log point increase in average hourly wages in the six months after adoption. Given that the average establishment-level gap for $15 VMWs is 0.11, the implied average wage increase is approximately 10.45% (the authors&amp;rsquo; estimate is 9–10%, consistent with small wage increases even in zero-gap comparison areas). Employment of workers earning under $30 per hour rose by 4.62% after $15 VMW events, 2.01% after major events (affecting ≥30% of workforce), and 1.25% across all 20 events. These employment increases are &lt;strong&gt;entirely attributable to reduced separations&lt;/strong&gt; rather than new hiring: separation rates fell by 0.42, 0.57, and 1.09 percentage points after all, major, and $15 VMW events respectively — equivalent to reductions of 6.57%, 8.73%, and 15.33% relative to pre-period means. Separations specifically to other database companies fell by 0.07–0.19 percentage points (5.63–13.48% relative to base rates). If anything, new hiring fell modestly after VMW adoption. Total monthly base pay and gross compensation both rose after VMWs, indicating increased total take-home pay without compensatory reductions in hours or bonuses. The total employment elasticity with respect to wages ranges from approximately 0.35 to 0.45, while the quit elasticity is 2.20–2.38 (consistent with dynamic monopsony models in which the labor supply elasticity is twice the quit elasticity).&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Spillover effects:&lt;/em&gt; Across all three definitions of the labor market, the paper estimates &lt;strong&gt;precise, economically negligible cross-employer wage spillovers&lt;/strong&gt; in the six months following VMW events. Cross-employer wage elasticities are statistically indistinguishable from zero across all specifications. Among the most narrowly defined sample — establishments with documented pre-event worker flows to or from the large retailer — the upper bound of the confidence interval rules out spillovers greater than 0.2% of wages. No wage spillovers are detected for new hires at non-policy establishments either. These null results are confirmed over a 12-month post-event horizon for the subsample of events with no other major policy nearby.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Mechanism:&lt;/em&gt; The reason for negligible spillovers is that VMWs reduced labor market churn rather than expanding the large retailer&amp;rsquo;s total employment. Hiring away from large retailers by connected non-policy firms falls after VMW adoption — consistent with fewer separations to recruit from — but &lt;strong&gt;overall hiring by non-policy firms does not decline&lt;/strong&gt;, as these firms substitute toward other hiring sources. This substitutability across new hire sources in a thick market is the proximate explanation for the absence of wage pressure on competitor firms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Results pertain to large national retailers (&amp;gt;150,000 employees) operating in U.S. commuting zones during 2014–2023. The database covers only employers large enough to participate in credit bureau income verification; smaller employers (representing over 75% of U.S. hourly workers by the BLS comparison) are not observed, and the authors caution that spillover effects on smaller firms cannot be assessed. The authors also explicitly note that their null local spillover results do not rule out national-level strategic wage-setting dynamics — the rapid sequential adoption of VMWs across major retailers may reflect national-level competition rather than local market competition.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-exactly-are-voluntary-minimum-wages-and-how-do-they-differ-from-statutory-minimum-wages"&gt;Q1. What exactly are &amp;ldquo;voluntary minimum wages&amp;rdquo; and how do they differ from statutory minimum wages?&lt;/h3&gt;
&lt;p&gt;Voluntary minimum wages (VMWs) are company-wide, publicly announced wage floors set unilaterally by private employers, typically well above the applicable statutory (federal, state, or local) minimum. Unlike statutory minimums, which bind all employers in a jurisdiction, VMWs apply only to the announcing company across all of its geographic operations in the U.S. The paper studies VMWs adopted by retailers with over 150,000 workers, which include wage floors at levels such as $9, $10, $12, and $15 per hour. $15 VMWs were adopted at a time when few states or localities had yet reached that threshold, meaning the policy bit into the company wage distribution far more deeply than prevailing statutory floors.&lt;/p&gt;
&lt;h3 id="q2-how-were-vmw-events-identified-and-matched-to-anonymized-firms-in-the-payroll-database"&gt;Q2. How were VMW events identified and matched to anonymized firms in the payroll database?&lt;/h3&gt;
&lt;p&gt;VMW events were identified from a database maintained by the National Employment Law Project and verified through an independent review of business news articles. These publicly reported announcements were then matched to the anonymized companies in the credit bureau payroll database using employer size, industry, and the timing of observed shifts in the firms&amp;rsquo; wage distributions. An additional three events were identified directly from data: months where the share of workers earning below a given wage level dropped by at least 15 percentage points (for non-$15 events) or 10 percentage points (for $15 events) while the share at exactly that wage bin jumped by at least 10–20 percentage points. The final sample of 20 events was restricted to those with no other major wage policy in the six months before or after.&lt;/p&gt;
&lt;h3 id="q3-how-does-the-gap-design-work-and-why-does-it-improve-on-the-fraction-affected-approach"&gt;Q3. How does the gap design work and why does it improve on the fraction-affected approach?&lt;/h3&gt;
&lt;p&gt;The gap for a given company, commuting zone, and time period is defined as the total wage increase needed to bring all sub-$30 workers up to the company minimum, divided by total wage costs — formally a labor-share-weighted average shortfall from the new minimum across wage bins. The gap leverages more cross-sectional variation in treatment intensity than the simple fraction of workers below the minimum: for a $15 VMW, an area where all workers earn $10 has a gap of 0.50 while an area where all earn $12 has a gap of 0.25. The gap is averaged over months −6 to −3 before the event. The period months −3 to −1 then serve as a placebo window: genuine VMW effects should appear only after the policy&amp;rsquo;s adoption month, not during the period immediately after the gap is measured. If instead the regression picks up mean reversion in noisy wage data, spurious effects would appear in months −3 to −1 rather than at event time 0.&lt;/p&gt;
&lt;h3 id="q4-what-is-the-magnitude-of-the-wage-effect-on-the-large-retailers-themselves"&gt;Q4. What is the magnitude of the wage effect on the large retailers themselves?&lt;/h3&gt;
&lt;p&gt;For $15 VMW events, the stacked event study estimates that moving from a gap of 0 to a gap of 1 is associated with an approximately 88 log point increase in average hourly wages beginning exactly in the month of policy adoption. Given the average establishment-level gap of 0.11 for $15 VMWs, this implies the average establishment raised wages by approximately 9–10% (the authors compute 10.45% from the average gap, consistent with a slight dampening because zero-gap CZs experienced marginally higher wages too). Wage increases are confirmed persistent at 12 months in robustness checks. For all 20 VMW events pooled, effects are somewhat smaller commensurate with the lower average bite.&lt;/p&gt;
&lt;h3 id="q5-how-did-vmws-affect-total-employment-and-its-components-at-the-large-retailers"&gt;Q5. How did VMWs affect total employment and its components at the large retailers?&lt;/h3&gt;
&lt;p&gt;After $15 VMW events, log total employment of sub-$30 workers rose by 4.62%; after major VMW events (≥30% bite), 2.01%; after all 20 events, 1.25%. The increases are entirely driven by retention gains. Separation rates fell by 1.09 percentage points after $15 VMWs, 0.57 p.p. after major events, and 0.42 p.p. after all events — translating to reductions of 15.33%, 8.73%, and 6.57% relative to pre-period means. Separations to other database companies specifically fell by 0.07–0.19 percentage points (5.63–13.48% relative to the base mean). New hiring — measured as year-on-year log change in hires to control for seasonality — fell after VMW adoption, consistent with a reduced need to replace departing workers.&lt;/p&gt;
&lt;h3 id="q6-what-do-the-labor-supply-elasticities-implied-by-the-vmw-results-look-like"&gt;Q6. What do the labor supply elasticities implied by the VMW results look like?&lt;/h3&gt;
&lt;p&gt;The total employment elasticity with respect to wages ranges from approximately 0.35 to 0.45 across the three event groupings. Under standard dynamic monopsony models, the labor supply elasticity facing the firm equals twice the quit elasticity in steady state (Manning, 2003). The quit elasticity — derived by dividing the proportional reduction in separations by the log wage increase — ranges from 2.20 to 2.38, consistent with the earlier monopsony-based case study of Ford&amp;rsquo;s $5 workday (Raff and Summers, 1987) and implying substantial firm-level wage-setting power.&lt;/p&gt;
&lt;h3 id="q7-did-vmws-increase-total-take-home-pay-or-were-wage-gains-offset-by-reductions-in-hours-or-bonuses"&gt;Q7. Did VMWs increase total take-home pay or were wage gains offset by reductions in hours or bonuses?&lt;/h3&gt;
&lt;p&gt;The paper examines log average monthly base pay and log average gross compensation (which includes bonuses and overtime) as additional outcomes. Both measures rose after $15 VMW events, indicating that the wage floor increase translated into genuine improvements in total take-home pay without compensatory reductions in hours or other non-wage compensation. The monthly gross pay series is an average over calendar year-to-date months, so increases appear gradually rather than as a sharp jump at the adoption month; nevertheless the upward trend is evident and consistent.&lt;/p&gt;
&lt;h3 id="q8-what-are-the-estimated-spillover-effects-on-wages-at-non-policy-employers"&gt;Q8. What are the estimated spillover effects on wages at non-policy employers?&lt;/h3&gt;
&lt;p&gt;Across all three definitions of the labor market — all non-policy establishments in the same CZ, establishments in the 15 connected industries in the same CZ, and establishments with documented pre-event worker flows — the estimated cross-employer wage effects are precise zeros. The stacked event study in the post-period shows coefficients centered on zero with small confidence intervals. The difference-in-differences cross-employer wage elasticity (instrumenting the large retailer&amp;rsquo;s wage change with the gap) is also indistinguishable from zero. Among the most exposed connected establishments, the point estimate is slightly positive but economically negligible; the upper confidence interval bound rules out spillovers greater than 0.2%. Results are confirmed over a 12-month horizon for the clean-event subsample.&lt;/p&gt;
&lt;h3 id="q9-could-the-null-spillover-result-reflect-mean-reversion-bias-rather-than-a-true-zero"&gt;Q9. Could the null spillover result reflect mean reversion bias rather than a true zero?&lt;/h3&gt;
&lt;p&gt;The authors address this concern explicitly. For the policy-company gap design, they build in a placebo-in-time check by measuring the gap over months −6 to −3 and checking that no wage effects appear in months −3 to −1. For the non-policy spillover analysis, they also examine an alternative treatment variable — the gap between non-policy establishments&amp;rsquo; wages and the large retailer&amp;rsquo;s new VMW — and find evidence of mean reversion: wages begin rising in the pre-period in the direction of this gap measure. They correct for this by detrending post-period estimates using a linear extrapolation of the pre-period trend. After detrending, spillover effects remain indistinguishable from zero.&lt;/p&gt;
&lt;h3 id="q10-why-are-spillover-effects-so-limited-if-the-large-retailer-is-drawing-fewer-workers-away-from-competitors"&gt;Q10. Why are spillover effects so limited if the large retailer is drawing fewer workers away from competitors?&lt;/h3&gt;
&lt;p&gt;The paper&amp;rsquo;s mechanism analysis shows that while the probability of a non-policy firm hiring a worker from the large retailer falls after a VMW event (consistent with fewer separations to recruit from the large retailer), the &lt;strong&gt;overall rate of hiring by non-policy firms does not decline&lt;/strong&gt;. Non-policy firms substitute toward other hiring sources — primarily other non-policy companies — rather than hiring fewer workers overall. This substitutability across recruiting sources in a thick labor market mutes the competitive pressure on competitor wages: since non-policy firms can replace the reduced flow from VMW companies with workers from other sources without changing total employment, they face no pressure to raise wages.&lt;/p&gt;
&lt;h3 id="q11-how-do-the-results-differ-when-focusing-on-czs-where-the-large-retailer-accounts-for-a-larger-employment-share"&gt;Q11. How do the results differ when focusing on CZs where the large retailer accounts for a larger employment share?&lt;/h3&gt;
&lt;p&gt;The authors test whether larger local market presence amplifies spillovers by splitting the sample at the median employment share of the large retailer in the CZ. They find no evidence of positive wage spillovers even in CZs where the large retailer&amp;rsquo;s employment share is above the median, confirming that neither local market size nor market concentration is a mechanism for spillover transmission in this setting.&lt;/p&gt;
&lt;h3 id="q12-how-do-these-vmw-spillover-results-compare-to-prior-evidence-on-employer-wage-setting-spillovers"&gt;Q12. How do these VMW spillover results compare to prior evidence on employer wage-setting spillovers?&lt;/h3&gt;
&lt;p&gt;The main prior U.S. evidence (Staiger et al., 2010) studied a federally mandated wage increase at Veterans Affairs hospitals and found a cross-establishment wage elasticity of approximately 0.19 for registered nurses at neighboring hospitals. The authors note two key differences: first, the VA policy increased both wages and employment at treated facilities, whereas VMWs primarily reduced separations without increasing hiring, so the supply of workers to competitor firms was not squeezed. Second, the market for low-wage retail and service workers is likely thicker (more potential hires available) than the market for registered nurses, allowing competitors to substitute hiring sources without bidding up wages.&lt;/p&gt;
&lt;h3 id="q13-what-do-the-null-local-spillover-results-imply-about-national-level-wage-dynamics"&gt;Q13. What do the null local spillover results imply about national-level wage dynamics?&lt;/h3&gt;
&lt;p&gt;The authors explicitly caution against reading the null local spillover result as implying VMWs have no broader effect on the low-wage labor market. The rapid and successive adoption of VMWs across major retailers during 2021–2022 could reflect national-level strategic wage-setting competition — firms mimicking each other&amp;rsquo;s announcements in an arms-race dynamic during tight labor markets — rather than local competitive transmission. The paper does not test for national-level strategic interactions and calls for further research on this dimension.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Voluntary Minimum Wage (VMW):&lt;/strong&gt; A company-wide, publicly announced wage floor set unilaterally by a private employer, applying across all of the firm&amp;rsquo;s geographic operations in the U.S., typically well above applicable statutory minimums. Distinct from legally mandated minimum wages in that they bind only the announcing firm and arise from the firm&amp;rsquo;s own strategic or reputational motivations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Gap Measure:&lt;/strong&gt; Borrowed from the national minimum wage literature (Card, 1992; Draca et al., 2011), this is the percent increase in a firm&amp;rsquo;s average hourly wage that would be required to bring all workers in a given commuting zone up to the company&amp;rsquo;s new voluntary minimum. Formally the labor-share-weighted average shortfall from the VMW across sub-$30 wage bins. A gap of 0 means no workers fall below the new minimum; a gap of 1 means all workers would need to be raised to the minimum, doubling the average wage. Used as a continuous treatment variable capturing the local bite of the policy.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Stacked Event Study:&lt;/strong&gt; An empirical design in which a separate 12-month panel (6 months pre- and post-event) is constructed for each of the 20 VMW events, these datasets are stacked, and the effect of the continuous gap treatment is estimated jointly across all events, with event-specific indicators interacting all regressors to allow each event to have its own intercept.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Placebo-in-Time Check:&lt;/strong&gt; A robustness test built into the gap design by computing the gap over months −6 to −3 and verifying that wage effects do not appear in months −3 to −1 (the period between gap measurement and VMW adoption). Genuine policy effects should materialize at the adoption month; spurious effects driven by mean reversion in noisy wage data would appear in months −3 to −1 because the gap would mechanically predict wage reversion toward the mean in the period immediately following its measurement.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Connected Establishments / Poaching and Feeder Establishments:&lt;/strong&gt; Specific firm-by-CZ cells identified as sharing a labor market with the large retailer via actual worker flows. &amp;ldquo;Poaching establishments&amp;rdquo; hired at least one worker from the large retailer in the 12 months before the VMW event. &amp;ldquo;Feeder establishments&amp;rdquo; had at least one worker subsequently hired by the large retailer in the same pre-period. These are the most narrowly defined and most economically relevant labor market competitors for testing spillover effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Quit Elasticity / Labor Supply Elasticity (Firm-Level):&lt;/strong&gt; The quit elasticity is the percent change in the separation rate divided by the percent change in wages induced by the VMW. Under standard dynamic monopsony models (Manning, 2003), in steady state the recruit elasticity equals the quit elasticity, and the firm-level labor supply elasticity equals twice the quit elasticity. The authors estimate quit elasticities of 2.20–2.38, implying labor supply elasticities of 4.40–4.76 to the firm — consistent with meaningful but not extreme monopsony power.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Cross-Employer Wage Elasticity:&lt;/strong&gt; The percent change in wages at a non-policy employer&amp;rsquo;s establishment associated with a 1% change in wages at the large retailer in the same commuting zone, instrumented using the large retailer&amp;rsquo;s gap interacted with the post-event indicator. Estimated to be a precise zero across all market definitions and event groupings in this paper.&lt;/p&gt;</description></item><item><title>Wage growth and labor market tightness</title><link>https://macropaperwarehouse.com/papers/wage-growth-and-labor-market-tightness/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/wage-growth-and-labor-market-tightness/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; Which measures of labor market tightness best predict nominal wage inflation, and do standard measures such as the unemployment rate and the vacancy-to-unemployment ratio capture the relevant slack? The paper also asks whether transitory productivity shocks affect wage growth, and whether the wage Phillips curve is nonlinear.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Motivation and Model.&lt;/strong&gt; Standard measures of labor market tightness have had mixed performance since the COVID-19 pandemic: unemployment quickly returned to pre-pandemic levels while wage growth remained persistently elevated, motivating a search for superior indicators. The paper builds on the theoretical framework of Bloesch, Lee, and Weber (2024), a tractable New Keynesian DSGE model in which firms set wages and workers search on the job. In this model, labor market tightness is well-summarized by either (a) the quits rate or (b) vacancies per effective searcher (V/ES), where effective searchers include both employed and unemployed job seekers. Unemployment enters the model&amp;rsquo;s wage Phillips curve but with a coefficient close to zero, because changes in the unemployment share do not substantially shift the composition of searchers in a way that alters firms&amp;rsquo; wage incentives. Transitory TFP shocks have theoretically ambiguous effects on nominal wage growth because the outcome depends on the central bank&amp;rsquo;s policy response.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Methods.&lt;/strong&gt; The main analysis uses quarterly U.S. data from 1990:Q2 to 2024:Q2. Wage growth is measured as the 3-month log change in the Employment Cost Index (ECI) for wages and salaries of private industry workers. Quits and vacancies are drawn from JOLTS (2001:Q1 forward) and extended back to 1990:Q2 using the Davis-Faberman-Haltiwanger series and Barnichon&amp;rsquo;s composite Help Wanted Index, respectively. The authors run a &amp;ldquo;horse race&amp;rdquo; of OLS univariate regressions of wage growth on thirteen separately normalized tightness indicators. They then run bivariate regressions pairing the quits rate with each other indicator to test whether any alternative provides independent predictive power. Robustness is assessed using 12-month ECI changes. An industry-level panel with time and industry fixed effects covering 11 broad sectors from JOLTS for 2001:Q1–2024:Q2 tests whether the same ranking holds within industries. Forecasting exercises use 1-, 2-, and 4-quarter-ahead in-sample regressions plus rolling out-of-sample one-quarter-ahead predictions beginning in 2004:Q1. Nonlinearity is evaluated via threshold regressions at the 25th percentile (unemployment) or 75th percentile (other measures) and via quadratic specifications.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings with Quantitative Magnitudes.&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Horse race (aggregate, contemporaneous):&lt;/em&gt; The quits rate explains 55 percent of variation in 3-month ECI wage growth (R² = 0.55), and V/ES explains 52 percent (R² = 0.52), the two highest among all indicators tested. A one standard deviation increase in either quits (0.39 percentage points) or V/ES (0.08) is associated with 0.20 percentage points higher 3-month wage growth. By contrast, the vacancy-to-unemployment ratio (V/U) explains only 41 percent of wage growth and the unemployment rate only 34 percent. Together, quits and V/ES explain nearly two-thirds of wage growth since 1994 and 78 percent since 2020:Q2.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Bivariate regressions:&lt;/em&gt; Conditional on the quits rate, the coefficient on every other tightness indicator drops to near zero, with the sole exception of V/ES, which retains a coefficient of 0.08 (significant) while the quits coefficient remains at 0.14. This result is consistent with the model&amp;rsquo;s prediction that quits and V/ES are close to sufficient statistics for labor market tightness.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;12-month ECI results:&lt;/em&gt; The ranking is preserved at longer horizons; quits and V/ES each explain approximately two-thirds of 12-month wage growth.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Productivity:&lt;/em&gt; Regressions of 3-month ECI wage growth on 3-month changes in labor productivity, TFP, and utilization-adjusted TFP all yield small, negative, and statistically indistinguishable from zero coefficients, consistent with the model&amp;rsquo;s prediction of an ambiguous effect of transitory productivity shocks on nominal wages.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Industry-level panel:&lt;/em&gt; Quits and V/ES remain the strongest predictors of within-industry wage growth after absorbing industry and time fixed effects. A one standard deviation increase in the industry quits rate (0.93 percentage points) is associated with 0.23 percentage points higher quarterly wage growth; a one standard deviation increase in industry V/ES (0.11) is associated with 0.13 percentage points higher wage growth.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;HPW Composite Index:&lt;/em&gt; The Heise-Pearce-Weber (HPW) Index, constructed as an OLS-weighted average of quits and V/ES, achieves a correlation of 0.9 with standardized 3-month ECI wage growth. In-sample forecasting R² for the HPW Index at 1, 2, and 4 quarters ahead is 0.62, 0.74, and 0.77, respectively — the highest of all indicators at each horizon.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Out-of-sample forecasting:&lt;/em&gt; Only the quits rate and the HPW Index consistently outperform a simple AR(1) benchmark throughout the out-of-sample period from 2004:Q1 to 2024:Q1. The forecasting performance of vacancy-based measures (V/U and V/ES) deteriorated steadily after 2015, consistent with evidence of structural shifts in vacancy measurement documented by Mongey and Horwich (2023).&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Nonlinearity:&lt;/em&gt; Threshold regressions and quadratic specifications provide little evidence of meaningful nonlinearity in the wage-tightness relationship for quits, V/ES, or the HPW Index over 1990–2024. The fit improvement from adding threshold terms is marginal, and slope coefficients are broadly stable across the full range of tightness, including the extreme tightness observed after COVID.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-theoretical-mechanism-links-quits-and-ves-to-nominal-wage-growth-in-contrast-to-unemployment"&gt;Q1. What theoretical mechanism links quits and V/ES to nominal wage growth, in contrast to unemployment?&lt;/h3&gt;
&lt;p&gt;In the Bloesch-Lee-Weber (2024) model incorporated in the paper, firms use both wages and vacancies to attract and retain workers from unemployment and from other firms, conditional on the overall mass of effective searchers. Labor market tightness is defined as V/S (vacancies over total searchers), not V/U, because employed workers also search on the job. When tightness is high, workers are harder to recruit and more likely to be poached, pressuring firms to raise wages. Quits are the endogenous component of separations and rise mechanically with tightness, making them a near-equivalent sufficient statistic for V/ES. Unemployment enters the wage Phillips curve in principle because the composition of searchers (employed vs. unemployed) matters for firms&amp;rsquo; wage-setting incentives, but the coefficient on unemployment is calibrated and estimated to be approximately zero.&lt;/p&gt;
&lt;h3 id="q2-how-do-the-authors-extend-the-quits-and-vacancies-data-back-to-1990-to-cover-the-full-sample-period"&gt;Q2. How do the authors extend the quits and vacancies data back to 1990 to cover the full sample period?&lt;/h3&gt;
&lt;p&gt;JOLTS data on quits and job openings begin in 2001:Q1. The authors extend the quits rate backward to 1990:Q2 using the Davis, Faberman, and Haltiwanger (2012) series, taking a simple average of the two in overlapping quarters (2001:Q1–2010:Q2). Vacancies are extended back to 1990:Q2 using the composite Help Wanted Index constructed by Barnichon (2010), with a similar overlapping average for 2000:Q4–2021:Q3. The effective-searcher measure (V/ES) is available only from 1994:Q1 because the CPS marginally attached worker series begins then.&lt;/p&gt;
&lt;h3 id="q3-how-is-the-ves-measure-constructed-and-why-does-it-differ-from-the-standard-vu-ratio"&gt;Q3. How is the V/ES measure constructed, and why does it differ from the standard V/U ratio?&lt;/h3&gt;
&lt;p&gt;Effective searchers are constructed as ES = U_s + 0.48·U_l + 0.40·Z_want + 0.09·Z_do-not-want + 0.07·N, where U_s is short-term unemployed (less than 27 weeks), U_l is long-term unemployed (27+ weeks), Z_want is marginally attached workers not in the labor force, Z_do-not-want is non-participants not marginally attached, and N is employment. The weights reflect relative search intensities estimated by Abraham, Haltiwanger, and Rendell (2020) and translated to publicly available CPS data by Sahin (2020). Because employed workers constitute a far larger share of the population than the unemployed, including them — even at the low weight of 0.07 — substantially increases the total effective searcher count relative to V/U. This matters because the model predicts that firms&amp;rsquo; wage decisions depend on the full pool of potential recruits and retention risk, not just the unemployed.&lt;/p&gt;
&lt;h3 id="q4-what-are-the-results-of-the-bivariate-horse-race-pairing-quits-with-each-other-tightness-measure"&gt;Q4. What are the results of the bivariate &amp;ldquo;horse race&amp;rdquo; pairing quits with each other tightness measure?&lt;/h3&gt;
&lt;p&gt;In bivariate OLS regressions of 3-month ECI wage growth on the quits rate plus one other indicator, the coefficient on quits remains approximately 0.14–0.22 percentage points per standard deviation regardless of which other variable is included, while all competing indicators&amp;rsquo; coefficients fall to near zero. The sole partial exception is V/ES, which retains a coefficient of 0.08 (significant at 5%) alongside a quits coefficient of 0.14; the combined fit is 0.60. For all other measures — including V/U (coefficient drops to 0.04), unemployment (0.00), jobs-workers gap (0.02), Conference Board availability (−0.01), and NFIB difficulty hiring (0.01) — the incremental contribution beyond quits is negligible. This result is consistent with the model&amp;rsquo;s prediction that quits and V/ES are jointly near-sufficient statistics for wage growth.&lt;/p&gt;
&lt;h3 id="q5-do-the-industry-level-panel-regressions-replicate-the-aggregate-ranking-and-why-is-this-an-important-test"&gt;Q5. Do the industry-level panel regressions replicate the aggregate ranking, and why is this an important test?&lt;/h3&gt;
&lt;p&gt;Yes. In panel regressions with industry and time fixed effects covering 11 JOLTS sectors from 2001:Q1 to 2024:Q2, the quits rate has the highest within-industry R² (0.019) and V/ES the second highest (0.010); all other indicators rank below. This within-industry test is important because it removes the possibility that the aggregate correlations are driven by unobserved macro variables that happen to co-move with quits and V/ES. The bivariate industry panel confirms that, conditional on quits, only V/ES adds substantially to the within-industry fit; all other indicators add negligible explanatory power.&lt;/p&gt;
&lt;h3 id="q6-why-might-industry-level-tfp-shocks-have-a-modest-positive-effect-on-wages-even-though-aggregate-tfp-shocks-do-not"&gt;Q6. Why might industry-level TFP shocks have a modest positive effect on wages even though aggregate TFP shocks do not?&lt;/h3&gt;
&lt;p&gt;At the industry level, the central bank does not respond to industry-specific TFP shocks. When a particular industry&amp;rsquo;s productivity rises and firms lower prices, consumer demand for that industry&amp;rsquo;s output rises. If demand rises by enough, firms must hire more workers to meet demand despite higher productivity per worker, leading them to post more vacancies and raise wages. At the aggregate level, the central bank does respond to the disinflation associated with positive TFP shocks (following a Taylor rule), which can raise overall consumption enough to require more aggregate hiring and generate a positive TFP-wage correlation — but the direction depends on monetary policy responsiveness, making the aggregate relationship ambiguous and empirically insignificant. The industry regressions find that a 1 percent increase in annual labor productivity is associated with 0.15 percent higher industry annual wage growth, significant at the 10 percent level.&lt;/p&gt;
&lt;h3 id="q7-how-is-the-hpw-index-constructed-and-what-is-its-in-sample-fit-with-wage-growth"&gt;Q7. How is the HPW Index constructed, and what is its in-sample fit with wage growth?&lt;/h3&gt;
&lt;p&gt;The HPW Index is constructed as a weighted average of the standardized quits rate and V/ES, where the weights are the OLS coefficients from a bivariate regression of 3-month ECI wage growth on both variables simultaneously (estimated over 1994:Q1–2024:Q2). The index is then normalized to have mean zero and standard deviation of one. The HPW Index achieves a correlation of 0.9 with standardized 3-month ECI wage growth. At the peak of post-pandemic inflation, the index predicted wage growth of approximately 2.6 standard deviations above the mean, corresponding to a quarterly wage growth rate of about 1.3 percent, close to realized values.&lt;/p&gt;
&lt;h3 id="q8-how-do-the-out-of-sample-forecasting-results-compare-across-indicators-and-what-accounts-for-the-deterioration-of-vacancy-based-measures"&gt;Q8. How do the out-of-sample forecasting results compare across indicators, and what accounts for the deterioration of vacancy-based measures?&lt;/h3&gt;
&lt;p&gt;Rolling out-of-sample one-quarter-ahead predictions from 2004:Q1 to 2024:Q1 show that only the quits rate and the HPW Index consistently outperform an AR(1) benchmark across the full period. V/U performed relatively well until 2015 but then deteriorated steadily, and V/ES similarly weakened after 2015, consistent with the finding by Mongey and Horwich (2023) that the relationship between job vacancies and other labor market indicators has persistently shifted since approximately 2010. The forecasting performance of the unemployment rate and several other standard measures deteriorated sharply in the post-COVID period when wage inflation surged, but quits and HPW maintained their performance throughout.&lt;/p&gt;
&lt;h3 id="q9-is-there-evidence-of-nonlinearity-in-the-wage-phillips-curve-particularly-in-the-extreme-tightness-of-the-post-covid-period"&gt;Q9. Is there evidence of nonlinearity in the wage Phillips curve, particularly in the extreme tightness of the post-COVID period?&lt;/h3&gt;
&lt;p&gt;The paper finds little evidence of meaningful nonlinearity. Threshold regressions at the 25th percentile for unemployment and 75th percentile for other measures yield marginal fit improvements: the R² for unemployment rises from 0.34 to 0.36 (a level shift rather than a slope change), and fit improvements for HPW, quits, and V/ES are essentially zero. Quadratic specifications confirm this: the coefficient on the squared term is insignificant in all specifications. The authors conclude that the relationship between labor market tightness (as measured by quits or the HPW Index) and nominal wage growth is approximately linear, including during the extreme tightness of the COVID aftermath.&lt;/p&gt;
&lt;h3 id="q10-why-does-the-paper-argue-that-the-slope-of-the-wage-phillips-curve-can-be-estimated-more-cleanly-than-the-price-phillips-curve"&gt;Q10. Why does the paper argue that the slope of the wage Phillips curve can be estimated more cleanly than the price Phillips curve?&lt;/h3&gt;
&lt;p&gt;In the model&amp;rsquo;s price Phillips curve, monetary policy endogenously responds to TFP shocks, creating an omitted variable problem that biases the estimated slope toward zero. In the wage Phillips curve, TFP and monetary policy shocks affect wages only through their general equilibrium effects on labor market tightness — they do not appear directly on the right-hand side. Consequently, the tightness variable is a sufficient statistic for wage inflation in the model, and the slope coefficient can be estimated consistently from reduced-form regressions without the identification problems that plague the price Phillips curve.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Vacancies per Effective Searcher (V/ES).&lt;/strong&gt; The paper&amp;rsquo;s preferred tightness measure, defined as job openings divided by effective searchers, where effective searchers are ES = U_s + 0.48·U_l + 0.40·Z_want + 0.09·Z_do-not-want + 0.07·N. This differs from the standard V/U ratio by including employed workers (at a weight of 0.07 reflecting their search intensity) and distinguishing between short-term and long-term unemployed and non-participants. It is the theoretically correct tightness measure in the on-the-job-search model, where the full pool of potential recruits — not only the unemployed — determines wage pressure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;On-the-Job Search.&lt;/strong&gt; The mechanism by which employed workers actively search for and receive job offers from other firms. In the Bloesch-Lee-Weber (2024) model underpinning the paper, on-the-job search implies that firms must set wages not only to attract unemployed workers but also to retain employed workers who may be poached. This changes the relevant measure of tightness from V/U to V/S and makes quits — which are the endogenous separations triggered when workers accept outside offers — a near-sufficient statistic for wage growth.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Quits Rate.&lt;/strong&gt; The ratio of voluntary separations (quits) to total employment in private sector, sourced from JOLTS (extended to 1990 using Davis et al. 2012). In the model, quits are the endogenous component of the separation rate and are tightly linked to vacancies per effective searcher because workers quit more frequently when labor market tightness is high and outside offers are plentiful. The paper establishes quits as the single best individual predictor of 3-month ECI wage growth (R² = 0.55) and the best out-of-sample forecaster along with HPW.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;HPW Tightness Index (Heise-Pearce-Weber Index).&lt;/strong&gt; A composite indicator of labor market tightness constructed as the OLS-coefficient-weighted average of the quits rate and V/ES, estimated by regressing 3-month ECI wage growth on both variables simultaneously. The index is normalized to mean zero and standard deviation of one. The HPW Index achieves the highest in-sample forecasting fit at 1, 2, and 4 quarters ahead (R² of 0.62, 0.74, and 0.77, respectively) and consistently outperforms the AR(1) benchmark out of sample, unlike most other indicators.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Wage Phillips Curve.&lt;/strong&gt; The reduced-form relationship between nominal wage inflation and labor market tightness, derived in the paper from first-order conditions of the firm&amp;rsquo;s optimization problem. In the model&amp;rsquo;s representation (equation 3), wage inflation is a function of deviations of V/ES and unemployment from steady state plus expected future wage inflation. The paper argues this relationship can be estimated more cleanly than the price Phillips curve because TFP and monetary policy shocks affect wages only through the tightness term, avoiding the omitted-variable bias that flattens price Phillips curve estimates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sufficient Statistic for Wage Inflation.&lt;/strong&gt; As used in the paper&amp;rsquo;s model, a variable (or pair of variables) such that once it is included in the wage Phillips curve, no other labor market indicator provides additional explanatory power for wage growth. The model predicts, and the empirical horse race confirms, that quits or V/ES are individually near-sufficient statistics: conditional on the quits rate, the coefficients on all other tightness measures (including unemployment, V/U, jobs-workers gap, and survey measures) fall to approximately zero.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Transitory TFP Shocks and Wage Growth.&lt;/strong&gt; The paper defines these as short-lived, positive shocks to total factor or labor productivity, as measured by 3-month changes in Fernald et al. (2012) series. The theoretical prediction is that their effect on nominal wage growth is ambiguous: if the central bank&amp;rsquo;s policy response lowers real rates enough, aggregate demand rises sufficiently to require more hiring, generating positive wage effects; if the policy response is limited, lower marginal costs reduce vacancies and wages. In the data, the sign is negative across all three productivity measures but statistically indistinguishable from zero in all specifications.&lt;/p&gt;</description></item><item><title>What Works and for Whom? Effectiveness and Efficiency of School Capital Investments Across the U.S.</title><link>https://macropaperwarehouse.com/papers/what-works-and-for-whom-effectiveness-and-efficiency-of-school-capital-investments-across-the-u.s./</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/what-works-and-for-whom-effectiveness-and-efficiency-of-school-capital-investments-across-the-u.s./</guid><description>&lt;h2 id="what-works-and-for-whom-effectiveness-and-efficiency-of-school-capital-investments-across-the-us"&gt;What Works and for Whom? Effectiveness and Efficiency of School Capital Investments Across the U.S.&lt;/h2&gt;
&lt;h3 id="research-question"&gt;Research Question&lt;/h3&gt;
&lt;p&gt;This paper investigates which types of school facility investments benefit students (as measured by test scores) and are valued by homeowners (as measured by house prices), and for which student populations these investments are most effective. Prior state-level studies had reached conflicting conclusions about the returns to school capital spending, and no nationwide evidence had distinguished impacts across spending categories or student backgrounds.&lt;/p&gt;
&lt;h3 id="data-and-methodology"&gt;Data and Methodology&lt;/h3&gt;
&lt;p&gt;The authors assemble a novel panel dataset covering approximately 14,000 school bond referenda in 29 U.S. states and 10,146 districts enrolling 71% of all U.S. students, for the period 1990–2017. The dataset combines: (1) ballot-level bond election records including vote shares, proposed amounts, and ballot text; (2) district-level test scores from the Stanford Education Data Archive (SEDA) extended backward to 2003 for all states and as early as 1995 for some, normalized to a national scale via NAEP; (3) a Census-tract-level house price index (Contat and Larson, 2022) aggregated to school districts; and (4) NCES district finance and demographic data.&lt;/p&gt;
&lt;p&gt;Bond ballot texts are classified into eight spending categories using text-analysis: classroom construction/renovation; HVAC; other infrastructure (plumbing, roofs, furnaces); safety and health (pollutant removal, building safety); STEM equipment and labs; athletic facilities; land purchases; and transportation vehicles.&lt;/p&gt;
&lt;p&gt;The identification strategy exploits quasi-random variation from close bond elections, building on the dynamic regression discontinuity (DRD) framework of Cellini et al. (2010). A key methodological contribution is a stacked DRD design that addresses heterogeneous treatment effects correlated with timing: each treatment cohort (districts that narrowly authorize a bond in year c) is matched against &amp;ldquo;clean controls&amp;rdquo; — districts that also proposed a bond in the same cohort but narrowly failed to authorize it and did not authorize any bond in the following ten years. Cohorts are stacked, and a dynamic RD model is estimated controlling for cohort fixed effects and a district&amp;rsquo;s bond proposal history.&lt;/p&gt;
&lt;h3 id="main-findings-with-quantitative-magnitudes"&gt;Main Findings with Quantitative Magnitudes&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Average effects.&lt;/strong&gt; Bond authorization raises capital spending by approximately $1,650 per pupil cumulatively over five years. Test scores increase gradually, reaching 0.079 standard deviations (sd) higher five to eight years after authorization, and 0.073 sd higher nine to twelve years after. 2SLS estimates, amortizing spending over a 30-year project life at a 9% depreciation rate, imply that a $1,000 increase in the flow value of capital spending raises test scores by 0.048 sd. House prices rise by approximately 9% eight to nine years after authorization. When house price effects are estimated against only locally-financed capital spending (not state aid), the 2SLS estimate is 0.8% per $1,000 — roughly consistent with efficiency — suggesting that the larger reduced-form house price response is driven primarily by state aid that supplements local funds rather than by an inefficiently low ex ante spending level.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Heterogeneity by spending category.&lt;/strong&gt; Category-specific estimates reveal that only certain project types raise test scores: HVAC (+0.20 sd, largest effect), safety and health (+0.15 sd), other infrastructure/plumbing/roofs (+0.15 sd), STEM equipment (+0.15 sd implied), and classroom space (+0.10 sd), all measured three to six years post-election. By contrast, bonds for athletic facilities, land purchases, and transportation produce no detectable effects on test scores. The pattern for house prices is the inverse: athletic facilities generate a 17% house price increase; classroom space generates 14%; STEM generates 11% — while HVAC and safety/health bonds produce no significant effect on house prices. The correlation between category-level test score and house price estimates is −0.07, indicating these are largely orthogonal outcomes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Heterogeneity by student socioeconomic status.&lt;/strong&gt; Effects are concentrated in districts serving socioeconomically disadvantaged students (top tercile of the share of students eligible for free or reduced-price meals, denoted low-SES). In low-SES districts, bond authorization raises test scores by 0.13 sd after seven years and house prices by 15%; in high-SES districts, neither outcome shows a significant effect. 2SLS estimates confirm that a $1,000 increase in cumulative spending raises test scores by 0.08 sd in low-SES districts but produces no detectable change in high-SES districts. The SES gradient persists after conditioning on spending amounts, spending categories, and baseline capital stock, indicating that students in disadvantaged districts have higher marginal returns to capital improvements independent of these channels. High-minority districts (top tercile of Black and Hispanic share) similarly see a 0.12 sd test score gain and 15% house price gain after seven years, versus 0.04 sd and 3% in low-minority districts.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Role of baseline capital stock.&lt;/strong&gt; Among districts with below-median capital stock, test score effects are 0.20 sd in low-SES districts seven years post-election. Even among above-median-stock districts, low-SES districts see house price effects exceeding 10% while high-SES districts see no effect. Differences by SES persist after conditioning on capital stock.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Policy simulation.&lt;/strong&gt; Closing the spending gap between high- and low-SES districts (approximately $1,000 over 10 years) without changing the composition of spending would raise low-SES test scores by roughly 0.08 sd, closing about 8% of the roughly 1 sd achievement gap. Targeting that same additional spending toward HVAC and safety/health (the highest-impact categories) would generate test score increases approximately three times as large, potentially closing up to 25% of the observed achievement gap.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Reconciling prior literature.&lt;/strong&gt; Replicating state-level estimates, the authors show that Ohio&amp;rsquo;s positive effects are explained by a high share of bonds in low-SES districts funding infrastructure, while Texas&amp;rsquo;s near-zero effects reflect a high share of bonds in higher-SES districts funding classrooms and athletic facilities.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-first-stage-effect-of-bond-authorization-on-capital-spending-and-does-it-contaminate-other-spending-categories"&gt;Q1. What is the first-stage effect of bond authorization on capital spending, and does it contaminate other spending categories?&lt;/h3&gt;
&lt;p&gt;A1: Bond authorization raises per-pupil capital spending by approximately $700 per year at two years post-election and $590 at three years, with cumulative spending $1,650 higher over five years in treated districts relative to districts that narrowly failed to authorize a bond. Bond revenues are legally restricted to capital uses, and the paper confirms that non-capital (current) spending and instructional spending are not affected following authorization. This establishes a clean first stage: bond authorization raises only capital outlays.&lt;/p&gt;
&lt;h3 id="q2-why-does-the-standard-drd-estimator-of-cellini-et-al-2010-require-refinement-and-what-problem-does-the-stacked-drd-design-solve"&gt;Q2. Why does the standard DRD estimator of Cellini et al. (2010) require refinement, and what problem does the stacked DRD design solve?&lt;/h3&gt;
&lt;p&gt;A2: The original CFR estimator assumes treatment effects are uncorrelated with the timing of treatment — an assumption potentially violated when, for example, bonds financing HVAC (high-impact) versus athletic facilities (amenity-focused) have different propensities to be proposed at different points in time. The stacked DRD design avoids &amp;ldquo;forbidden comparisons&amp;rdquo; by comparing each treatment cohort only against clean controls that propose but fail to authorize a bond in the same year and do not authorize any bond in the subsequent ten years. This ensures consistency even when treatment effects are heterogeneous across cohorts and correlated with timing.&lt;/p&gt;
&lt;h3 id="q3-how-do-the-authors-validate-the-quasi-random-assignment-assumption-of-the-regression-discontinuity-design"&gt;Q3. How do the authors validate the quasi-random assignment assumption of the regression discontinuity design?&lt;/h3&gt;
&lt;p&gt;A3: Three tests are performed. First, a McCrary (2008) density test on the vote margin distribution shows no discontinuity at the cutoff in the pooled or stacked data (p-values of 0.59 and 0.24, respectively), though discontinuities are found in Arkansas, Missouri, and Oklahoma — those three states are excluded. Second, pre-election district covariates (income, education, SES shares, enrollment, revenues, expenditures) are smooth around the cutoff in both datasets. Third, pre-election trends in test scores and house prices are flat and parallel between marginally approved and marginally rejected districts.&lt;/p&gt;
&lt;h3 id="q4-how-are-the-eight-spending-categories-constructed-and-how-many-bonds-are-successfully-classified"&gt;Q4. How are the eight spending categories constructed, and how many bonds are successfully classified?&lt;/h3&gt;
&lt;p&gt;A4: Categories are drawn from the SchoolBondFinder.com classification produced by The Amos Group, then refined by splitting capital improvements into HVAC versus other infrastructure, splitting construction/renovation into classroom versus athletic facility projects, and adding land purchases as a separate category. Keyword-based text analysis of ballot language successfully assigns 75% of the approximately 14,000 bonds to at least one of the eight categories. More than two-thirds of classified bonds receive multiple category designations, with a mean of 2.9 categories per proposed bond and 3.2 per authorized bond.&lt;/p&gt;
&lt;h3 id="q5-why-do-hvac-bonds-raise-test-scores-but-not-house-prices-while-athletic-facility-bonds-raise-house-prices-but-not-test-scores"&gt;Q5. Why do HVAC bonds raise test scores but not house prices, while athletic facility bonds raise house prices but not test scores?&lt;/h3&gt;
&lt;p&gt;A5: The authors interpret this divergence as reflecting what different types of improvements offer to different stakeholders. HVAC improvements reduce excessive heat and air pollution exposure in classrooms, directly improving students&amp;rsquo; learning experiences — consistent with Park et al. (2020) on heat and Gilraine and Zheng (2022) on air pollution. These improvements are not visibly salient to homeowners without school-age children and carry no amenity value for the broader community. Athletic facilities, by contrast, are highly visible and provide a community amenity valued in the housing market regardless of their impact on academic instruction. The near-zero correlation (−0.07) between category-level test score and house price estimates confirms that the two outcomes respond to largely distinct features of capital investments.&lt;/p&gt;
&lt;h3 id="q6-what-are-the-three-candidate-explanations-for-the-larger-effects-of-bond-authorization-in-low-ses-districts-and-which-explanations-survive-empirical-scrutiny"&gt;Q6. What are the three candidate explanations for the larger effects of bond authorization in low-SES districts, and which explanations survive empirical scrutiny?&lt;/h3&gt;
&lt;p&gt;A6: The three candidates are: (1) larger spending increases after authorization in low-SES districts; (2) a different composition of spending categories (more toward high-impact HVAC and safety); and (3) higher marginal returns per dollar for disadvantaged students, holding spending size and composition fixed. The data confirm all three operate, but the third is the residual: 2SLS estimates show a $1,000 increase raises test scores by 0.08 sd in low-SES districts versus a statistically zero effect in high-SES districts, and within-category estimates show HVAC bonds raise scores by 0.27 sd in low-SES districts but have no detectable effect in high-SES districts. Differences by SES also persist after conditioning on the estimated baseline capital stock, though low capital stock accounts for part of the gap.&lt;/p&gt;
&lt;h3 id="q7-how-does-the-role-of-state-aid-alter-the-interpretation-of-the-house-price-effect-for-spending-efficiency"&gt;Q7. How does the role of state aid alter the interpretation of the house price effect for spending efficiency?&lt;/h3&gt;
&lt;p&gt;A7: A 9% house price increase after bond authorization, if taken at face value under Brueckner&amp;rsquo;s (1979) efficiency test, would suggest the ex ante level of school capital spending was inefficiently low. However, state grants that partly match local bond revenues raise actual spending without raising local property taxes proportionally. When the 2SLS house price effect is estimated against only locally financed capital spending (using proposed bond size as the relevant measure), the implied house price increase is just 0.8% per $1,000 — consistent with rough efficiency on average across the full sample. The authors conclude that the large reduced-form house price response is driven primarily by the capitalization of state aid, not by an undersupply of capital investments at the aggregate level.&lt;/p&gt;
&lt;h3 id="q8-does-household-sorting-account-for-the-observed-test-score-and-house-price-gains-following-bond-authorization"&gt;Q8. Does household sorting account for the observed test score and house price gains following bond authorization?&lt;/h3&gt;
&lt;p&gt;A8: Bond authorization produces small but detectable compositional changes: the share of high-SES students is approximately 3 percentage points higher seven years after an election (a roughly 4% increase relative to an average share of 0.73), while enrollment and the share of white students are largely unaffected. However, controlling for district-by-year shares of each sociodemographic group only slightly attenuates the test score and house price estimates, indicating that sorting accounts for a small share of the observed gains.&lt;/p&gt;
&lt;h3 id="q9-are-the-findings-robust-to-alternative-research-designs"&gt;Q9. Are the findings robust to alternative research designs?&lt;/h3&gt;
&lt;p&gt;A9: The results are robust to five alternative estimation approaches: (1) the original one-step TOT estimator of Cellini et al. (2010); (2) a version of the stacked DRD where clean controls are districts that do not approve any bonds in the full [c−5, c+10] window; (3) a version that matches treated and control districts in each cohort based on bond history; (4) a version not controlling for future bond history; and (5) the extended two-way fixed effects (ETWFE) estimator of Wooldridge (2021). Results are also robust to linear polynomials with different slopes and quadratic polynomials of the vote margin.&lt;/p&gt;
&lt;h3 id="q10-how-does-the-capital-stock-measure-illuminate-mechanism-and-what-are-its-limitations"&gt;Q10. How does the capital stock measure illuminate mechanism, and what are its limitations?&lt;/h3&gt;
&lt;p&gt;A10: The authors construct a district-level capital stock as the 30-year depreciated sum of capital spending from Census of Governments data (1967–2017) at a 5% depreciation rate. This stock is negatively correlated with the share of low-SES students, confirming that more disadvantaged students attend schools in worse structural condition. Conditioning on this proxy, the SES gradient in bond impacts is reduced but remains. Among districts with below-median capital stock, low-SES districts see test score gains of 0.20 sd after seven years, while among above-median-stock districts the gap narrows to approximately 0.10 vs. 0.05 sd. A key limitation is that detailed school-condition data are unavailable nationally, so the capital stock is a proxy only.&lt;/p&gt;
&lt;h3 id="q11-what-is-the-quantitative-policy-implication-of-the-targeting-exercise"&gt;Q11. What is the quantitative policy implication of the targeting exercise?&lt;/h3&gt;
&lt;p&gt;A11: On average, low-SES districts receive about $97 per pupil per year less in capital spending than high-SES districts, so closing this gap over ten years implies approximately $970 in additional cumulative spending. Without changing spending composition, this would raise test scores by roughly 0.08 sd in low-SES districts, closing about 8% of the approximately 1 sd achievement gap between high- and low-SES districts. Redirecting that same additional spending toward the highest-impact categories (HVAC and safety/health) would generate test score gains roughly three times larger, potentially closing up to 25% of the observed achievement gap.&lt;/p&gt;
&lt;h3 id="q12-how-do-the-cross-state-differences-documented-in-prior-literature-map-onto-the-papers-heterogeneity-findings"&gt;Q12. How do the cross-state differences documented in prior literature map onto the paper&amp;rsquo;s heterogeneity findings?&lt;/h3&gt;
&lt;p&gt;A12: The authors replicate earlier state-level estimates and show that Ohio&amp;rsquo;s relatively large positive effects — found by Conlin and Thompson (2017) — are explained by a high concentration of bonds in low-SES districts funding infrastructure, while Texas&amp;rsquo;s near-zero effects — found by Martorell et al. (2016) — reflect a high share of bonds in higher-SES districts funding classrooms and athletic facilities. Wisconsin and Michigan, which showed null effects in earlier studies, similarly have bond compositions and student demographics that predict small impacts under the paper&amp;rsquo;s heterogeneity framework.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Stacked Dynamic Regression Discontinuity (Stacked DRD).&lt;/strong&gt; The paper&amp;rsquo;s primary estimation strategy, which combines the dynamic RD framework of Cellini et al. (2010) with a stacked-cohort design adapted from the staggered difference-in-differences literature. For each treatment cohort (year in which a bond barely passes), &amp;ldquo;clean controls&amp;rdquo; are defined as districts that also proposed a bond in the same year but narrowly failed to authorize it and did not authorize any subsequent bond within ten years. Cohort-specific datasets are stacked and estimated jointly with cohort fixed effects, ensuring that estimates are robust to treatment effect heterogeneity correlated with timing.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Clean Controls.&lt;/strong&gt; Districts used as the counterfactual for treated districts in a given cohort: those that propose a bond in the same year as the treated cohort, barely fail to authorize it, and remain untreated for ten subsequent years. Their &amp;ldquo;clean&amp;rdquo; status is quasi-random because their future non-authorization results from narrow electoral loss rather than any endogenous district choice.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bond Spending Categories.&lt;/strong&gt; Eight mutually-non-exclusive classifications of bond spending derived from ballot text using keyword analysis: classroom space; HVAC; other infrastructure (plumbing, roofs, furnaces); safety and health (pollutant removal, compliance upgrades); STEM equipment and labs; athletic facilities; land purchases; and transportation. These categories are defined in the paper not by administrative accounting codes but by the stated intended use of funds in ballot language.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Treatment-on-the-Treated (TOT) Estimator.&lt;/strong&gt; The CFR estimator that captures the effect of bond authorization against the counterfactual of never authorizing a bond in the foreseeable future, achieved by including leads and lags of a district&amp;rsquo;s bond proposal history as controls. This addresses the problem that multiple elections over time make simple treated-vs-control comparisons confounded by past and future bond activity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Capital Stock (District-Level Proxy).&lt;/strong&gt; A measure of each district&amp;rsquo;s accumulated school facility capital at a given point in time, constructed as the depreciated 30-year running sum of capital expenditures from the Census of Governments, using a 5% annual depreciation rate. Used as a proxy for facility conditions in the absence of nationally available building-quality data, and confirmed to be negatively correlated with district share of low-SES students.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Brueckner Efficiency Test.&lt;/strong&gt; An application of the theoretical framework linking public good provision levels to house price responses. If a spending increase raises house prices, the initial spending level was below the efficient level; if it lowers house prices, spending was too high. In this paper, the test is refined to use only locally-financed capital spending as the explanatory variable, to strip out the capitalization of state aid and isolate the efficiency assessment for locally-determined spending.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Socio-Economic Status (SES) Terciles.&lt;/strong&gt; Districts are ranked by the share of students eligible for free or reduced-price school meals as of 1995. &amp;ldquo;Low-SES districts&amp;rdquo; refers to those in the top tercile of this share (most disadvantaged); &amp;ldquo;high-SES districts&amp;rdquo; refers to those in the bottom tercile (least disadvantaged). Effects are estimated separately for these subsamples throughout.&lt;/p&gt;</description></item><item><title>What's driving the decline in entrepreneurship?</title><link>https://macropaperwarehouse.com/papers/whats-driving-the-decline-in-entrepreneurship/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/whats-driving-the-decline-in-entrepreneurship/</guid><description>&lt;p&gt;The entrepreneurship rate in the United States—defined as the share of the labor force who own and actively manage a business with at least ten employees—declined by 26% between 1987 and 2015, a decline mirrored in the firm entry rate and not explained by compositional changes in the economy or driven by a small number of sectors. This paper addresses what caused this broad-based decline using Current Population Survey data, two new empirical facts, and a dynamic general equilibrium model of occupational choice. The first new fact is that the decline was larger for higher-education groups (35% for those with more than a college degree versus 2.4% for those without a high-school diploma), indicating that the driving force is not skill-neutral. The second new fact is that the size distribution of entrepreneur firms has been stable, so the entrepreneurship decline represents a shrinkage of the entrepreneurial sector relative to the economy. Estimating the contribution of four candidate explanations—skill-biased technical change (SBTC), increasing regulation, technology-driven increases in fixed and entry costs, and technology-driven productivity advantages for large firms—the paper finds that increasing entry costs account for most of the decline in both the entrepreneurship share and the firm entry rate, with empirical evidence pointing to both regulation and technology as sources of these higher costs.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-does-the-model-of-occupational-choice-capture-and-how-are-the-explanations-identified"&gt;Q1. What does the model of occupational choice capture, and how are the explanations identified?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The dynamic general equilibrium model allows individuals to choose between working as an employee (earning wages) and being an entrepreneur (paying fixed and entry costs, then operating a firm); the model generates predictions about the entrepreneurship rate, firm entry rate, and the distribution of entrepreneur firm sizes across groups, which the data discipline.&lt;/strong&gt; By requiring the model to match changes in entrepreneurship along multiple dimensions—including the education-gradient fact and the stable size distribution—the author can separately identify the contribution of each candidate mechanism. SBTC operates through wages (raising opportunity cost of entrepreneurship for skilled workers); entry-cost increases reduce the number of new entrepreneurs regardless of skill; productivity advantages for large firms shift the size distribution; and regulation/technology-driven fixed-cost increases reduce incumbent-entrepreneur survival.&lt;/p&gt;
&lt;h3 id="q2-why-does-skill-biased-technical-change-fail-to-explain-the-level-decline"&gt;Q2. Why does skill-biased technical change fail to explain the level decline?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;SBTC raises wages for high-skill workers, which could in principle explain why fewer of them choose entrepreneurship; and indeed SBTC is found to have tilted entrepreneurship toward less-educated people.&lt;/strong&gt; However, SBTC cannot explain the decline in the aggregate entrepreneurship rate because: it does not reduce the incentive to be an entrepreneur for lower-skill workers (who are relatively unaffected), and the stable size distribution of entrepreneur firms is inconsistent with SBTC (which would tend to shift composition rather than reduce overall entrepreneurship). The model confirms that SBTC explains the education gradient but contributes little to the overall level decline.&lt;/p&gt;
&lt;h3 id="q3-what-is-the-role-of-entry-costs-and-what-drives-them"&gt;Q3. What is the role of entry costs, and what drives them?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Increasing entry costs are found to explain most of the decline in the share of people who are entrepreneurs and most of the decline in the firm entry rate; the data also reject the hypothesis that entry-cost increases were accompanied by large changes in entrepreneur firm size, consistent with the observed stability of the size distribution.&lt;/strong&gt; Empirical evidence suggests two sources of higher entry costs: increasing regulation (occupational licensing, tax-code complexity, zoning restrictions) and technology changes that increase the fixed investments required to operate (e.g., adoption of IT systems). The paper does not fully separate these two sources but presents evidence consistent with both operating simultaneously.&lt;/p&gt;
&lt;h3 id="q4-what-is-the-role-of-increasing-productivity-of-large-firms"&gt;Q4. What is the role of increasing productivity of large firms?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Increasing productivity of large, non-entrepreneurial (e.g., publicly listed) firms matters little for the entrepreneurship rate or the firm entry rate, but has driven most of the reallocation of labor away from entrepreneur businesses.&lt;/strong&gt; This is because the productivity advantage of large firms shifts the scale of production without necessarily changing who becomes an entrepreneur, largely leaving the extensive margin of entrepreneurship intact while reducing the share of aggregate economic activity attributable to the entrepreneurial sector.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;entrepreneurship rate&lt;/strong&gt; : the share of the labor force who own and actively manage a business with at least ten employees, the paper&amp;rsquo;s main measure of entrepreneurship, which declined 26% from 1987 to 2015 in the CPS data.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;entry costs&lt;/strong&gt; : the one-time costs required to establish a new entrepreneurial business; the paper finds these rose over the sample period due to both regulation and technology, and identifies them as the primary driver of the entrepreneurship decline.&lt;/p&gt;</description></item><item><title>Who's Afraid of the Minimum Wage? Measuring the Impacts on Independent Businesses Using Matched U.S. Tax Returns</title><link>https://macropaperwarehouse.com/papers/whos-afraid-of-the-minimum-wage-measuring-the-impacts-on-independent-businesses-using-matched-u.s.-tax-returns/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/whos-afraid-of-the-minimum-wage-measuring-the-impacts-on-independent-businesses-using-matched-u.s.-tax-returns/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This paper asks how independent (pass-through) businesses in the United States accommodate minimum wage increases — specifically whether they reduce employment, compress profits, pass costs through to customers, or exit — and what happens to the low-earning workers and business owners affected by these adjustments.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Methodology&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors construct a novel linked firm-worker-owner panel dataset from the universe of U.S. tax returns, covering approximately 235,000 pass-through firms (S-corporations, partnerships, and LLCs) per year in highly exposed industries over 2010–2019. &amp;ldquo;Highly exposed&amp;rdquo; industries are defined as those where at least 15% of workers earned below the full-time equivalent of the federal minimum wage ($15,080 per year) in 2013. The dataset links annual business income tax returns to the individual income tax returns and W-2 information reports of all workers and owners.&lt;/p&gt;
&lt;p&gt;The causal identification strategy exploits the six state minimum wage increases that took effect in 2014 (California, Connecticut, Delaware, Michigan, Minnesota, and New Jersey) relative to 24 states that did not change their wage floors at any point from 2012–2018. The empirical workhorse is a panel difference-in-differences event study (Equation 1), augmented by DFL re-weighting (DiNardo et al., 1996) to improve comparability of treatment and control firms on observables. The analysis covers cumulative effects through 2018, by which point the average minimum wage across treatment states had risen 30.6%.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Employment:&lt;/strong&gt; The average exposed independent firm does not meaningfully reduce employment. The authors estimate an own-wage elasticity of -0.209 (s.e. = 0.0112). Employment adjustments manifest as moderately lower hiring rather than layoffs of existing workers. Reduced hiring is wholly concentrated among teenagers and very part-time jobs paying less than $3,900 annually (with 67% earning less than $1,000 per year).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Worker earnings:&lt;/strong&gt; Despite the hiring reduction, low-earning workers employed at exposed independent firms experience average earnings gains of approximately $2,000 per year by 2018, relative to comparable workers in untreated states. Young individuals aged 20–26 without a 2013 job earn roughly $4,000 more per year by 2018; teenagers without a 2013 job gain approximately $1,000 per year. Workers in these groups are no less likely — and in some cases slightly more likely — to be employed five years after the minimum wage increase.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Wage bills:&lt;/strong&gt; Average wage bills among surviving treated firms rose 7.03% (s.e. = 0.0153) by 2018. Earnings gains are concentrated among workers earning $15,600–$35,000 annually, with no evidence of reduced earnings for higher-paid workers. The 7% average wage bill increase amounts to only 1.4% of 2013 firm revenues, easing pass-through.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Revenue and profits:&lt;/strong&gt; Revenues of surviving treated firms grew approximately 2.1% more than control firms by 2018. On average, this revenue increase fully offsets the higher wage bill, yielding a small net profit increase of roughly $3,360 (s.e. = $1,123) per owner by 2018, or about 2.7% of mean 2013 owner income.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Firm exit:&lt;/strong&gt; On average across all highly exposed industries, minimum wages increased the five-year exit probability by 0.9 percentage points (s.e. = 0.0029), relative to a baseline raw exit rate of approximately 29%. Exit effects are driven entirely by restaurants: by 2018, restaurants in treated states were 1.85 percentage points (s.e. = 0.0039) more likely to have exited, while the exit response for non-restaurant exposed firms is a precisely estimated zero.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Heterogeneity by productivity within restaurants:&lt;/strong&gt; Exit is concentrated entirely in the bottom productivity quartile (coefficient = 0.0254, s.e. = 0.0079), with no significant effect in the upper three quartiles. Profits among surviving small restaurants rise by $5,941 (s.e. = $1,546) by 2018 relative to 2013. Among small restaurants, the profit gains are larger for firms in the higher productivity quartiles (Q3: +$7,915; Q4: +$9,161). Surviving restaurants also increase non-labor input spending by 2.53% (s.e. = 0.0101), consistent with expanded output following competitor exits.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Entrant characteristics:&lt;/strong&gt; Post-reform restaurant entrants in treatment states have higher wage bills (13.8% higher in logs), higher revenues (4.0% higher), higher value-added (8.4% higher), and higher productivity (net income/revenue ratio 2.24 percentage points higher) than entrants in control states, indicating the minimum wage raises the productivity floor for new entrants.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Owner outcomes after exit:&lt;/strong&gt; Owners of small restaurants forced out by the minimum wage are significantly less likely to own an independent business five years later, but earn no less on average in wages plus business income. Policy-induced exiters are significantly less likely to report negative incomes, suggesting substitution away from risky or marginally profitable business ownership.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;Theoretical Framework&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors present a Cournot competition model with heterogeneous firm productivity and fixed production costs. A minimum wage cost shock raises marginal costs, narrowing margins for all firms. Firms whose cost increases exceed the market price increase cannot cover fixed costs and exit. Remaining firms gain higher markups and larger market shares as demand is reallocated from exiting firms. Selection on ex-ante productivity (the least productive firms exit) limits the distortion to market quantity and amplifies profit gains among productive survivors. The model predicts profit increases only in markets with firm exit, which matches the data: profits rise among restaurants (where exit occurs) but not among retailers (where exit is a precisely estimated zero).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Findings pertain to the short-to-medium run (up to five years post-legislation) of phased-in minimum wage increases averaging 30.6% in six U.S. states. The sample covers pass-through (independent) businesses in highly exposed industries. Longer-run effects may differ if entrants adopt production technologies that rely less on low-wage labor or incumbents reconfigure inputs. Border-county retailers appear to be less able to pass through costs than interior firms, suggesting product market competition is a key moderating factor.&lt;/p&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-why-do-the-authors-focus-on-pass-through-businesses-rather-than-publicly-traded-corporations"&gt;Q1. Why do the authors focus on pass-through businesses rather than publicly traded corporations?&lt;/h3&gt;
&lt;p&gt;Pass-throughs (S-corporations, partnerships, and LLCs) comprise 78% of non-sole-proprietorship businesses and 79% of firms with fewer than 20 employees. They represent the majority organizational form for independent businesses in virtually all two-digit NAICS industry groups except utilities and enterprise management. Because minimum wage concerns are disproportionately raised on behalf of small independent businesses, and because most minimum wage workers in restaurants are employed at pass-throughs, studying pass-throughs directly addresses the policy debate. Additionally, pass-through tax returns link business income directly to the individual tax returns of each owner, enabling the authors to separately identify employee versus owner responses.&lt;/p&gt;
&lt;h3 id="q2-how-do-the-authors-define-highly-exposed-industries-and-why-does-this-matter-for-identification"&gt;Q2. How do the authors define &amp;ldquo;highly exposed&amp;rdquo; industries and why does this matter for identification?&lt;/h3&gt;
&lt;p&gt;Highly exposed industries are defined as four-digit NAICS industries where at least 15% of workers earned below the full-time federal minimum wage equivalent ($15,080 per year) in 2013, using tax data to construct a proxy for minimum wage workers. The analysis focuses on these industries because minimum wage workers are extremely concentrated — the vast majority are in Leisure/Hospitality and Retail. Restricting to highly exposed industries allows the authors to estimate average effects within affected markets and conduct heterogeneity analysis across firm characteristics within those markets, including comparing firms with different baseline shares of low-earning workers that nonetheless all face the market-level cost shock.&lt;/p&gt;
&lt;h3 id="q3-how-do-the-employment-effects-decompose-into-hiring-versus-retention"&gt;Q3. How do the employment effects decompose into hiring versus retention?&lt;/h3&gt;
&lt;p&gt;The average firm subject to a higher wage floor does not lay off existing workers (the retention line is flat in event study estimates). By 2018, firms in treated states hire roughly one fewer worker on average than similar firms in control states, entirely through reduced hiring. This reduced hiring is wholly concentrated among teenagers in very part-time jobs: the missing hires consist entirely of workers who would have earned less than $3,900 annually, with 67% earning less than $1,000 per year. Simultaneously, workers already employed at exposed firms are 2 to 4 percentage points more likely to remain with their 2013 employer by 2016, with prime-age low-earning workers exhibiting the largest retention increases.&lt;/p&gt;
&lt;h3 id="q4-what-happens-to-low-earning-workers-and-young-people-in-individual-level-panels"&gt;Q4. What happens to low-earning workers and young people in individual-level panels?&lt;/h3&gt;
&lt;p&gt;Low-earners (those earning below $25,000 in each year from 2012–2014) at exposed independent firms experience average earnings gains of approximately $2,000 per year by 2018 relative to similar workers in untreated states, including teenage low-earners. Young individuals aged 20–26 with no job in 2013 experience a relative earnings increase of approximately $4,000 per year by 2018; teenagers without jobs in 2013 gain approximately $1,000 per year. These workers are no less likely — and often slightly more likely — to be employed relative to their counterparts in control states, so the earnings gains are not offset by employment losses at the individual level.&lt;/p&gt;
&lt;h3 id="q5-what-is-the-magnitude-of-the-cost-shock-for-firms-and-how-does-it-compare-to-revenues"&gt;Q5. What is the magnitude of the cost shock for firms and how does it compare to revenues?&lt;/h3&gt;
&lt;p&gt;By 2018, the average wage bill among surviving firms in treated states was 7.03% (s.e. = 0.0153) higher than comparable firms in control states. This is consistent with a back-of-envelope calculation: low-earning workers account for about 21% of wage bills at these firms, and states raised minimum wages by 30.6% on average (0.21 × 0.306 = 0.064). However, the 7% wage bill increase amounts to only approximately 1.4% of 2013 firm revenues, making cost pass-through relatively modest. Higher minimum wages have no discernible impact on pension contributions but slightly reduce deductions for other benefits including health insurance.&lt;/p&gt;
&lt;h3 id="q6-how-do-surviving-firms-finance-the-increased-wage-bill-and-what-happens-to-profits"&gt;Q6. How do surviving firms finance the increased wage bill, and what happens to profits?&lt;/h3&gt;
&lt;p&gt;Surviving firms finance the wage increase primarily through higher revenues. By 2018, revenues of firms in treated states grew approximately 2.1% more than revenues of firms in control states. On average, this revenue increase outpaces the higher wage bill, resulting in a net profit increase of approximately $3,360 (s.e. = $1,123) per owner by 2018, representing about 2.7% of mean 2013 owner income. There is no evidence of redistribution from middle- or high-income workers within firms; wage bill increases are concentrated among workers earning $15,600–$35,000 annually, consistent with minimum wage spillovers to workers slightly above the statutory floor.&lt;/p&gt;
&lt;h3 id="q7-why-do-restaurants-experience-exit-effects-but-retailers-do-not"&gt;Q7. Why do restaurants experience exit effects but retailers do not?&lt;/h3&gt;
&lt;p&gt;The asymmetry stems from the intensity of low-wage labor in production. While low-earning workers account for a similar share of labor costs at restaurants (41.8%) and retailers (38.5%), labor costs overall are more than twice as large at restaurants relative to retailers. Wage bills account for 39% of variable costs and 27% of revenues at restaurants, but only 16% of variable costs and 13% of revenues at retailers. As a result, raising the minimum wage raises variable costs by 5.76% at restaurants. Non-restaurant exposed firms are able to fully pass through their smaller cost shock, yielding flat profits and neither employment nor exit impacts.&lt;/p&gt;
&lt;h3 id="q8-why-is-firm-exit-concentrated-in-the-lowest-productivity-quartile-of-restaurants-rather-than-among-the-most-exposed-firms"&gt;Q8. Why is firm exit concentrated in the lowest productivity quartile of restaurants rather than among the most exposed firms?&lt;/h3&gt;
&lt;p&gt;The Cournot framework predicts exits among firms with the lowest ex-ante productivity (highest marginal costs), the largest cost shock (highest share of low-wage labor per unit of output), or a combination. Empirically, productivity is the primary determinant: restaurants across all productivity quartiles use similar shares of low-earning workers (40–44% of wage bills for Q1 through Q4). Exit rises significantly only among restaurants in the bottom productivity quartile (coefficient = 0.0254, s.e. = 0.0079), with no significant effects in Q2–Q4. Among the lowest-productivity restaurants, those most dependent on low-earning labor face the largest exit rates.&lt;/p&gt;
&lt;h3 id="q9-how-do-the-models-predictions-about-profit-heterogeneity-match-the-data"&gt;Q9. How do the model&amp;rsquo;s predictions about profit heterogeneity match the data?&lt;/h3&gt;
&lt;p&gt;The Cournot model predicts profits should rise only in markets with firm exit (via increased margins and market share reallocation to survivors). This is exactly what the data show. Among restaurants, where exit is concentrated in the bottom productivity quartile, profits among surviving small restaurants rise by $5,941 (s.e. = $1,546) by 2018. Among small restaurants specifically, profit gains increase with productivity: Q3 restaurants gain $7,915 (s.e. = $3,326) and Q4 restaurants gain $9,161 (s.e. = $2,127), while Q1 and Q2 gains are statistically indistinguishable from zero. In non-restaurant exposed industries where the exit effect is a precise zero, profits are also flat — exactly as the model predicts.&lt;/p&gt;
&lt;h3 id="q10-what-happens-to-the-characteristics-of-new-restaurant-entrants-after-the-minimum-wage-increase"&gt;Q10. What happens to the characteristics of new restaurant entrants after the minimum wage increase?&lt;/h3&gt;
&lt;p&gt;Post-reform restaurant entrants in treatment states are systematically more productive than entrants in control states. They have wage bills 13.8% higher (in logs), revenues 4.0% higher, value-added 8.4% higher, and productivity ratios (net income/revenue) 2.24 percentage points higher than new entrants in control markets. This implies the minimum wage raises the minimum viable productivity threshold for entrant restaurants, consistent with Sorkin (2015)&amp;rsquo;s insight that minimum wages shape the capital and technology choices of entering firms. The restaurant industry thus becomes more productive on average through both the exit of the least productive incumbents and the entry of more productive new firms.&lt;/p&gt;
&lt;h3 id="q11-how-do-worker-transition-patterns-reflect-the-reallocation-of-output-to-surviving-firms"&gt;Q11. How do worker transition patterns reflect the reallocation of output to surviving firms?&lt;/h3&gt;
&lt;p&gt;Workers at large independent businesses (top revenue quartile) are 3.52 percentage points more likely to remain with their 2013 employer in 2018 and 2.36 percentage points less likely to switch to another large firm. The large firms that retain more of their existing workforce also reduce their hiring of very part-time teenagers the most — in the top revenue quartile, firms shed roughly 4.5 employment relationships on average, comprising higher retention of 4.15 existing workers offset by reduced hiring of 8.67 very part-time teenage workers. Workers originally at smaller exposed firms are more likely to be found working at larger firms five years out, consistent with demand reallocation from exiting and shrinking small firms toward larger, more productive survivors.&lt;/p&gt;
&lt;h3 id="q12-what-happens-to-owners-of-restaurants-that-exit-due-to-the-minimum-wage"&gt;Q12. What happens to owners of restaurants that exit due to the minimum wage?&lt;/h3&gt;
&lt;p&gt;Policy-induced exiters of small restaurants are significantly less likely to own an independent business five years later and less likely to receive all earnings from business ownership, relative to owners of restaurants that exited for other reasons in control states. However, their average incomes (wage income plus ordinary business income) are no lower. This income stability is partly explained by the fact that policy-induced exiters are significantly less likely to report negative incomes five years out, suggesting they substitute away from potentially risky or marginally profitable business ownership toward wage employment or other activities. The utility implications are ambiguous: these former owners may have preferred business ownership even if it did not yield higher income.&lt;/p&gt;
&lt;h3 id="q13-what-is-the-role-of-product-market-competition-in-mediating-pass-through-as-evidenced-by-border-county-analysis"&gt;Q13. What is the role of product market competition in mediating pass-through, as evidenced by border-county analysis?&lt;/h3&gt;
&lt;p&gt;The border county robustness analysis reveals that product market competition is central to pass-through success. Retailers near state borders, where consumers can cross-state-border shop, face more elastic demand and are less able to finance the wage cost shock with new revenues, exhibiting reduced profits and higher exit rates (though estimates are imprecise). Further from the border, where the cost shock is more commonly felt by all potential substitutes (making market demand elasticity rather than firm demand elasticity the relevant parameter), results are very similar to the full-sample aggregate findings. This confirms that the common nature of the minimum wage cost shock — shared by all competing firms in the market — is a key reason firms can pass through costs to consumers.&lt;/p&gt;
&lt;h3 id="q14-how-do-the-findings-address-the-divide-among-independent-business-owners-on-minimum-wage-policy"&gt;Q14. How do the findings address the divide among independent business owners on minimum wage policy?&lt;/h3&gt;
&lt;p&gt;The heterogeneous outcomes rationalize why surveys consistently find business owners divided. Among restaurants, some owners (those operating the least productive small restaurants) face exit and loss of business ownership, while surviving productive restaurateurs see higher profits of $5,941–$9,161 per year. Among non-restaurant exposed businesses, owners are broadly unaffected in terms of profits and viability. Uncertainty about whether a given firm&amp;rsquo;s demand is elastic enough to bear cost pass-through — given that owners may be more familiar with the elasticity of firm-level demand from prior unilateral price changes, rather than the relevant market-level demand elasticity applying to a common cost shock — may broaden opposition to include even owners who would ultimately benefit.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Pass-through businesses (independent businesses):&lt;/strong&gt; Privately owned firms organized as S-corporations, partnerships, or LLCs, taxed by passing income through to the individual returns of owners rather than at the entity level. In 2015, these comprised 78% of non-sole-proprietorship U.S. businesses and 46% of employment. The paper uses &amp;ldquo;pass-through&amp;rdquo; and &amp;ldquo;independent business&amp;rdquo; interchangeably as the unit of analysis.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Highly exposed industries:&lt;/strong&gt; Four-digit NAICS industries where at least 15% of workers earned below the annual full-time equivalent of the federal minimum wage ($15,080) in 2013, as measured in the authors&amp;rsquo; administrative tax data. This threshold proxies the concentration of minimum-wage workers across industries and drives the sample selection for firm-level analysis.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Own-wage elasticity of employment:&lt;/strong&gt; The estimated percentage change in employment at a firm associated with a given percentage change in the firm&amp;rsquo;s minimum wage. The authors estimate this as -0.209 (s.e. = 0.0112), reflecting the average effect across all exposed independent businesses, conditional on the firm&amp;rsquo;s industry, size, and local market characteristics.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;DFL re-weighting (DiNardo-Fortin-Lemieux):&lt;/strong&gt; A non-parametric reweighting procedure that adjusts the distribution of control-group firms to match the distribution of treatment-group firms on observables (specifically, two-year lagged value-added within three-digit NAICS industries). Used to improve pre-reform comparability of treatment and control firm samples without parametric functional form assumptions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Firm productivity (in this paper&amp;rsquo;s sense):&lt;/strong&gt; Measured as the ratio of net profits to revenues (net income/revenue) at the firm level in the base year 2013, used to assign firms to productivity quartiles for heterogeneity analysis. This is a firm-level profitability measure constructed from pass-through tax returns, not a total factor productivity estimate requiring production function estimation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Firm exit:&lt;/strong&gt; An indicator for a firm that filed a tax return in 2013 but did not file a return in a subsequent year t. The average one-year exit rate for highly exposed independent businesses is 5.2%; the cumulative five-year raw exit rate is approximately 29% across treatment and control states.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Cournot competition with heterogeneous productivity and fixed costs:&lt;/strong&gt; The paper&amp;rsquo;s conceptual framework, in which N firms compete in quantities with asymmetric marginal costs (reflecting heterogeneous productivity), a common output price, and a fixed cost of production. Under this framework, a minimum wage cost shock narrows margins unevenly, induces exit among firms that cannot cover fixed costs, and generates both demand reallocation and market share gains for productive survivors — rationalizing simultaneous exit and profit increases in the same industry.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Common cost shock:&lt;/strong&gt; The property that a minimum wage increase raises production costs for all firms employing low-wage workers in the same market simultaneously. Because all competing firms face higher costs, the relevant pass-through parameter is the elasticity of market demand rather than the (higher) elasticity of individual firm demand, facilitating cost pass-through to consumers and distinguishing minimum wages from unilateral price changes by a single firm.&lt;/p&gt;</description></item><item><title>Why Is Workplace Sexual Harassment Underreported? The Value of Outside Options amid the Threat of Retaliation</title><link>https://macropaperwarehouse.com/papers/why-is-workplace-sexual-harassment-underreported-the-value-of-outside-options-amid-the-threat-of-retaliation/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/why-is-workplace-sexual-harassment-underreported-the-value-of-outside-options-amid-the-threat-of-retaliation/</guid><description>&lt;h2 id="layer-1--summary"&gt;Layer 1 — Summary&lt;/h2&gt;
&lt;h3 id="research-question-and-argument"&gt;Research question and argument&lt;/h3&gt;
&lt;p&gt;Dahl and Knepper address the long-standing puzzle of why workplace sexual harassment is chronically underreported despite high estimated prevalence. Survey evidence indicates that no fewer than 1 in 28 U.S. workers report annual victimization, yet only 1 in 11,000 workers files a formal charge with the Equal Employment Opportunity Commission (EEOC). Even following the #MeToo movement, formal charges rose only about 10%, leaving an enormous gap unexplained.&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s central hypothesis is that employers coerce victims into silence through the credible threat of retaliatory firing. The key mechanism: because reporting constitutes a &amp;ldquo;protected activity&amp;rdquo; triggering employer notification, workers who fear losing their jobs will suppress claims. This threat is most binding when a worker&amp;rsquo;s outside options are weak — when it is hard to find a new job or when unemployment insurance (UI) benefits are thin. The paper tests this hypothesis by asking whether external shocks that reduce the value of outside options increase the threshold of harassment severity above which workers are willing to report.&lt;/p&gt;
&lt;h3 id="measurement-strategy"&gt;Measurement strategy&lt;/h3&gt;
&lt;p&gt;Measuring underreporting directly is impossible by definition. The authors&amp;rsquo; key methodological insight is to use the &lt;em&gt;selectivity&lt;/em&gt; of filed charges as an indirect proxy. Under mild assumptions, if workers become more selective about which incidents they report, the average quality of filed charges must rise. The authors measure quality using the EEOC&amp;rsquo;s own merit determination: a charge is deemed meritorious if the employer settles, the claimant withdraws upon receipt of benefits, or the EEOC finds reasonable cause after investigation. The merit rate thus serves as an observable proxy for the (unobservable) degree of underreporting.&lt;/p&gt;
&lt;p&gt;Baseline descriptive evidence supports the mechanism&amp;rsquo;s relevance: across 2000–2015, sexual harassment charges were nearly 50% more likely to be meritorious than non-harassment charges (27.0% vs. 18.6%), and more than twice as likely to involve employer retaliation (63.4% vs. 30.7%). The proportion of EEOC sexual harassment cases involving retaliation rose from 52% in 2000 to 72% in 2015, a period over which the annual volume of filed charges fell by 37%.&lt;/p&gt;
&lt;h3 id="analysis-1--labor-market-conditions-20002015"&gt;Analysis 1 — Labor market conditions (2000–2015)&lt;/h3&gt;
&lt;p&gt;The first empirical design exploits monthly variation in state-industry unemployment rates over 2000–2015 using EEOC microdata on individual charges. The regression controls for industry, state, and time fixed effects, isolating within-state-industry variation in unemployment. The identifying assumption is that a worker&amp;rsquo;s willingness to file depends only on her outside options and the severity of harassment she experiences, conditional on fixed effects.&lt;/p&gt;
&lt;p&gt;The results indicate that each one percentage point increase in a state-industry&amp;rsquo;s monthly unemployment rate is associated with a 0.5–0.7% increase in the probability that a filed charge is deemed meritorious by the EEOC. This is consistent with the hypothesis that workers become more reluctant to report as outside labor market options weaken.&lt;/p&gt;
&lt;p&gt;Heterogeneity analysis using linked EEO-1 establishment data strengthens the interpretation. The effect is amplified in industries that employ a larger fraction of men and in establishments where male managers account for a higher share of supervisory roles. Oﬀending establishments in the sample have, on average, 2.8 percentage points more male employees and 5 percentage points more male managers than non-offending establishments. The selectivity-unemployment gradient is larger in these male-dominated environments, consistent with a role for gendered power disparities in enabling employer retaliation.&lt;/p&gt;
&lt;h3 id="analysis-2--north-carolina-ui-reform-quasi-experiment"&gt;Analysis 2 — North Carolina UI reform (quasi-experiment)&lt;/h3&gt;
&lt;p&gt;The second design exploits North Carolina&amp;rsquo;s 2013 UI reform as a plausibly exogenous reduction in the value of outside options. In response to the near-insolvency of its UI trust fund following the Great Recession, North Carolina simultaneously reduced maximum weekly benefits by approximately 35% (from $535 to $350 per week) and cut maximum benefit duration from 26 to 20 weeks. Together, these changes reduced the maximum total regular UI benefit available to North Carolinians by approximately 50%, from roughly $14,000 to $7,000. These reforms also violated the Congressional non-reduction rule, making individuals ineligible for an additional 47 weeks of federal Emergency Unemployment Compensation benefits, further amplifying the effective cut. North Carolina was the only state to simultaneously reduce both the level and duration of benefits.&lt;/p&gt;
&lt;p&gt;The authors implement a difference-in-differences design comparing North Carolina to other Southern states that did not change their UI programs, controlling for state and month-year fixed effects. Pre-reform parallel trends are documented via event study. Administrative UI recipiency data show that the short-term UI recipiency rate in North Carolina fell from 33% to 10% — a 59% decline relative to control states — within roughly two years of the reform.&lt;/p&gt;
&lt;p&gt;The main finding is that the selectivity of sexual harassment charges filed in North Carolina increased by approximately 7 percentage points following the reform, representing more than a 30% increase relative to control states. This is consistent with the hypothesis that reduced UI generosity raises the cost of a retaliatory firing, causing workers to suppress all but the most severe harassment incidents.&lt;/p&gt;
&lt;p&gt;The authors note that North Carolina also reduced corporate and personal income taxes shortly after the UI reform. Because tax cuts should increase both labor demand and labor supply (insofar as substitution effects dominate income effects), this would tend to reduce the reporting threshold, leading them to interpret the 30%+ estimate as a lower bound on the causal effect of the UI reform on selectivity.&lt;/p&gt;
&lt;h3 id="formal-model"&gt;Formal model&lt;/h3&gt;
&lt;p&gt;The paper presents a threshold model of reporting behavior adapted from Boone and Van Ours (2006). Workers choose a reporting threshold: the minimum harassment severity above which they will file a charge. The threshold rises when the value of outside options falls, either because the probability of finding a new job declines (recession) or because unemployment benefits shrink. The model predicts that the merit rate of filed charges will rise as outside options weaken. The model explicitly does not predict the volume of charges, because firm behavior — which may adjust endogenously to higher reporting thresholds — is not modeled.&lt;/p&gt;
&lt;h3 id="scope-conditions"&gt;Scope conditions&lt;/h3&gt;
&lt;p&gt;All findings concern formal EEOC charges filed in the United States between 2000 and 2015 (analysis 1) and through the post-2013 reform period (analysis 2). The EEOC definition of illegal harassment requires severity sufficient to create a &amp;ldquo;hostile or offensive work environment&amp;rdquo; or an adverse employment action. The paper&amp;rsquo;s merit measure captures harassment that exceeded this legal threshold; non-meritorious charges may still involve some level of misconduct. The sample for establishment-level heterogeneity analyses covers private firms with 100 or more employees (EEO-1 filers), approximately 40% of U.S. employees. The mechanism specifically concerns retaliation-driven suppression of &lt;em&gt;formal&lt;/em&gt; reporting; effects on informal or anonymous reporting cannot be assessed.&lt;/p&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-q-what-is-the-core-mechanism-the-paper-proposes-to-explain-underreporting"&gt;Q1. Q: What is the core mechanism the paper proposes to explain underreporting?&lt;/h3&gt;
&lt;p&gt;A: Employers threaten workers with retaliatory firing for engaging in protected activity (filing an EEOC charge). Because the EEOC notifies the named employer within 10 days of receiving a charge, worker anonymity is rarely preserved. When a worker&amp;rsquo;s outside options are weak — because unemployment is high or UI benefits are thin — the expected cost of a retaliatory firing is higher, raising the severity threshold above which a victim is willing to report. Workers therefore &amp;ldquo;tough it out&amp;rdquo; rather than risk their current job.&lt;/p&gt;
&lt;h3 id="q2-q-how-does-the-paper-measure-something-that-is-by-definition-not-reported"&gt;Q2. Q: How does the paper measure something that is, by definition, not reported?&lt;/h3&gt;
&lt;p&gt;A: By using the quality of filed charges as a proxy for the degree of underreporting. Under the threshold model, if workers only report when harassment exceeds a higher bar, the average quality of what does get filed must rise. The EEOC&amp;rsquo;s own merit determination (settlement, withdrawal with benefits, or reasonable-cause ruling) provides an objective, externally-assessed quality measure. An increase in the merit rate signals that the population of filed charges has become more selected — that is, that the unreported fraction has grown.&lt;/p&gt;
&lt;h3 id="q3-q-what-does-the-0507-figure-mean-and-what-is-its-interpretation"&gt;Q3. Q: What does the 0.5–0.7% figure mean, and what is its interpretation?&lt;/h3&gt;
&lt;p&gt;A: Each one percentage point increase in a state-industry&amp;rsquo;s monthly unemployment rate is associated with a 0.5–0.7 percentage point increase in the probability that a filed sexual harassment charge receives a merit designation from the EEOC. This is interpreted as evidence that workers become more selective — filing only more severe cases — as outside options weaken, consistent with higher underreporting at lower harassment thresholds.&lt;/p&gt;
&lt;h3 id="q4-q-why-did-the-number-of-eeoc-sexual-harassment-charges-fall-by-37-between-2000-and-2015-even-as-retaliation-rates-rose"&gt;Q4. Q: Why did the number of EEOC sexual harassment charges fall by 37% between 2000 and 2015, even as retaliation rates rose?&lt;/h3&gt;
&lt;p&gt;A: The paper offers the interpretation that firms have become more effective at credibly threatening retaliation to suppress reporting. The 37% volume decline does not imply harassment has diminished; it may reflect a rising fraction of victims staying silent. The authors note the model does not make a prediction about volume because firm behavior is not modeled — volume depends on both worker reporting thresholds and employer conduct.&lt;/p&gt;
&lt;h3 id="q5-q-why-is-north-carolinas-ui-reform-particularly-well-suited-as-a-natural-experiment"&gt;Q5. Q: Why is North Carolina&amp;rsquo;s UI reform particularly well-suited as a natural experiment?&lt;/h3&gt;
&lt;p&gt;A: Four features make it attractive. First, the reform was motivated by trust fund insolvency rather than local labor market conditions, making it more plausibly exogenous to harassment reporting trends. Second, it was implemented during a period of historically high unemployment, when the social safety net was unusually relevant to workers considering risky actions. Third, the cuts affected both the intensive margin (benefit level, down ~35%) and the extensive margin (duration, from 26 to 20 weeks; added eligibility restrictions), with total maximum benefits cut by approximately 50%. Extensive-margin cuts are likely particularly salient for workers worried about a retaliatory firing. Fourth, the cuts to regular UI were permanent and primary, rather than affecting supplemental federal programs.&lt;/p&gt;
&lt;h3 id="q6-q-what-role-does-industry-and-establishment-gender-composition-play"&gt;Q6. Q: What role does industry and establishment gender composition play?&lt;/h3&gt;
&lt;p&gt;A: The underreporting effect — proxied by the merit-unemployment gradient — is amplified in industries with a larger fraction of male coworkers and in establishments with a higher fraction of male managers. Establishments named in sexual harassment charges have, on average, 2.8 percentage points more male employees and 5 percentage points more male managers than non-respondent establishments. The male-manager underreporting gradient is further amplified by higher unemployment, suggesting gendered power disparities interact with labor market conditions to suppress reporting.&lt;/p&gt;
&lt;h3 id="q7-q-does-the-paper-make-predictions-about-the-volume-of-charges-not-just-their-quality"&gt;Q7. Q: Does the paper make predictions about the volume of charges, not just their quality?&lt;/h3&gt;
&lt;p&gt;A: No. The threshold model explicitly does not model firm behavior and makes no prediction about charge volume. Whether volume rises or falls following a labor demand shock is theoretically ambiguous: firms may respond to higher reporting thresholds by escalating harassment (increasing both incidence and severity), or may not respond at all. The identifying assumption requires only that a worker&amp;rsquo;s willingness to file depends on her outside options and the severity of harassment she experiences — not on firm behavior.&lt;/p&gt;
&lt;h3 id="q8-q-what-is-the-value-of-a-statistical-harassment-vsh-figure-and-how-does-it-relate-to-the-papers-motivation"&gt;Q8. Q: What is the &amp;ldquo;value of a statistical harassment&amp;rdquo; (VSH) figure, and how does it relate to the paper&amp;rsquo;s motivation?&lt;/h3&gt;
&lt;p&gt;A: Hersch (2018) estimates the VSH for serious cases at approximately $7.6 million, roughly comparable to the value of a statistical life (VSL). Dahl and Knepper cite this figure to underscore the magnitude of the underreporting problem: with an estimated 5 million workers victimized annually, the social costs of suppressed reporting are substantial. The comparison to VSL motivates why closing the reporting gap matters for welfare, not just legal compliance.&lt;/p&gt;
&lt;h3 id="q9-q-what-is-the-ex-ante-moral-hazard-interpretation-of-the-ui-results"&gt;Q9. Q: What is the ex-ante moral hazard interpretation of the UI results?&lt;/h3&gt;
&lt;p&gt;A: Most UI research focuses on ex-post effects — how benefit generosity affects job search behavior for workers who have already lost their jobs. Dahl and Knepper document an ex-ante moral hazard effect: UI generosity affects the behavior of currently employed workers by changing the expected cost of actions (reporting harassment) that might trigger job loss. Lower UI generosity raises the effective cost of a retaliatory firing, discouraging reporting. This is analogous to, but in the opposite direction from, Lusher et al. (2020), who find that UI expansions reduced productivity among currently employed workers.&lt;/p&gt;
&lt;h3 id="q10-q-what-does-the-parallel-trends-evidence-show-for-the-nc-difference-in-differences"&gt;Q10. Q: What does the parallel-trends evidence show for the NC difference-in-differences?&lt;/h3&gt;
&lt;p&gt;A: The paper presents an event study documenting parallel pre-reform trends in the merit rate between North Carolina and control states. The control group is other Southern states that did not change their UI programs during the sample period, excluding AR, FL, GA, and SC (which made changes) and the West South Central division (which exhibited differential pre-trends). The UI recipiency rate tracks closely between NC and control states prior to July 2013, then diverges sharply thereafter, dropping from 33% to 10% in North Carolina within two years — a 59% decline relative to controls.&lt;/p&gt;
&lt;hr&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Merit determination (EEOC):&lt;/strong&gt; The EEOC assigns a merit designation to a sexual harassment charge if the named employer settles with the employee, the claimant withdraws the charge upon receipt of benefits, or the EEOC itself determines after investigation that there is &amp;ldquo;reasonable cause&amp;rdquo; to believe harassment occurred. As used in this paper, merit designations capture cases where harassment exceeded the legal threshold of a &amp;ldquo;hostile or offensive work environment&amp;rdquo; or produced an adverse employment decision — not all cases involving some level of misconduct.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Selectivity of charges:&lt;/strong&gt; The fraction of filed EEOC sexual harassment charges that receive a merit designation. In the paper&amp;rsquo;s framework, higher selectivity (a higher merit rate) signals that workers are filing only more severe cases — i.e., that underreporting of less severe cases has increased. Selectivity is used as an observable proxy for the (unobservable) degree of underreporting.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Reporting threshold (ᾱ):&lt;/strong&gt; In the paper&amp;rsquo;s threshold model, the minimum level of harassment severity above which a worker will file an EEOC charge. The threshold is determined by the equality between the expected gains from reporting (probability of success times compensation plus elimination of harassment) and the expected costs (probability of retaliation times the gap between current wage and unemployment value). The threshold rises when outside options weaken — either through lower job-finding probabilities or reduced UI benefits.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Outside options:&lt;/strong&gt; In this paper, the expected value to a worker of becoming unemployed: a weighted average of the wage at a new job (weighted by job-finding probability) and unemployment benefits (weighted by the probability of not finding a job). Outside options determine the cost a worker bears if retaliatory firing follows an EEOC charge. The paper&amp;rsquo;s two empirical analyses correspond to two separate shocks to outside options: aggregate labor demand (unemployment rate) and institutional safety net generosity (UI benefits).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Retaliation:&lt;/strong&gt; Defined by the EEOC as punishment for engaging in a protected activity, such as filing a charge. Retaliation arose in 63.4% of all EEOC sexual harassment charges filed between 2000 and 2015 — more than double the rate for non-harassment charges — and rose from 52% of harassment cases in 2000 to 72% in 2015. In the paper&amp;rsquo;s model, the probability of a retaliatory firing is denoted θ, and is treated as fixed (not a function of harassment severity for tractability).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Ex-ante moral hazard (UI):&lt;/strong&gt; The effect of UI benefit generosity on the behavior of currently employed workers, rather than on those already unemployed. In this paper&amp;rsquo;s context, higher UI generosity reduces the cost of a potential retaliatory firing for currently employed workers, making them more willing to report harassment. The North Carolina UI reform provides evidence of this ex-ante channel: when benefits were cut, the selectivity of harassment charges rose, consistent with workers becoming less willing to risk their jobs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;EEO-1 data:&lt;/strong&gt; A mandatory annual survey of private establishments in the United States with 100 or more employees, covering approximately 40% of all U.S. employees. Collected by the EEOC, these data report the gender, race, and occupational distribution of workers within each establishment. In this paper, the EEO-1 files are linked to EEOC charge microdata to analyze how the gender composition of co-workers and managers moderates both the incidence of reported harassment and the degree of underreporting.&lt;/p&gt;
&lt;hr&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary based on IZA Discussion Paper 14740. AI-assisted, human review pending.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;</description></item></channel></rss>