D82 | Macro Paper Warehouse

Collusion with Optimal Information Disclosure

Mon, 01 Jan 0001 00:00:00 +0000

This paper asks how a third-party intermediary (an “algorithm”) that observes market demand or costs superior to competing firms should optimally disclose that information to maximize the firms’ collusive profit in a repeated Bertrand competition setting. The motivation is the rise of algorithmic pricing intermediaries such as RealPage in apartment rentals, A2i Systems in retail gasoline, and Rainmaker in hotel rooms, as well as offline cartel facilitators like AC-Treuhand.

The model extends the canonical Rotemberg–Saloner (1986) repeated Bertrand framework with stochastic demand. The key technical assumption is that firm profit is affine in the unknown state s, so expected profit depends only on the expected state. This holds for binary states, linear demand with unknown intercept (D(p,s) = s − p), and linear demand with unknown per-unit cost. The algorithm observes s and commits to a known disclosure policy mapping s to a public signal. The solution concept is pure-strategy subgame-perfect equilibrium, and the paper solves for the disclosure policy and equilibrium that jointly maximize collusive profit.

The main result (Theorem 1) is that the unique optimal disclosure policy is upper censorship: there is a cutoff ŝ such that demand states s < ŝ are disclosed and result in the corresponding monopoly price p^m(s), while demand states s ≥ ŝ are pooled — only the event {s ≥ ŝ} is disclosed — and result in the monopoly price for the mean concealed state, p^m(s*), where s* = E[s | s ≥ ŝ]. The reduction to a static information design problem (Lemma 1) is the key technical step: optimal collusive profit equals V*, the greatest fixed point of V = max_{G ∈ MPC(F)} E_G[min{π^m(s), δV/((1−δ)(n−1))}]. The “capped monopoly profit” min{π^m(s), π^max} is convex-then-concave in s, and classical results from the static information design literature (Kolotilin 2018; Dworczak and Martini 2019) then imply upper censorship is uniquely optimal.

Two features of the optimal equilibrium are notable. First, prices are rigid (constant at p^m(s*)) whenever s ≥ ŝ — the opposite of Rotemberg–Saloner’s “price wars during booms.” The logic is that pooling high demand states with a lower average state is more profitable than cutting prices, because pooling reduces the current-period deviation gain without sacrificing as much on-path profit. Second, for demand states s ∈ (ŝ, s*), the equilibrium price p^m(s*) exceeds the monopoly price p^m(s) — supra-monopoly pricing occurs for a range of intermediate states. Monopoly pricing is attainable at each such state in isolation, but recommending the higher price p^m(s*) is necessary to make the pooling incentive-compatible at states s > s*.

Comparing to full disclosure, Proposition 1 shows that optimal disclosure leads to strictly higher prices at every demand state, and hence unambiguously lower consumer surplus. Proposition 3 shows that improving the algorithm’s accuracy (a mean-preserving spread of F) reduces expected consumer surplus whenever consumer surplus under monopoly pricing is concave in s — a natural condition. This result is more pessimistic than prior work (Sugaya–Wolitzky 2018; Miklos-Thal–Tucker 2019), which found ambiguous effects because those papers assumed full disclosure.

Comparative statics (Proposition 2): fewer firms or a higher discount factor δ increases collusive profit V* and makes prices more flexible (raises ŝ). Collusion is impossible if and only if δ < (n−1)/n, the same threshold as under full disclosure.

Extensions maintain the core results. With Markov (persistent) demand (Section 4 / Theorem 2), upper censorship remains optimal but the cutoff ŝ(s) depends on last-period demand s: under positive serial correlation, ŝ(s) is decreasing in s, so the algorithm discloses less information following high demand. With differentiated products under a symmetric linear demand system (Section 5 / Theorem 3), the optimal policy censors an intermediate interval [ŝ_L, ŝ_H] and discloses both the lowest and highest demand states, because at high states the absence of an upper bound on equilibrium profit makes disclosure with price-cutting optimal.

Q: What is the core research question and why is it policy-relevant? A: The paper asks how an informed intermediary should optimally disclose demand or cost information to competing firms to maximize their collusive profit. It is directly motivated by antitrust cases against RealPage (sued by the US DOJ in August 2024), A2i Systems/Kalibrate, and Rainmaker, all of which gather market data from competing firms and recommend prices. The theory also applies to offline facilitators like AC-Treuhand, prosecuted by the European Commission for disclosing competitively sensitive information.

Q: What is the affinity assumption and why does it matter? A: The paper assumes that firm profit π(p, s) is affine (linearly increasing) in the demand or cost state s for each price p. This implies that expected profit for any distribution over states equals profit evaluated at the expected state: E[π(p,s)] = π(p, E[s]). As a consequence, any disclosure policy is equivalent, from a profit standpoint, to choosing a distribution G of the firms’ posterior mean beliefs over s, and G must be a mean-preserving contraction of the prior F (by Blackwell 1953). The assumption is satisfied for binary states, linear demand with unknown intercept, and linear demand with unknown cost.

Q: What is the key reduction result (Lemma 1) and what does it achieve? A: Lemma 1 reduces the problem of finding an optimal repeated-game equilibrium to a static information design problem. Optimal collusive profit equals V*, the greatest fixed point of V = max_{G ∈ MPC(F)} E_G[min{π^m(s), δV/((1−δ)(n−1))}], and this is attained by a symmetric, stationary, grim-trigger equilibrium. The reduction works because, under Bertrand competition, static deviation gains are proportional to on-path payoffs, creating a one-to-one correspondence that allows the repeated-game constraint to be folded into a single-period objective.

Q: Why is upper censorship the uniquely optimal disclosure policy? A: The static information design problem has a “capped monopoly profit” objective: min{π^m(s), π^max}, where π^max = δV*/((1−δ)(n−1)) is the maximum per-period profit that satisfies incentive constraints. Because π^m(s) is convex (as the maximum of affine functions) and the cap π^max is constant, the overall objective is convex for s below the cap and constant (then concave) above it — i.e., convex-then-concave in s. Classical results for linear information design (Kolotilin 2018; Dworczak and Martini 2019) imply that the unique optimal policy for a convex-then-concave objective is upper censorship.

Q: What is the supra-monopoly pricing result and why does it arise? A: For demand states s ∈ (ŝ, s*), the equilibrium price is p^m(s*) > p^m(s), meaning firms charge above the monopoly price for the current state. This arises because the pooling policy must recommend a single price for all states s ≥ ŝ, and the recommended price is p^m(s*) where s* = E[s | s ≥ ŝ]. At intermediate states s ∈ (ŝ, s*), this price exceeds the local monopoly price. The algorithm accepts lower profit at these states because it is necessary to maintain the pooled recommendation at higher states where monopoly pricing would otherwise require a price cut.

Q: How does optimal disclosure compare to full disclosure in terms of consumer surplus? A: Proposition 1 shows that collusive prices under optimal disclosure are strictly higher at every demand state compared to full disclosure (Rotemberg–Saloner). In Rotemberg–Saloner, high demand states trigger price cuts (“price wars during booms”) to deter deviation; under optimal disclosure, high states are pooled and prices are instead rigid at p^m(s*). Because prices are higher at all states, consumer surplus is unambiguously lower under optimal disclosure.

Q: What does Proposition 3 say about the effect of algorithmic accuracy on consumer surplus? A: Proposition 3 states that if consumer surplus under monopoly pricing, CS(s), is concave in s, then a mean-preserving spread of F (i.e., improved algorithmic accuracy) reduces expected consumer surplus. This result is more pessimistic than prior work by Sugaya–Wolitzky (2018) and Miklos-Thal–Tucker (2019), which found ambiguous effects. The difference is that those papers assumed full disclosure, so better accuracy tightened incentive constraints and sometimes forced price cuts. Under optimal selective disclosure, a more accurate algorithm always raises average prices because the algorithm withholds information that would have forced price cuts.

Q: What are the comparative statics with respect to the number of firms and the discount factor? A: Proposition 2 establishes that a decrease in the number of firms n or an increase in the discount factor δ increases collusive profit V* and makes collusive prices more flexible (raises ŝ). The intuition for fewer firms making prices more flexible is that with fewer firms, incentive constraints bind for a narrower range of demand states, so less pooling is needed. Collusion is impossible if and only if δ < (n−1)/n, the same threshold as under full disclosure.

Q: How does the model generate empirically testable predictions distinct from other collusion models? A: The model predicts: (1) the equilibrium price distribution has support on an interval [p^m(s_bar), p^m(ŝ)] plus a single mass point at the higher price p^m(s*); (2) prices are pro-cyclical overall but rigidly fixed at p^m(s*) for all but the lowest demand states; (3) the gap p^m(s) − p(s) is non-monotone — zero at low states, negative (supra-monopoly) at intermediate states, and positive at high states; (4) prices are more flexible when firms are more patient or fewer. The rigid high price combined with a flexible interval of lower prices is described as a distinctive collusive marker not present in other models.

Q: How does the model relate to the empirical literature testing Green–Porter versus Rotemberg–Saloner? A: Rotemberg–Saloner predicts counter-cyclical prices (price wars during booms), while Green–Porter predicts pro-cyclical prices. Empirical tests (e.g., Porter 1983, Ellison 1994) have typically found pro-cyclical prices, favoring Green–Porter. The present model generates pro-cyclical prices through a different mechanism — perfect monitoring plus selectively disclosed demand information — showing that pro-cyclical prices are consistent with perfect monitoring when the information intermediary optimally pools high demand states. The paper suggests that distinguishing the theories requires estimating the gap between price and monopoly price over the cycle: under Green–Porter, collusion succeeds better in high demand states; under this model, collusion succeeds better in low demand states.

Q: What narrative evidence from the RealPage case corroborates the model’s predictions? A: The US DOJ complaint against RealPage states that “in down markets… [RealPage] instills pricing discipline in landlords, curbing normal fully independent competitive reactions by substituting them with interdependent decision-making,” and that RealPage advertised that its AI helps clients “avoid the race to the bottom in down markets.” This is consistent with the model’s prediction of flexible monopoly prices at low demand states and a rigid, supra-monopolistic price in normal times. The Kumatori Contractors Cooperative case (studied by Kawai, Nakabayashi, and Ortner 2024) corroborates the censorship result: that organization took drastic steps to limit bidders’ information about costs on the largest projects — exactly the states where deviation is most tempting.

Q: How do results change with persistent (Markov) demand? A: Theorem 2 shows that upper censorship remains uniquely optimal with Markov demand, but the cutoff ŝ(s) now depends on last-period demand s. Under positive serial correlation, ŝ(s) is decreasing in s: the algorithm discloses less information after high demand because firms are more optimistic and thus more tempted to deviate. Under negative serial correlation, ŝ(s) is increasing. The optimal collusive price is no longer always equal to the monopoly price for the disclosed mean demand, and the expected price conditional on last-period demand can be countercyclical (similar to Rotemberg–Saloner), even though the current-period price is always monotone in current demand.

Q: How does the optimal disclosure policy change with differentiated products? A: With a symmetric linear demand system (Section 5, Theorem 3), the optimal policy censors an intermediate interval [ŝ_L, ŝ_H] and discloses both the lowest and the highest demand states. At high demand states s > ŝ_H, the algorithm discloses the state and recommends a price below monopoly (to satisfy incentive constraints), because with differentiated goods there is no upper bound on equilibrium profit and profit is convex in s at high states, making disclosure with price-cutting optimal. Mathematically, the capped monopoly profit is piecewise-convex rather than convex-then-concave, so the optimal policy is intermediate-interval censorship rather than upper censorship. The Appendix A version extends to general demand systems and capacity constraints with the same qualitative logic.

Q: What are the main limitations and directions for future work acknowledged by the authors? A: The paper identifies three main limitations. First, if profit is not affine in s (i.e., expected profit depends on more than the mean state), the information design problem becomes non-linear and upper censorship is typically suboptimal, though it remains approximately optimal when the problem is close to linear. Second, the model assumes the algorithm’s objective is to maximize industry profit; if the intermediary is a profit-maximizing seller of software (as in Harrington 2022), the objective may instead be to maximize the profit differential between adopters and non-adopters. Third, the model assumes all firms use the algorithm; allowing partial adoption would require modeling firms’ incentives to subscribe. The paper notes that incorporating these considerations “could be an interesting direction for future research.”

Upper Censorship (disclosure policy): A disclosure policy in which demand states below a cutoff ŝ are revealed to firms (along with the corresponding monopoly price recommendation), while states above ŝ are pooled — only the event {s ≥ ŝ} is disclosed — with a single monopoly price recommendation p^m(s*) for the mean concealed state s* = E[s | s ≥ ŝ]. This is the uniquely optimal disclosure policy in the baseline model.

Capped Monopoly Profit: The per-period profit objective in the reduced static information design problem: min{π^m(s), π^max}, where π^max = δV*/((1−δ)(n−1)) is the maximum industry profit attainable in a single period without violating incentive constraints. This function is convex-then-concave in s, which drives the optimality of upper censorship.

Supra-Monopoly Pricing: Equilibrium prices that exceed the monopoly price for the realized demand state. In the model, this occurs for states s ∈ (ŝ, s*), where the algorithm’s pooled recommendation p^m(s*) is above the local monopoly price p^m(s). It arises because the pooled recommendation must be incentive-compatible at the highest concealed states.

Price Rigidity: The feature of the optimal equilibrium in which the collusive price is constant at p^m(s*) for all demand states s ≥ ŝ. The algorithm achieves this by withholding information about high demand states, preventing the “price wars during booms” predicted by Rotemberg–Saloner (1986) under full disclosure.

Algorithmic Accuracy: In the paper’s terms, the informativeness of the algorithm’s signal about s, formalized as the precision of the distribution F. Improving accuracy corresponds to a mean-preserving spread of F (Blackwell 1953). A more accurate algorithm always increases collusive profit; under the concavity condition on consumer surplus, it also reduces expected consumer surplus.

Mean-Preserving Contraction (MPC(F)): The set of distributions G of firms’ posterior mean beliefs over s that are consistent with Bayesian updating of the prior F. By Blackwell (1953), a disclosure policy is feasible if and only if it induces a distribution G ∈ MPC(F). This is the feasibility constraint in the static information design problem.

Affinity in the state: The assumption that π(p, s) is affine (linearly increasing) in s for each price p. This implies E[π(p,s)] = π(p, E[s]), so expected profit is determined entirely by the expected state, enabling the reduction of the disclosure problem to choosing a distribution of posterior means.

Competing under Information Heterogeneity: Evidence from Auto Insurance

Mon, 01 Jan 0001 00:00:00 +0000

This paper studies imperfect competition in selection markets where competing firms have heterogeneous information about consumers — a layer of asymmetry distinct from the classic buyer-seller information gap. The central questions are: how do inter-firm information asymmetries shape equilibrium pricing, consumer sorting, and market efficiency; and whether a centralized bureau that aggregates and equalizes firms’ risk information can promote competition and improve welfare.

The empirical setting is the Italian mandatory motor vehicle liability insurance market (Responsabilità Civile Auto). The authors use the IPER dataset from IVASS, a nationally representative panel of matched insurer-insuree contracts covering 124,428 liability insurance contracts for new customers in the province of Rome from 2013 to 2021. The panel tracks consumers across insurer switches, enabling construction of individual-specific risk estimates from ex-post claim records using Poisson regressions for claim frequency and log-normal regressions for claim severity. The analysis focuses on the top 10 largest firms plus a composite fringe firm.

The paper’s empirical strategy proceeds in three stages. First, individual risk types are estimated from multi-year claim panels. Second, demand parameters — price sensitivity and firm-level unobserved product attributes — are recovered using a novel fixed-point algorithm (extending Berry et al. 1995) that infers the full offered-price distribution from observed transaction prices alone, without parametric restrictions on price distributions across firms. Third, supply-side parameters — pricing coefficients, signal variances, and cost parameters — are identified by exploiting the monotone mapping between offered prices and private signals, borrowing from the nonparametric auction literature.

The model features firms that each draw a private Gaussian signal about a consumer’s true risk type theta, with firm-specific signal standard deviation sigma_j. Lower sigma_j means higher information precision. Firms set prices as a linear function of their posterior risk rating: p_j = alpha_j + beta_j * E(theta | theta_j, D=j). Firms simultaneously choose pricing coefficients to maximize expected profits.

Key empirical findings: (1) Firms differ substantially in how sensitively their premiums respond to realized consumer risk — a reduced-form measure of information precision — with Figure 2 showing wide cross-firm variation in premium-to-risk coefficients. (2) Structural estimation confirms substantial heterogeneity in signal standard deviations sigma_j across all 11 firms. Firms with less accurate risk-rating algorithms (higher sigma_j) tend to have more efficient cost structures (lower claim-processing cost parameter k_j), generating distinct comparative advantages. (3) Baseline pricing coefficients alpha_j and risk-sensitivity coefficients beta_j vary dramatically across firms. (4) Senior drivers are less price sensitive; urban drivers are more price sensitive. Lower-risk consumers show stronger preferences for Firms 3 and 5, while higher-risk consumers disproportionately choose Firm 8.

Counterfactual simulations assess three information policies relative to the baseline. Under a centralized risk bureau — which collects each firm’s signal, aggregates them weighted by precision, and distributes the combined signal equally — average premiums fall by 21.6% and consumer surplus rises by 15.7%. The efficiency benchmark (firms observe true risk perfectly) yields a 25.7% premium reduction and a 16.9% consumer surplus gain, so the bureau recovers almost all the efficiency gap. The privacy benchmark (all firms restricted to the coarsest signal in the market) raises surplus for high-risk consumers by 6.9% but harms low-risk consumers.

The bureau’s price reduction operates through two channels: it eliminates the market power that accrues to firms with superior private information, and it aligns firms’ risk evaluations, enabling sharper undercutting. The bureau also reduces average costs by 12 euros per contract by enabling more efficient insurer-insuree matching — cost-efficient claim processors can better target the consumer types they have a comparative advantage in serving.

The analysis is confined to new customers in Rome’s provincial market to avoid complications from dynamic pricing and consumer-firm learning. The model abstracts away from optional contract clauses (treated as observable characteristics) and does not model the specific mechanisms generating information heterogeneity.

Q: What is the paper’s core research question? A: The paper asks how information asymmetries between competing firms (not just between buyers and sellers) shape equilibrium pricing strategies, consumer sorting, and market efficiency in a selection market, and whether a centralized bureau that equalizes firms’ access to aggregated risk information can improve competition and welfare. This extends the classic Akerlof-Rothschild-Stiglitz framework by introducing a second layer of asymmetry — across sellers themselves.

Q: Why is the Italian auto insurance market well suited for this study? A: Italy mandates liability insurance for all drivers and prohibits rejections, so the analysis focuses entirely on how consumers sort across insurers rather than on participation margins. The IPER dataset from IVASS is a nationally representative panel tracking policyholders even across insurer switches, providing both premium and ex-post claim records needed to construct individual risk types. The market has roughly 50 competing firms using demonstrably heterogeneous pricing algorithms, documented through a survey of major insurers and reduced-form regressions.

Q: How do the authors measure firm-level information precision in the reduced-form analysis? A: They estimate individual-specific risk types from a panel of claim records using Poisson regressions (claim frequency) and log-normal regressions (claim severity), then regress each firm’s premiums on those estimated risk measures. Firms whose premiums respond more sensitively to realized risk are inferred to have higher information precision. Figure 2 shows that these premium-to-risk coefficients vary significantly across firms — for example, Firm 7’s premiums are considerably more sensitive to risk than Firm 8’s — providing reduced-form evidence of heterogeneous information precision before any structural estimation.

Q: What is the structural model’s signal structure? A: Each firm j draws a private signal theta_j ~ N(theta, sigma_j^2) about a consumer’s true risk type theta, where sigma_j is the firm-specific signal standard deviation. A smaller sigma_j means higher precision. Signals are independent across firms conditional on theta, analogous to common-value auctions where firms receive noisy estimates of a shared unknown value (expected claim payouts). The parameter sigma_j is the key structural object the paper identifies and estimates.

Q: What is novel about the demand estimation strategy? A: Standard demand estimation assumes the same price is offered to all consumers or that the full price menu is observed. Here, only transaction prices are observed — the prices of unchosen insurers are not in the data. The authors apply the Wu and Xin (2024) fixed-point algorithm, which jointly estimates consumers’ sorting probabilities, offered price distributions, and demand parameters by adding an outer loop over sorting propensities to the Berry (1994) contraction mapping. No parametric restrictions are imposed on the offered price distributions, and they are allowed to vary fully across firms.

Q: How are firms’ signal variances identified separately from pricing coefficients? A: There is a one-to-one mapping between a firm’s offered price and its signal (prices increase monotonically in the signal, analogous to bids in auctions). After recovering the offered price distribution from the demand step, the authors observe price dispersion at a fixed risk level. By focusing on average prices conditional on each risk level, signal noise averages out, identifying the pricing coefficients beta_j. The residual price dispersion at fixed risk then identifies signal variance sigma_j^2.

Q: What does structural estimation reveal about the relationship between information precision and cost efficiency? A: Firms with higher signal standard deviations (less precise risk evaluation) tend to have lower claim-processing cost parameters k_j — they are more efficient at handling claims. This creates distinct comparative advantages: some firms excel at risk identification but face higher processing costs, while others process claims cheaply but evaluate risk less precisely. This heterogeneity means information-equalizing policies have differentiated firm-level impacts.

Q: What are the quantitative effects of the centralized risk bureau on premiums and consumer surplus? A: The bureau reduces average premiums by 21.6% relative to baseline and increases consumer surplus by 15.7%. The efficiency benchmark — where firms observe consumers’ true risk perfectly — produces a 25.7% premium reduction and a 16.9% consumer surplus gain. The bureau therefore closes nearly all of the gap to the first-best allocation in surplus terms (15.7% vs. 16.9%).

Q: Through what mechanisms does the bureau reduce prices? A: Two distinct channels are identified. First, equalizing information precision eliminates the informational market power held by firms with superior signals, compelling them to compete more aggressively on price. Second, when all firms share the same risk evaluation of a consumer, they can undercut each other more precisely, which intensifies price competition further. Both channels operate simultaneously under the bureau.

Q: How does the bureau affect consumer surplus distribution across risk types? A: The bureau primarily benefits low-risk consumers because improved information allows firms to price discriminate more accurately on risk type, lowering prices for those who are low risk. High-risk consumers see smaller benefits and may face relatively higher premiums. This contrasts with the privacy benchmark, where restricting all firms to the coarsest signal in the market raises high-risk consumers’ surplus by 6.9% — because it becomes harder for firms to distinguish them from low-risk consumers.

Q: What is the cost efficiency effect of the bureau? A: Under the centralized risk bureau, average costs per contract fall by 12 euros. This reflects more efficient insurer-insuree matching: when firms have equal and better information, those with cost advantages in claims processing can better identify and attract the consumer types they are relatively best equipped to serve. The authors note that given the scale of the Italian auto insurance market (approximately 31 million contracts annually), this per-contract saving implies a substantial aggregate impact.

Q: What happens to firm profits under the bureau, and is the impact uniform? A: Average profits decline overall due to lower prices. However, the impact is heterogeneous across firms. Firms that rely most heavily on superior information precision — often smaller, more specialized firms — experience greater profit losses, since the bureau most directly erodes their competitive advantage.

Q: How does the privacy benchmark differ from the bureau scenario? A: The privacy benchmark simulates a regulation that restricts all firms to using only basic consumer information, setting signal variance to the highest level observed in the market. Unlike the bureau (which improves and equalizes information), this benchmark degrades information uniformly. It produces opposite distributional effects: high-risk consumers gain 6.9% in surplus as cross-subsidization from low-risk to high-risk consumers increases, while low-risk consumers are worse off.

Q: Why does the paper focus on new customers only? A: Focusing on new customers avoids complications from dynamic pricing, where insurers update premiums based on accumulated claim history with a specific consumer, and from consumer-firm learning dynamics. This follows standard practice in the empirical asymmetric information literature, as cited in Chiappori and Salanie (2000) and Crawford et al. (2018).

Q: How does this paper relate to and extend prior work on selection markets? A: Prior empirical work on imperfect competition in selection markets — including Einav et al. (2010), Crawford et al. (2018), and related studies — assumes that competing firms have symmetric information about consumers. This paper is described as introducing the first tractable empirical framework for analyzing selection markets where firms have heterogeneous information. It also incorporates multidimensional cost heterogeneity on the supply side, adding to work by Salanié (2017) and Nelson (2025).

Q: What do the reduced-form regressions reveal about pricing heterogeneity across insurers? A: Firm-level regressions of premiums on observable risk factors show R-squared values ranging from 0.39 to 0.59. Estimated coefficients on key risk factors vary dramatically: being one year older reduces premiums by 0.25 to 1.68 euros depending on the firm; a higher bonus-malus class increases premiums by 12 to 32 euros; one additional accident in the previous five years raises premiums by 74 to 181 euros. These ranges reflect genuine differences in actuarial algorithms, not just sampling variation.

Q: What is the bonus-malus system and why does its saturation matter for the paper’s setting? A: Italy’s bonus-malus (BM) system assigns drivers to one of 18 risk classes based on accident history. Because approximately 80% of policyholders are in the best class (BM class 1), the public BM system provides limited granularity for risk evaluation. This saturation creates strong incentives for firms to develop proprietary risk-rating algorithms, which is the institutional basis for the substantial information heterogeneity that the paper documents and models.

Information Precision (sigma_j): In the paper’s model, the firm-specific parameter measuring the dispersion of a firm’s private signal about a consumer’s true risk type. Firm j draws signal theta_j ~ N(theta, sigma_j^2); 1/sigma_j is information precision. A smaller sigma_j means the firm more accurately identifies consumer risk. This is not merely a theoretical construct — the paper identifies and estimates sigma_j structurally for each of the 11 firms.

Heterogeneous Information: The condition where competing firms hold signals of different precision about the same consumer’s unobserved risk type, introducing asymmetry not just between buyers and sellers (as in Akerlof 1970) but among sellers themselves. This is the paper’s central departure from prior literature on selection markets, which assumed symmetric information among firms.

Centralized Risk Bureau: A policy institution that collects each firm’s analyzed risk signal, aggregates them weighted by each firm’s information precision (producing a combined signal more precise than any individual firm’s signal), and makes the aggregated information equally accessible to all firms. The bureau is the paper’s primary policy counterfactual, and it is modeled as equalizing both the level and heterogeneity of information precision across competitors.

Offered vs. Accepted Price Distribution: A distinction central to the paper’s identification strategy. The accepted price distribution is what is observed in transaction data — prices conditional on the consumer having chosen that firm. The offered price distribution is the full set of prices the firm would charge across all consumers, including those who did not select it. The paper recovers the offered distribution from the accepted distribution using a fixed-point algorithm, without imposing parametric restrictions.

Selection Loop: The paper’s methodological extension of the Berry (1994) BLP contraction mapping for mean utilities. An outer loop iterates over consumers’ sorting propensities to jointly recover offered price distributions, sorting probabilities, and demand parameters when only transaction prices are observed. This technique handles the endogeneity of which prices are accepted.

Risk Rating: The firm’s posterior assessment of a consumer’s expected cost, computed as the posterior mean E(theta | theta_j, D=j) — the expected true risk type conditional on the firm’s private signal and the consumer selecting that firm. Firms set prices as a linear function of their risk rating: p_j = alpha_j + beta_j * E(theta | theta_j, D=j).

Comparative Advantage (information vs. cost): The paper’s finding that firms with lower information precision (higher sigma_j) tend to have more efficient cost structures (lower k_j), and vice versa. This cross-sectional negative correlation between information advantage and cost advantage means that policy interventions that equalize information precision shift the basis of competition from information asymmetry to cost specialization.

Contract Terms, Employment Shocks, and Default in Credit Cards

Mon, 01 Jan 0001 00:00:00 +0000

Layer 1 — Overview

Research Question

This paper asks two related questions bearing on financial inclusion policy in developing countries: (1) How effective are credit card contract term changes — specifically interest rate reductions and minimum payment increases — in limiting default among new borrowers? (2) How large is the effect of formal-sector job loss on default relative to these contract term interventions, and can the difference in magnitudes be explained by differential cash flow impacts?

Setting and Data

The study is set in Mexico during 2007–2009 and exploits a large nationwide stratified randomized controlled trial implemented by a major commercial bank (“Bank A”) on its financial-inclusion credit card — a product that accounted for approximately 15% of all first-time formal-sector loans in Mexico as of 2010. The study card was targeted at borrowers with limited or no formal credit history (the bank’s “C, C- and D” customer segments); 47% of the experimental sample held it as their first formal loan product. A sample of 144,000 pre-existing cardholders was stratified into nine cells based on bank tenure (6–11 months, 12–23 months, 24+ months) and past repayment behavior, then randomly allocated to eight treatment arms combining two minimum payment levels (5% or 10% of the outstanding balance) and four annual interest rates (15%, 25%, 35%, 45%), for 26 months (March 2007 to May 2009). The study sample is representative of the bank’s national portfolio of approximately 1.3 million study card customers. Card-level data run through December 2014 — five years after the experiment ended — allowing examination of both short- and long-run effects. The experimental sample is matched to Mexico’s Social Security database (IMSS), providing monthly formal employment histories from January 2004 to December 2012 for 59% of the sample; and to credit bureau data, allowing observation of defaults across all formal financial institutions.

Main Findings with Quantitative Magnitudes

Result 1 — Interest rate effects are modest in aggregate. A 30 percentage point (pp) decrease in the annual interest rate (from 45% to 15%, a 67% reduction relative to the baseline rate) decreased cumulative default by 2.5 pp over the 26-month experiment, for a default elasticity of +0.20. Over the same 18-month horizon used for unemployment comparisons, the implied effect is 1.03 pp. These magnitudes are substantially smaller than predictions elicited from Mexican central bank regulators (mean predicted decrease: 8.6 pp) and from participants on the Social Science Prediction Platform (mean predicted decrease: 5 pp). Default continued to decline in the lower-rate arm for approximately three years after the experiment ended, reaching −1 pp by March 2012, after which effects became statistically indistinguishable from zero.

Result 2 — No effect on the newest borrowers. For the newest borrowers (those with 6–11 months of tenure when the experiment began — the group with a 36% cumulative default rate over 26 months versus 18% for those with 24+ months of tenure), the interest rate reduction has no effect on default over the 26-month period, with point estimates consistently small and statistically indistinguishable from zero. This is in contrast to older borrowers, who are meaningfully responsive.

Result 3 — Minimum payment increases increase short-run default but reduce long-run default. Doubling the minimum payment from 5% to 10% of outstanding balance increased cumulative default by 0.8 pp by the end of the experiment (26-month elasticity: +0.04; p = 0.016), driven primarily by defaults occurring within the first year. The short-run increase is concentrated among the most liquidity-constrained borrowers — those with the highest baseline debt utilization and those in the minimum-payer stratum (baseline debt utilization rate of 85%). After the experiment ended and all arms were returned to the same 4% minimum payment, the previously higher-minimum-payment arm exhibited persistently lower default, reaching a 1 pp decline by the end of the sample (p = 0.054 at end of study period), relative to a base default rate of 41% at that point.

Result 4 — Job displacement effects are seven times larger than contract term effects. Formal-sector job displacement (identified using mass layoff events at firms with 50+ employees, defined as year-on-year employment contractions exceeding 30% of prior-year average employment) increased cumulative default by 4.8 pp after 12 months and 7.6 pp after 18 months. This is seven times larger than the effect of a 30 pp interest rate decrease (1.03 pp over 18 months) and nine times larger than the effect of doubling minimum payments (0.8 pp). Formal job loss alone can explain approximately 14% of total study card default during the experiment (calculation: 19.8% of formally employed study card borrowers lose their job at least once in the first 18 months; multiplied by the 7.6 pp default increase per spell, this yields 1.5 pp of the 10.8% base default rate at 18 months). Results are corroborated using a nationally representative matched credit bureau–IMSS sample of 600,339 borrowers, which yields 8,723 mass layoff events and similar estimates.

Per-peso normalization. A back-of-the-envelope calculation normalizes all three shocks by their respective cash flow impacts. The interest rate decrease reduces cumulative required minimum payments due by 2,917 MXN pesos over 18 months; the minimum payment doubling increases them by 1,325 MXN pesos; formal job loss reduces total labor earnings by an estimated 21,328 MXN pesos (adjusting formal-sector earnings losses of 77,555 MXN pesos downward by 72.5% to reflect that 82% of workers who lose formal employment transition to informal employment in the following quarter, with total earnings falling only 27.5%). The per-peso default effects are: 0.36 pp per 1,000 MXN pesos for the interest rate intervention; 0.51 pp for the minimum payment intervention; and 0.36 pp for job displacement. The null hypothesis that all three per-peso effects are equal cannot be rejected (p = 0.78).

Interpretation

The authors present a simple two-period optimizing model emphasizing the role of previously accumulated debt and liquidity constraints. The model generates four testable predictions consistent with the data: (1) lower interest rates decrease default via reduced debt burden; (2) higher minimum payments increase short-run default by tightening liquidity constraints; (3) “surprise” minimum payment increases (where borrowers anticipated they would continue) reduce post-experiment default via debt reduction; (4) negative income shocks (modeled as first-order stochastic dominance deterioration in period-2 income) increase default. The per-peso normalization supports the interpretation that cash flow impacts — not differential per-peso susceptibility to shocks — drive the relative magnitudes of the three effects.

Layer 2 — Q&A

Q1: Why is the interest rate elasticity of default (0.20) so much lower than prior estimates in the literature?

A: The paper contrasts its 26-month elasticity of +0.20 with estimates from Karlan and Zinman (2019) (1.8) and Adams et al. (2009) (2.2), and notes it falls in the same range as Karlan and Zinman (2009) (0.27) and DeFusco et al. (2021) (0.01). The paper proposes that variation in borrower tenure may partly explain cross-study differences, as default elasticities appear to be increasing in bank tenure. The newest borrowers — the most policy-relevant subgroup — show zero elasticity, pulling the overall estimate down. The paper also argues that in this context, interest-rate-driven moral hazard (all channels: debt burden, concurrent, and dynamic) is collectively small.

Q2: What mechanism explains why newer borrowers are entirely unresponsive to interest rate changes?

A: The paper hypothesizes that newer borrowers place a higher continuation value on the card (captured by parameter v in the model) because they have fewer formal credit alternatives; at baseline, only 64% of the 6–11 month stratum held a card with another bank versus 78% of the 24+ month stratum. A higher continuation value implies more muted responses to interest rate changes (formally derived in Appendix E.3). Newer borrowers also respond more strongly to credit limit increases, consistent with tighter liquidity constraints. A regression controlling for age, gender, baseline card ownership, debt utilization, labor force attachment, and earnings cannot explain away the differential treatment effect between new and old borrowers (differential remains significant at p = 0.05), suggesting the tenure gradient in responsiveness is not simply a composition effect.

Q3: Why does increasing minimum payments raise short-run default but reduce long-run default?

A: In the short run, the doubling of minimum payments tightens liquidity constraints for already-constrained borrowers. The increase in default is concentrated among borrowers in the highest baseline debt-utilization tercile and among minimum-payers (baseline debt utilization of 85%), and is preceded by a sharp rise in delinquencies in months 3–5 (which trigger 350 MXN peso fees per occurrence, further worsening the repayment burden). In the long run, borrowers who anticipated continuing higher minimum payments (the experiment ended without advance notice, so borrowers expected the new terms to persist) chose lower debt levels during the experiment. Since all arms were returned to the same low minimum payment when the experiment ended, the lower-debt borrowers in the higher-minimum-payment arm were better positioned to weather subsequent shocks, producing the 1 pp post-experiment decline in default. The hypothesis that this is driven by habit formation in payment behavior is ruled out by the absence of any effect of past higher minimum payments on post-experimental payment levels.

Q4: How is the mass-layoff identification strategy designed and validated?

A: The paper uses the universe of IMSS formal employment records to define a mass layoff at a firm (50+ employees) as the first month in which year-on-year employment declines by more than 30% of average employment in the prior 12 months. An individual is “displaced” if they lost their job in the same quarter as their employer’s mass layoff event. The identification assumption is that, conditional on individual and time fixed effects, the exact timing of the mass layoff is uncorrelated with workers’ potential default outcomes. This is supported by: (1) mass layoffs occurring in every period, making coincidence with credit market shocks unlikely; (2) time fixed effects absorbing common trends; and (3) the absence of statistically distinguishable pre-trends in default between displaced and non-displaced workers. The paper implements both standard two-way fixed effects and the staggered DiD estimator of de Chaisemartin and D’Haultfoeuille (2024), which remains valid under heterogeneous and dynamic effects, and the results are similar across methods.

Q5: How does the paper account for informal employment when estimating the cash flow impact of job loss?

A: Formal-sector earnings losses over 18 months post-displacement are estimated at 77,555 MXN pesos using IMSS wage data in an event-study design paralleling the default equation. However, since more than 4/5 of workers who lose formal employment are informally employed in the following quarter (based on Mexico’s ENOE labor force survey panel), and total labor earnings fall by only an estimated 27.5% over the three post-displacement quarters, the paper scales the formal earnings loss down to 21,328 MXN pesos (≈ 0.275 × 77,555). This brings the estimated earnings loss closer to prior developed-country estimates of displacement costs and is treated as a lower bound relative to the raw formal-earnings loss figure.

Q6: Does the cost of default deter borrowers from defaulting, and what is the cost?

A: The paper argues that defaulters face substantial consequences. Using an instrumental variables strategy (treatment assignment as instrument for default on the study card), the probability of having a new loan one year after default is estimated to be 65 pp lower relative to the non-default counterfactual (p = 0.03). A selection-on-observables approach also shows that study card default is associated with the complete absence of any subsequent credit card for at least four years. These costs should provide strong incentives to remain current, making the high observed default rates primarily attributable to cash flow shocks rather than strategic default. The value of formal credit is further confirmed by the finding that a 100 MXN peso increase in the study card’s credit limit translates into 32 MXN pesos of additional debt (instrumental variable estimates are more than twice as large as OLS), and by the comparison of informal loan terms (annual rates averaging 291%, loan amounts of 3,658 MXN pesos, durations of 0.52 years) with formal loan terms (94 pp lower rates, 9,842 MXN peso average amounts, 1.07 year durations).

Q7: Are the default treatment effects different across the interest rate and minimum payment interventions, or do they interact?

A: The paper tests for and cannot reject separability between the two interventions at standard significance levels. At the end of the experiment (May 2009), the p-value for the null that the minimum payment effect is constant across interest rate arms is 0.44; five years later it is 0.65. The null that the interest rate effect is constant across both minimum payment arms yields p = 0.08 at end of experiment and p = 0.411 five years later. The fully saturated specification yields results indistinguishable from the parsimonious linear-separable specification.

Q8: Are there spillover effects from the contract term changes onto other loans held by study participants?

A: No spillover effects on default on other loans are found, either during the experiment or after it ended, based on credit bureau data covering all formal-sector loans held by the experimental sample. There is also no evidence of crowd-out or crowd-in from other lenders in terms of new loans or loan closures. The only minor exception is a small decrease in default (3%, or approximately 2 pp out of a 61 pp base) on other Bank A loans in the high minimum payment arm.

Q9: Why does the effect of unemployment on default exceed the model’s predictions from cash flow alone?

A: The paper’s back-of-the-envelope normalization finds that the per-peso effects of all three shocks on default are statistically indistinguishable (p = 0.78 for the null that all three λ estimates are equal), with point estimates of λ_IR = 0.36, λ_MP = 0.51, and λ_U = 0.36 pp per 1,000 MXN pesos. This implies that job loss does not have a larger per-peso effect on default than contract term changes; the larger absolute effect of displacement arises entirely from its larger cash flow impact. Additional consequences of job loss beyond cash flow (health, mental health) do not appear to generate additional default beyond what can be attributed to income loss.

Q10: How do the experimental results compare to what experts predicted?

A: Expert predictions were systematically too large. Mexican central bank regulators predicted a mean decrease of 8.6 pp from a 30 pp interest rate reduction at the 18-month horizon, versus the actual estimated effect of 1.03 pp. Social Science Prediction Platform respondents predicted a mean decrease of 5 pp. For minimum payments, regulators on average predicted a 0.4 pp decrease in default from doubling the minimum payment, whereas the actual effect was a 0.8 pp increase. Three-quarters of SSPP respondents correctly predicted the sign of the minimum payment effect (an increase in default), but the predicted mean increase was 6.4 pp, far larger than the estimated 0.8 pp.

Q11: Do the job displacement results generalize beyond the experimental sample?

A: Yes. The paper repeats the displacement event study on the intersection of the nationally representative credit bureau sample (approximately 600,339 individuals with both credit information and employment histories) with the universe of IMSS data for October 2011–March 2014, yielding 8,723 mass layoff events. This sample is representative of the population of Mexican borrowers with formal employment histories, and the estimated effects on default for any loan in the credit bureau are similar in magnitude to the experimental-sample results, providing a measure of external validity.

Q12: What do the debt dynamics during the experiment reveal about the mechanisms for interest rate effects on default?

A: The data show that purchases (net of payments) increase in response to interest rate decreases, consistent with downward-sloping demand for credit; yet total debt declines in lower-rate arms. This is consistent with the model’s prediction that the mechanical compounding effect (lower rate applied to previously accumulated debt) exceeds the behavioral new-purchase response. Confirmed empirically: the debt elasticity to the interest rate is estimated to be positive, with preferred estimates in the range [+0.18, +0.54]. The decline in default is further concentrated among borrowers with the highest baseline debt utilization rates, those for whom the debt compounding effect is strongest — consistent with the debt channel as the primary mechanism.

Key Concepts

Cumulative Default Measure: Default is defined as three consecutive monthly payments each below the required minimum payment due, at which point Bank A automatically revokes the card. The outcome variable is coded as Yit = 1 if borrower i has defaulted in any month s ≤ t and 0 otherwise, making it a cumulative (absorbing) measure. This allows estimation on an unchanging sample, avoiding attrition biases that would arise from conditioning on not having defaulted in the prior period.

Minimum Payment Due (mpd): The paper uses the required minimum payment due to avoid delinquency as its central cash-flow normalization variable. This is a comprehensive measure that incorporates not only the contractually specified fraction of outstanding balance but also interest charges, fees, and endogenous borrower responses (changes in debt and purchases). It serves as the common denominator for benchmarking the cash flow impacts of the two contract term interventions and formal job loss against one another.

Free Cash Flow / Per-Peso Normalization (λ): The paper defines per-peso default effects (λ^IR, λ^MP, λ^U) by dividing each intervention’s average treatment effect on cumulative default (in percentage points) by the cumulative change in the minimum payment due (or equivalent cash flow impact) induced by that intervention over 18 months. The resulting ratio is expressed as percentage points of default per 1,000 MXN pesos of cash flow change. This normalization is explicitly not treated as an instrumental variable estimate; it is a descriptive back-of-the-envelope calculation intended to equate the scale of the three shocks.

Mass Layoff / Displacement: A mass layoff at the firm level is defined as the first month in which year-on-year firm employment declines by more than 30% of average employment in the prior 12 months, restricted to firms with 50+ employees. An individual worker is classified as displaced if they lost formal-sector employment in the same calendar quarter as their employer’s mass layoff event. This definition follows Jacobson et al. (1993) and subsequent literature and is used to isolate plausibly involuntary (exogenous) separations from voluntary quits or individually driven terminations.

Continuation Value (v): In the paper’s two-period optimizing model, v is the reduced-form utility parameter capturing future flow of card benefits, warm glow from card ownership, or the option value of retaining access to formal credit, experienced only if the card is not in default. The paper uses v to rationalize the zero interest-rate response of newer borrowers: ceteris paribus, higher v implies that borrowers will remain current on the card even when interest rates are high, because they value continued access. Higher v thus implies more muted responses to interest rate changes.

Bank Tenure Strata: Borrowers are stratified into three groups based on length of relationship with the study card: “new customers” (6–11 months), medium-term (12–23 months), and long-term (24+ months). Tenure is used both as a stratification variable for the experiment and as a primary dimension of heterogeneity in treatment effects, reflecting differing default rates (36% vs. 18% at 26 months), labor market vulnerability (1.34× higher job loss probability for new vs. long-term), and interest rate responsiveness (zero for new, significantly positive for long-term borrowers).

Debt Burden Channel vs. Concurrent Moral Hazard: The paper distinguishes three channels through which interest rate changes can affect default: (a) the debt burden channel — higher rates mechanically increase the stock of interest-accruing debt, making repayment harder; (b) concurrent moral hazard — higher current interest rates alter the incentive to default on existing obligations, holding debt constant; and (c) dynamic moral hazard — higher future interest rates reduce the benefit of remaining current. The paper’s finding of a modest total effect (elasticity 0.20) implies that the sum of all three channels is small in this context, with the debt burden channel being the primary driver of what effect does exist.

From Doubt to Devotion: Trials and Learning-Based Pricing

Mon, 01 Jan 0001 00:00:00 +0000

This paper studies a dynamic mechanism design problem in which an informed seller sells an experience good to a skeptical buyer who learns about the product through consumption. The central question is: how does a seller leverage proprietary data about product-buyer match quality together with the buyer’s ability to learn, and what are the welfare implications in equilibrium?

The model features a seller who privately observes a binary match quality (theta in {H, L}) between their service and the buyer. The buyer does not observe match quality and has an initially unknown private value v for the good, drawn from a Myerson-regular distribution F with support [v_low, v_high] and normalized mean E[v] = 1. If the match is high, the buyer receives instantaneous utility rewards according to a Poisson process with flow rate lambda*I, where I in [0,1] is the seller-controlled access level. Upon receiving the first reward, the buyer perfectly learns both match quality theta and their own value v. The seller commits to a dynamic mechanism over time horizon T = [0, T] specifying access and prices conditional on reported histories. Both parties are risk-neutral and there is no discounting in the baseline.

Two benchmark cases show the first-best is attainable absent both key features simultaneously. If trade is static (prices set only at time 0) or if the seller is uninformed about theta, the seller achieves first-best revenue of lambdamu_0T by selling the entire service upfront. Proposition 1 establishes both cases; this implies that consumer data on theta is not required for maximizing social welfare, and it is weakly dominant for a seller to never collect consumer data in static environments.

The central result is that the combination of dynamic pricing and seller private information breaks the first-best. A high-type seller can deviate by offering a “Myersonian free trial”: provide full access up to time tM (defined as argmax_t {(1 - exp(-lambdat))(T - t)}), then offer the remaining service at post-trial price lambdavM(T - tM), where vM is the Myerson monopoly price. The buyer accepts the trial regardless of beliefs (participation is weakly dominant) and purchases the post-trial service if and only if v >= vM. This deviation yields payoff pi_F = (1 - exp(-lambdatM))(1 - F(vM))lambdavM*(T - tM). Proposition 2 states that the first-best cannot be implemented in any equilibrium if and only if pi_F > lambdamu_0T. Corollary 1 shows this condition holds for sufficiently large T, since pi_F grows proportionally with T while the first-best also grows with T but the ratio converges to a constant less than 1 only for some parameter configurations and exceeds 1 for others.

Theorem 1 (the main mechanism design result) characterizes the boundary of the IC-IR feasible payoff set: any mechanism on this boundary is outcome-uniquely implemented by a trial mechanism, defined by a triple (v0, t0, p0) — a trial length, a post-trial value threshold, and a trial price. During [0, t0] uninformed buyers receive full access; after t0 only buyers who received a reward with v >= v0 continue at a premium. Trial length t0 is weakly increasing in the weight placed on the low-type seller and in the prior mu_0; post-trial threshold v0 is weakly decreasing in the same objects (Proposition 3).

Equilibrium payoffs (Proposition 5) are precisely the IC-IR feasible pairs satisfying pi_H >= pi_F, implemented by pooling trial mechanisms in which both seller types propose identical mechanisms and the buyer updates beliefs only through private consumption signals. Under the D1 refinement (Proposition 6), only mechanisms with trial length tM and post-trial threshold vM survive. These have the shortest trial and highest post-trial price of all equilibrium mechanisms, minimize social surplus, and may leave both seller types strictly worse off than in a world without private information — directly contrasting the static informed principal result of Koessler and Skreta (2016) where data always helps the seller.

When the seller can control service quality q in addition to access I (Section 6), the relevant equilibrium mechanisms become dynamic tiered pricing rather than binary trials: a low-quality, high-ad-load free tier provides learning opportunities while reducing information rents; convinced buyers upgrade to a premium ad-free tier. Counterintuitively, enriching the seller’s screening technology can reduce both revenue and social efficiency in equilibrium because additional instruments create additional signaling opportunities that distort outcomes further.

Q: What is the core tension that prevents the first-best from being an equilibrium?

A: When the seller is privately informed and pricing is dynamic, the high-type seller anticipates a greater likelihood of the buyer receiving a utility shock than the buyer’s own prior implies. This belief gap makes it profitable for the high-type seller to deviate from a proposed first-best mechanism by offering a free trial that “proves” high match quality and then extracting rent from convinced buyers. Because this deviation is profitable — yielding pi_F > lambdamu_0T under some parameters — the first-best pooling contract unravels. The interaction of both ingredients (dynamic pricing and informed seller) is necessary: either ingredient alone is insufficient to break the first-best (Proposition 1).

Q: What exactly is the Myersonian free trial and why does the buyer always accept it?

A: The Myersonian free trial provides full service access up to time tM = argmax_t {(1 - exp(-lambdat))(T - t)} at (approximately) zero price, then offers the remaining service at price lambdavM(T - tM) where vM is the Myerson monopoly price. The buyer accepts the trial regardless of their prior belief about match quality because the trial itself is free and provides non-negative payoff. After the trial, the buyer purchases the post-trial service if and only if they received a reward with v >= vM; otherwise they exit. The deviation payoff is pi_F = (1 - exp(-lambdatM))(1 - F(vM))lambdavM*(T - tM).

Q: Under what parametric conditions can the first-best not be supported in equilibrium?

A: By Proposition 2, the first-best cannot be implemented if and only if pi_F > lambdamu_0T. Corollary 1 states that for sufficiently large T this always fails, since as T grows, pi_F grows proportionally (the post-trial term (T - tM) dominates) while tM converges to a finite value. More precisely, for large T, pi_F / (lambdamu_0T) converges to (1 - exp(-lambda*tM)) * (1 - F(vM)) * vM / mu_0, which exceeds 1 under appropriate parameter configurations. Conversely, when mu_0 is high or the service horizon is short, the first-best may remain implementable.

Q: What is a trial mechanism and how does Theorem 1 characterize it?

A: A trial mechanism is defined by a triple (v0, t0, p0): uninformed buyers receive full access on [0, t0] and no access thereafter; a buyer who reports a reward of value v >= v0 at time t receives full service for the remainder [t, T] at a price increment of lambdav0(T - t0); the trial itself is priced at p0. Theorem 1 states that any payoff pair on the boundary of the IC-IR feasible set is outcome-uniquely attained by such a trial mechanism with appropriately determined (v0, t0, p0). The proof uses a relaxed problem retaining only two key constraint families: local incentive constraints on value reporting (IC-V) and a global intertemporal constraint preventing buyers from hiding the arrival of rewards forever (IC-U).

Q: How does the trial length respond to changes in prior belief mu_0 and distributional spread?

A: Proposition 3 states that t0 is weakly increasing in mu_0: as market belief becomes more optimistic, both seller types extract higher revenue from the trial, so the mechanism designer extends the trial. Proposition 4 adds that for a uniform distribution on [1-delta, 1+delta], trial length t0 is weakly increasing in delta (greater spread). The post-trial threshold v0 is weakly decreasing in mu_0, meaning that a more optimistic prior leads to a less exclusive post-trial cutoff.

Q: What are the equilibrium payoffs and how does the high-type seller’s free-trial option constrain them?

A: Proposition 5 states that (pi_L, pi_H) is an equilibrium payoff if and only if it lies in the IC-IR feasible set and pi_H >= pi_F. The lower bound pi_H >= pi_F reflects the high-type seller’s outside option: they can always deviate to the Myersonian free trial. Corollary 4 then shows that all “reasonable” equilibrium payoffs (those with pi_H >= pi_L, surviving a mild off-path refinement) are implemented by trial mechanisms with complete pooling — both seller types propose the same mechanism and the buyer updates beliefs only through private consumption signals, not the mechanism’s structure.

Q: What does the D1 refinement select and why do it lead to worse outcomes?

A: Proposition 6 shows that the only equilibrium trial mechanisms surviving the D1 criterion have trial length tM and post-trial threshold vM — the Myersonian free trial parameters. These have the shortest trial and highest post-trial price among all equilibrium mechanisms, resulting in the minimum social surplus. The intuition is that the high-type seller signals credibly by proposing mechanisms that generate high revenue from post-trial price discrimination (which the low type cannot profit from), pushing toward maximum learning-based discrimination. All D1-surviving payoffs are Pareto dominated by the point H (the unconstrained IC-IR optimum) for any prior mu_0, and Pareto dominated by point B when mu_0 is small.

Q: Can having consumer preference data hurt the seller, and under what conditions?

A: Yes. The distortion from signaling incentives can be so large that both seller types earn strictly less in the D1-surviving equilibrium than they would if neither possessed private information (where the first-best is attained). This result holds when the condition of Proposition 2 is satisfied — i.e., when pi_F > lambdamu_0T. This contrasts sharply with the static result of Koessler and Skreta (2016), in which the ex-ante profit-maximizing mechanism is always supportable in equilibrium and data always (weakly) helps sellers.

Q: How do trial mechanisms differ from the prior literature on signaling through introductory prices?

A: The earlier literature (Milgrom and Roberts 1986; Bagwell 1987; Bagwell and Riordan 1991; Judd and Riordan 1994) uses two-period models with no seller commitment, so all pricing behavior is necessarily trial-like by model restriction. The present model instead allows the seller full flexibility to design any dynamic mechanism — including selling everything ex-ante, which would prevent buyers from gaining information rent. Trials emerge endogenously as the equilibrium outcome rather than being imposed by the model structure, and the paper provides new economic content on what determines trial length and price thresholds.

Q: What happens when the seller controls service quality in addition to access?

A: Section 6 extends the baseline by allowing the seller to choose (I, q) from a subset of [0,1]^2, where I governs the Poisson arrival rate and q scales the reward value (utility from a reward is v*q). Theorem 2 shows that the relevant equilibrium mechanisms now take the form of dynamic tiered pricing: a low-quality tier (interpreted as high ad load) provides learning opportunities while reducing information rents; once convinced, buyers upgrade to a premium high-quality tier. Enriching the screening technology in this way can reduce both revenue and social efficiency in equilibrium, because additional instruments create additional signaling opportunities that distort outcomes further from the revenue-maximizing benchmark.

Q: What are the two sources of welfare loss relative to the first-best in D1-surviving equilibria?

A: The welfare analysis in Appendix F identifies two sources. First, exclusion inefficiency: buyers with values v in [v_low, vM) who would generate positive surplus are excluded from post-trial service. Second, service truncation inefficiency: service access is cut off after trial length tM for buyers who were never convinced (theta = L type realizations and high-type buyers with v < vM), reducing total surplus below the first-best of mu_0 * lambda * T. Both losses are minimized (welfare is maximized) among trial mechanisms by longer trials and lower post-trial cutoffs, precisely the opposite of what D1 selects.

Q: Does the model extend to continuous seller types or multiple buyer types?

A: Appendix K outlines an extension to continuous seller types theta drawn from a distribution G on [theta_low, theta_high], where rewards arrive at rate lambdaItheta. The main economic forces persist: higher seller types anticipate faster buyer learning and have stronger incentives to offer trials. The main results generalize: equilibrium mechanisms are trial mechanisms, and under D1, pooling equilibria with maximum post-trial discrimination are selected. Appendix G similarly notes that the multiple-buyer-type extension preserves complete pooling and the D1 selection result.

Q: What is the role of the “global intertemporal constraint” (IC-U) in the proof of Theorem 1?

A: The canonical approach to dynamic mechanism design (Eso and Szentes 2007; Pavan, Segal, and Toikka 2014) relaxes the problem to only local incentive constraints on the initial report. This fails here because the informed seller causes buyer and seller to disagree on the evolution of buyer beliefs, making the timing of trade matter and requiring tracking of incentive constraints at every point in time. The paper identifies two key binding constraints in the relaxed problem: (IC-V) the buyer does not misreport their reward value, and (IC-U) the buyer does not remain silent about the arrival of a reward forever. Retaining only these two constraint families yields a tractable bang-bang solution for the optimal access policy, which is then verified to satisfy all original IC-IR constraints.

Q: What are the implications for platform design and data collection strategy?

A: The results imply that the value of consumer data depends critically on market dynamics. In static markets, collecting data about consumer match quality is weakly beneficial for sellers (Proposition 1, first point). In dynamic markets with buyer learning and sufficiently long service horizons, the same data can strictly reduce seller revenue by enabling a deviation that unravels first-best pricing. This suggests platforms in dynamic digital markets should weigh whether possessing and acting on proprietary match data improves or worsens their equilibrium position, and that regulatory attention to consumer data collection in dynamic markets may have welfare-ambiguous effects.

Trial mechanism: A dynamic mechanism parameterized by (v0, t0, p0) in which the seller provides full service access during [0, t0] for uninformed buyers, offers continued service after t0 only to buyers who received a reward with value v >= v0, and charges a post-trial price of p0 + lambdav0(T - t0) for those who qualify. In the paper’s usage, this is the unique outcome-implementing mechanism on the boundary of the IC-IR feasible payoff set.

Myersonian free trial: The limiting trial mechanism as the trial price epsilon approaches zero, with trial length tM = argmax_t {(1 - exp(-lambdat))(T - t)} and post-trial threshold vM equal to the Myerson monopoly price. It yields payoff pi_F = (1 - exp(-lambdatM))(1 - F(vM))lambdavM*(T - tM) to the high-type seller, and constitutes the binding outside option constraining equilibrium payoffs.

Belief gap: The divergence between the seller’s and buyer’s beliefs about the rate at which the buyer will receive Poisson rewards. Because the high-type seller knows theta = H, they anticipate a higher probability of reward arrival than the buyer’s prior implies. This gap makes the buyer’s belief process non-martingale from the seller’s perspective, breaking the standard dynamic mechanism design approach and creating profitable deviation incentives.

IC-IR feasible payoff set: The set of seller payoff pairs (pi_L, pi_H) achievable by mechanisms satisfying both incentive compatibility (for seller type reports and buyer learning reports) and individual rationality (non-negative ex-ante payoffs for all parties). Theorem 1 establishes that the boundary of this set is uniquely implemented by trial mechanisms.

Dynamic tiered pricing: The equilibrium mechanism form that emerges when the seller controls both access I and service quality q. It features a low-quality tier (high ad load) providing learning opportunities at reduced information rent, and a premium tier offering full quality to buyers convinced of high match quality. This generalizes trial mechanisms to settings with richer screening technology.

Global intertemporal constraint (IC-U): The constraint requiring that, upon receiving a Poisson reward, the buyer finds it suboptimal to remain silent about its arrival forever. Together with the local value-reporting incentive constraint (IC-V), these two constraints constitute the binding restrictions in the paper’s relaxed mechanism design problem, replacing the full continuum of incentive constraints that would otherwise be intractable.

D1 criterion: A standard equilibrium refinement from signaling games applied here to the space of mechanism proposals. Among all pooling equilibrium trial mechanisms, D1 selects only those with parameters (tM, vM) — the shortest trial length and highest post-trial threshold — because the high-type seller has a strictly larger set of buyer responses for which deviation to a high-discrimination mechanism is profitable. These surviving mechanisms Pareto dominate no other equilibrium mechanism and minimize social surplus.

Illiquid Lemon Markets and the Macroeconomy

Mon, 01 Jan 0001 00:00:00 +0000

The paper develops a quantitative capital-accumulation model in which capital trades in illiquid markets with asymmetric information — sellers know the quality of their capital but buyers do not. It combines this model with microdata on nonresidential capital units listed for trade to measure the degree of information asymmetry and quantify its macroeconomic effects.

Model: The economy features heterogeneous capital units characterized by observed quality ω (e.g., size, location, age — observable to both buyers and sellers) and unobserved quality a (known only to the seller). Capital trades in directed-search markets: sellers post a price and a target submarket; buyers direct their search; a matching function determines trade probabilities. Buyers observe announced quality and have an inspection technology that reveals true quality with probability ψ (“lemon detection probability”); with probability 1−ψ a low-quality unit goes undetected. In equilibrium, sellers of high-quality capital signal their type by listing at higher prices and accepting lower trading probabilities (the Guerrieri-Shimer-Wright 2010 competitive search separating equilibrium, adapted to the capital accumulation setting). The key model prediction is that the residual price — the component of a listed price orthogonal to observed characteristics — is positively correlated with duration on the market, with the slope increasing as the degree of asymmetric information (1−ψ) rises.

Data: Idealista, Spain’s largest online real estate platform, provides monthly listings for all nonresidential structures (retail, office, and industrial space) listed for sale from 2005 to 2018 — approximately 8.9 million property-month observations from over 1.15 million distinct capital units. The average listed price per square foot is $162 (2017 dollars); the average duration on the market is 10.5 months; each listing receives on average 800 views, 45 clicks, and 3 emails per month from prospective buyers.

Empirical facts (Section 4): Two cross-sectional regularities confirm the model’s predictions:

Predicted price (from a hedonic regression on observable characteristics) is negatively correlated with duration — units with better observable characteristics sell faster, consistent with full-information competitive search (higher buyer valuation → higher matching rate)
Residual price (orthogonal to observables) is positively correlated with duration — estimated slope coefficient ŷq ≈ 0.148 — consistent with asymmetric-information signaling (high-quality capital sellers post high residual prices to separate from low-quality sellers, accepting lower trading probabilities)
The residual-price/duration slope exhibits strong countercyclical variation, roughly doubling during the Euro crisis (peak slope ≈ 0.38, compared to baseline ≈ 0.148), consistent with asymmetric information worsening during downturns

Calibration (monthly frequency, Table 4 fixed; Table 5 fitted):

Fixed parameters: β = 0.9966 (annual rate of time preference 4%), α = 0.35 (capital share), δ = 0.0074/month (8.5% annual nonresidential depreciation), γ = 1.004 (1.6% annual TFP growth), γn = 1.0027 (1% annual population growth), ϕ = 0.0027 (3.2% annual firm exit rate), η = 0.8 (matching curvature), φ = 0.5 (seller bargaining power)
Fitted to four data moments (slope ŷq, SD of predicted prices, SD of residual prices, mean duration): ψ = 0.9795 (probability a lemon goes unnoticed = 2% per inspection); σω = 0.72 (SD observed quality); σa = 0.58 (SD unobserved quality); m̄ = 0.267 (matching efficiency)
Model-simulated moments match targets essentially exactly (Table 5); untargeted relationship between duration and predicted prices is also well-matched (Table 6)

Steady-state output effects (Table 7, relative to full-information benchmark):

Total output: −1.22% in baseline (ψ = 0.9795)
Effective capital input: −2.55% (main driver of output loss)
Capital stock: −1.12% (32% of output effect — reduced returns to producing new capital)
Capital unemployment rate: +1.0 pp above full-information rate of 5% (25% contribution — high-quality capital remains listed longer)
Allocation channel: 16% contribution — information asymmetries disproportionately reduce trading of high-quality capital, lowering average quality of employed capital
Labor input: −0.5% (26% contribution — reduced capital input lowers labor demand)
Moving to full information (ψ → 1): output gain of +1.5% — modest at baseline, indicating the baseline economy is not far from full information
Moving to Euro-crisis level (ψ = 0.96): output decline of ~2% — large response because the economy’s output elasticity to ψ is high

Crisis experiment (Section 5.3): An unexpected 2 percentage-point decline in ψ (to 0.96, calibrated to match the observed increase in the residual-price/duration slope during the Euro crisis), lasting 3 years and reverting with persistence ρψ = 0.94:

Output contraction on impact: 2%
Time to recover half the output decline: more than 5 years (slow recovery driven by persistent capital underinvestment)
Primary mechanism: lower inspection accuracy → high-quality capital sellers reduce trading probability to signal quality → capital unemployment rate rises (especially for high-quality units) → expected return to producing new capital falls → investment contracts → capital input declines persistently
Secondary interaction: at higher steady-state asymmetric information (ψ = 0.96), other shocks (TFP, exit rate, discount factor) are amplified — e.g., the cumulative output response to an exit rate shock is 26% larger than in a full-information economy

Scope conditions: The model abstracts from aggregate uncertainty (the baseline is steady-state analysis), financial intermediaries, and endogenous information technology. The dataset covers Spain’s nonresidential real estate market 2005–2018; the measurement of ψ from listed prices and duration assumes that residual prices fully reflect unobserved capital quality (Proposition 5’s small-search-cost approximation). The quantitative results are robust to alternative bargaining protocols (TIOLI), higher firm exit rates, inelastic labor supply, and narrower observable-characteristic sets.

Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.

In depth

Q1. Why does asymmetric information generate a positive correlation between residual prices and duration?

In the model’s separating equilibrium, sellers of high-quality capital choose prices and targeting strategies that prevent low-quality sellers from mimicking them; since low-quality sellers have a lower marginal cost of accepting lower trading probabilities (their capital is worth less to them in continued use), high-quality sellers can separate by listing at higher residual prices paired with lower market tightness and lower matching rates. The correlation between residual price and duration is therefore a direct measure of the degree of asymmetric information: the slope coefficient ŷq increases monotonically as ψ decreases (Proposition 5 and Figure 4), allowing the researcher to back out ψ from the micro data.

Q2. Why is the residual-price/duration slope countercyclical?

The data show that the slope roughly doubled during Spain’s 2008–2013 downturn and euro crisis, consistent with the model’s prediction that asymmetric information (1−ψ) worsens during economic contractions. The paper interprets this as evidence that buyers’ ability to evaluate capital quality deteriorates when economic uncertainty rises — for example, during crises it is harder to assess the profitability of retail or office space based on observable characteristics alone. This countercyclical pattern motivates the crisis experiment in Section 5.3, where a 2pp increase in 1−ψ (the degree of information asymmetry) replicates the observed slope dynamics.

Q3. Why is the 2% crisis output contraction slow to recover?

The sluggishness of recovery operates through the investment channel: when high-quality capital sellers reduce trading probabilities to signal their type, they slow the transfer of used capital from sellers (firms that exit) to buyers (firms that expand), reducing the effective capital input; this lower capital input reduces the expected marginal return to producing new capital, depressing investment; because capital accumulates gradually, the output recovery inherits the slow pace of investment recovery. The persistence parameter ρψ = 0.94 (monthly) adds further sluggishness from the slow normalization of the information environment itself.

Q4. Why are the steady-state output losses modest while the crisis response is large?

The economy features a moderate baseline degree of asymmetric information (ψ = 0.9795 — only 2% lemon-detection failure), so the steady-state distortion is small (−1.22% output relative to full information); however, the economy has a large elasticity of output to ψ, so even a small deterioration in information quality (2pp) generates large output effects (−2%). This high sensitivity arises because the effects of asymmetric information are highly nonlinear: at low levels of information frictions, small increases in the lemon probability generate proportionally large increases in the required signaling by high-quality sellers, sharply reducing their trading probabilities.

Q5. How does asymmetric information interact with other shocks?

At the baseline degree of asymmetric information (ψ = 0.9795), the aggregate responses to standard shocks (TFP, discount factor, exit rate) are similar to an economy with full information; however, at the Euro-crisis level (ψ = 0.96), the cumulative output response to an exit rate shock is 26% larger than under full information. The mechanism is that asymmetric information taxes the reallocation of capital: when more capital must be reallocated (due to higher firm exit), more of it passes through the illiquid, distorted lemon market, amplifying the output effect of the underlying shock.

Q6. What policies can reduce the distortions from asymmetric information?

The paper notes two broad policy directions: (1) policies that improve information transparency — making previously private capital characteristics public, e.g., mandatory disclosure or standardized quality certification — directly raise ψ and shift the economy toward full information, eliminating the signaling distortion; (2) policies that reduce the incentive for mimicking — for example, by allowing post-transaction renegotiation after quality is revealed (the TIOLI bargaining extension in Table 8) — have similar quantitative effects to the baseline. The paper leaves the welfare analysis of specific information-provision policies for future research.

Q7. What is the role of the data in identifying the model parameters?

The four targeted moments — slope of duration on residual prices, standard deviation of predicted prices, standard deviation of residual prices, and mean duration — jointly identify the four structural parameters {ψ, σω, σa, m̄} (Proposition 5); the key insight is that ψ and m̄ are separately identified because ŷq and mean duration respond differently to each: ψ and m̄ both affect ŷq positively, but m̄ reduces mean duration while ψ increases it, providing orthogonal variation. The calibration achieves an essentially exact match of the four targeted moments (Table 5) and also matches the untargeted negative slope between duration and predicted prices (Table 6), providing an overidentification check.

Key concepts

lemon market : a secondary market for heterogeneous assets in which sellers have private information about quality; following Akerlof (1970), lemons (low-quality assets) crowd out high-quality assets unless high-quality sellers can credibly signal their type; in the paper, signaling takes the form of higher listed prices paired with lower trading probabilities.

residual price : the component of a capital unit’s listed price orthogonal to its observable characteristics (the residual from a hedonic regression); the paper’s key empirical variable, theoretically shown to be positively correlated with unobserved capital quality and with duration under asymmetric information.

inspection technology : a buyer’s technology that reveals the true quality of a capital unit with probability ψ before (or after) purchase; the accuracy ψ governs the degree of asymmetric information in the economy — lower ψ implies worse information, requiring more costly signaling by high-quality sellers.

countercyclical asymmetric information : the empirical finding that the slope between residual prices and duration roughly doubles during the Euro crisis, interpreted as deterioration in buyers’ ability to evaluate capital quality during economic downturns; motivates the crisis experiment.

three channels of output loss : the three mechanisms through which asymmetric information reduces output: (i) lower capital stock (reduced investment incentives); (ii) higher capital unemployment rate (high-quality capital remains listed longer); (iii) adverse allocation effect (high-quality capital trades less frequently, lowering average quality of employed capital).

The Optimal Taxation of Couples

Mon, 01 Jan 0001 00:00:00 +0000

Layer 1 — Overview

Research Question. What is the optimal joint nonlinear earnings tax schedule for married couples? How should one spouse’s marginal tax rate depend on the other’s earnings? When is individual earnings-based (separable) taxation optimal versus family-income-based taxation, and what determines the sign and magnitude of “jointness” — the dependence of one spouse’s marginal tax on the other’s earnings?

Model. The paper studies a canonical unitary household model in which each couple consists of two spouses who jointly maximize utility subject to a joint budget constraint. Spousal productivities are drawn from a joint distribution F with arbitrary dependence structure. The planner maximizes a weighted sum of couples’ utilities, with Pareto weights that are decreasing functions of productivities. Utility takes a quasi-linear form in consumption and labor disutility with constant labor supply elasticity parameter γ (implying earnings elasticity γ/(γ-1)). The tax problem is equivalent to a two-dimensional mechanism design problem in which the planner chooses allocations as functions of reported productivity types, subject to incentive compatibility and budget feasibility. Because spousal productivities are two-dimensional, the problem is a multi-dimensional screening problem whose properties are poorly understood in general.

Methodology. The authors proceed in two directions. First, they establish conditions under which the first-order approach (FOA) — restricting attention to local incentive constraints — is valid in this bi-dimensional setting. They show, for the special case of the benchmark economy (symmetric, independent types, separable Pareto weights), that FOA validity is equivalent to convexity of a certain transformation of the value function, and derive necessary and sufficient conditions that are strictly weaker than their unidimensional analogs — so the FOA is more likely to hold in two dimensions than in one. For the general economy, they invoke an Implicit Function Theorem argument in Hölder space to show that the FOA holds for Pareto weights sufficiently close to utilitarian (i.e., when the planner is not “too redistributive”). Second, assuming FOA validity, they characterize optimal taxes via a second-order nonlinear PDE. Since this PDE cannot be solved analytically in general, they apply the Coarea Formula to derive closed-form expressions for conditional averages of optimal tax distortions over various subsets of the type space, expressed entirely in terms of structural primitives (labor supply elasticities, Pareto weights, and elasticities of the joint distribution of productivities).

Main Findings.

Average distortions and assortativeness. Average optimal distortions on married individuals are ranked by the degree of positive quadrant dependence (PQD) in spousal productivities: more assortative matching implies higher optimal tax rates. Optimal distortions on married individuals are always weakly lower than on single individuals with the same productivity, same elasticities, and same marginal productivity distribution — strictly so unless matching is perfectly positively assortative. The intuition is that when couples pool resources, intra-family redistribution already occurs, and distortionary taxation crowds this out; more random matching produces more within-family redistribution, reducing the marginal social value of public redistribution through taxation.
Optimality of separable (individual earnings-based) taxation. In the benchmark economy with independent types, optimal taxes are exactly separable (individual earnings-based), and optimal distortions on married individuals equal precisely one-half of those on comparable single individuals. With separable Pareto weights and independent types more generally, taxes remain separable. Once types are positively dependent, however, the planner optimally introduces jointness even under separable social weights.
Jointness and tail (in)dependence. Optimal jointness — whether one spouse’s marginal tax rate increases or decreases in the other’s earnings — depends critically on tail dependence of the joint productivity distribution, captured by the copula and survival copula elasticities. For right-tail dependent distributions (so that extremely productive individuals are likely to be matched with extremely productive partners), positive jointness is optimal at the top (raising taxes on high earners whose partners are also high earners) and negative at the bottom. For right-tail independent distributions (such as the Gaussian copula, which is tail-independent for any finite ρ), the distortion-reducing motive dominates: optimal jointness is negative at the top and positive at the bottom, conditional on standard convergence conditions.
Primary vs. secondary earners. The secondary earner (lower-productivity spouse) faces on average higher optimal distortions than the primary earner when the planner values redistribution to couples with a very unproductive spouse (α(w,0) ≥ 1), because the phasing out of transfers targeted to such couples generates high marginal tax rates on secondary earners. Family earnings-based taxation is optimal only when total family productivity and relative spousal productivity are independent, and when social weights are measurable only with respect to total family output.
Restricted taxation. Optimal distortions under any of the three restricted tax regimes (anonymous, separable, family earnings-based) exactly equal the relevant conditional average of unrestricted optimal distortions. This establishes that the welfare difference between the restricted and unrestricted optimum stems solely from the planner’s inability to tag taxes to individual productivity types within the restricted class.

Quantitative Findings (calibrated to 2020 CPS data on U.S. married couples, ages 25-65, worked ≥ 20 weeks). Spousal productivities are positively but not perfectly dependent, with Kendall’s tau = 0.21 and Pearson correlation = 0.25 for productivities (0.21 for earnings). The joint distribution is well approximated by a Gaussian copula (ρ = 0.33) with Pareto-lognormal marginals (a = 2.95, Gini = 0.31). The Gaussian copula is tail-independent, so consistent with analytical results, optimal jointness is positive for low earners and negative for high earners (the latter arising at earnings above approximately $8.5 million in the benchmark specification). The quantitative magnitude of optimal jointness is small — marginal taxes for one spouse change by at most several percentage points as a function of the other spouse’s earnings. Individual earnings-based taxation provides a good approximation to the unrestricted optimum. By contrast, family earnings-based (joint) taxation is a poor approximation in all specifications, with marginal taxes on family income varying substantially with the earnings share of the secondary earner, and this conclusion holds even when Pareto weights explicitly favor family earnings-based taxation (k = 0 case). The implied top marginal tax rate converges toward approximately 55 percent (corresponding to limiting distortion of ≈1.35 = 1/γa with γ = 0.25, a = 2.95) but the convergence is slow, so optimal marginal rates remain substantially below this limit even at earnings of $300,000.

Layer 2 — Q&A

Q1: What is the mechanism design formulation, and why is FOA validity a key concern in the bi-dimensional setting?

A: The planner’s problem is cast as a direct mechanism in which couples report their two-dimensional productivity type (w1, w2) and receive allocations (consumption, earnings). Incentive compatibility requires that no couple prefers to misreport. In one-dimensional models (Mirrlees 1971), restricting attention to local incentive constraints (the FOA) yields the standard ODE characterization of optimal taxes and is valid for a broad class of primitives. In two dimensions, solutions to multi-dimensional screening problems generically display “bunching” (Rochet-Choné 1998, Armstrong 1996), and the FOA may fail. The key difference exploited in this paper is the absence of participation constraints in the public finance setting, which eliminates the main force driving FOA failure in industrial organization models.

Q2: What are the necessary and sufficient conditions for FOA validity in the benchmark economy with independent types?

A: (Proposition 1) In the benchmark economy (symmetric, independent types, separable Pareto weights), FOA validity is equivalent to the condition that x·(1 + λ̃(x^{-γ})/2) is increasing in x, where λ̃(t) = [∫_t^∞ (1-α̃(w))g(w)dw] / (γtg(t)). The unidimensional analog requires x·(1 + λ̃(x^{-γ})) to be increasing. Since the bi-dimensional condition multiplies λ̃ by 1/2 rather than 1, the set of primitives satisfying it is strictly larger: every (G, α̃, γ) for which the unidimensional FOA holds also satisfies the bi-dimensional condition, but not vice versa. Economically, the FOA holds as long as the planner is not “too redistributive” — i.e., Pareto weights on low types are not so high as to violate these monotonicity conditions.

Q3: What is the Coarea Formula result (equation 27) and why is it the central technical tool?

A: Given that the optimality conditions form a PDE system that cannot generally be solved pointwise, the authors integrate the optimality condition (equation 20) over subsets of the type space defined by level sets of an arbitrary function Q(w1, w2). The Coarea Formula allows them to express the result as: E[Σ_i λ*_i γ_i (∂lnQ/∂lnw_i) | Q=t] = [1 − E[α|Q≥t]] / [−∂ln P(Q≥t)/∂ln t]. By choosing different Q functions (e.g., Q = w_i, Q = max{k_1 w_1, k_2 w_2}, Q = R(w) for total family productivity, Q = I(w) for relative productivity), the formula delivers closed-form expressions for distinct conditional averages of optimal distortions, all expressed in terms of exogenous primitives. This contrasts with variational approaches (Golosov et al. 2014, Spiritus et al. 2022) that express optimal taxes in terms of endogenous moments.

Q4: How do optimal distortions on married individuals compare to those on single individuals, and what is the exact quantitative relationship in the independent-types benchmark?

A: (Proposition 4) In the benchmark economy with independent types, the optimal distortion on spouse i with productivity t equals exactly one-half of the optimal distortion λ^{sng,}(t) in the corresponding unidimensional economy: λi(t, w{-i}) = (1/2)λ^{sng,*}(t), and this is independent of the partner’s productivity w_{-i}. The intuition: the deadweight cost of taxing any individual depends only on her own characteristics (elasticity, productivity, density), not on whom she is married to. However, the redistributive benefit of taxation depends on matching — when matching is random, every high-productivity individual is married on average to an average person, so the incremental social benefit of extracting tax revenue from her is exactly half of what it would be if she were single (since half the benefit goes to a partner who is already average). More generally (Proposition 5 and Corollary 2), average distortions are weakly lower for married individuals than for singles as long as matching is not perfectly positively assortative.

Q5: What is average jointness and how is it measured?

A: Average jointness J_i(t) is defined as the ratio of average distortions on spouse i conditional on the partner having above-t productivity to average distortions conditional on the partner having below-t productivity, minus one. Jointness is positive if the marginal tax rate on spouse i is on average increasing in the partner’s productivity, negative if decreasing, and zero for separable (individual earnings-based) taxes. The paper characterizes jointness through auxiliary functions H_i(t) (conditional distortion relative to unconditional average), whose behavior is determined by the copula elasticities η_i and survival copula elasticities η̄_i — the percentage change in the conditional quantile of the partner’s productivity when one spouse’s productivity quantile increases by 1%.

Q6: What is the role of tail dependence in determining the sign of optimal jointness?

A: (Proposition 7, Lemma 4) For right-tail dependent distributions — where the probability that an extremely productive person is married to an extremely productive partner remains bounded away from zero as productivity → ∞ — the redistributive benefit of positive jointness (targeting taxes to the richest couples) dominates its distortionary cost, so optimal average jointness is positive at the top. For right-tail independent distributions (where this probability converges to zero), the distortionary cost of positive jointness dominates, and optimal jointness is negative at the top. Exactly symmetric logic applies at the bottom using the survival copula and left-tail dependence. The bivariate lognormal/Gaussian copula is right-tail independent for any finite correlation ρ, while a distribution with perfect assortative matching in the tails would be right-tail dependent. The speed of convergence to tail independence, measured by κ = lim_{u→0} ln(u)/ln(C(u,u)) ∈ [1/2, 1), also matters: slower convergence (κ closer to 1) implies smaller optimal jointness under tail independence.

Q7: When is individual earnings-based (separable) taxation optimal, and when is family earnings-based taxation optimal?

A: (Propositions 4, 8, Corollary 1) Individual earnings-based taxation is optimal when Pareto weights are separable and spousal productivities are independent. When types are positively dependent, the planner introduces jointness even with separable social weights, because conditioning taxes on both spouses’ earnings facilitates redistribution across couple types. Family earnings-based taxation is optimal when: (i) social weights are measurable only with respect to total family productivity r (i.e., the planner cares only about total family output, not the identity or relative productivity of individual spouses), and (ii) total family productivity r and relative spousal productivity ι are statistically independent. When r and ι are not independent, even a planner with an intrinsic preference for family earnings-based taxation will find it optimal to depart from it.

Q8: What does Proposition 9 (Corollary 7) establish about the relationship between restricted and unrestricted optimal taxes?

A: (Corollary 7) For each restricted tax regime (anonymous, individual earnings-based, family earnings-based), the optimal distortions under the restricted tax equal the corresponding conditional average of unrestricted optimal distortions. Specifically: optimal individual earnings-based distortions equal E[λ*_i | w_i = t] (the average unrestricted distortion at productivity t); optimal family earnings-based distortions equal E[weighted average of λ*_i | R(w) = r]. This reveals that the unrestricted and restricted planners solve the same tradeoff between redistribution benefits and distortionary costs, but the restricted planner must apply a single tax rate to groups of couples that cannot be distinguished under the restriction. The welfare loss from restriction comes entirely from this forced bunching, not from a different objective or a different first-order condition.

Q9: What do the quantitative results say about the goodness of approximation of separable vs. family earnings-based taxation?

A: In the calibrated benchmark economy (Gaussian copula, ρ = 0.33, Pareto-lognormal marginals, γ = 0.25, m = 0.35), optimal jointness is quantitatively small — the marginal tax rate on one spouse changes by at most several percentage points as a function of the other spouse’s earnings over the plotted range. Individual earnings-based (separable) taxation therefore provides a good approximation to the unrestricted optimum across all specifications considered. By contrast, family earnings-based taxation is a poor approximation: the marginal tax rate on family income varies substantially with the earnings share of the secondary earner (the ratio min{y1,y2}/(y1+y2)), and the deviation from the optimal unrestricted tax is large. This finding is robust across different Pareto weight specifications (m ∈ {0.35, 1.5}, k ∈ {0, 1, 2}) and holds even when k = 0, i.e., when the planner’s social weights inherently prefer family earnings-based taxation.

Q10: How do the calibration results relate to the analytical comparative statics predictions?

A: The calibration validates the analytical predictions quantitatively. The analytical result (Proposition 5) that optimal distortions in the U.S. lie between those under random matching (1/2 of single-individual rates) and perfect assortative matching (same as single-individual rates) is confirmed: optimal tax rates for married individuals in the calibrated economy lie between the independence and perfect-dependence gray-line benchmarks in Figure 6. The analytical prediction (Proposition 7) that the Gaussian copula implies positive jointness at the bottom and negative at the top is confirmed, with the switch to negative jointness occurring above approximately $8.5 million in earnings. The slow convergence of the Gaussian copula to tail independence (κ = (1+ρ)/2 ≈ 0.665) explains the small magnitude of optimal jointness relative to the FGM copula (which has κ = 1/2, faster convergence, and exhibits more pronounced jointness as shown in the appendix). The analytical limiting distortion of E[λ*_i | w_i = t] → 1/(γa) ≈ 1.35 as t → ∞ (corresponding to a top marginal tax rate of approximately 55 percent) is confirmed, though convergence is slow and rates remain substantially below this limit at $300,000 in earnings.

Q11: How does the paper relate to and advance beyond Kleven, Kreiner, and Saez (2007/2009)?

A: Kleven et al. (2009) studied couples taxation but avoided the multi-dimensional screening complexity by restricting the secondary earner to binary labor supply. The working paper by Kleven et al. (2007) considered the continuous setting but noted the difficulty of the FOA and derived several special-case insights. The current paper extends KKS in several systematic ways: it provides the first formal proof that the FOA conditions are strictly weaker in bi-dimensional than unidimensional settings; generalizes the formula for average distortions to arbitrary joint distributions (not just independent types); characterizes optimal jointness under positive dependence (not just independence); establishes the role of tail (in)dependence in determining the sign of jointness; compares optimal taxes for married vs. single individuals; and derives conditions under which family earnings-based or individual earnings-based taxation is optimal. It also shows that the KKS result on jointness sign (determined by the third derivative of the SWF) applies only under independence and can be reversed even with arbitrarily small positive dependence, as demonstrated with the Gaussian copula example.

Key Concepts

First-Order Approach (FOA) in multi-dimensional taxation. The restriction of the mechanism design problem to local incentive constraints only — dropping global (non-local) incentive compatibility conditions and solving a relaxed problem. In the paper’s context, FOA validity is equivalent to convexity of a specific transformation vx* of the optimal utility function in the “linearized” type space X. The paper shows that the condition for FOA validity is strictly weaker (i.e., a strictly larger set of primitives satisfies it) in the bi-dimensional couples setting than in the corresponding unidimensional model, because the absence of participation constraints eliminates the main force driving FOA failure in industrial organization multi-dimensional screening.

Optimal tax distortion λ_i(w).* The monotone transformation of the marginal tax rate defined by λ_i(w) = [∇_i T(y(w))] / [1 − ∇_i T(y(w))], where ∇_i T is the partial derivative of the tax function with respect to spouse i’s earnings. This transformation maps [−∞, ∞] marginal tax rates to (−1, ∞) distortions. The optimal tax schedule is characterized by the function λ* satisfying a system of PDEs; the paper studies conditional averages of λ* rather than λ* pointwise.

Coarea Formula. A mathematical result from geometric measure theory that, in this context, converts an integral of the PDE optimality condition over a two-dimensional domain into an integral over the level sets of an arbitrary function Q(w). Applied to equation (20), it yields: E[Σ_i λ*_i γ_i (∂lnQ/∂lnw_i) | Q=t] = [1 − E[α|Q≥t]] / [−∂ln P(Q≥t)/∂ln t]. By choosing different Q functions, the formula delivers conditional averages of optimal distortions over different subsets of the type space, all in terms of exogenous primitives. This is the paper’s principal analytical tool for characterizing optimal taxes without solving the PDE explicitly.

Jointness (positive/negative). The dependence of the optimal marginal tax rate on one spouse’s earnings on the other spouse’s earnings. Taxes are positively jointed at w if ∂²T/∂y_1∂y_2 > 0 (so raising one spouse’s earnings increases the marginal tax rate on the other); negatively jointed if this cross-partial is negative; disjointed (separable) if it is zero. Average jointness J_i(t) at productivity t is measured as the ratio of conditional average distortions above and below the partner’s productivity threshold, minus one. Optimal jointness is the paper’s primary policy object for understanding how taxes on one spouse should respond to the other’s earnings.

Copula and survival copula elasticities (η_i, η̄_i). Defined as η_i(t) = ∂ln C(u)/∂ln u_i and η̄_i(t) = ∂ln C̄(u)/∂ln ū_i, where C is the copula of the joint productivity distribution, C̄ is the survival copula, and u_i = G_i(t_i), ū_i = 1−G_i(t_i) are the corresponding quantiles. These elasticities measure the percentage change in the conditional quantile of the partner’s productivity when one spouse’s productivity quantile increases by 1%. They quantify the additional distortionary cost introduced by jointness relative to a separable tax schedule: smaller elasticities (stronger dependence) correspond to larger distortionary costs of jointness at the boundaries of probability mass.

Tail (in)dependence. A joint distribution F is right-tail dependent if lim_{t→∞} P(w_{-i}≥t | w_i≥t) > 0, i.e., extremely productive individuals have a positive probability of being matched with equally extreme partners. It is right-tail independent if this limit is zero. The speed of convergence to tail independence is measured by κ = lim_{u→0} ln(u)/ln(C(u,u)) ∈ [1/2, 1). Tail dependence determines the sign of optimal average jointness in the tails: right-tail dependence favors positive jointness at the top; right-tail independence favors negative jointness at the top. The Gaussian copula is right-tail independent for any finite ρ; a perfectly assortative matching distribution is right-tail dependent.

Positive quadrant dependence (PQD) order. A partial ordering on joint distributions with the same marginals: F^b ≥_{PQD} F^a if F^b(w) ≥ F^a(w) for all w, equivalently if Cov(φ_1(w_1), φ_2(w_2)) ≥ 0 for any two increasing functions. The paper uses this order to rank economies by the “assortativeness” of matching, and shows that optimal average distortions are monotone in this order (Proposition 5): more assortative matching implies weakly higher optimal tax distortions on each married individual.

Pareto-lognormal (PLN) distribution. Used in the calibration to model the marginal distribution of spousal productivities. Defined as G(t) = Φ((ln t − μ)/σ) − a·exp(aμ + a²σ²/2)·Φ((ln t − μ)/σ − aσ), parameterized by location μ, scale σ, and tail parameter a. The PLN family has a lognormal body and a Pareto tail with tail parameter a, making it suitable for capturing the empirical finding of a thin left tail (implying optimal marginal taxes approaching zero as earnings → 0) and a thick right tail (implying a positive limiting marginal tax rate of approximately 1/(1 + 1/(γa)) as earnings → ∞).