Published [American Economic Review] doi:10.1257/aer.20211317 Online 1 Oct 2025 · Issue Oct 2025 Vol. 115, No. 10, pp. 3322-3366

Consistent Evidence on Duration Dependence of Price Changes

Fernando Alvarez

Katarína Borovičková

Robert Shimer

Canonical DOI Free to read · GREEN Open access ↗

What this paper finds — and why it matters

Layer 1 — Overview

Research Question. This paper asks two related questions. First, can one develop a robust, distribution-free estimator for the discrete-time mixed proportional hazard (MPH) model of duration with unobserved heterogeneity? Second, what does that estimator reveal about the shape of the hazard of price changes, the role of heterogeneity in shaping aggregate price dynamics, and the distinction between regular price changes and sales?

Methodology. The authors develop a linear generalized method of moments (GMM) estimator for the discrete-time MPH model, building on identification results in Honoré (1993). The model specifies that the probability a price spell ends at duration t, conditional on surviving to t, equals the product of a product-specific frailty parameter θ (unobserved, fixed over time) and a common baseline hazard bt. The estimator exploits repeated price spells per product via moment conditions that are linear in bt, making estimation and inference straightforward. It accommodates right- and left-censored data, competing risks, and spell-specific observable characteristics, without requiring any parametric assumption on the frailty distribution. The estimator is consistent as the number of products grows, even with a short time dimension. A Hansen-Sargan J-test of overidentifying restrictions and a test of the monotone-average-type prediction are also developed.

The estimator is applied to two datasets: (1) IRI weekly store data (2001–2011), covering 30 product categories and more than 21 million products, yielding 684,919,778 pairs of durations; and (2) Online Micro Price data from Cavallo (2018), comprising approximately 250,000 products at daily frequency.

Main Findings with Quantitative Magnitudes.

Baseline hazard and heterogeneity. In the pooled IRI data, the Kaplan-Meier hazard is steeply declining throughout the entire range from 2 to 60 weeks. In contrast, the estimated baseline hazard is roughly constant until week 4 and then declines only modestly, with a noticeable spike at week 52. The ratio of the Kaplan-Meier hazard to the baseline hazard — the average type, E[θ|t] — drops by approximately 60 percent within the first 20 weeks, and continues to decline, reaching roughly 0.3 of its initial value after one year. This decomposition reveals substantial unobserved heterogeneity that accounts for a large fraction of the observed decline in the Kaplan-Meier hazard.

Implications for structural models. The finding of a decreasing baseline hazard is inconsistent with canonical state-dependent pricing models (Golosov and Lucas, 2007), which predict an increasing hazard, conditional on a given firm’s type. The decreasing baseline hazard is instead broadly consistent with time-dependent pricing models, though not with a constant-hazard (Calvo, 1983) specification.

Monetary policy impulse response. In a calibrated time-dependent pricing model with strategic complementarity (α = 0, 0.5, 0.95), the aggregate price level dynamics in the estimated heterogeneous-firm MPH economy are close to those of a homogeneous-firm economy that uses the Kaplan-Meier hazard as the common price-change hazard. The homogeneous-firm approximation is substantially closer to the MPH economy than a Taylor (1979, 1980) staggered-contract economy with the same Kaplan-Meier hazard, particularly when strategic complementarity is strong (α = 0.95). The Calvo economy provides a poor approximation due to its exponential (constant-speed) price convergence structure.

Regular versus temporary price changes. Using the competing-risks extension with spell-specific observables — classifying spells by whether they start and end with a price increase (+) or decrease (−) — the authors separately estimate four baseline hazards. The baseline hazard for consecutive price increases (b++t) is relatively flat, especially for the first 6 weeks, then flat until week 45, with a spike near one year, consistent with price-plan models. The baseline hazard for reversals (particularly b−+t, price decreases followed by price increases, associated with sales) is steeply declining. The J-test statistics are substantially lower for price trends (J++ = 3,920; J−− = 3,401) than for reversals (J+− = 8,737; J−+ = 7,910), and markedly lower than the pooled-model J = 10,498, indicating that the MPH structure fits regular price changes considerably better than sales.

Scope Conditions. Results are conditional on weekly store-level price data for mostly packaged consumer goods (30 IRI product categories). The analysis focuses on price spells of at least 2 weeks to avoid spurious duration-one spells from mid-week price changes. The maximum duration examined is 60 weeks. The comparison of estimation methods relies on the IRI data only; the Online Micro Price data confirm weekly decision-making through a spike in the daily hazard every 7 days. Comparisons with maximum likelihood estimates show that GMM recovers more heterogeneity (average type declines to 0.37 at 6 months by GMM versus 0.48 by continuous-time MLE), and that time aggregation explains most of the discrepancy between the two methods.

Layer 2 — Q&A

Q1. What is the mixed proportional hazard (MPH) model as used in this paper, and what does the estimator identify?

A1. The MPH model specifies that the hazard that a price spell ends at duration t, conditional on surviving to t, equals θ·bt, where θ is a product-specific frailty parameter drawn from an unknown distribution G and bt is a baseline hazard common to all products. The estimator, which is linear in bt, identifies the baseline hazard up to a multiplicative constant using moment conditions derived from repeated spell data, without restricting the shape of the frailty distribution. Identification relies on comparing the joint survival probabilities of two consecutive spells for the same product and exploits the symmetry implied by the MPH structure across spells.

Q2. How does the Kaplan-Meier hazard relate to the baseline hazard, and what does this relationship imply about heterogeneity?

A2. The paper proves that the Kaplan-Meier hazard Ht equals bt times E[θ|t], the mean frailty among spells surviving to duration t. Because higher-type products (those with a higher propensity to change prices) exit the pool of surviving spells earlier, E[θ|t] is strictly decreasing in t — a form of dynamic selection. The ratio Ht/bt, normalized to 1 at the start, falls to approximately 0.4 by week 20 in the pooled IRI data and to approximately 0.3 after one year, documenting that a large share of the decline in the Kaplan-Meier hazard reflects heterogeneity rather than structural negative duration dependence.

Q3. What does the estimated baseline hazard imply about structural models of price setting?

A3. A decreasing baseline hazard is inconsistent with the canonical state-dependent model of Golosov and Lucas (2007), in which a firm’s hazard of price change is increasing in the time since the last change, because larger deviations from the desired price accumulate with duration. The decreasing baseline hazard is instead consistent with time-dependent pricing models and with price-plan models where within-plan switches are costless. The mild spike at week 52 in the baseline hazard is consistent with Taylor-type annual pricing rules.

Q4. What is the approximate aggregation result for monetary policy, and how quantitatively accurate is it?

A4. In the time-dependent pricing model without strategic complementarity (α = 0), the impulse response of the aggregate price level to a monetary shock in a heterogeneous-firm economy is exactly the same as in a homogeneous-firm economy whose single firm uses the Kaplan-Meier survival function. This extends Carvalho and Schwartzman (2015) to an approximation in the case with strategic complementarity (α = 0.5 and α = 0.95). Numerically, the path of aggregate prices in the estimated MPH economy is close to that in the homogeneous-firm Kaplan-Meier economy, and substantially closer to it than to the Taylor-contract economy — the difference is most pronounced at horizons beyond about half a year when α = 0.95, where the Taylor economy shows notably slower initial convergence and faster later convergence relative to the MPH and homogeneous economies.

Q5. How do the paper’s results differ from those obtained using maximum likelihood estimation of the continuous-time MPH model?

A5. The GMM estimator recovers substantially more heterogeneity than maximum likelihood (MLE) applied to the continuous-time model with continuous records (assumed gamma frailty). The average type falls from 1 to 0.37 at six months under GMM, versus only 0.48 under MLE. The authors investigate two sources of this discrepancy: the assumed frailty distribution family (gamma) and time aggregation. They conclude that time aggregation is quantitatively more important in the IRI weekly data — that is, the continuous-time MLE approach fails to properly account for the discrete nature of the data-generating process, leading it to understate heterogeneity and recover a steeper baseline hazard.

Q6. How does the paper distinguish regular price changes from sales without directly observing a sales flag?

A6. The competing-risks extension classifies each spell by whether it starts with a price increase or decrease (observable characteristic χ ∈ {+, −}) and by whether it ends with a price increase or decrease (competing risk ρ ∈ {+, −}). Price trends — spells where the direction is the same at both the start and end (++ or −−) — are interpreted as regular price changes; price reversals (especially −+, i.e., price decrease followed by increase) are associated with sales. This approach is consistent with the statistical model used for estimation, avoids the bias from simply dropping suspected sales spells before estimation, and allows the MPH structure to hold only for the risks of interest even if it fails for others.

Q7. How well does the MPH model fit regular price changes versus sales?

A7. The J-test of overidentifying restrictions yields test statistics of J++ = 3,920 for consecutive price increases and J−− = 3,401 for consecutive price decreases, compared with J = 10,498 for the pooled model and J+− = 8,737 and J−+ = 7,910 for the reversal hazards. All rejections are at conventional significance levels (critical value 1,749 at 5%), but the rejection is substantially milder for price trends than for price reversals. For individual product categories, the model cannot be rejected for 8 categories (out of 30) for b++ and 21 categories for b−−, suggesting the MPH structure is a much better description of regular price changes than of sales.

Q8. What role do one-week price spells play in the data, and why are they excluded?

A8. In the IRI data, prices are measured as the ratio of weekly revenue to quantity, so a price change occurring mid-week generates a spurious price spell of duration one week. If all spells including one-week spells are retained, the autocorrelation of spell durations is only 0.029 in levels and even negative (−0.042) in logs, which is inconsistent with a mixture model. Once one-week spells are excluded, the autocorrelation rises to 0.235 in levels and 0.233 in logs, and is stable when two-week spells are also excluded (0.248 and 0.256). The paper therefore sets the lower duration bound at T̲ = 2 weeks.

Q9. What does the daily Online Micro Price data add relative to the weekly IRI data?

A9. The daily data reveal a sharp spike in the price-change hazard every seven days, suggesting that even when prices are observed daily, the decision to change prices is made at the weekly frequency. This justifies the use of a discrete-time model with a one-week period. The estimates from daily and weekly aggregations of the same data are broadly similar, though weekly data recovers somewhat less heterogeneity than daily data. Aggregating IRI weekly data to monthly frequency understates heterogeneity even more, confirming that frequency matters for measuring heterogeneity.

Q10. What are the computational advantages of the GMM estimator relative to maximum likelihood?

A10. Because the moment conditions are linear in the baseline hazard bt, the GMM estimator is obtained in closed form, making estimation fast and inference straightforward. On the pooled IRI sample, GMM estimation (including standard errors) required 70 minutes on a machine with 60 GB memory, whereas the maximum likelihood estimator required 15 hours on a machine with 256 GB memory and failed entirely on the 60 GB machine. The GMM approach also avoids the need to specify the frailty distribution family and guarantees a global solution (proved by the identification result), whereas the likelihood function is non-linear in bt and may have multiple local maxima.

Q11. What is the shape of the b++ baseline hazard for regular price increases, and what models does it support?

A11. The baseline hazard for spells starting and ending with a price increase (b++) is decreasing during the first 6 weeks — dropping by almost 50% — and then flat until approximately week 45, with a pronounced spike at around one year. This shape is consistent with price-plan models (Eichenbaum, Jaimovich, and Rebelo, 2011) with Calvo-type switching between plans, where within-plan changes are costless and the hazard of between-plan switching is approximately constant. The annual spike is consistent with Taylor-type pricing. Approximately 76.8% of complete spells starting after a price increase last at most 6 weeks.

Key Concepts

Baseline hazard (bt). The component of the MPH hazard that is common to all products and may vary arbitrarily with elapsed duration t. It represents structural duration dependence — the tendency for a given product to be more or less likely to change price as a function of how long its current spell has lasted — net of heterogeneity. It is identified only up to a multiplicative constant.

Frailty parameter (θ) / frailty distribution (G). The product-specific scaling factor in the MPH model, fixed over all spells for a given product, that captures permanent unobserved differences in price-change frequency across products. The paper treats G as a nuisance parameter and does not require a parametric assumption on its shape. A higher θ means the product has a higher baseline propensity to change its price.

Average type (E[θ|t]). The mean frailty parameter among spells that have survived to at least duration t. Because high-type products change price earlier and exit the pool of surviving spells first, the average type is provably strictly decreasing in t under the MPH model. It is measured as the ratio of the Kaplan-Meier hazard to the baseline hazard, and its rate of decline measures the importance of dynamic selection.

Kaplan-Meier hazard (Ht). The probability that a randomly drawn spell ends at duration t, conditional on having lasted at least t periods. It mixes together structural duration dependence (captured by bt) and dynamic selection (captured by changes in the average type). It can be estimated without imposing the MPH structure, requiring only stationarity of the duration process.

Competing risks. The framework in which a price spell can end for multiple distinct reasons — here, ending with a price increase or a price decrease — each with its own hazard function. The paper’s GMM approach allows the MPH structure to hold for only a subset of risks and observables, without imposing any structure on the remaining risks.

Price trends vs. price reversals. A classification of spells based on the direction of the surrounding price changes. Price trends are spells where the direction of the price change at the start and end of the spell is the same (++ or −−), interpreted as regular price changes. Price reversals are spells where the direction switches (e.g., −+, a price decrease followed by a price increase), associated with sales and other temporary price changes.

Strategic complementarity in pricing (α). The degree to which a firm’s target price responds to the average price set by other firms. Parameterized by α ∈ [0, 1), where α = 0 yields the exact aggregation result (only the Kaplan-Meier hazard matters) and higher α increases aggregate price stickiness by making firms reluctant to deviate from the average price when few others are adjusting.

Dynamic selection. The mechanism by which the composition of the pool of surviving price spells shifts toward lower-type (more price-sticky) products as duration increases, because higher-type products change price sooner and exit the pool. This is the source of the gap between the steeply declining Kaplan-Meier hazard and the more modestly declining baseline hazard.

How this summary was made. Bibliographic fields are pulled from Crossref and OpenAlex and are not model-generated. The summary was drafted from the open-access manuscript , checked by a claim-grounding and calibration review pass, and approved before publishing. Found an error or a misrepresentation? Flag it here — corrections are welcome, especially from the authors.