Macro Paper Warehouse Forthcoming macro & monetary research
Online First [Journal of Political Economy] doi:10.1086/741623 Online 2 Apr 2026

Robust Estimation and Inference in Panels with Interactive Fixed Effects

Timothy B. Armstrong — University of Southern California

Martin Weidner — University of Oxford

Andrei Zeleneev — University College London

What this paper finds — and why it matters

This paper develops new estimation and inference tools for the coefficient on a covariate of interest in large panel regressions whose unobserved heterogeneity has an interactive fixed effects (factor) structure. The authors demonstrate that standard tools for this model — the least-squares estimator of Bai (2009) and the common correlated effects estimator of Pesaran (2006) — can be heavily biased and severely size-distorted when some of the factors are “weak,” i.e., when factor loadings and factors lack enough variation to be distinguished from noise; in their Monte Carlo designs conventional confidence intervals built on the LS estimator can have almost zero coverage. They propose a debiased estimator together with a bias-aware confidence interval that, given only an upper bound on the number of factors, remains valid uniformly over a class of data-generating processes allowing weak, strong, or nonexistent factors. The construction applies minimax linear estimation to debias a preliminary estimate of the effects matrix, using a nuclear-norm bound on that preliminary estimate’s error, and the estimator attains a faster uniform rate of convergence than existing approaches when weak factors are allowed (reaching the parametric √(NT) rate when N and T grow at the same rate). In 5,000-replication Monte Carlo experiments and an empirical illustration calibrated to the divorce-law studies of Friedberg (1998) and Wolfers (2006), the debiased estimator substantially reduces weak-factor bias without inflating variance and performs comparably to LS when factors are strong, though the bias-aware CIs are often conservative — their oracle length is slightly less than half their actual length, bounding how much the critical value could be tightened. The method requires that the covariate of interest not itself be fully explained by a low-dimensional factor model, which rules out, for example, a treatment indicator that switches on for a subset of units in a single period.

Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.


In depth

Q1. What problem with interactive fixed effects panels does the paper address?

The paper addresses the failure of conventional estimators and confidence intervals for the regression coefficient β when the factors in an interactive fixed effects model are weak rather than strong. The model is a linear panel regression Yit = Xitβ + Σk Zk,it δk + Γit + Uit in which the unobserved component Γit has a factor structure Γit = Σr λir ftr (factor loadings λir, factors ftr), studied under large-N, large-T asymptotics. The standard least-squares estimator of Bai (2009) is √(NT)-consistent and asymptotically normal under a “strong factor assumption” requiring the loadings and factors to have sufficient variation. When that assumption fails — when factors are present but too weak to separate from the noise term Uit — the estimator cannot recover the true loadings and factors, leaving omitted-variables bias from Γit and producing substantial bias and misleading inference.

Q2. How bad is the problem for conventional methods, and what shows it?

In the authors’ Monte Carlo study, weak factors leave the LS estimator heavily biased and non-normal, and conventional CIs based on it can have almost zero coverage. Their finite-sample distribution plots show the LS estimator centered at the true value when factors are nonexistent or strongly identified, but heavily biased and non-normally distributed at intermediate (“weak”) factor strengths. The simulation tables (5,000 replications) report that the LS estimator is heavily biased and the associated 5%-level tests and 95% CIs are heavily size-distorted unless all factors are strong. The common correlated effects estimator of Pesaran (2006) does not even apply in their designs because the cross-sectional averages of the loadings equal zero.

Q3. What is the paper’s proposed estimator, and how is it constructed?

The paper proposes a debiased estimator that applies the theory of minimax linear estimation to a preliminary estimate of the effects matrix, using a nuclear-norm bound on that estimate’s error. Starting from a preliminary estimate Γ̂pre of the effects matrix Γ together with a bound Ĉ on the nuclear norm ‖Γ − Γ̂pre‖* of its estimation error, the authors form augmented outcomes Ỹit = Yit − Γ̂pre,it and treat the residual effect Γ̃ = Γ − Γ̂pre as a nuisance satisfying the convex constraint ‖Γ̃‖* ≤ Ĉ. They then derive linear weights Ait that optimally use this constraint via minimax linear estimation (Ibragimov and Khas’minskii, 1985; Donoho, 1994; Armstrong and Kolesár, 2018), so that the weights control the remaining omitted-variables bias due to weak factors not captured by Γ̂pre. Bounding the nuclear norm is a convex relaxation of the rank constraint rank(Γ) ≤ R, connecting the approach to the matrix-completion and debiased-LASSO literatures.

Q4. What makes the confidence interval “bias-aware,” and what does it require?

The confidence interval is bias-aware because it uses the nuclear-norm bound Ĉ to explicitly account for the remaining bias in the debiased estimator, and it requires only an upper bound on the number of factors. Rather than assuming the bias is negligible, the CI incorporates the worst-case remaining bias permitted by the bound, which is what allows it to remain valid even under weak factors. The authors show the CI is valid uniformly over a large class of DGPs that allows weak, strong, or nonexistent factors up to the specified upper bound on the number of factors.

Q5. What are the convergence-rate results?

The authors derive rates of convergence that hold uniformly over the DGP class, and show their estimator achieves a faster uniform rate than existing approaches when weak factors are allowed. When N and T grow at the same rate, the estimator attains the parametric √(NT) rate. This improvement is established for the regime that explicitly permits weak factors that cannot be consistently estimated; the authors note their results also apply to the strong and “semi-strong” regimes studied elsewhere, where factors can be consistently estimated and conventional estimators are already asymptotically unbiased and normal.

Q6. What is the key scope condition on the covariate of interest?

An important condition is that the covariate of interest Xit must not itself be entirely explained by a low-dimensional factor model. The method leverages variation in Xit that cannot be explained by a small number of factors. As the authors illustrate, if Xit is the state-year minimum wage, the design requires that states change their minimum-wage laws in different years and often enough to generate such variation. The condition rules out settings where Xit is an indicator for a policy that affects a subset of units starting in a single time period, because then Xit is collinear with the factor model (Xit = λi·ft with λi a treated-unit indicator and ft a post-period indicator) — a fundamental identification problem that other literatures address with additional assumptions.

Q7. Under what error and model conditions do the results hold?

The results hold under conditions similar to Bai (2009) and Moon and Weidner (2015), with the error term mean zero conditional on the regressors and effects but allowed to be heteroskedastic and weakly dependent. Uit is assumed mean zero conditional on X, the controls Z, and Γ, while heteroskedasticity (possibly depending on Xit and Γit) and some weak dependence are permitted. The number of factors R is unknown but assumed small relative to N and T. Unlike some related work, the authors deliberately avoid imposing extra structure such as homoskedasticity or full independence of the errors from the effects and regressor, because such structure would supply additional identifying information and lead to a fundamentally different analysis.

Q8. How does the debiased estimator perform in the Monte Carlo experiments?

In the simulations, the debiased estimator effectively reduces the weak-factor bias without inflating variance, and performs comparably to LS when all factors are strong. Across designs with one and two factors (5,000 replications each), the tables report bias, standard deviation, RMSE, test size, and average CI length. The efficiency gains from debiasing can be very large when a weak factor is present, especially at larger sample sizes, while the cost when factors are strong is minimal. Because LS CIs under weak factors can have zero coverage (being centered on the biased LS estimator and too short), the authors benchmark length against identification-robust “oracle” CIs that invert the LS-based t-statistic using least-favorable critical values; the bias-aware CI’s actual length is at least comparable to, and mostly shorter than, the LS oracle CI length.

Q9. How conservative are the bias-aware confidence intervals?

The bias-aware CIs are often conservative: across most designs their oracle length is slightly less than half their actual length, which bounds how much the critical value could be reduced. This implies the bias-aware critical value cannot be decreased by more than about a factor of two without sacrificing coverage in these Monte Carlos. The authors attribute the conservativeness to two possible sources — the bias bound in their main theorem may be conservative, or there may be additional structure in the initial error or its correlation with the data that the nuclear-norm debiasing does not exploit — and note they cannot rule out that other DGPs would make the critical value non-reducible.

Q10. What does the empirical illustration show?

In an experiment calibrated to the unilateral-divorce-law studies of Friedberg (1998) and Wolfers (2006), a weak factor can severely distort conventional inference, and in the actual data the potential presence of one weak factor is likely to be sufficient to nullify the significance of previously obtained non-robust estimates. Using Kim and Oka (2014) data on a balanced panel of N = 48 states and T = 33 years, with the divorce rate as outcome and a unilateral-divorce-law dummy as the covariate (controlling for state-specific quadratic trends and time effects), the calibrated simulation reproduces the pattern from the abstract design: LS is heavily biased and size-distorted under a weak factor, while the debiased estimator has substantially smaller bias, standard deviation, and RMSE and competitive performance under a strong factor. Applied to the real data, allowing up to one weak factor produces bias-aware CIs substantially wider than the non-robust ones, and the authors find that this potential weak factor is likely sufficient to nullify the significance of the earlier non-robust estimates.

Q11. How does this work relate to existing approaches to weak or rank-deficient factors?

The paper is distinguished by providing inference that remains valid under arbitrary weak factors without assuming the factors can be consistently estimated, which prior approaches generally do not. Robustness results in Bai (2009) and Moon and Weidner (2015) cover the special case where some factors are exactly zero while the rest are strong, but not more general weak factors. Chetverikov and Manresa (2022) also achieve a faster rate under weak factors but assume strong factors when constructing CIs and place a factor structure on the covariate matrix. Lower bounds of Zhu (2019) show no CI can be asymptotically valid under weak factors while matching the performance of Bai’s (2009) CI when factors are strong, underscoring that some cost is unavoidable. The minimax-debiasing strategy parallels debiased-LASSO methods (e.g., Javanmard and Montanari, 2014) for omitted-variable bias in high-dimensional regression.

Key concepts

Interactive fixed effects (factor structure)
A model of unobserved panel heterogeneity in which the effect Γit is the sum over factors of a loading times a factor, Γit = Σr λir ftr; equivalently, the matrix of unobserved effects Γ has rank at most R. It generalizes additive fixed effects (αi + γt) and nests the grouped unobserved heterogeneity model as a special case.
Weak factors
Factors whose loadings and/or factors lack sufficient variation across units or over time, so that they cannot be reliably distinguished from the noise term Uit. Under weak factors the strong factor assumption fails and the least-squares estimator’s omitted-variables bias and inference distortions appear; the paper allows arbitrary sequences of such factors, including the nonexistent-factor case.
Strong factor assumption
The condition (as in Bai 2009) that all factor loadings and factors have sufficient variation across i and over t, under which the LS estimator of β is √(NT)-consistent and asymptotically normal. The paper’s contribution is to provide valid estimation and inference without requiring it.
Bias-aware confidence interval
A confidence interval that explicitly incorporates a bound on the estimator’s remaining bias (here via the nuclear-norm bound Ĉ on the preliminary estimate’s error) rather than assuming the bias is asymptotically negligible, enabling uniform validity across factor strengths given an upper bound on the number of factors.
Minimax linear estimation
A method (Ibragimov and Khas’minskii 1985; Donoho 1994; Armstrong and Kolesár 2018) that chooses linear weights minimizing worst-case mean-squared error over a parameter space defined by a convex constraint; here it produces the debiasing weights Ait that optimally use the nuclear-norm constraint ‖Γ̃‖* ≤ Ĉ on the residual effects matrix.
Nuclear norm
The sum of the singular values of a matrix, used as a convex relaxation of the (non-convex) rank constraint rank(Γ) ≤ R; bounding ‖Γ̃‖* constrains the residual effects matrix and is the device through which the bias bound and bias-aware CI are constructed.

Key concepts

Interactive fixed effects (factor structure)
A model of unobserved panel heterogeneity in which the effect Γit is the sum over factors of a loading times a factor, Γit = Σr λir ftr; equivalently, the matrix of unobserved effects Γ has rank at most R. It generalizes additive fixed effects (αi + γt) and nests the grouped unobserved heterogeneity model as a special case.
Weak factors
Factors whose loadings and/or factors lack sufficient variation across units or over time, so that they cannot be reliably distinguished from the noise term Uit. Under weak factors the strong factor assumption fails and the least-squares estimator’s omitted-variables bias and inference distortions appear; the paper allows arbitrary sequences of such factors, including the nonexistent-factor case.
Strong factor assumption
The condition (as in Bai 2009) that all factor loadings and factors have sufficient variation across i and over t, under which the LS estimator of β is √(NT)-consistent and asymptotically normal. The paper’s contribution is to provide valid estimation and inference without requiring it.
Bias-aware confidence interval
A confidence interval that explicitly incorporates a bound on the estimator’s remaining bias (here via the nuclear-norm bound Ĉ on the preliminary estimate’s error) rather than assuming the bias is asymptotically negligible, enabling uniform validity across factor strengths given an upper bound on the number of factors.
Minimax linear estimation
A method (Ibragimov and Khas’minskii 1985; Donoho 1994; Armstrong and Kolesár 2018) that chooses linear weights minimizing worst-case mean-squared error over a parameter space defined by a convex constraint; here it produces the debiasing weights Ait that optimally use the nuclear-norm constraint ‖Γ̃‖* ≤ Ĉ on the residual effects matrix.
Nuclear norm
The sum of the singular values of a matrix, used as a convex relaxation of the (non-convex) rank constraint rank(Γ) ≤ R; bounding ‖Γ̃‖* constrains the residual effects matrix and is the device through which the bias bound and bias-aware CI are constructed.
How this summary was made. Bibliographic fields are pulled from Crossref and OpenAlex and are not model-generated. The summary was drafted from the open-access manuscript , checked by a claim-grounding and calibration review pass, and approved before publishing. Found an error or a misrepresentation? Flag it here — corrections are welcome, especially from the authors.