The Impact of Incarceration on Employment, Earnings, and Tax Filing
What this paper finds — and why it matters
Overview
This paper estimates the causal effect of incarceration on employment, wage earnings, self-employment, and tax filing behavior using administrative criminal justice data linked to Internal Revenue Service (IRS) records for approximately half a million felony defendants in two U.S. states: North Carolina and Ohio. The study period covers cases filed from the early 2000s through 2014, with outcomes tracked through 2020 using IRS W-2 and 1040 records.
Research Question
The central question is whether incarceration itself — as distinct from arrest, conviction, and other criminal justice interactions that precede or accompany it — causes lasting reductions in defendants’ labor market outcomes. The paper explicitly holds fixed upstream interactions (conviction, arrest) to isolate the effect of the incarceration sentence.
Data and Sample
Criminal justice records from Ohio (Common Pleas courts in Franklin, Cuyahoga, and Hamilton counties, covering Columbus, Cleveland, and Cincinnati) and North Carolina (Administrative Office of the Courts and Department of Public Safety) are linked to de-identified IRS records via name, date of birth, sex, address, and partial Social Security Numbers. Match rates are 92% in Ohio and 95% in North Carolina. The sample is restricted to defendants aged 18–50 at time of offense with cases filed 2002–2014. IRS records include employer-reported W-2 wages (regardless of individual tax filing), self-employment income from Schedule C/SE, non-employee compensation (1099-MISC), and gig-economy earnings from 1099 returns. All dollar figures are adjusted to 2016 dollars using the PCE deflator.
Empirical Strategy
Two independent quasi-experimental research designs are used:
North Carolina — Sentencing guideline discontinuities: North Carolina’s structured sentencing guidelines map offense class (E through I, the five least severe felony classes) and prior record points (a numerical criminal history score) into permissible punishment types (incarceration vs. probation) and sentence lengths. Allowable punishment types change discretely at five cell boundaries, generating discontinuities in incarceration sentences for otherwise similar defendants. The paper uses these five boundary discontinuities as excluded instruments in a parameterized regression discontinuity design stacked across offense classes. First-stage F-statistic = 115.
Ohio — Random assignment to judges: Cases are randomly assigned by computer to judges at arraignment in the three counties studied. Judge leave-out mean sentence length is used as an instrument for individual sentence length. The design follows Norris et al. (2021) and yields F-statistic = 321. The instrument shifts sentences along both the extensive margin (any vs. no incarceration) and intensive margin (longer vs. shorter sentences).
Both designs produce complier populations for whom at least 37–45% are shifted on the extensive margin (from no incarceration to some incarceration), based on partial identification bounds using linear programming.
Main Findings
The paper’s central finding is that incarceration generates large short-run reductions in labor market activity during the incapacitation period, but no detectable long-run reductions in annual employment or earnings once defendants have been released and the incapacitation effects have dissipated.
- In the first year after case filing, when incarceration rates peak (roughly 75–100 additional days incarcerated for a 12-month sentence), employment falls by approximately 10 percentage points and total W-2 earnings contract commensurately.
- Within 3–4 years of filing, employment effects return to near zero and are statistically insignificant in both states.
- Five to nine years after filing, when effects on contemporaneous incarceration have dissipated, the estimated effect of a 12-month sentence on annual earnings is positive or near zero in both states. The combined 95% confidence interval rules out reductions in annual wages greater than $231 (approximately 5% of the untreated complier mean) and rules out any adverse employment effects.
- Despite no long-run level effects, losses during incapacitation are never recouped. A one-year sentence reduces cumulative earnings over five years by approximately $2,914 — a 13% reduction relative to the complier mean.
- Effects on self-employment, independent contracting, 1040 filing, adjusted gross income, EITC take-up, and interstate migration are similarly null in the long run.
Incapacitation vs. Post-Release Scarring
The paper provides two tests for whether short-run earnings losses reflect incapacitation alone or also post-release scarring (e.g., human capital depreciation, employer discrimination, or discouragement effects):
- A “visual IV” regression of year-t earnings effects on year-t days-incarcerated effects yields an R² of 0.83–0.85 across states, with the intercept near zero (positive and small), indicating that virtually all dynamic earnings impacts flow through contemporaneous incapacitation and not through a post-release channel.
- Constructed outcomes that impose the null of pure incapacitation (scaling pre-case average earnings or covariate-predicted earnings by the share of the year free from prison) closely track actual earnings effects in both states, further confirming that incapacitation is the dominant mechanism.
Pre-existing Labor Market Detachment
A key scope condition is defendants’ severe labor market disadvantage prior to their case. Fewer than 50–60% of defendants are employed in the year before filing; average pre-case W-2 earnings (including zeros) are below $6,000. Among employed defendants, only 10% earn more than $22,000 per year. Untreated complier means for earnings in the year after case filing are below $4,000, with virtually no earnings or employment growth over the following nine years. The paper concludes that returning to pre-filing earnings levels is sufficient for incarcerated defendants to match their non-incarcerated peers — a low bar that is readily met.
Policy Implications
Back-of-envelope aggregation implies incapacitation losses of approximately $6.16 billion per year in foregone earnings for the U.S. prison population, concentrated in communities heavily affected by incarceration. However, a marginal reduction in incarceration rates would increase average earnings by only $51 for white men and $213 for black men, suggesting incarceration’s direct contribution to labor market inequality is modest relative to the $21,100 black-white earnings gap estimated by Bayer and Charles (2018). The paper concludes that upstream factors — other criminal justice interactions, human capital deficits, and broader socioeconomic disadvantage — are more plausibly responsible for low earnings among the formerly incarcerated.
Q&A
Q1: What is the exact treatment variable, and what is the counterfactual?
The treatment variable is months of incarceration sentenced in the focal case (a continuous, weakly positive ordered treatment). The counterfactual for non-incarcerated defendants in North Carolina is probation (all defendants are convicted by construction under structured sentencing guidelines). In Ohio, the authors cannot reject that all compliers who do not receive a prison sentence are still convicted, implying the counterfactual is also conviction and probation. All compliers therefore acquire a criminal record regardless of sentence. The treatment effect is thus the effect of incarceration conditional on conviction, holding fixed the criminal record.
Q2: How are effects interpreted given multiple instruments and continuous treatment?
Under a “weakly positive ordered treatment” assumption and standard LATE conditions, the 2SLS estimates can be interpreted as Average Causal Responses (ACRs) — weighted averages of the marginal dose effects (12 vs. 11 months, 6 vs. 5 months, 1 vs. 0 months, etc.) for complier subgroups shifted by each instrument. In North Carolina with five parameterized RD instruments, the estimate averages ACRs weighted by first-stage strength. In Ohio with a leave-out mean instrument, the estimate is a convex average of ACRs under the assumption that the linear first-stage model is a good approximation. Dosage weights for both states put mass on a wide range of sentence lengths including both extensive and intensive margins, though Ohio’s weights are more skewed toward shorter sentences.
Q3: How large are the first-stage effects, and how strong is the instrument?
In North Carolina, sentences jump by 50% or more at sentencing guideline cell boundaries where allowable punishment types change to include incarceration. The first-stage F-statistic is 115. In Ohio, defendants assigned to the most severe judge receive incarceration sentences approximately six months longer than those assigned to the least severe judge (roughly 30% of the average non-zero sentence), with a slope of approximately 0.8 in the first-stage regression; F-statistic = 321. At least 37% of compliers in North Carolina and 45% in Ohio are shifted on the extensive margin (from no incarceration to some positive incarceration), with upper bounds as high as 95%.
Q4: What evidence supports instrument validity (exclusion restriction and independence)?
Instrument validity is tested by estimating 2SLS “effects” on pre-case outcomes measured 2–4 years before the focal case. In both states, the instruments show no relationship with pre-case employment, W-2 wages, total days previously incarcerated, or binary severe prior incarceration. The probability of being matched to IRS records and the quality of the match are also uncorrelated with the instruments. In Ohio, potential exclusion restriction violations from judges affecting conviction (not just sentence) are addressed empirically: nearly 90% of defendants are convicted, the most severe judge is only 0.7 p.p. more likely to convict than the least severe judge (t-stat = 1.53), and the estimated conviction rate among untreated compliers is 0.972 (s.e. 0.018), so one cannot reject that all non-incarcerated compliers are convicted.
Q5: How does the paper test for the incapacitation mechanism against post-release scarring?
Two complementary exercises are conducted. First, a “visual IV” plot regresses year-t earnings effects on year-t days-incarcerated effects across all post-filing years. If incapacitation is the sole channel, all points should lie on a line through the origin. The R² is 0.83 in North Carolina and 0.85 in Ohio, the estimated intercept is near zero (positive and small) in both states, and the slope (earnings lost per day incarcerated) is approximately $12. This implies cumulative earnings losses of $12 × 268 days = $3,216, very close to the directly estimated $2,914. Second, constructed outcomes that scale pre-case earnings or covariate-predicted earnings by the share of the year not incarcerated closely track actual earnings effects throughout the post-filing period, and both converge to zero as incapacitation effects fade — consistent with pure incapacitation and no net scarring.
Q6: What are the long-run (5–9 years) earnings and employment estimates, and how precisely are null effects ruled out?
Averaged across both states using inverse-variance weights, the estimated effect of a 12-month sentence on annual W-2 earnings five to nine years after filing is positive but statistically indistinguishable from zero. The 95% confidence interval rules out reductions in annual wages greater than $231 (approximately 5% of the untreated complier mean of roughly $4,500–$5,000). The 95% CI also rules out any adverse employment effects. The untreated complier mean for employment 5–9 years post-filing is approximately 40% in North Carolina and slightly above 40% in Ohio.
Q7: What happens to cumulative earnings over five years despite null long-run level effects?
Even though long-run annual earnings are unaffected, earnings losses during incapacitation are never made up. A one-year sentence reduces cumulative employment (measured as years with any W-2) and cumulative earnings over five years by approximately $2,914 — a 13% reduction relative to the complier mean. This reflects the mechanical loss of earnings during the period of physical incapacitation, without a subsequent compensating period of higher earnings after release.
Q8: Do defendants with stronger pre-case labor market attachment show different long-run patterns?
The sample is split between defendants employed in at least 2 of the 4 years prior to the case (53–57% of the sample across states) and those less attached. Both groups show zero long-run earnings and employment effects. Previously employed defendants experience much larger short-run earnings drops — more than three times larger in the first year post-filing — and their earnings recover more slowly, reaching zero effect approximately six years after filing (vs. three years for the previously unemployed). For a stricter cut (pre-case average earnings above $15,000, representing only 12–15% of the sample), the long-run earnings effect is −$1,426 (8% of the untreated complier mean), significant only at the 10% level, and partly attributable to residual incapacitation (19.6 additional days incarcerated 5–9 years post-filing). For defendants with pre-case earnings below $15,000, incarceration slightly increases long-run employment (2.4 pp, p = 0.01) and earnings ($400, p = 0.03), possibly reflecting rehabilitative benefits (GED or educational programs) for labor-market-detached individuals.
Q9: Does first-time incarceration (extensive-margin exposure) have larger long-run effects than repeat exposure?
The paper tests this by splitting the sample into defendants with and without prior incarceration history. Among defendants with no prior incarceration, the instruments generate large differences in lifetime exposure: a 12-month sentence increases the probability of ever being incarcerated over the next 5–9 years by 26 p.p. (North Carolina) and 41 p.p. (Ohio). Among those not receiving a sentence, 48% (North Carolina) and 19% (Ohio) are eventually incarcerated anyway, implying treatment causes a 52 and 81 p.p. increase in lifetime incarceration probability for extensive-margin compliers. Despite these large differences in lifetime exposure, long-run earnings and employment effects remain small and statistically insignificant in both subsamples. The difference in long-run effects between previously and never incarcerated defendants is not statistically significant (p = 0.29 for employment, p = 0.82 for earnings).
Q10: Are there heterogeneous effects by race, sex, or criminal history?
There is no evidence of long-run scarring for any demographic or criminal history subgroup. Effects for black and non-black defendants are both positive for long-run earnings and employment. Non-black defendants show somewhat larger cumulative losses (consistent with marginally higher counterfactual earnings), but differences are not statistically significant. Estimates for women are imprecise due to small sample size. Among defendants with and without prior felony charges in the four years preceding the case, there are neither economically nor statistically significant long-run earnings or employment effects. Cumulative losses are somewhat larger for defendants without prior felony charges (p = 0.07), reflecting their higher pre-case earnings.
Q11: How does the paper handle potential migration bias in outcomes?
Tax filing and W-2 receipt in the state of sentencing are used to proxy for whether defendants remain in the same state. Among untreated compliers, 88% of those with a tax footprint maintain it in the state of sentencing. No statistically significant effects of incarceration on migration (measured as filing or receiving a W-2 in North Carolina or Ohio) are detected, suggesting prior studies of recidivism measured within-state are unlikely to be severely biased by migration responses.
Q12: What effect does incarceration have on mortality?
Incarceration reduces five-year mortality by approximately 0.8 percentage points (about 20% of the untreated mean). The authors note this is too small to explain the null long-run labor market effects: even if all defendants whose death was averted were employed, removing them from the employment count would reduce the employment effect of a 12-month sentence only to approximately zero.
Q13: How do the paper’s findings compare to prior studies, particularly Mueller-Smith (2015)?
Mueller-Smith (2015) finds large and persistent negative incarceration effects on labor market outcomes in Texas using a structural decomposition and Lasso-based judge-covariate interactions as instruments. The paper argues methodological differences are the likely explanation: the Lasso-selected interacted instruments can be susceptible to many-weak instruments bias toward OLS. It notes that Mueller-Smith’s simpler 2SLS specifications (analogous to those used here) show no statistically significant earnings effects. North Carolina and Ohio are documented to be broadly similar to Texas (and the U.S. average) in rehabilitation program participation, recidivism rates, and incarceration rates, reducing the likelihood that genuine geographic heterogeneity explains the divergence.
Q14: What is the paper’s aggregate extrapolation of incapacitation earnings losses?
Scaling the estimated $2,914 cumulative loss per 12-month sentence by the ratio of days exposed to total days in a year gives a per-day loss of approximately $12. Applied to the 1,435,500 people incarcerated in U.S. prisons on any given day in 2019 (excluding the more than 700,000 in jail), the implied aggregate yearly earnings loss from incapacitation is approximately $6.16 billion.
Key Concepts
Incapacitation effect: The mechanical reduction in earnings and employment that occurs while a defendant is physically confined in prison and unable to work, as distinct from any post-release scarring effect. The paper shows this is the dominant — and essentially sole — causal channel through which incarceration affects labor market outcomes in their sample.
Post-release scarring: Persistent reductions in earnings or employment that persist after a defendant is released from prison, caused by mechanisms such as employer discrimination based on incarceration history, human capital depreciation, loss of job contacts, or psychological discouragement effects. The paper finds no evidence of scarring in either state.
Average Causal Response (ACR): The weighted average of the marginal dose effects of incarceration (e.g., effect of 12 vs. 11 months, 1 vs. 0 months) for groups of defendants whose sentence lengths are shifted by a given instrument. Contrasted with a binary LATE, the ACR averages across the full dosage distribution for compliers.
Complier: An individual whose incarceration sentence is shifted by the instrument — either from zero to some positive sentence (extensive margin) or from a shorter to a longer sentence (intensive margin). Counterfactual outcome means for compliers sentenced to zero months provide the baseline for evaluating effect magnitudes.
Sentencing guideline discontinuity: The discrete jump in permissible punishment types and minimum sentence lengths at specific criminal history score thresholds within North Carolina’s structured sentencing grid. Defendants just above a threshold are more likely to be incarcerated than otherwise similar defendants just below, generating quasi-experimental variation exploited as an instrument.
Leave-out mean judge instrument: In Ohio, each defendant’s assigned judge’s average incarceration sentence length computed over all other cases that judge handles (excluding the defendant’s own case), residualized on court-by-month fixed effects. Because judges are randomly assigned to cases, this measure is conditionally independent of defendant potential outcomes and serves as an instrument for sentence length.
Control complier mean: The estimated mean potential outcome for compliers under the counterfactual of receiving zero months of incarceration. Used as a benchmark to evaluate the magnitude of treatment effects and to characterize how low the earnings baseline is for the population driving the causal estimates.
Extensive vs. intensive margin of incarceration: The extensive margin refers to the binary shift from receiving no prison sentence to receiving any prison sentence; the intensive margin refers to increasing sentence length conditional on some incarceration. The paper argues that neither margin appears to produce long-run labor market scarring, and uses linear programming bounds to estimate that at least 37–45% of compliers in each state are shifted on the extensive margin.