An irrelevance theorem for risk aversion and time-varying risk
What this paper finds — and why it matters
Layer 1: Overview
Chen and Palomino prove a general irrelevance theorem identifying when risk aversion and time-varying risk are irrelevant for key model dynamics in representative-agent macroeconomic models. The central research question is why advances in risk modeling — Epstein-Zin (EZ) recursive preferences, long-run risk, disaster risk — generate rich asset price behavior in endowment economies but fail to produce commensurate effects in standard production economies. The paper resolves this puzzle by characterizing the precise structural conditions under which risk parameters become irrelevant, and provides a taxonomy for how models can escape those conditions.
The theoretical framework is a representative-agent model with EZ preferences, which separate the elasticity of intertemporal substitution (EIS, parameter psi) from risk aversion (gamma). The remaining economic structure — production technology, resource constraints, government policy, financial sector — is assumed to exhibit an analogous separation: variables that control expected values (“first moment states,” such as capital and productivity) are separated from variables that control higher central moments (“higher moment states,” such as stochastic volatility of productivity). The paper proceeds through three settings of increasing generality: a two-period illustrative model, a dynamic stochastic growth model with capital adjustment costs (Jermann 1998) and heteroskedastic AR(1) productivity, and a fully abstract general model covering a broad class of rational-expectations equilibrium systems.
The central result is Theorem 1: if (1) intertemporal and risk preferences are separated (EZ-style), (2) first and higher moment drivers of the remaining model structure are separated, and (3) constraints are approximately linear, then risk aversion gamma and higher-moment parameters theta_h are irrelevant for the elasticity of any endogenous variable — including all asset prices — with respect to first moment states and lagged endogenous variables. Formally, in the solution z_t = z + Z_zz_{t-1} + Z_xx_t + Z_h*h_t, the elasticity matrices Z_z and Z_x are independent of gamma and theta_h. Risk parameters affect only model intercepts and steady states (the constant z) and the elasticity with respect to higher moment states (Z_h). Thus augmenting a stochastic growth model with shocks to volatility or risk aversion has no effect on impulse responses to productivity shocks or other first-moment disturbances.
In the homoskedastic special case (constant volatility), risk aversion is irrelevant for the impulse response of every variable, including all asset prices. This clarifies the Tallarini (2000) separation: it is not a separation between macroeconomic and financial variables, but between means (average equity premium, steady-state levels) and volatilities and impulse responses. Risk aversion affects the level of the equity premium but not stock price volatility or impulse responses.
Numerical verification using projection methods (Caldara et al. 2012) confirms irrelevance holds even at risk aversion of 100 and unconditional volatility of volatility of 80% of baseline. A second, richer model class — with EIS of 0.3, capital adjustment cost elasticity of 3, and left-skewed gamma-distributed productivity shocks calibrated to match Bekaert and Engstrom (2017) quarterly consumption growth moments (kurtosis 4.04, skewness -0.399, matching model kurtosis of 4 and skewness of -0.82) — produces an equity premium more than three times larger than the baseline class and a stock price elasticity with respect to productivity about three times larger, yet continues to display irrelevance: risk aversion and time-varying risk have essentially no effect on the stock price elasticity with respect to productivity.
The theorem extends to smooth ambiguity preferences (Klibanoff, Marinacci, Mukerji 2005) and multiplier preferences (Hansen and Sargent 2001) as long as risk adjustments remain functions of higher-moment state variables. The paper also derives the Barro-King (1984) comovement restriction under recursive preferences (Appendix C), showing that in the neoclassical structure only productivity shocks generate positive comovement of consumption, investment, and labor. This interacts with the irrelevance theorem to explain why production-economy asset pricing models face a compounded difficulty: volatility and risk-aversion shocks cannot break irrelevance within the standard structure, and they also cannot generate the required comovement without additional mechanisms.
The paper provides a unified taxonomy for generating a meaningful role for risk in production economies. One can “break” irrelevance by removing one of the three assumptions: (1) allowing risk aversion to vary with economic conditions as in Campbell-Cochrane (1999) habit formation or heterogeneous agents; (2) introducing non-separability between first and higher moments in production, as in Di Tella and Hall (2022) where entrepreneurial idiosyncratic risk makes aggregate volatility endogenous; or (3) incorporating sufficient nonlinearity via occasionally binding constraints, as in Brunnermeier-Sannikov (2014) or Gourio-Ngo (2020) near the zero lower bound. Alternatively, one can “adapt” to irrelevance by driving dynamics with higher-moment shocks — volatility shocks (Basu-Bundick 2017, combined with nominal rigidities to preserve comovement) or risk-aversion shocks (Basu et al. 2024, combined with an investment reallocation channel).
Layer 2: Deep Dive
What is the core intuition behind the irrelevance theorem?
The Euler equation under EZ preferences decomposes into an Intertemporal Term (characterizing expected consumption-return tradeoffs, driven by EIS) and a Risk Term (characterizing tradeoffs across unexpected future states, driven by risk aversion). In standard models, the production technology is ‘a perfect foresight model with shocks tacked on’: transformation across time is separated from transformation across future states. Because constraints are approximately linear, innovations to endogenous variables with respect to first-moment shocks (productivity, capital) do not contain investment or other endogenous variables, so the Risk Term is a function only of higher-moment states. Differentiating the Euler equation with respect to a first-moment state therefore eliminates the Risk Term entirely, leaving only the Intertemporal Term and making the solution for that elasticity independent of gamma and sigma.
How is the Tallarini (2000) result clarified and extended?
Tallarini (2000) shows that risk aversion is irrelevant for quantity dynamics in a homoskedastic real business cycle model. This is widely interpreted as a separation between macroeconomic (quantity) and financial (price) variables. The paper shows this interpretation is incorrect. When shocks are homoskedastic, risk aversion is irrelevant not just for quantities but for all asset price dynamics, including stock price volatility. The actual separation is between means (steady states, intercepts, average equity premium — all of which depend on risk aversion) and volatilities and impulse responses (which do not). The paper extends Tallarini’s result by showing irrelevance holds for all endogenous variables including stock prices, by showing it persists under heteroskedasticity for elasticities with respect to first-moment states specifically, and by generalizing to abstract models beyond the neoclassical RBC framework.
What are the three conditions required for irrelevance and what is the role of each?
The three conditions are: (1) Separation of intertemporal and risk preferences — EZ-style preferences ensure risk aversion gamma enters only the Risk Term of the Euler equation, not the Intertemporal Term. If preferences are non-separable (e.g., power utility, habit formation), gamma enters the intertemporal tradeoff and affects first-moment elasticities. (2) Separation of first and higher moment drivers in the remaining model structure — production technology and all other constraints must not link transformation of goods across time to transformation across states. If higher-moment variables appear in the production function or resource constraint (e.g., idiosyncratic risk in entrepreneurial production as in Di Tella-Hall 2022), first-moment states appear in the Risk Term and irrelevance breaks. (3) Approximate linearity of constraints — nonlinearities create interactions between current state values and forward-looking volatility. Strong enough nonlinearities (such as those introduced by occasionally binding constraints near the zero lower bound or in financial crisis models) can cause irrelevance to fail even when conditions (1) and (2) hold.
What is the formal mathematical structure of the general model and theorem?
The general model consists of a system of expectational equilibrium conditions E[f(z_{t+1}, x_{t+1} | z_t, x_t, h_t, z_{t-1}; Theta)] = 0, where z_t are endogenous variables, x_t are first-moment exogenous states following a heteroskedastic AR(1) with shock distribution conditional on h_t, and h_t are higher-moment states with an independent AR(1) process. The equilibrium conditions split into constraints (f0, depending only on theta_0, not gamma or theta_h) and asset-pricing Euler equations (depending on the EZ SDF, hence on gamma). The proof uses a risk-adjusted affine approximation (Assumptions 1 and 2): constraints are approximated as conditionally affine in states; the CGF of shocks is conditionally affine in h_t. Conjecturing a linear solution z_t = z + Z_zz_{t-1} + Z_xx_t + Z_h*h_t and applying the method of undetermined coefficients in separate layers shows that Z_z satisfies a quadratic matrix equation depending only on theta_0 (Proposition 2, Equation 171), and Z_x satisfies a Sylvester equation also depending only on theta_0 and Z_z (Equation 172). Since neither equation involves gamma or theta_h, those parameters are irrelevant for Z_z and Z_x. Z_h and z do depend on all parameters including gamma and theta_h.
How does the irrelevance theorem interact with the Barro-King (1984) comovement constraint?
Barro and King (1984) show that, in the neoclassical structure, shocks other than productivity shocks fail to generate the observed positive comovement of consumption, investment, and labor. The paper derives this result under recursive preferences in Appendix C, confirming it extends to the EZ case. The comovement constraint implies that, within the neoclassical structure, the magnitude of higher-moment shocks must be limited to preserve comovement — production-economy asset pricing models typically drive business cycles with productivity shocks rather than volatility or risk-aversion shocks. But the irrelevance theorem implies that productivity shock impulse responses are independent of risk. Together, these results explain why modeling asset prices in production economies is non-trivial: one must simultaneously address comovement (ruling out large higher-moment shocks as the primary business cycle driver) and irrelevance (meaning productivity shocks cannot be enriched with risk dynamics). A successful model must either break irrelevance or adapt to it with mechanisms that also solve the comovement problem.
What does it mean to ‘break’ irrelevance and what are the main examples?
Breaking irrelevance means removing one of the three conditions so that risk aversion or risk parameters enter the elasticity with respect to first-moment states. Examples: (1) Campbell-Cochrane (1999) external habit: risk aversion varies over time as consumption approaches habit, creating time-varying links between the intertemporal and risk terms of the Euler equation. Heterogeneous households (Guvenen 2009) produce similar effects. (2) Di Tella and Hall (2022): entrepreneurs face uninsurable idiosyncratic shocks, making the aggregate production function incorporate risk. Volatility is endogenous and affects how the economy responds to first-moment shocks. Colacito et al. (2014), Decker et al. (2016), and Belo (2010) similarly incorporate production risk-return tradeoffs. (3) Brunnermeier-Sannikov (2014) financial frictions and Gourio-Ngo (2020) zero lower bound: occasionally binding constraints introduce strong enough nonlinearities to break the affine approximation and generate large endogenous volatility far from the steady state. A non-separable production example is also given: if k_{t+1} = (k+i)*1{epsilon >= 0}, investment appears in the consumption innovation and hence in the Risk Term, causing gamma and sigma to enter the first-moment elasticity.
What does it mean to ‘adapt’ to irrelevance and what are the main examples?
Adapting to irrelevance means staying within the class of models covered by the theorem but driving business cycle dynamics with shocks to higher-moment states rather than first-moment states. In this approach, risk aversion and risk parameters remain irrelevant for how the model responds to first-moment shocks (productivity, capital), but they do affect the elasticity with respect to higher-moment shocks and thus drive important dynamics. Basu and Bundick (2017) drive cycles with shocks to the volatility of time preference and maintain positive comovement of consumption, investment, and labor by incorporating nominal rigidities (New-Keynesian frictions break the Barro-King constraint). Basu et al. (2024) drive cycles with shocks to risk aversion and recover comovement via a novel investment reallocation channel between labor and capital. Dupor and Mehkari (2014) document other mechanisms that can overcome the comovement problem, including consumption-investment complementarities and externalities in leisure preferences.
How does the paper extend irrelevance beyond Epstein-Zin preferences?
The paper shows irrelevance holds for a broader family of preferences as long as the log SDF can be written as a base component m*{t+1} plus additional risk adjustments m{i,t+1} = f_tilde_i(Lambda, theta_0) * A_i * z_{t+1}, where Lambda is a generalized risk parameter vector (encompassing ambiguity aversion and other attitudes), and the associated certainty equivalent condition E_{i,t}[A_iz_{t+1}] = -H_{i,t}[f_hat_i * A_iz_{t+1}] holds. This formulation covers smooth ambiguity preferences (Klibanoff et al. 2005, illustrated via Ju-Miao 2012 generalized smooth ambiguity with ambiguity aversion parameter eta) and multiplier preferences (Hansen-Sargent 2001). The key property for irrelevance to hold is that the risk adjustments are solely functions of higher-moment state variables h_t. For smooth ambiguity, irrelevance holds if belief dynamics are exogenous, as in Ilut-Schneider (2014).
What numerical exercises are conducted to validate the approximate linearity assumption?
Two classes of models are solved using projection methods (Caldara et al. 2012), which provide the highest accuracy among available solution methods and capture time variation in risk premiums that second-order perturbation methods cannot. Class 1 replicates Tallarini (2000): EIS = 1, elasticity of investment = 10, normally distributed shocks (gamma shape parameter = 600), calibrated to HP-filtered output volatility of about 1.5% per quarter. Class 2 introduces larger frictions: EIS = 0.3, elasticity of investment = 3, left-skewed gamma shocks with shape parameter 6 (implying kurtosis = 4, skewness = -0.82, consistent with Bekaert-Engstrom 2017 empirical moments of quarterly consumption growth: kurtosis 4.04, skewness -0.399). For both classes, risk aversion is varied up to 100 and the unconditional volatility of volatility up to 80% of the baseline volatility. In both classes, the stock price elasticity with respect to productivity shows essentially no variation with risk aversion or volatility-of-volatility (though a slight negligible median decline is noted), while the equity premium and the stock price elasticity with respect to volatility respond clearly to those risk parameters. The exercise also shows Class 2 produces an equity premium more than three times larger than Class 1 and a stock price elasticity with respect to productivity about three times larger, yet irrelevance persists.
How does the paper relate to and differ from Backus, Ferriere, and Zin (2015)?
Backus, Ferriere, and Zin (2015) is the closest predecessor, providing irrelevance results for several specific models of time-varying risk and time-varying ambiguity. However, the paper argues they share the common misinterpretation of the Tallarini property as a separation between quantities and prices. The present paper extends their results into a fully abstract, general model structure with arbitrary equilibrium conditions and arbitrary shock distributions, proving irrelevance without tying it to specific model structures. This generality allows the paper to clarify that the separation is between means and volatilities, not between macro and finance variables. The paper also provides a clearer account of how models generate meaningful risk dynamics by breaking or adapting to the three theorem conditions.
What is the relationship between the paper’s results and risk-adjusted affine approximations in the prior literature?
The proof builds directly on the risk-adjusted affine approximation methodology of Jermann (1998), Malkhozov (2014), and Lopez, Lopez-Salido, and Vazquez-Grande (2018). These approximations preserve exact equality for the nonlinear expectation and certainty equivalent equations (not linearizing them) while linearizing other constraints. Special cases of the irrelevance result appear in the second- and third-order perturbation solutions of Schmitt-Grohe and Uribe (2004) and Van Binsbergen et al. (2012), which this paper unifies and generalizes. The use of entropy (the conditional cumulant generating function operator) to summarize higher-order terms is motivated by Backus et al. (2014), who show entropy effectively summarizes asset pricing properties of pricing kernels. The conditionally affine CGF assumption (Assumption 2) generalizes the normal-shock setting where CGFs are exactly affine in h_t.
What are the scope conditions and limitations of the theorem?
The theorem applies under three maintained assumptions: (1) separation of preferences (EZ-style or the broader class in Section 4.4), (2) separation of first and higher moment drivers in all model constraints including government, financial sector, labor markets, and endowment processes, and (3) approximate linearity — formally, that the affine approximation (Assumptions 1 and 2) is accurate. The theorem does NOT apply when: constraints are strongly nonlinear due to occasionally binding constraints (ZLB, financial crisis regimes); production incorporates endogenous risk-return tradeoffs; risk aversion varies endogenously with the state (habit formation, wealth distribution with heterogeneous agents); or belief dynamics are endogenous in the ambiguity case. The paper cannot provide a complete characterization of when nonlinearities are ‘strong enough’ to break irrelevance — numerical evidence suggests simply increasing risk aversion or vol-of-vol is insufficient, but occasionally binding constraints in the literature have been shown to be sufficient. The theorem also assumes the first and higher moment state shocks are independent (Equation 54), a modeling assumption that drives the separation.
What do the results imply for how the field should model asset prices in production economies?
The theorem implies that meaningful risk modeling in production economies is fundamentally more demanding than in endowment economies. In endowment economies, adding EZ preferences with high risk aversion or stochastic volatility directly affects how asset prices respond to the endowment process. In production economies, these same additions have no effect on impulse responses to productivity shocks — the primary drivers of business cycles in the neoclassical structure — because productivity is a first-moment state. Successful production-economy asset pricing models must therefore either: incorporate mechanisms that connect intertemporal and risk tradeoffs in production (endogenous volatility, incomplete markets, idiosyncratic risk); introduce sufficient structural nonlinearity; or drive business cycles with higher-moment shocks combined with additional mechanisms to preserve comovement. The paper suggests that the limited success of long-run risk and disaster risk models in production economies is not a failure of calibration but a logical consequence of the theorem’s conditions being satisfied.
Key Concepts
First moment states: Exogenous state variables that affect expected values of the model structure (e.g., productivity level, capital stock) but not the higher central moments of the shock distributions. In the general model, x_t with shock distribution having zero mean conditional on h_t but variance and higher moments controlled entirely by h_t, not x_t itself.
Higher moment states: Exogenous state variables that control the conditional higher central moments (variance, skewness, kurtosis) of the shock distributions but not their means — e.g., stochastic volatility of productivity h_t. Risk aversion and parameters governing higher moments (theta_h) are irrelevant for elasticities with respect to first-moment states but are critical for elasticities with respect to higher-moment states.
Irrelevance (in this paper’s sense): The property that risk aversion gamma and higher-moment parameters theta_h do not enter the matrices Z_z and Z_x in the solution z_t = z + Z_zz_{t-1} + Z_xx_t + Z_h*h_t. These parameters are irrelevant for impulse responses and dynamic elasticities with respect to first-moment states, though they do affect steady states (z), model intercepts, and elasticities with respect to higher-moment states (Z_h).
Breaking irrelevance: Removing one of the three theorem conditions — separability of preferences, separability of first and higher moment drivers in constraints, or approximate linearity — so that risk aversion or risk parameters enter the first-moment elasticities. Requires economically substantive modifications such as endogenous risk-return tradeoffs in production, habit formation, or occasionally binding constraints.
Adapting to irrelevance: Staying within the class of models covered by the theorem — accepting that risk parameters do not affect first-moment impulse responses — but driving business cycle dynamics primarily with shocks to higher-moment states (volatility, risk aversion). Requires additional mechanisms (nominal rigidities, reallocation channels) to maintain positive comovement of consumption, investment, and labor, which higher-moment shocks cannot generate in the neoclassical structure alone.
Risk-adjusted affine approximation: A solution method that preserves the nonlinear expectation and certainty equivalent equations exactly (not linearizing them, thereby retaining all risk effects) while log-linearizing the remaining constraints. The resulting solution is affine in the state variables, with the CGF of shocks assumed to be conditionally affine in the higher-moment states h_t. This approach captures higher-order risk terms while maintaining analytical tractability.
Entropy operator: The conditional matrix operator H_t[u] = log E_t[exp(u - E_t[u])], equivalent to the vectorized conditional cumulant generating function (CGF) evaluated at 1. Used to represent all higher-order terms in the equilibrium conditions compactly; the key technical tool enabling the proof to separate expectational terms (independent of risk parameters) from entropy terms (functions of higher-moment states).
Means-volatilities separation: The corrected characterization of Tallarini (2000)’s result: risk aversion affects model means (intercepts, steady states, average equity premium) but not volatilities or impulse responses of any variable — including asset prices — when shocks are homoskedastic. This reinterpretation replaces the widely held but incorrect view that Tallarini establishes a separation between macroeconomic and financial variables.