Stochastic Volatility: Heston Model Explained

Options traders know that implied volatility varies across strikes — deep out-of-the-money puts typically trade at higher implied volatility than at-the-money options. This pattern, known as the volatility smile or skew, cannot be explained by the Black-Scholes model, which assumes a single constant volatility. The Heston model (1993) addresses this limitation by treating volatility as a random process with its own dynamics, correlated with the underlying asset price. This article explains how stochastic volatility models work, what each Heston parameter controls, and why negative correlation generates the characteristic equity skew.

What Is a Stochastic Volatility Model?

A stochastic volatility model is a class of options pricing framework where volatility is itself a random variable driven by its own stochastic process, rather than a fixed parameter. Instead of specifying a single number for volatility as Black-Scholes does, these models give volatility its own equation — it can rise, fall, cluster during crises, and revert toward a long-run level over time.

Key Concept

A stochastic volatility model treats variance as a second random process, correlated with — but separate from — the asset price process. This correlation is what allows the model to generate realistic volatility smiles and skews.

The Heston model (1993) is the workhorse of stochastic volatility modeling because it offers semi-closed-form pricing via characteristic functions. While the pricing formula still requires numerical integration, this is far more tractable than full Monte Carlo simulation. Other stochastic volatility models include SABR (popular in interest rate markets), the Bates model (Heston plus jumps), and more recent rough volatility models.

The key intuition is that in real markets, fear (volatility) itself rises and falls — it clusters, spikes during market stress, and slowly reverts back to normal levels. Stochastic volatility models capture this behavior directly, while constant-volatility models like Black-Scholes cannot.

Why Constant Volatility Fails

The Black-Scholes model assumes volatility is a single, constant number. Under this assumption, all options on the same underlying with the same expiration should have the same implied volatility — a perfectly flat volatility surface.

Important Limitation

Using a single implied volatility to price or hedge a portfolio of options across different strikes systematically misprices tail risk. The market’s actual volatility surface is not flat.

Empirical reality tells a different story. When you back out implied volatility from market prices using the Black-Scholes formula, you get different numbers for different strikes — proving the constant-volatility assumption is wrong. Specifically:

  • Fat tails: Black-Scholes uses a lognormal distribution, but actual returns have fatter tails — extreme moves happen more frequently than the model predicts.
  • Leverage effect: Equity volatility tends to rise when prices fall. Black-Scholes has no mechanism for this correlation.
  • Volatility clustering: High volatility begets high volatility. Periods of calm and periods of turbulence cluster together. Black-Scholes ignores this persistence entirely.
The Post-1987 Skew

After the 1987 stock market crash, S&P 500 options permanently developed a steep volatility skew — out-of-the-money puts now trade at significantly higher implied volatility than at-the-money options. No single constant volatility can reproduce this pattern in Black-Scholes.

March 2020: COVID Crash and VIX Spike

When the S&P 500 fell over 30% in March 2020, the VIX (a measure of expected S&P 500 volatility) spiked from around 15 to over 80 — its highest level since 2008. This simultaneous crash in equities and spike in volatility is exactly what the Heston model’s negative ρ captures. Black-Scholes, with no mechanism for correlated stock-volatility moves, cannot explain why volatility quadrupled precisely when prices collapsed.

Heston Model Formula: Mean-Reverting Variance

Steven Heston published “A Closed-Form Solution for Options with Stochastic Volatility” in 1993. The key innovation was finding an analytically tractable solution using characteristic functions, despite volatility being random. The model uses two coupled stochastic differential equations (SDEs).

Stock Price Process
dS = (r – q)S dt + √v · S · dWS
The stock follows geometric Brownian motion with instantaneous variance v(t) replacing constant σ²

Where (r – q) is the risk-free rate minus dividend yield — the risk-neutral drift used for pricing. The diffusion term uses √v instead of constant σ, where v(t) is the instantaneous variance at time t.

Variance Process (CIR-type)
dv = κ(θ – v) dt + ξ√v · dWv
Variance mean-reverts toward long-run level θ at speed κ, with randomness scaled by vol-of-vol ξ

This is a Cox-Ingersoll-Ross (CIR) type process. When variance v is above its long-run level θ, the drift is negative (pulling v down). When v is below θ, the drift is positive (pulling v up). The √v scaling in the diffusion term ensures variance remains non-negative.

Key Concept

The Heston model’s critical innovation is not just that variance is random — it is that variance and stock price move together through a correlation parameter ρ. This correlation is what generates realistic volatility skews.

The two Brownian motions are correlated:

Correlation Structure
E[dWS · dWv] = ρ dt
Stock and variance shocks are correlated with coefficient ρ (typically negative for equities)

The Feller condition provides a sufficient condition for variance to stay strictly positive:

Feller Condition
2κθ > ξ²
When satisfied, variance remains strictly positive from a positive start; when violated, variance stays non-negative but zero becomes attainable

Think of variance as a rubber band stretched toward its long-run level θ. The parameter κ controls how hard the band pulls back, and ξ controls how much it quivers along the way. The stock price responds to wherever the rubber band currently sits.

Heston Model Parameters

The Heston model has five parameters. Understanding what each one controls is essential for interpreting calibration results and model behavior.

Parameter Name Intuition Typical Equity Range
κ Mean reversion speed How quickly variance snaps back to θ 1 to 5 (annualized)
θ Long-run variance Level variance gravitates toward; √θ is long-run vol 0.04 to 0.09 (20-30% vol)
ξ Vol of vol How much variance itself fluctuates 0.2 to 0.5
ρ Correlation Link between stock and variance shocks -0.9 to -0.5
v0 Initial variance Starting variance level Calibrated to current ATM IV
Key Concept

ρ controls skew (the tilt of the smile); ξ controls kurtosis (the curvature of the wings). Both matter for fitting a realistic volatility surface, but they do fundamentally different jobs.

κ (kappa) — Mean reversion speed: High κ means variance snaps back to θ quickly; the process is tightly anchored. Low κ means variance can drift far from its long-run level for extended periods.

θ (theta) — Long-run variance: This is the level variance gravitates toward over time. If θ = 0.04, then long-run volatility is √0.04 = 20%.

ξ (xi) — Volatility of volatility: Controls how much variance itself fluctuates. Higher ξ produces fatter tails and more pronounced wings in the volatility surface. Note that ξ governs curvature; it does not by itself create skew.

ρ (rho) — Correlation: For equities, ρ is empirically negative (typically -0.5 to -0.8). When ρ < 0, stock price falls tend to coincide with variance rises — the leverage effect. This is the single most important parameter for generating realistic equity skew. Positive ρ can occur in some commodity markets where price rallies are associated with rising volatility.

v0 — Initial variance: The starting value for the variance process, typically calibrated jointly with other parameters to fit the current volatility surface.

Pro Tip

When calibrating Heston, always check the Feller condition (2κθ > ξ²). Calibrations that violate it may fit the current surface well but can produce numerical instabilities or require special handling when simulating paths.

Important: Calibrated Heston parameters are risk-neutral pricing parameters, not historical estimates. They are derived from option prices under the risk-neutral measure Q, not from historical return data under the real-world measure P. The risk-neutral parameters embed a volatility risk premium that may cause them to differ significantly from what you would estimate from realized returns.

How the Heston Model Explains the Volatility Smile

The volatility smile and skew article documents what traders observe empirically — the shape of the implied volatility surface across strikes and maturities. This section explains why the Heston model produces these patterns mechanistically.

The mechanism depends primarily on the correlation parameter ρ (though skew magnitude also depends on ξ and maturity):

  • ρ = 0: Heston produces a symmetric smile — both wings (deep OTM puts and calls) trade at higher implied volatility than ATM. This is pure kurtosis from ξ > 0.
  • ρ < 0: The smile tilts left into a skew — OTM puts become more expensive (higher IV) than OTM calls. This is the characteristic equity skew.
Key Concept

Setting ρ = 0 in Heston produces a symmetric smile from kurtosis alone. It is negative correlation (ρ < 0) that tilts the smile into the characteristic equity skew.

The economic intuition: investors price crash risk. When markets fall sharply, volatility spikes simultaneously (because ρ < 0 links downward stock moves to upward variance moves). Put options that pay off in crashes therefore cost more — they pay off precisely when volatility is elevated, making them more valuable as insurance.

Illustrative Example: How Negative ρ Creates Equity Skew

Consider an equity index at 4,000 with Heston parameters: ρ = -0.70, κ = 2.0, θ = 0.04, ξ = 0.30, v0 = 0.04.

During a sudden 10% drop to 3,600:

  • The stock process experiences a large negative shock (dWS < 0)
  • Because ρ = -0.70, the variance shock is positive with high probability — variance v rises sharply
  • The market is now in a state of elevated variance precisely when put options are deep in the money

Result: Put options priced under this Heston calibration cost significantly more than Black-Scholes (or Heston with ρ = 0) would suggest. The implied volatility curve shows a pattern like: ATM ~20%, 10% OTM put ~26-28%, 10% OTM call ~18% — the classic left-skewed smile. (Exact levels depend on maturity and calibration specifics.)

The term structure of the smile is governed by κ and θ jointly. High κ means short-dated smiles are steep but long-dated smiles flatten toward the long-run level θ. Low κ means the smile shape persists more uniformly across maturities.

Stochastic Volatility Model vs Local Volatility vs GARCH

Stochastic volatility is one of several approaches to modeling non-constant volatility. Each framework has distinct strengths and use cases.

Dimension Stochastic Volatility (Heston) Local Volatility (Dupire) GARCH
Time framework Continuous time Continuous time Discrete time
Volatility process Separate stochastic process with own Brownian motion Deterministic function σ(S,t) Conditional variance updated each period
Smile fit Generates smile via ρ and ξ; requires calibration Fits any observed surface exactly by construction Produces fat tails; option-pricing variants exist but less common
Forward dynamics Realistic — smile can shift, widen, narrow Less realistic — forward smile tends to flatten Not the primary tool for options dynamics
Calibration 5 parameters; nonlinear optimization Non-parametric; solved from market prices Estimated from historical return series
Primary use case Options pricing and risk management Exotic options pricing matching vanilla surface Risk modeling, VaR, econometric forecasting

Stochastic vs Local Volatility: Dupire’s local volatility model fits any observed vanilla surface exactly by construction — which sounds ideal. The problem is that its forward smile dynamics are unrealistic; in local vol, the smile tends to flatten as time passes in ways that don’t match observed behavior. Heston’s smile can shift and persist more realistically. In practice, many trading desks use Stochastic Local Volatility (SLV) models that blend both approaches.

Stochastic Volatility vs GARCH: GARCH models live in discrete time and are estimated from historical price data. They excel at econometric forecasting of realized volatility and Value at Risk calculations. Stochastic volatility models live in continuous time and are calibrated to option prices under the risk-neutral measure. They answer different questions: GARCH asks “what will volatility be next week?” while Heston asks “what is the risk-neutral distribution for pricing options today?”

Pro Tip

In practice, the industry often uses Stochastic Local Volatility (SLV) models, which combine a local volatility backbone with a stochastic volatility overlay. This preserves exact vanilla calibration while improving forward smile dynamics.

Practical Challenges with Stochastic Volatility Models

While Heston improves significantly on Black-Scholes, it comes with practical difficulties:

Calibration complexity: Five parameters must be jointly optimized to fit the market volatility surface. The optimization problem is non-convex — multiple parameter sets can produce similar vanilla fits but differ significantly in their exotic pricing and hedging behavior.

Computational cost: Unlike Black-Scholes, Heston does not have a simple closed-form price. The semi-analytical solution requires numerical integration of a characteristic function (Fourier inversion). This is slower than Black-Scholes but far faster than full Monte Carlo simulation.

Greeks and hedging: In Heston, you need to hedge not just delta (∂V/∂S) but also vega (∂V/∂v). Because variance v is not directly observable or tradeable, vega hedging requires using other traded options as instruments. The hedge ratios are more complex than Black-Scholes.

Parameter instability: Heston parameters calibrated to today’s surface may differ significantly from parameters calibrated to yesterday’s surface. The model is not necessarily stable across time — this is the “recalibration problem.”

Recalibration Warning

Heston parameters calibrated to the current surface can shift significantly from day to day. A calibration that looks perfect today may produce unstable Greeks tomorrow. Monitor recalibration stability as part of model risk management.

Limitations of the Heston Model

Even with stochastic volatility, the Heston model has fundamental limitations:

Single-factor variance: Heston uses one stochastic variance factor. Real volatility surfaces exhibit multi-factor behavior — short-dated and long-dated smiles can move independently. Extensions like time-dependent Heston, double Heston, or the Bates model address some of these limitations.

No jumps: Heston assumes continuous price paths. Equity returns exhibit sudden jumps from overnight gaps, earnings announcements, or market events. The Bates model (1996) extends Heston with a Poisson jump component to capture this behavior.

Rough volatility: Recent empirical research by Gatheral and Rosenbaum finds that realized volatility has fractal-like “rough” behavior not captured by the smooth CIR process. Rough volatility models like rBergomi better fit certain features of the ATM skew term structure.

Feller condition trade-off: Calibrations that impose the Feller condition for theoretical cleanliness often produce worse fits to the observed surface. Practitioners frequently allow Feller violations, accepting the limitation for better empirical accuracy.

Model Risk

Even a well-calibrated Heston model will misprice some exotic options. Fitting the marginal distributions (vanilla surface) does not constrain the joint distribution across time. Barrier options and path-dependent exotics need separate validation.

Common Mistakes

Understanding these common errors helps avoid pitfalls when working with stochastic volatility models:

1. Confusing ρ and ξ effects on the smile
Mistake: Assuming that vol-of-vol (ξ) is responsible for the skew.
Correction: ξ controls the curvature (wings) of the smile, not the tilt. The tilt — the asymmetry between put and call implied vols — comes from ρ < 0. A model with ξ > 0 but ρ = 0 produces only a symmetric smile.

2. Confusing risk-neutral and real-world parameters
Mistake: Using historically estimated variance parameters directly for option pricing, or interpreting calibrated parameters as forecasts of future realized volatility.
Correction: Calibrated Heston parameters are risk-neutral (Q-measure) quantities derived from option prices. They embed a volatility risk premium and may differ substantially from P-measure parameters estimated from historical returns. Use the right parameters for the right purpose.

3. Calibrating to a single expiry only
Mistake: Calibrating Heston parameters to fit the smile for one expiry and using the result for all maturities.
Correction: Heston has a specific term structure of volatility implied by κ and θ. Calibrating to a single slice can produce parameters that wildly misprice options at other maturities. Always calibrate to a multi-expiry surface.

4. Ignoring the Feller condition in simulation
Mistake: Allowing the Feller condition to be violated in Monte Carlo simulation without implementing a fix for near-zero or negative variance.
Correction: Even if calibration violates Feller for better fit, the simulation discretization must handle near-zero variance paths explicitly. Common fixes include the full truncation scheme or the QE (Quadratic Exponential) scheme from Andersen (2007).

Path-Dependent Exotics Warning

Fitting the vanilla surface does not guarantee correct exotic pricing. Two Heston calibrations with similar vanilla prices can produce materially different barrier or Asian option prices. Always validate exotic pricing independently.

Frequently Asked Questions

A stochastic volatility model is an options pricing framework where volatility is treated as a random variable that evolves over time, rather than being fixed. Instead of specifying a single number for volatility (as Black-Scholes does), the model gives volatility its own stochastic process — it can rise, fall, cluster, and revert to a long-run level. The Heston model is the most widely used example because it offers tractable pricing via characteristic functions.

The Heston model generates the volatility skew through its correlation parameter ρ. When ρ is negative (typically -0.5 to -0.8 for equities), stock price declines tend to coincide with variance increases — the leverage effect. This means put options that pay off during market crashes are more valuable because volatility spikes at the same time. The result is higher implied volatility for out-of-the-money puts than for out-of-the-money calls, producing the characteristic downward-sloping skew observed in equity markets.

Black-Scholes treats volatility as a fixed constant for the life of the option. Heston replaces that constant with a dynamic process: variance follows a mean-reverting CIR-type process with its own randomness, correlated with the stock price. This makes Heston capable of generating the volatility smile and skew that Black-Scholes cannot produce. The trade-off is added complexity: five parameters instead of one, and numerical integration instead of a simple closed-form formula.

The Feller condition is the inequality 2κθ > ξ². It is a sufficient condition for the variance process to stay strictly positive from a positive starting point. When satisfied, variance can approach zero but never reach it. When violated, variance remains non-negative but zero becomes attainable. In practice, calibrated Heston parameters sometimes violate this condition because the market smile requires high vol-of-vol (ξ). Practitioners accept this trade-off but must handle near-zero variance carefully in Monte Carlo simulation to avoid numerical errors.

Heston is calibrated to implied volatility — specifically, to option prices across strikes and maturities. The calibrated parameters are risk-neutral (Q-measure) quantities, not historical estimates. This is fundamentally different from GARCH models, which are estimated from historical return data. The distinction matters: Heston parameters embed a volatility risk premium and are designed for pricing, while historical volatility estimates are designed for forecasting. Using one in place of the other leads to errors.

Disclaimer

This article is for educational and informational purposes only and does not constitute investment or trading advice. Stochastic volatility models including the Heston model involve significant mathematical complexity. Model outputs depend on calibration choices and assumptions; no model perfectly represents market behavior. Past calibration accuracy does not guarantee future performance. Always consult qualified professionals before making trading or risk management decisions.