Stochastic Volatility: Heston Model Explained
Options traders know that implied volatility varies across strikes — deep out-of-the-money puts typically trade at higher implied volatility than at-the-money options. This pattern, known as the volatility smile or skew, cannot be explained by the Black-Scholes model, which assumes a single constant volatility. The Heston model (1993) addresses this limitation by treating volatility as a random process with its own dynamics, correlated with the underlying asset price. This article explains how stochastic volatility models work, what each Heston parameter controls, and why negative correlation generates the characteristic equity skew.
What Is a Stochastic Volatility Model?
A stochastic volatility model is a class of options pricing framework where volatility is itself a random variable driven by its own stochastic process, rather than a fixed parameter. Instead of specifying a single number for volatility as Black-Scholes does, these models give volatility its own equation — it can rise, fall, cluster during crises, and revert toward a long-run level over time.
A stochastic volatility model treats variance as a second random process, correlated with — but separate from — the asset price process. This correlation is what allows the model to generate realistic volatility smiles and skews.
The Heston model (1993) is the workhorse of stochastic volatility modeling because it offers semi-closed-form pricing via characteristic functions. While the pricing formula still requires numerical integration, this is far more tractable than full Monte Carlo simulation. Other stochastic volatility models include SABR (popular in interest rate markets), the Bates model (Heston plus jumps), and more recent rough volatility models.
The key intuition is that in real markets, fear (volatility) itself rises and falls — it clusters, spikes during market stress, and slowly reverts back to normal levels. Stochastic volatility models capture this behavior directly, while constant-volatility models like Black-Scholes cannot.
Why Constant Volatility Fails
The Black-Scholes model assumes volatility is a single, constant number. Under this assumption, all options on the same underlying with the same expiration should have the same implied volatility — a perfectly flat volatility surface.
Using a single implied volatility to price or hedge a portfolio of options across different strikes systematically misprices tail risk. The market’s actual volatility surface is not flat.
Empirical reality tells a different story. When you back out implied volatility from market prices using the Black-Scholes formula, you get different numbers for different strikes — proving the constant-volatility assumption is wrong. Specifically:
- Fat tails: Black-Scholes uses a lognormal distribution, but actual returns have fatter tails — extreme moves happen more frequently than the model predicts.
- Leverage effect: Equity volatility tends to rise when prices fall. Black-Scholes has no mechanism for this correlation.
- Volatility clustering: High volatility begets high volatility. Periods of calm and periods of turbulence cluster together. Black-Scholes ignores this persistence entirely.
After the 1987 stock market crash, S&P 500 options permanently developed a steep volatility skew — out-of-the-money puts now trade at significantly higher implied volatility than at-the-money options. No single constant volatility can reproduce this pattern in Black-Scholes.
When the S&P 500 fell over 30% in March 2020, the VIX (a measure of expected S&P 500 volatility) spiked from around 15 to over 80 — its highest level since 2008. This simultaneous crash in equities and spike in volatility is exactly what the Heston model’s negative ρ captures. Black-Scholes, with no mechanism for correlated stock-volatility moves, cannot explain why volatility quadrupled precisely when prices collapsed.
Heston Model Formula: Mean-Reverting Variance
Steven Heston published “A Closed-Form Solution for Options with Stochastic Volatility” in 1993. The key innovation was finding an analytically tractable solution using characteristic functions, despite volatility being random. The model uses two coupled stochastic differential equations (SDEs).
Where (r – q) is the risk-free rate minus dividend yield — the risk-neutral drift used for pricing. The diffusion term uses √v instead of constant σ, where v(t) is the instantaneous variance at time t.
This is a Cox-Ingersoll-Ross (CIR) type process. When variance v is above its long-run level θ, the drift is negative (pulling v down). When v is below θ, the drift is positive (pulling v up). The √v scaling in the diffusion term ensures variance remains non-negative.
The Heston model’s critical innovation is not just that variance is random — it is that variance and stock price move together through a correlation parameter ρ. This correlation is what generates realistic volatility skews.
The two Brownian motions are correlated:
The Feller condition provides a sufficient condition for variance to stay strictly positive:
Think of variance as a rubber band stretched toward its long-run level θ. The parameter κ controls how hard the band pulls back, and ξ controls how much it quivers along the way. The stock price responds to wherever the rubber band currently sits.
Heston Model Parameters
The Heston model has five parameters. Understanding what each one controls is essential for interpreting calibration results and model behavior.
| Parameter | Name | Intuition | Typical Equity Range |
|---|---|---|---|
| κ | Mean reversion speed | How quickly variance snaps back to θ | 1 to 5 (annualized) |
| θ | Long-run variance | Level variance gravitates toward; √θ is long-run vol | 0.04 to 0.09 (20-30% vol) |
| ξ | Vol of vol | How much variance itself fluctuates | 0.2 to 0.5 |
| ρ | Correlation | Link between stock and variance shocks | -0.9 to -0.5 |
| v0 | Initial variance | Starting variance level | Calibrated to current ATM IV |
ρ controls skew (the tilt of the smile); ξ controls kurtosis (the curvature of the wings). Both matter for fitting a realistic volatility surface, but they do fundamentally different jobs.
κ (kappa) — Mean reversion speed: High κ means variance snaps back to θ quickly; the process is tightly anchored. Low κ means variance can drift far from its long-run level for extended periods.
θ (theta) — Long-run variance: This is the level variance gravitates toward over time. If θ = 0.04, then long-run volatility is √0.04 = 20%.
ξ (xi) — Volatility of volatility: Controls how much variance itself fluctuates. Higher ξ produces fatter tails and more pronounced wings in the volatility surface. Note that ξ governs curvature; it does not by itself create skew.
ρ (rho) — Correlation: For equities, ρ is empirically negative (typically -0.5 to -0.8). When ρ < 0, stock price falls tend to coincide with variance rises — the leverage effect. This is the single most important parameter for generating realistic equity skew. Positive ρ can occur in some commodity markets where price rallies are associated with rising volatility.
v0 — Initial variance: The starting value for the variance process, typically calibrated jointly with other parameters to fit the current volatility surface.
When calibrating Heston, always check the Feller condition (2κθ > ξ²). Calibrations that violate it may fit the current surface well but can produce numerical instabilities or require special handling when simulating paths.
Important: Calibrated Heston parameters are risk-neutral pricing parameters, not historical estimates. They are derived from option prices under the risk-neutral measure Q, not from historical return data under the real-world measure P. The risk-neutral parameters embed a volatility risk premium that may cause them to differ significantly from what you would estimate from realized returns.
How the Heston Model Explains the Volatility Smile
The volatility smile and skew article documents what traders observe empirically — the shape of the implied volatility surface across strikes and maturities. This section explains why the Heston model produces these patterns mechanistically.
The mechanism depends primarily on the correlation parameter ρ (though skew magnitude also depends on ξ and maturity):
- ρ = 0: Heston produces a symmetric smile — both wings (deep OTM puts and calls) trade at higher implied volatility than ATM. This is pure kurtosis from ξ > 0.
- ρ < 0: The smile tilts left into a skew — OTM puts become more expensive (higher IV) than OTM calls. This is the characteristic equity skew.
Setting ρ = 0 in Heston produces a symmetric smile from kurtosis alone. It is negative correlation (ρ < 0) that tilts the smile into the characteristic equity skew.
The economic intuition: investors price crash risk. When markets fall sharply, volatility spikes simultaneously (because ρ < 0 links downward stock moves to upward variance moves). Put options that pay off in crashes therefore cost more — they pay off precisely when volatility is elevated, making them more valuable as insurance.
Consider an equity index at 4,000 with Heston parameters: ρ = -0.70, κ = 2.0, θ = 0.04, ξ = 0.30, v0 = 0.04.
During a sudden 10% drop to 3,600:
- The stock process experiences a large negative shock (dWS < 0)
- Because ρ = -0.70, the variance shock is positive with high probability — variance v rises sharply
- The market is now in a state of elevated variance precisely when put options are deep in the money
Result: Put options priced under this Heston calibration cost significantly more than Black-Scholes (or Heston with ρ = 0) would suggest. The implied volatility curve shows a pattern like: ATM ~20%, 10% OTM put ~26-28%, 10% OTM call ~18% — the classic left-skewed smile. (Exact levels depend on maturity and calibration specifics.)
The term structure of the smile is governed by κ and θ jointly. High κ means short-dated smiles are steep but long-dated smiles flatten toward the long-run level θ. Low κ means the smile shape persists more uniformly across maturities.
Stochastic Volatility Model vs Local Volatility vs GARCH
Stochastic volatility is one of several approaches to modeling non-constant volatility. Each framework has distinct strengths and use cases.
| Dimension | Stochastic Volatility (Heston) | Local Volatility (Dupire) | GARCH |
|---|---|---|---|
| Time framework | Continuous time | Continuous time | Discrete time |
| Volatility process | Separate stochastic process with own Brownian motion | Deterministic function σ(S,t) | Conditional variance updated each period |
| Smile fit | Generates smile via ρ and ξ; requires calibration | Fits any observed surface exactly by construction | Produces fat tails; option-pricing variants exist but less common |
| Forward dynamics | Realistic — smile can shift, widen, narrow | Less realistic — forward smile tends to flatten | Not the primary tool for options dynamics |
| Calibration | 5 parameters; nonlinear optimization | Non-parametric; solved from market prices | Estimated from historical return series |
| Primary use case | Options pricing and risk management | Exotic options pricing matching vanilla surface | Risk modeling, VaR, econometric forecasting |
Stochastic vs Local Volatility: Dupire’s local volatility model fits any observed vanilla surface exactly by construction — which sounds ideal. The problem is that its forward smile dynamics are unrealistic; in local vol, the smile tends to flatten as time passes in ways that don’t match observed behavior. Heston’s smile can shift and persist more realistically. In practice, many trading desks use Stochastic Local Volatility (SLV) models that blend both approaches.
Stochastic Volatility vs GARCH: GARCH models live in discrete time and are estimated from historical price data. They excel at econometric forecasting of realized volatility and Value at Risk calculations. Stochastic volatility models live in continuous time and are calibrated to option prices under the risk-neutral measure. They answer different questions: GARCH asks “what will volatility be next week?” while Heston asks “what is the risk-neutral distribution for pricing options today?”
In practice, the industry often uses Stochastic Local Volatility (SLV) models, which combine a local volatility backbone with a stochastic volatility overlay. This preserves exact vanilla calibration while improving forward smile dynamics.
Practical Challenges with Stochastic Volatility Models
While Heston improves significantly on Black-Scholes, it comes with practical difficulties:
Calibration complexity: Five parameters must be jointly optimized to fit the market volatility surface. The optimization problem is non-convex — multiple parameter sets can produce similar vanilla fits but differ significantly in their exotic pricing and hedging behavior.
Computational cost: Unlike Black-Scholes, Heston does not have a simple closed-form price. The semi-analytical solution requires numerical integration of a characteristic function (Fourier inversion). This is slower than Black-Scholes but far faster than full Monte Carlo simulation.
Greeks and hedging: In Heston, you need to hedge not just delta (∂V/∂S) but also vega (∂V/∂v). Because variance v is not directly observable or tradeable, vega hedging requires using other traded options as instruments. The hedge ratios are more complex than Black-Scholes.
Parameter instability: Heston parameters calibrated to today’s surface may differ significantly from parameters calibrated to yesterday’s surface. The model is not necessarily stable across time — this is the “recalibration problem.”
Heston parameters calibrated to the current surface can shift significantly from day to day. A calibration that looks perfect today may produce unstable Greeks tomorrow. Monitor recalibration stability as part of model risk management.
Limitations of the Heston Model
Even with stochastic volatility, the Heston model has fundamental limitations:
Single-factor variance: Heston uses one stochastic variance factor. Real volatility surfaces exhibit multi-factor behavior — short-dated and long-dated smiles can move independently. Extensions like time-dependent Heston, double Heston, or the Bates model address some of these limitations.
No jumps: Heston assumes continuous price paths. Equity returns exhibit sudden jumps from overnight gaps, earnings announcements, or market events. The Bates model (1996) extends Heston with a Poisson jump component to capture this behavior.
Rough volatility: Recent empirical research by Gatheral and Rosenbaum finds that realized volatility has fractal-like “rough” behavior not captured by the smooth CIR process. Rough volatility models like rBergomi better fit certain features of the ATM skew term structure.
Feller condition trade-off: Calibrations that impose the Feller condition for theoretical cleanliness often produce worse fits to the observed surface. Practitioners frequently allow Feller violations, accepting the limitation for better empirical accuracy.
Even a well-calibrated Heston model will misprice some exotic options. Fitting the marginal distributions (vanilla surface) does not constrain the joint distribution across time. Barrier options and path-dependent exotics need separate validation.
Common Mistakes
Understanding these common errors helps avoid pitfalls when working with stochastic volatility models:
1. Confusing ρ and ξ effects on the smile
Mistake: Assuming that vol-of-vol (ξ) is responsible for the skew.
Correction: ξ controls the curvature (wings) of the smile, not the tilt. The tilt — the asymmetry between put and call implied vols — comes from ρ < 0. A model with ξ > 0 but ρ = 0 produces only a symmetric smile.
2. Confusing risk-neutral and real-world parameters
Mistake: Using historically estimated variance parameters directly for option pricing, or interpreting calibrated parameters as forecasts of future realized volatility.
Correction: Calibrated Heston parameters are risk-neutral (Q-measure) quantities derived from option prices. They embed a volatility risk premium and may differ substantially from P-measure parameters estimated from historical returns. Use the right parameters for the right purpose.
3. Calibrating to a single expiry only
Mistake: Calibrating Heston parameters to fit the smile for one expiry and using the result for all maturities.
Correction: Heston has a specific term structure of volatility implied by κ and θ. Calibrating to a single slice can produce parameters that wildly misprice options at other maturities. Always calibrate to a multi-expiry surface.
4. Ignoring the Feller condition in simulation
Mistake: Allowing the Feller condition to be violated in Monte Carlo simulation without implementing a fix for near-zero or negative variance.
Correction: Even if calibration violates Feller for better fit, the simulation discretization must handle near-zero variance paths explicitly. Common fixes include the full truncation scheme or the QE (Quadratic Exponential) scheme from Andersen (2007).
Fitting the vanilla surface does not guarantee correct exotic pricing. Two Heston calibrations with similar vanilla prices can produce materially different barrier or Asian option prices. Always validate exotic pricing independently.
Frequently Asked Questions
Disclaimer
This article is for educational and informational purposes only and does not constitute investment or trading advice. Stochastic volatility models including the Heston model involve significant mathematical complexity. Model outputs depend on calibration choices and assumptions; no model perfectly represents market behavior. Past calibration accuracy does not guarantee future performance. Always consult qualified professionals before making trading or risk management decisions.