Volatility Estimation: EWMA and GARCH Models Explained

Q: What does GARCH stand for and what does it model?

GARCH stands for Generalized Autoregressive Conditional Heteroskedasticity . It models the time-varying (conditional) variance of financial asset returns. “Autoregressive” means today’s variance depends on past values, “conditional” means the variance changes based on available information, and “heteroskedasticity” means the variance is not constant over time. The GARCH(1,1) model — the most widely used specification — uses three parameters (ω, α, β) to capture how volatility responds to market shocks and reverts to a long-run level. It was introduced by Tim Bollerslev in 1986 as a generalization of Robert Engle’s original ARCH model (1982).

Q: What is the difference between EWMA and GARCH?

The key difference is mean reversion . EWMA has one parameter (λ) and no mean reversion — once volatility rises, the EWMA forecast stays elevated indefinitely (IGARCH behavior). GARCH has three parameters (ω, α, β) and mean-reverts to a long-run variance level V L = ω / (1 – α – β). This means GARCH predicts that elevated volatility will eventually decline back to normal, and depressed volatility will eventually rise. For short-horizon (1-day) forecasts, the two models often produce similar results. For multi-day forecasts, GARCH is generally more realistic because it accounts for mean reversion.

Q: What do alpha and beta mean in a GARCH(1,1) model?

In the GARCH(1,1) equation σ n ² = ω + αr n-1 ² + βσ n-1 ², α measures shock sensitivity — how strongly the model reacts to yesterday’s squared return (new market information). A higher α means volatility spikes more sharply after a large return. β measures persistence — how much of yesterday’s conditional variance carries forward to today. A higher β means elevated volatility decays more slowly. The sum α + β determines overall persistence: typical equity values are around 0.95-0.99. The half-life (days for a shock to decay by half) is approximately ln(0.5) / ln(α + β). For example, with α + β = 0.98, the half-life is about 34 trading days.

Q: What is the GARCH long-run variance?

The long-run variance (V L ) is the unconditional variance that the GARCH model reverts to over time. It is calculated as V L = ω / (1 – α – β). For example, if ω = 0.000002, α = 0.08, and β = 0.90, then V L = 0.000002 / 0.02 = 0.0001, corresponding to a long-run daily volatility of 1.0% (about 15.9% annualized). When current volatility is above V L , the model forecasts a decline back toward this level. When current volatility is below V L , the model forecasts an increase. This mean-reversion property is what distinguishes GARCH from EWMA and makes it more suitable for multi-day volatility forecasts.

Q: Can GARCH predict market crashes?

No. GARCH models estimate and forecast volatility , not the direction or magnitude of future returns. A GARCH model can tell you that volatility is currently elevated (indicating a higher-risk environment) or that it is likely to mean-revert, but it cannot predict when or whether a crash will occur. GARCH is a backward-looking model that uses past return patterns to estimate current conditional variance. It is a risk measurement tool, not a market timing tool. That said, GARCH-based volatility forecasts are valuable inputs for risk management — they help quantify how much you could lose under current conditions, which is precisely what Value at Risk models are designed to do.

Published: March 2, 2026

Article by Ryan O'Connell, CFA, FRM

The GARCH model is the industry standard for estimating and forecasting financial market volatility. Unlike simple historical measures that treat all past observations equally, conditional volatility models like EWMA and GARCH recognize a fundamental truth about financial markets: volatility changes over time, and recent returns carry more information about current risk than distant ones. This guide covers the full progression — from basic historical variance to the GARCH(1,1) model used by banks, risk managers, and derivatives traders worldwide.

Understanding how to estimate volatility is essential for anyone involved in risk management, option pricing, or portfolio construction. A Value at Risk (VaR) model is only as good as its volatility input. A derivatives trader who misprices volatility will misprice options. This article explains the key models, their formulas, and when to use each one — with a worked example you can follow step by step.

What Is Volatility Estimation?

Volatility measures the magnitude of price fluctuations in a financial asset. Unlike price or return, volatility is not directly observable — it must be estimated from historical return data or inferred from option prices (implied volatility). This article focuses on the first approach: estimating volatility from observed returns.

Key Concept

Conditional volatility models (EWMA, GARCH) recognize that volatility is not constant — it changes over time in predictable ways. Periods of large price swings tend to cluster together, a phenomenon called volatility clustering. These models give more weight to recent observations, producing volatility estimates that adapt to current market conditions.

The distinction between unconditional and conditional volatility is central to this topic. Unconditional volatility is a single long-run average — treat all data equally and compute one number. Conditional volatility changes at each point in time, reflecting the information available up to that moment. For risk management, conditional estimates are far more useful because they capture the current volatility regime.

An important simplification used throughout: for daily returns, the expected return is approximately zero (the daily drift is negligible compared to daily volatility). This means we can use observed returns r_t directly in variance formulas without subtracting the mean — a standard convention in volatility modeling.

Simple Historical Volatility

The simplest approach to estimating volatility is to compute the realized variance — the equally weighted average of squared daily returns over a lookback window of n days.

Realized Variance (Zero-Mean)

σ² = (1/n) × Σ r_t²

Average of squared returns, assuming daily mean return ≈ 0

This is the realized variance form under the zero-mean assumption. It differs from the textbook sample variance formula (which uses 1/(n-1) and subtracts the sample mean), but for daily data with hundreds of observations and a near-zero mean, the difference is negligible. To annualize, multiply the daily variance by 252 (trading days per year) and take the square root:

Annualized Volatility

σ_annual = σ_daily × √252

Convert daily volatility to an annualized figure

The key limitation: every observation receives equal weight, regardless of how recent it is. A return from six months ago has the same influence as yesterday’s return. Consider the March 2020 COVID crash — the S&P 500 fell over 30% in weeks, then recovered sharply. A 1-year historical volatility estimate would remain elevated for months after markets had calmed, overstating the true current risk. This is the motivation for weighted approaches.

EWMA (Exponentially Weighted Moving Average)

The Exponentially Weighted Moving Average (EWMA) model addresses the equal-weighting problem by assigning exponentially declining weights to past observations. Recent returns matter more; distant returns fade away.

EWMA Variance

σ_n² = λσ_n-1² + (1 – λ)r_n-1²

Today’s variance = λ × yesterday’s variance + (1 – λ) × yesterday’s squared return

Where λ (lambda) is the decay factor, typically between 0.90 and 0.99. JPMorgan’s RiskMetrics system, introduced in 1994, popularized λ = 0.94 for daily data and λ = 0.97 for monthly data. The weight assigned to an observation k days ago is (1 – λ)λ^k, which declines geometrically — at λ = 0.94, data from 30 trading days ago receives only about 16% of the weight given to yesterday’s return.

Pro Tip

EWMA is mathematically equivalent to an IGARCH(1,1) model with ω = 0 and α + β = 1. This means EWMA has no finite unconditional variance and no mean reversion — volatility shocks persist indefinitely in the forecast. This is both a strength (simplicity, responsiveness) and a weakness (no long-run anchor). The GARCH model addresses this limitation.

To illustrate, consider Apple (AAPL) stock with yesterday’s EWMA variance estimate of σ_n-1² = 0.00025 (1.58% daily vol) and yesterday’s return of r_n-1 = -4.5% following an earnings miss. Using λ = 0.94: σ_n² = 0.94 × 0.00025 + 0.06 × 0.002025 = 0.000235 + 0.000122 = 0.000357, giving σ_n = 1.89% daily. The single large return immediately increased the volatility estimate from 1.58% to 1.89% — and under EWMA, this elevated level will persist in all future forecasts until offset by smaller returns.

The GARCH(1,1) Model — Formula and Parameters

The Generalized Autoregressive Conditional Heteroskedasticity (GARCH) model, introduced by Tim Bollerslev in 1986, extends EWMA by adding a crucial third component: a constant term ω that pulls volatility back toward a long-run level.

GARCH(1,1) Variance Equation

σ_n² = ω + αr_n-1² + βσ_n-1²

Conditional variance = constant + shock reaction + variance persistence

Where ω > 0, α ≥ 0, β ≥ 0, and α + β < 1 (the covariance stationarity condition).

Parameter interpretation:

α (alpha) — shock sensitivity: how strongly yesterday’s squared return (new information) affects today’s variance estimate. Higher α means volatility reacts more sharply to market surprises.
β (beta) — persistence: how much of yesterday’s conditional variance carries forward. Higher β means volatility decays slowly after a shock.
ω (omega) — baseline contribution: ensures variance has a positive floor and drives mean reversion to the long-run level.

Long-Run Variance

V_L = ω / (1 – α – β)

The unconditional variance the model reverts to over time

The sum α + β is called persistence — it determines how slowly volatility decays back toward V_L after a shock. The volatility half-life (trading days for a shock to decay by half) is approximately ln(0.5) / ln(α + β). This is the key advantage of GARCH over EWMA: mean reversion. Elevated volatility will gradually return to V_L, and depressed volatility will gradually rise back to V_L.

The “(1,1)” in GARCH(1,1) refers to 1 lag of the squared return and 1 lag of the conditional variance. This specification is sufficient for the vast majority of financial applications — higher-order models like GARCH(2,1) rarely provide meaningful improvement.

Explore the Portfolio Analytics & Risk Management Course

GARCH Model Example

S&P 500 — 1-Day Ahead Volatility Forecast

A risk manager has estimated the following GARCH(1,1) parameters for the S&P 500:

ω = 0.000002
α = 0.08
β = 0.90
Persistence = α + β = 0.98

Yesterday’s estimated daily variance was σ_n-1² = 0.0004 (corresponding to 2% daily volatility), and yesterday’s return was r_n-1 = -3% (a sharp sell-off).

Step 1: Calculate today’s conditional variance:

σ_n² = 0.000002 + 0.08 × (-0.03)² + 0.90 × 0.0004

= 0.000002 + 0.08 × 0.0009 + 0.90 × 0.0004

= 0.000002 + 0.000072 + 0.000360 = 0.000434

Step 2: Convert to daily volatility:

σ_n = √0.000434 = 2.08% daily (≈ 33.0% annualized)

Step 3: Compare to long-run volatility:

V_L = 0.000002 / (1 – 0.98) = 0.0001 → σ_L = 1.0% daily (≈ 15.9% annualized)

Step 4: Interpret:

Current volatility (2.08% daily) is well above the long-run level (1.0% daily). The -3% return increased the variance estimate from 0.0004 to 0.000434. With persistence of 0.98, the half-life is approximately 34 trading days (about 7 weeks) — meaning this elevated volatility will gradually mean-revert back to the long-run level over the coming weeks.

Maximum Likelihood Estimation

GARCH parameters (ω, α, β) cannot be calculated with a simple formula — they are estimated using maximum likelihood estimation (MLE). The idea is to find the parameter values that make the observed return data most probable under the assumed model.

Gaussian Log-Likelihood (Proportional Form)

L ∝ -½ Σ [ln(σ_t²) + r_t² / σ_t²]

Maximize L by choosing ω, α, β subject to stationarity constraints

There is no closed-form solution — the log-likelihood must be maximized numerically using optimization algorithms (such as Nelder-Mead or BFGS). The process requires initial parameter guesses and convergence criteria. In practice, software handles this automatically: Python’s arch package, R’s rugarch, or MATLAB’s garch function.

A reliable GARCH fit typically requires 500 to 1,000+ daily observations (roughly 2-4 years of trading data). Fewer than 250 observations often produces unstable or unreliable parameter estimates. After fitting, always check standardized residuals (r_t / σ_t) for remaining ARCH effects — if the model is well-specified, these residuals should behave like independent draws. If standardized residuals show excess kurtosis (fat tails), a Student-t distribution may be more appropriate than the Gaussian assumption.

EWMA vs GARCH Model — Key Differences

Both EWMA and GARCH capture volatility clustering, but they differ in fundamental ways that affect when each model is appropriate.

EWMA

1 parameter (λ, typically 0.94)
Minimal calibration — λ typically set to 0.94 (can be optimized)
IGARCH: no mean reversion
No finite unconditional variance
Multi-day forecasts are flat (current vol persists)
RiskMetrics industry standard
Best for: 1-day VaR, simplicity, speed

GARCH(1,1)

3 parameters (ω, α, β)
Requires MLE estimation from data
Covariance-stationary: mean-reverts to V_L
Finite unconditional variance (V_L)
Multi-day forecasts converge toward V_L
Academic and regulatory standard
Best for: multi-day horizons, term structure, formal risk models

For short-horizon applications like 1-day VaR, EWMA and GARCH often produce similar results. The difference becomes significant for multi-day forecasts — GARCH’s mean reversion produces a declining volatility term structure after a shock, while EWMA forecasts a constant level indefinitely. For risk managers building multi-day or multi-week VaR models, GARCH provides a more realistic volatility path.

Volatility Forecasting

One of the most practical applications of GARCH is multi-step-ahead volatility forecasting. The GARCH model’s mean-reversion property creates a natural term structure of volatility.

GARCH forecasting: The h-day-ahead forecast gradually converges toward the long-run variance V_L as the forecast horizon h increases. If volatility is currently elevated (as in our S&P 500 example), the model predicts it will decay back toward V_L. If volatility is currently depressed, the model predicts it will rise. At sufficiently long horizons, the GARCH forecast equals V_L regardless of the starting point.

EWMA forecasting: Because EWMA has no unconditional variance, its multi-day forecast stays flat at the current estimated level — it has no long-run anchor to converge toward. This is the IGARCH property: every shock has a permanent effect on the forecast.

This forecasting behavior directly affects risk models. Parametric VaR scales the volatility input across the holding period. Using EWMA for a 10-day VaR will produce a higher estimate (during stressed periods) than GARCH, because GARCH predicts the volatility will partially revert during those 10 days. The choice of model can meaningfully impact capital requirements and risk limits.

Explore the Value at Risk Course

Common Mistakes with GARCH Models

Volatility estimation involves statistical complexity that creates several common pitfalls. Avoiding these mistakes is essential for reliable risk management.

1. Using simple historical volatility for risk management. Equally weighted variance ignores volatility clustering — the most important stylized fact of financial returns. After a market shock, simple historical volatility is slow to react (if the lookback window is long) or noisy (if the window is short). Conditional models like EWMA and GARCH are designed specifically for this problem.

2. Blindly using λ = 0.94 without validation. JPMorgan’s RiskMetrics default was calibrated for a broad set of assets in the 1990s. It may not be optimal for all asset classes, time periods, or market regimes. Validate the decay factor against your specific data or use GARCH to estimate the effective weighting from the data itself.

3. Ignoring mean reversion for long-horizon forecasts. Using an EWMA-based volatility estimate for a 30-day or 90-day risk horizon implicitly assumes volatility stays at its current level indefinitely. If volatility is elevated after a crisis, this overstates risk; if depressed during calm markets, it understates risk. GARCH’s mean reversion provides more realistic multi-day forecasts.

4. Estimating GARCH with too few data points. Reliable MLE estimation requires at least 500 daily observations (~2 years). With fewer data points, parameter estimates become unstable and can produce misleading results. The long-run variance V_L is especially sensitive to sample size.

5. Confusing conditional and implied volatility. GARCH estimates conditional volatility from historical return data — it is backward-looking. Implied volatility is extracted from option prices and reflects the market’s forward-looking expectation. These two measures serve different purposes and often diverge, especially during market stress.

6. Failing to validate model residuals. Fitting a GARCH model is not the final step. Always check standardized residuals for remaining ARCH effects (using Ljung-Box tests on squared residuals). If the model is well-specified, residuals should be approximately independent. Backtest any VaR model built on GARCH output against realized returns.

7. Assuming normally distributed errors when tails are heavy. Financial returns typically exhibit excess kurtosis — extreme events occur more frequently than a normal distribution predicts. Using Gaussian GARCH when returns have fat tails will underestimate tail risk. Consider Student-t or skewed-t innovations when standardized residuals show excess kurtosis.

GARCH Model Limitations

Symmetry Assumption

GARCH(1,1) treats positive and negative returns of the same magnitude identically — a +3% return and a -3% return produce the same volatility update. In equity markets, this is unrealistic: negative returns typically increase volatility more than positive returns of equal size (the leverage effect). Asymmetric models like EGARCH and GJR-GARCH address this limitation.

Leverage effect not captured. The asymmetric response of volatility to negative versus positive returns is well-documented in equity markets. After sharp declines, volatility tends to spike more than it would after an equally large rally. Extensions such as EGARCH (Nelson, 1991) and GJR-GARCH (Glosten, Jagannathan, and Runkle, 1993) add a leverage term to capture this asymmetry. For equity applications, these models often provide a better fit than standard GARCH(1,1).

Parameter sensitivity to sample period. GARCH parameters estimated over a calm period (e.g., 2017) can differ significantly from those estimated over a volatile period (e.g., 2008-2009). This sensitivity means the long-run variance V_L itself is an estimate, not a fixed constant. Practitioners should re-estimate parameters periodically and conduct sensitivity analysis.

Distributional assumptions affect tail risk. The standard Gaussian GARCH model underestimates the probability of extreme returns. Using a Student-t distribution for innovations improves tail fit but introduces an additional parameter (degrees of freedom) that must be estimated. The choice of distribution directly affects risk measures like Value at Risk and parametric VaR at high confidence levels.

Higher-order models add complexity with marginal benefit. GARCH(2,1) or GARCH(1,2) models add more lag terms, but empirical evidence consistently shows that GARCH(1,1) captures the vast majority of conditional variance dynamics. The additional parameters rarely justify the added estimation complexity.

Despite these limitations, GARCH(1,1) remains the workhorse of volatility estimation. Its combination of parsimony (three parameters), mean reversion, and strong empirical performance makes it the default choice for practitioners across risk management, derivatives pricing, and portfolio construction. Volatility estimation also plays a role beyond trading desks — real options analysis requires estimating project cash flow uncertainty, and employee stock option valuation under FASB ASC 718 depends on reliable volatility inputs.

Frequently Asked Questions

GARCH stands for Generalized Autoregressive Conditional Heteroskedasticity. It models the time-varying (conditional) variance of financial asset returns. “Autoregressive” means today’s variance depends on past values, “conditional” means the variance changes based on available information, and “heteroskedasticity” means the variance is not constant over time. The GARCH(1,1) model — the most widely used specification — uses three parameters (ω, α, β) to capture how volatility responds to market shocks and reverts to a long-run level. It was introduced by Tim Bollerslev in 1986 as a generalization of Robert Engle’s original ARCH model (1982).

The key difference is mean reversion. EWMA has one parameter (λ) and no mean reversion — once volatility rises, the EWMA forecast stays elevated indefinitely (IGARCH behavior). GARCH has three parameters (ω, α, β) and mean-reverts to a long-run variance level V_L = ω / (1 – α – β). This means GARCH predicts that elevated volatility will eventually decline back to normal, and depressed volatility will eventually rise. For short-horizon (1-day) forecasts, the two models often produce similar results. For multi-day forecasts, GARCH is generally more realistic because it accounts for mean reversion.

In the GARCH(1,1) equation σ_n² = ω + αr_n-1² + βσ_n-1², α measures shock sensitivity — how strongly the model reacts to yesterday’s squared return (new market information). A higher α means volatility spikes more sharply after a large return. β measures persistence — how much of yesterday’s conditional variance carries forward to today. A higher β means elevated volatility decays more slowly. The sum α + β determines overall persistence: typical equity values are around 0.95-0.99. The half-life (days for a shock to decay by half) is approximately ln(0.5) / ln(α + β). For example, with α + β = 0.98, the half-life is about 34 trading days.

The long-run variance (V_L) is the unconditional variance that the GARCH model reverts to over time. It is calculated as V_L = ω / (1 – α – β). For example, if ω = 0.000002, α = 0.08, and β = 0.90, then V_L = 0.000002 / 0.02 = 0.0001, corresponding to a long-run daily volatility of 1.0% (about 15.9% annualized). When current volatility is above V_L, the model forecasts a decline back toward this level. When current volatility is below V_L, the model forecasts an increase. This mean-reversion property is what distinguishes GARCH from EWMA and makes it more suitable for multi-day volatility forecasts.

No. GARCH models estimate and forecast volatility, not the direction or magnitude of future returns. A GARCH model can tell you that volatility is currently elevated (indicating a higher-risk environment) or that it is likely to mean-revert, but it cannot predict when or whether a crash will occur. GARCH is a backward-looking model that uses past return patterns to estimate current conditional variance. It is a risk measurement tool, not a market timing tool. That said, GARCH-based volatility forecasts are valuable inputs for risk management — they help quantify how much you could lose under current conditions, which is precisely what Value at Risk models are designed to do.

Disclaimer

This article is for educational and informational purposes only and does not constitute investment or financial advice. The GARCH parameters and example calculations are illustrative and do not represent actual fitted model parameters for any specific security. Volatility models involve statistical assumptions that may not hold in all market conditions. Always conduct your own analysis and consult qualified financial professionals before making risk management or investment decisions.

Explore Top Finance Certificates

Access official certificates from Wharton Online & Columbia Business School Executive Education, powered by Wall Street Prep. Save up to $500 with code RYAN.

Volatility Estimation: EWMA and GARCH Models Explained

What Is Volatility Estimation?

Simple Historical Volatility

EWMA (Exponentially Weighted Moving Average)

The GARCH(1,1) Model — Formula and Parameters

GARCH Model Example

Maximum Likelihood Estimation

EWMA vs GARCH Model — Key Differences

EWMA

GARCH(1,1)

Volatility Forecasting

Common Mistakes with GARCH Models

GARCH Model Limitations

Frequently Asked Questions

What does GARCH stand for and what does it model?

What is the difference between EWMA and GARCH?

What do alpha and beta mean in a GARCH(1,1) model?

What is the GARCH long-run variance?

Can GARCH predict market crashes?

Disclaimer

Table of Contents

UWorld CFA Prep

Explore Top Finance Certificates

Contact Me

Contact Me