Simulation Methods: Historical, Bootstrap, and Monte Carlo

A single expected return or point forecast tells you almost nothing about the range of outcomes your portfolio might experience. Simulation methods address this limitation by generating many possible scenarios and aggregating the results into probability distributions. This guide compares the three most common simulation approaches in finance — historical simulation, bootstrap simulation, and Monte Carlo simulation — covering when to use each method, their key assumptions, and the pitfalls that lead to unreliable results.

What Are Simulation Methods?

Simulation methods generate multiple scenarios to estimate probability distributions of future outcomes. Rather than producing a single forecast, they reveal the range of possibilities — including tail events that simple averages would hide.

Key Concept

Simulation replaces point estimates with distributions. Instead of “the portfolio should return 7%,” simulation tells you “there is a 5% chance the portfolio loses more than 12% and a 50% chance it returns between 4% and 10%.”

Three major simulation approaches dominate finance applications:

  • Historical simulation uses actual past outcomes directly as possible future scenarios
  • Bootstrap simulation resamples historical observations with replacement to create new synthetic scenarios
  • Monte Carlo simulation generates random draws from a specified probability distribution or model

Each method makes fundamentally different assumptions about how the future relates to the past. Choosing correctly matters: the wrong method can produce misleading risk estimates, especially in the tails.

Simulation vs Scenario Analysis vs Stress Testing

These three terms are often confused. Simulation generates many random scenarios to build probability distributions. Scenario analysis evaluates a small number of specific, predetermined scenarios (e.g., “what if inflation rises to 8%?”). Stress testing focuses specifically on extreme or adverse scenarios to evaluate portfolio resilience under crisis conditions. Simulation methods can be used for all three purposes, but the goals differ.

Historical Simulation

Historical simulation uses actual observed outcomes as possible future scenarios. The logic is simple: if you want to know what returns might occur tomorrow, look at what returns actually occurred on past days and assume any of them could repeat.

The method works by collecting a history of returns (or risk factor changes), ranking them from worst to best, and reading percentiles directly. For Value at Risk (VaR), the 5th percentile of historical returns becomes the 95% VaR estimate. No distributional assumptions are imposed — the data speaks for itself.

Pro Tip

Historical simulation is transparent and easy to explain to stakeholders. When a risk manager says “the worst day in our sample was a 4.2% loss,” that is immediately understandable without statistical jargon.

Key assumptions:

  • The past is representative of the future — the distribution of returns is stable over time
  • The historical sample includes relevant tail events
  • Market structure and correlations have not fundamentally changed

Strengths:

  • No distributional assumption (non-parametric)
  • Captures fat tails if they are present in the sample
  • Preserves historical cross-asset correlations automatically
  • Simple to implement and explain

Weaknesses:

  • Cannot extrapolate beyond observed history — if a 10% single-day crash never happened, it will not appear
  • Equally weights all historical observations, including stale data from different market regimes
  • Limited scenarios — with 500 days of history, you have exactly 500 scenarios
  • Ignores time-series dependence like volatility clustering

For a detailed methodology, see our guide to historical simulation VaR.

Bootstrap Simulation

Bootstrap simulation extends historical simulation by resampling with replacement. Instead of using each historical return exactly once, the bootstrap randomly draws observations from the historical sample — potentially selecting some observations multiple times and others not at all — to create new synthetic scenarios.

The key insight is that resampling generates more scenario combinations than the original sample while preserving the empirical distribution of returns. With 500 historical observations, you can generate 10,000 bootstrap samples, each representing a plausible path.

Key Concept

Bootstrap resampling does not create new information — it creates new combinations from existing information. The underlying risk factors and tail events are still bounded by what actually occurred in history.

Key assumptions:

  • Historical observations are independent and identically distributed (i.i.d.)
  • The past distribution is representative of the future
  • Sampling with replacement adequately captures uncertainty

Strengths:

  • Generates more scenarios than pure historical simulation
  • Preserves the empirical distribution without parametric assumptions
  • Enables standard error estimation for risk metrics (by resampling entire samples and computing VaR for each)
  • Multi-period paths can produce cumulative outcomes not directly observed in history

Weaknesses:

  • Individual-period returns are still bounded by historical observations
  • Standard bootstrap assumes i.i.d. observations, which ignores volatility clustering and serial correlation
  • Resampling individual assets independently destroys historical correlations — must resample entire cross-sections together

Block bootstrap and stationary bootstrap variants address the time-dependence weakness by resampling blocks of consecutive observations rather than individual returns, preserving short-term serial dependence.

Monte Carlo Simulation

Monte Carlo simulation generates random scenarios from a specified probability distribution or model rather than directly from historical data. The analyst defines the expected return, volatility, correlations, and distributional form, then draws random values to simulate many possible paths.

The method is named after the Monte Carlo casino — a reference to its reliance on randomness. It was formalized in the 1940s for nuclear weapons research and is now the dominant simulation approach in finance for portfolio risk, derivatives pricing, and retirement planning.

Key assumptions:

  • The specified model and distribution are correct — if you assume normality but returns have fat tails, results will understate tail risk
  • Estimated parameters (mean, volatility, correlations) are accurate
  • The correlation structure you model reflects true dependence

Strengths:

  • Can model scenarios that never occurred in history — stress scenarios, regime changes, or hypothetical events
  • Can explicitly model correlations, copulas, and complex dependence structures when included in the model
  • Handles path-dependent instruments (options, structured products) via full revaluation
  • Scales to thousands or millions of scenarios with modern computing

Weaknesses:

  • “Garbage in, garbage out” — results are only as good as the model specification
  • Distribution misspecification is the primary source of model risk
  • Requires parameter estimation, which introduces estimation error
  • Tail convergence requires many simulations (10,000+ for stable VaR estimates)

Historical Simulation vs Bootstrap vs Monte Carlo

The three methods differ along several critical dimensions. The right choice depends on your data, your model confidence, and your risk measurement goals.

Historical Simulation

  • Uses observed data directly
  • No distributional assumptions
  • Limited to historical scenarios
  • Best for: transparent, explainable risk estimates when history is representative

Bootstrap Simulation

  • Resamples from observed data
  • More scenarios, same information
  • Still bounded by history
  • Best for: uncertainty quantification and scenario expansion

Monte Carlo Simulation

  • Generates from a model
  • Can extrapolate beyond history
  • Model risk is primary concern
  • Best for: stress testing, derivatives, path-dependent analysis
Dimension Historical Bootstrap Monte Carlo
Source of scenarios Observed data Resampled data Model/distribution
Distributional assumption None None (empirical) Required
Extrapolates beyond history? No Cumulative paths only Yes
Handles changing volatility? No (equal weights) No (standard) Yes (GARCH, etc.)
Handles serial dependence? No (one-period); partially for multi-period paths Block bootstrap variants If explicitly modeled
Correlation treatment Implicit in data Implicit if cross-section resampled Explicit (Cholesky, copulas)
Main source of model risk Non-representative sample i.i.d. assumption Distribution misspecification
Best for nonlinear portfolios? Full revaluation needed Full revaluation needed Native strength

How to Choose a Simulation Method

Start with three questions:

  1. Do you trust a parametric model? If yes, Monte Carlo gives flexibility. If not, use historical or bootstrap.
  2. Do you need to model scenarios that never occurred? If yes, you must use Monte Carlo. Historical methods cannot extrapolate.
  3. Is time dependence critical? If volatility clustering matters, use filtered historical simulation (which applies GARCH to historical data) or Monte Carlo with a GARCH component.

Example: VaR Estimation Using All Three Methods

Consider a simple portfolio allocated 60% to U.S. large-cap stocks (SPY, the S&P 500 ETF) and 40% to aggregate bonds (AGG, the Bloomberg U.S. Aggregate Bond ETF), evaluated using 250 trading days of history.

Comparing 95% VaR Estimates

Historical Simulation: Rank the 250 daily portfolio returns from worst to best. The 5th percentile is approximately the 13th worst day. Suppose that return was -1.8%. The 95% one-day VaR is 1.8%.

Bootstrap Simulation: Resample 10,000 daily returns with replacement from the same 250-day sample. Take the 5th percentile of the resampled distribution. The point estimate will be similar to historical (-1.75% to -1.85%). To estimate a confidence interval for VaR, repeat the entire process: draw many bootstrap samples of 250 days each, compute the 5% VaR for each sample, then take percentiles of those VaR estimates.

Monte Carlo Simulation: Estimate the portfolio’s mean return (0.04% daily) and volatility (0.9% daily). Assuming normal returns, simulate 10,000 draws. The 5th percentile of a normal distribution is approximately mean – 1.645 × volatility = 0.04% – 1.645 × 0.9% = -1.44%. (This closed-form result is also the analytical variance-covariance VaR; Monte Carlo should converge to it as the simulation count increases.)

Why the estimates differ: Historical and bootstrap preserve whatever fat tails exist in the sample, while Monte Carlo with normality underestimates tail risk if returns have excess kurtosis. In this example, the historical sample contained several days with losses larger than a normal distribution would predict, pushing the historical VaR estimate above the parametric estimate.

For comprehensive VaR methodology, see our Value at Risk guide and detailed method articles on historical VaR and Monte Carlo VaR.

2008 Crisis: How Correlation Spikes Break Simulation Models

Before the 2008 financial crisis, the correlation between U.S. stocks (S&P 500) and investment-grade corporate bond returns (represented by LQD, the iShares investment-grade corporate bond ETF) was moderate — both respond to economic growth, but corporate bond yields also include credit spread and interest rate components. During the Lehman Brothers collapse in September-October 2008, both sold off sharply as credit spreads widened dramatically and risk assets fell together. (Treasuries, by contrast, rallied as a flight-to-quality asset.)

Historical simulation (pre-crisis sample): A risk model calibrated on 2005-2007 data for a stock/corporate-bond portfolio would show moderate VaR because the calm-period sample contained no crisis-level credit spread blowout or simultaneous selloff.

Monte Carlo (pre-crisis parameters): A model using the illustrative calm-period correlation of approximately 0.3 between stock and corporate bond returns would similarly understate risk. When return correlations spiked during the crisis (approaching 0.6-0.8 in some windows), the model’s diversification assumptions failed.

Lesson: Both historical and Monte Carlo methods can understate tail risk when calibrated to a regime that no longer applies. Stress testing with scenario-based correlation shocks — or using filtered historical simulation with crisis periods — helps capture these regime-dependent risks.

Common Mistakes

Simulation methods are powerful but easily misapplied. These are the most frequent errors:

1. Using stale data: A historical sample dominated by a calm market regime will understate risk when volatility spikes. Equally weighting observations from 2017 (low volatility) and 2008 (crisis) may not reflect current conditions.

2. Distribution misspecification: Assuming normality when returns have fat tails leads Monte Carlo to underestimate extreme losses. Financial returns often exhibit excess kurtosis and negative skewness.

3. Ignoring correlation changes: Correlations between assets spike during market stress. A model calibrated to calm-period correlations will understate portfolio risk precisely when diversification benefits disappear.

4. Resampling assets independently: In bootstrap simulation, drawing stock returns and bond returns independently destroys their historical correlation structure. Always resample entire cross-sections (all assets on the same day) together.

5. Insufficient scenarios for tail estimation: Estimating the 99% VaR requires enough simulations that the 1st percentile is stable. With 1,000 simulations, only 10 observations define the 1% tail — too few for reliability. Use 10,000+ for VaR and 100,000+ for Expected Shortfall.

6. Confusing precision with accuracy: Monte Carlo can produce estimates to many decimal places, but that precision is meaningless if the underlying model is wrong. A stable, precise estimate of the wrong number is still wrong.

7. Mixing regimes without adjustment: Combining historical data from different volatility regimes (e.g., pre- and post-crisis) without adjustment can produce estimates that reflect neither regime accurately.

Limitations of Simulation Methods

Important Limitation

All simulation methods are models, not reality. They produce probability estimates, not guarantees. Actual outcomes can — and will — fall outside simulated ranges, especially during unprecedented market events.

Regime changes: The past may not predict the future. Structural changes in markets (new regulations, changed central bank policy, technological disruption) can invalidate historical relationships.

Model risk: Monte Carlo is only as good as its assumptions. The “correct” distribution is unknowable, and small changes in distributional assumptions can produce large changes in tail estimates.

Time horizon limitations: One-day VaR estimates do not scale cleanly to longer horizons. The square-root-of-time rule assumes i.i.d. returns, which is empirically false. Multi-period path simulation is more appropriate for longer horizons.

Nonlinear instruments: Options and structured products require full revaluation at each scenario — delta-normal approximations can severely understate risk when positions are far from the money or when volatility shifts dramatically.

Frequently Asked Questions

No method is universally most accurate — it depends on whether the method’s assumptions match your data and scenario. Historical simulation is accurate when history is representative. Monte Carlo is accurate when the model is correctly specified. The “best” method is the one whose assumptions most closely match reality for your specific portfolio and time horizon.

The number depends on the statistic you are estimating. For 95% VaR, 10,000 Monte Carlo simulations typically provide stable estimates. For 99% VaR or Expected Shortfall, 50,000 to 100,000 simulations may be needed because fewer observations define the extreme tail. For bootstrap, the effective sample size is limited by your historical data length, regardless of how many resamples you generate.

Yes — hybrid methods combine strengths of multiple approaches. Filtered historical simulation applies a GARCH model to standardize historical returns, then rescales them by current volatility estimates. Block bootstrap preserves time-series dependence by resampling blocks of consecutive observations. Copula Monte Carlo combines modeled marginal distributions with an explicitly specified dependence structure.

In Monte Carlo, correlations are explicitly modeled using Cholesky decomposition of the correlation matrix to generate correlated random draws. In bootstrap simulation, correlation is preserved by resampling entire cross-sections — all assets on the same historical date — rather than individual asset returns. Historical simulation preserves correlations automatically since scenarios are actual historical cross-sections.

Use parametric methods (Monte Carlo) when you have confidence in your distributional assumptions and need to model scenarios beyond historical experience — stress tests, derivatives pricing, or hypothetical market conditions. Use non-parametric methods (historical or bootstrap) when you want to avoid imposing distributional assumptions and believe historical observations adequately represent the relevant risk scenarios.

No. Historical simulation uses past returns to estimate future risk distributions. Backtesting evaluates how well a risk model would have performed historically — comparing predicted VaR to actual losses over a past period. Backtesting is a validation technique, not a simulation method. You can backtest any simulation method, including Monte Carlo.

Bootstrap resamples from actual historical observations — it cannot produce scenarios outside the historical range (at the individual-period level). Monte Carlo generates new random draws from a specified distribution, which can produce any scenario the distribution allows, including extreme events that never occurred historically. Bootstrap makes no distributional assumption; Monte Carlo requires one.
Disclaimer

This article is for educational and informational purposes only and does not constitute investment advice. Simulation results are probability estimates based on models and assumptions that may not reflect future market conditions. Always conduct your own research and consult a qualified financial advisor before making investment decisions.