Simulation Methods: Historical, Bootstrap, and Monte Carlo

Published: May 9, 2026

Article by Ryan O'Connell, CFA, FRM

A single expected return or point forecast tells you almost nothing about the range of outcomes your portfolio might experience. Simulation methods address this limitation by generating many possible scenarios and aggregating the results into probability distributions. This guide compares the three most common simulation approaches in finance — historical simulation, bootstrap simulation, and Monte Carlo simulation — covering when to use each method, their key assumptions, and the pitfalls that lead to unreliable results.

What Are Simulation Methods?

Simulation methods generate multiple scenarios to estimate probability distributions of future outcomes. Rather than producing a single forecast, they reveal the range of possibilities — including tail events that simple averages would hide.

Key Concept

Simulation replaces point estimates with distributions. Instead of “the portfolio should return 7%,” simulation tells you “there is a 5% chance the portfolio loses more than 12% and a 50% chance it returns between 4% and 10%.”

Three major simulation approaches dominate finance applications:

Historical simulation uses actual past outcomes directly as possible future scenarios
Bootstrap simulation resamples historical observations with replacement to create new synthetic scenarios
Monte Carlo simulation generates random draws from a specified probability distribution or model

Each method makes fundamentally different assumptions about how the future relates to the past. Choosing correctly matters: the wrong method can produce misleading risk estimates, especially in the tails.

Simulation vs Scenario Analysis vs Stress Testing

These three terms are often confused. Simulation generates many random scenarios to build probability distributions. Scenario analysis evaluates a small number of specific, predetermined scenarios (e.g., “what if inflation rises to 8%?”). Stress testing focuses specifically on extreme or adverse scenarios to evaluate portfolio resilience under crisis conditions. Simulation methods can be used for all three purposes, but the goals differ.

Historical Simulation

Historical simulation uses actual observed outcomes as possible future scenarios. The logic is simple: if you want to know what returns might occur tomorrow, look at what returns actually occurred on past days and assume any of them could repeat.

The method works by collecting a history of returns (or risk factor changes), ranking them from worst to best, and reading percentiles directly. For Value at Risk (VaR), the 5th percentile of historical returns becomes the 95% VaR estimate. No distributional assumptions are imposed — the data speaks for itself.

Pro Tip

Historical simulation is transparent and easy to explain to stakeholders. When a risk manager says “the worst day in our sample was a 4.2% loss,” that is immediately understandable without statistical jargon.

Key assumptions:

The past is representative of the future — the distribution of returns is stable over time
The historical sample includes relevant tail events
Market structure and correlations have not fundamentally changed

Strengths:

No distributional assumption (non-parametric)
Captures fat tails if they are present in the sample
Preserves historical cross-asset correlations automatically
Simple to implement and explain

Weaknesses:

Cannot extrapolate beyond observed history — if a 10% single-day crash never happened, it will not appear
Equally weights all historical observations, including stale data from different market regimes
Limited scenarios — with 500 days of history, you have exactly 500 scenarios
Ignores time-series dependence like volatility clustering

For a detailed methodology, see our guide to historical simulation VaR.

Bootstrap Simulation

Bootstrap simulation extends historical simulation by resampling with replacement. Instead of using each historical return exactly once, the bootstrap randomly draws observations from the historical sample — potentially selecting some observations multiple times and others not at all — to create new synthetic scenarios.

The key insight is that resampling generates more scenario combinations than the original sample while preserving the empirical distribution of returns. With 500 historical observations, you can generate 10,000 bootstrap samples, each representing a plausible path.

Key Concept

Bootstrap resampling does not create new information — it creates new combinations from existing information. The underlying risk factors and tail events are still bounded by what actually occurred in history.

Key assumptions:

Historical observations are independent and identically distributed (i.i.d.)
The past distribution is representative of the future
Sampling with replacement adequately captures uncertainty

Strengths:

Generates more scenarios than pure historical simulation
Preserves the empirical distribution without parametric assumptions
Enables standard error estimation for risk metrics (by resampling entire samples and computing VaR for each)
Multi-period paths can produce cumulative outcomes not directly observed in history

Weaknesses:

Individual-period returns are still bounded by historical observations
Standard bootstrap assumes i.i.d. observations, which ignores volatility clustering and serial correlation
Resampling individual assets independently destroys historical correlations — must resample entire cross-sections together

Block bootstrap and stationary bootstrap variants address the time-dependence weakness by resampling blocks of consecutive observations rather than individual returns, preserving short-term serial dependence.

Monte Carlo Simulation

Monte Carlo simulation generates random scenarios from a specified probability distribution or model rather than directly from historical data. The analyst defines the expected return, volatility, correlations, and distributional form, then draws random values to simulate many possible paths.

The method is named after the Monte Carlo casino — a reference to its reliance on randomness. It was formalized in the 1940s for nuclear weapons research and is now the dominant simulation approach in finance for portfolio risk, derivatives pricing, and retirement planning.

Key assumptions:

The specified model and distribution are correct — if you assume normality but returns have fat tails, results will understate tail risk
Estimated parameters (mean, volatility, correlations) are accurate
The correlation structure you model reflects true dependence

Strengths:

Can model scenarios that never occurred in history — stress scenarios, regime changes, or hypothetical events
Can explicitly model correlations, copulas, and complex dependence structures when included in the model
Handles path-dependent instruments (options, structured products) via full revaluation
Scales to thousands or millions of scenarios with modern computing

Weaknesses:

“Garbage in, garbage out” — results are only as good as the model specification
Distribution misspecification is the primary source of model risk
Requires parameter estimation, which introduces estimation error
Tail convergence requires many simulations (10,000+ for stable VaR estimates)

Learn Monte Carlo Implementation Details

Historical Simulation vs Bootstrap vs Monte Carlo

The three methods differ along several critical dimensions. The right choice depends on your data, your model confidence, and your risk measurement goals.

Historical Simulation

Uses observed data directly
No distributional assumptions
Limited to historical scenarios
Best for: transparent, explainable risk estimates when history is representative

Bootstrap Simulation

Resamples from observed data
More scenarios, same information
Still bounded by history
Best for: uncertainty quantification and scenario expansion

Monte Carlo Simulation

Generates from a model
Can extrapolate beyond history
Model risk is primary concern
Best for: stress testing, derivatives, path-dependent analysis

Dimension	Historical	Bootstrap	Monte Carlo
Source of scenarios	Observed data	Resampled data	Model/distribution
Distributional assumption	None	None (empirical)	Required
Extrapolates beyond history?	No	Cumulative paths only	Yes
Handles changing volatility?	No (equal weights)	No (standard)	Yes (GARCH, etc.)
Handles serial dependence?	No (one-period); partially for multi-period paths	Block bootstrap variants	If explicitly modeled
Correlation treatment	Implicit in data	Implicit if cross-section resampled	Explicit (Cholesky, copulas)
Main source of model risk	Non-representative sample	i.i.d. assumption	Distribution misspecification
Best for nonlinear portfolios?	Full revaluation needed	Full revaluation needed	Native strength

How to Choose a Simulation Method

Start with three questions:

Do you trust a parametric model? If yes, Monte Carlo gives flexibility. If not, use historical or bootstrap.
Do you need to model scenarios that never occurred? If yes, you must use Monte Carlo. Historical methods cannot extrapolate.
Is time dependence critical? If volatility clustering matters, use filtered historical simulation (which applies GARCH to historical data) or Monte Carlo with a GARCH component.

Example: VaR Estimation Using All Three Methods

Consider a simple portfolio allocated 60% to U.S. large-cap stocks (SPY, the S&P 500 ETF) and 40% to aggregate bonds (AGG, the Bloomberg U.S. Aggregate Bond ETF), evaluated using 250 trading days of history.

Comparing 95% VaR Estimates

Historical Simulation: Rank the 250 daily portfolio returns from worst to best. The 5th percentile is approximately the 13th worst day. Suppose that return was -1.8%. The 95% one-day VaR is 1.8%.

Bootstrap Simulation: Resample 10,000 daily returns with replacement from the same 250-day sample. Take the 5th percentile of the resampled distribution. The point estimate will be similar to historical (-1.75% to -1.85%). To estimate a confidence interval for VaR, repeat the entire process: draw many bootstrap samples of 250 days each, compute the 5% VaR for each sample, then take percentiles of those VaR estimates.

Monte Carlo Simulation: Estimate the portfolio’s mean return (0.04% daily) and volatility (0.9% daily). Assuming normal returns, simulate 10,000 draws. The 5th percentile of a normal distribution is approximately mean – 1.645 × volatility = 0.04% – 1.645 × 0.9% = -1.44%. (This closed-form result is also the analytical variance-covariance VaR; Monte Carlo should converge to it as the simulation count increases.)

Why the estimates differ: Historical and bootstrap preserve whatever fat tails exist in the sample, while Monte Carlo with normality underestimates tail risk if returns have excess kurtosis. In this example, the historical sample contained several days with losses larger than a normal distribution would predict, pushing the historical VaR estimate above the parametric estimate.

For comprehensive VaR methodology, see our Value at Risk guide and detailed method articles on historical VaR and Monte Carlo VaR.

2008 Crisis: How Correlation Spikes Break Simulation Models

Before the 2008 financial crisis, the correlation between U.S. stocks (S&P 500) and investment-grade corporate bond returns (represented by LQD, the iShares investment-grade corporate bond ETF) was moderate — both respond to economic growth, but corporate bond yields also include credit spread and interest rate components. During the Lehman Brothers collapse in September-October 2008, both sold off sharply as credit spreads widened dramatically and risk assets fell together. (Treasuries, by contrast, rallied as a flight-to-quality asset.)

Historical simulation (pre-crisis sample): A risk model calibrated on 2005-2007 data for a stock/corporate-bond portfolio would show moderate VaR because the calm-period sample contained no crisis-level credit spread blowout or simultaneous selloff.

Monte Carlo (pre-crisis parameters): A model using the illustrative calm-period correlation of approximately 0.3 between stock and corporate bond returns would similarly understate risk. When return correlations spiked during the crisis (approaching 0.6-0.8 in some windows), the model’s diversification assumptions failed.

Lesson: Both historical and Monte Carlo methods can understate tail risk when calibrated to a regime that no longer applies. Stress testing with scenario-based correlation shocks — or using filtered historical simulation with crisis periods — helps capture these regime-dependent risks.

Common Mistakes

Simulation methods are powerful but easily misapplied. These are the most frequent errors:

1. Using stale data: A historical sample dominated by a calm market regime will understate risk when volatility spikes. Equally weighting observations from 2017 (low volatility) and 2008 (crisis) may not reflect current conditions.

2. Distribution misspecification: Assuming normality when returns have fat tails leads Monte Carlo to underestimate extreme losses. Financial returns often exhibit excess kurtosis and negative skewness.

3. Ignoring correlation changes: Correlations between assets spike during market stress. A model calibrated to calm-period correlations will understate portfolio risk precisely when diversification benefits disappear.

4. Resampling assets independently: In bootstrap simulation, drawing stock returns and bond returns independently destroys their historical correlation structure. Always resample entire cross-sections (all assets on the same day) together.

5. Insufficient scenarios for tail estimation: Estimating the 99% VaR requires enough simulations that the 1st percentile is stable. With 1,000 simulations, only 10 observations define the 1% tail — too few for reliability. Use 10,000+ for VaR and 100,000+ for Expected Shortfall.

6. Confusing precision with accuracy: Monte Carlo can produce estimates to many decimal places, but that precision is meaningless if the underlying model is wrong. A stable, precise estimate of the wrong number is still wrong.

7. Mixing regimes without adjustment: Combining historical data from different volatility regimes (e.g., pre- and post-crisis) without adjustment can produce estimates that reflect neither regime accurately.

Limitations of Simulation Methods

Important Limitation

All simulation methods are models, not reality. They produce probability estimates, not guarantees. Actual outcomes can — and will — fall outside simulated ranges, especially during unprecedented market events.

Regime changes: The past may not predict the future. Structural changes in markets (new regulations, changed central bank policy, technological disruption) can invalidate historical relationships.

Model risk: Monte Carlo is only as good as its assumptions. The “correct” distribution is unknowable, and small changes in distributional assumptions can produce large changes in tail estimates.

Time horizon limitations: One-day VaR estimates do not scale cleanly to longer horizons. The square-root-of-time rule assumes i.i.d. returns, which is empirically false. Multi-period path simulation is more appropriate for longer horizons.

Nonlinear instruments: Options and structured products require full revaluation at each scenario — delta-normal approximations can severely understate risk when positions are far from the money or when volatility shifts dramatically.

Frequently Asked Questions

No method is universally most accurate — it depends on whether the method’s assumptions match your data and scenario. Historical simulation is accurate when history is representative. Monte Carlo is accurate when the model is correctly specified. The “best” method is the one whose assumptions most closely match reality for your specific portfolio and time horizon.

The number depends on the statistic you are estimating. For 95% VaR, 10,000 Monte Carlo simulations typically provide stable estimates. For 99% VaR or Expected Shortfall, 50,000 to 100,000 simulations may be needed because fewer observations define the extreme tail. For bootstrap, the effective sample size is limited by your historical data length, regardless of how many resamples you generate.

Yes — hybrid methods combine strengths of multiple approaches. Filtered historical simulation applies a GARCH model to standardize historical returns, then rescales them by current volatility estimates. Block bootstrap preserves time-series dependence by resampling blocks of consecutive observations. Copula Monte Carlo combines modeled marginal distributions with an explicitly specified dependence structure.

In Monte Carlo, correlations are explicitly modeled using Cholesky decomposition of the correlation matrix to generate correlated random draws. In bootstrap simulation, correlation is preserved by resampling entire cross-sections — all assets on the same historical date — rather than individual asset returns. Historical simulation preserves correlations automatically since scenarios are actual historical cross-sections.

Use parametric methods (Monte Carlo) when you have confidence in your distributional assumptions and need to model scenarios beyond historical experience — stress tests, derivatives pricing, or hypothetical market conditions. Use non-parametric methods (historical or bootstrap) when you want to avoid imposing distributional assumptions and believe historical observations adequately represent the relevant risk scenarios.

No. Historical simulation uses past returns to estimate future risk distributions. Backtesting evaluates how well a risk model would have performed historically — comparing predicted VaR to actual losses over a past period. Backtesting is a validation technique, not a simulation method. You can backtest any simulation method, including Monte Carlo.

Bootstrap resamples from actual historical observations — it cannot produce scenarios outside the historical range (at the individual-period level). Monte Carlo generates new random draws from a specified distribution, which can produce any scenario the distribution allows, including extreme events that never occurred historically. Bootstrap makes no distributional assumption; Monte Carlo requires one.

Disclaimer

This article is for educational and informational purposes only and does not constitute investment advice. Simulation results are probability estimates based on models and assumptions that may not reflect future market conditions. Always conduct your own research and consult a qualified financial advisor before making investment decisions.

Explore Top Finance Certificates

Access official certificates from Wharton Online & Columbia Business School Executive Education, powered by Wall Street Prep. Save up to $500 with code RYAN.

Simulation Methods: Historical, Bootstrap, and Monte Carlo

What Are Simulation Methods?

Simulation vs Scenario Analysis vs Stress Testing

Historical Simulation

Bootstrap Simulation

Monte Carlo Simulation

Historical Simulation vs Bootstrap vs Monte Carlo

Historical Simulation

Bootstrap Simulation

Monte Carlo Simulation

How to Choose a Simulation Method

Example: VaR Estimation Using All Three Methods

Common Mistakes

Limitations of Simulation Methods

Frequently Asked Questions

Which simulation method is most accurate?

How many simulations are needed for reliable results?

Can I combine simulation methods?

How do I handle correlation in simulation?

When should I use parametric vs non-parametric simulation?

Is historical simulation the same as backtesting?

What is the difference between bootstrap and Monte Carlo?

Disclaimer

Table of Contents

MarketXLS Excel Add-in

Explore Top Finance Certificates

Contact Me

Contact Me