The bell curve, z-scores, the 68-95-99.7 rule, and the law of large numbers
The normal distribution (or Gaussian distribution) is the single most important probability distribution in quantitative finance. Its symmetric, bell-shaped density curve arises naturally whenever a quantity is influenced by many small, independent effects. Daily stock returns, measurement errors, and portfolio P&L fluctuations all tend to cluster around a central value and thin out in the tails — precisely the shape the normal distribution captures.
A continuous random variable follows a normal distribution with mean and standard deviation if its probability density function is
The two parameters fully characterize the distribution:
We write to denote that is normally distributed.
For any normal distribution, the probability of falling within one, two, or three standard deviations of the mean is fixed:
This rule gives quick mental estimates. If someone tells you a hedge fund's daily P&L has mean $0 and standard deviation $1M, you immediately know that a $3M loss day should occur roughly of the time — about once every 750 trading days.
Any normal variable can be converted to a standard normal via the transformation
This -score tells you how many standard deviations sits above or below the mean. Standardization lets you use a single standard normal table (or function ) for every normal problem.
Key standard normal values worth memorizing:
| | | |-----|-----------| | 1.00 | 0.8413 | | 1.65 | 0.9505 | | 1.96 | 0.9750 | | 2.33 | 0.9901 |
Before asking what distribution the sample mean follows, we should ask: does it converge at all? The Law of Large Numbers (LLN) answers yes.
If are iid with finite mean , then the sample mean converges to as . The weak LLN says convergence is in probability; the strong LLN says it is almost sure.
In finance, LLN is why backtests work: if you evaluate a strategy over enough independent trades, the average P&L converges to the true expected P&L. The catch is "enough" — for heavy-tailed distributions (which real returns exhibit), convergence can be slow.
The Central Limit Theorem (covered in the next lesson) goes further: not only does converge to (LLN), but its distribution is approximately normal for large . Because many financial quantities — a portfolio return, for instance — are sums of many small, roughly independent contributions, the normal approximation is remarkably useful.
A stock's daily log-return is modeled as . What is the probability that the return on a given day exceeds ?
Step 1 — Standardize. Compute the -score for :
Step 2 — Look up the probability. From the standard normal table, .
Step 3 — Complement. We want :
There is roughly a 7% chance the stock gains more than 3% on any given day.
Interview tip: Many quant interviews test whether you can quickly standardize and use the normal CDF. Memorize , , and — these three values cover most back-of-the-envelope calculations.
The normal (Gaussian) distribution is the most important continuous distribution.
68-95-99.7 rule: ~68% of data within 1σ, ~95% within 2σ, ~99.7% within 3σ.
Central Limit Theorem: The sum of many independent random variables tends toward normal regardless of their individual distributions.