Question 1

Does this rule apply to non-normal distributions?

Accepted Answer

Only approximately, and only for distributions that resemble a normal — symmetric, bell-shaped, with light tails.  For skewed distributions (income, waiting times) the rule overestimates the central mass.  For heavy-tailed distributions (stock returns, earthquake magnitudes) the rule dramatically underestimates the tail probabilities.

Question 2

Why these particular percentages?

Accepted Answer

They come from integrating the normal density between ±1σ, ±2σ, ±3σ.  The exact values are 68.27%, 95.45%, 99.73% — but the rounded 68, 95, 99.7 are easier to memorise and accurate enough for back-of-envelope work.

Question 3

Where does the normal distribution come from?

Accepted Answer

From the Central Limit Theorem: the sum (or average) of many independent random variables, each with finite variance, converges to a normal distribution.  That's why measurement errors, test scores, and many natural quantities turn out to be approximately normal.

Question 4

What's beyond 3σ?

Accepted Answer

About 0.27% — roughly 1 in 370 observations.  Beyond 4σ: 1 in 16,000.  Beyond 5σ: 1 in 1.7 million.  The tails of a normal are spectacularly thin, which is why real-world data with rare extreme events (financial crashes, viral content) is poorly modelled as normal.

The 68–95–99.7 rule for the normal distribution

What this shows

Where it shows up

Frequently asked questions

Related topics