🎯
fundamentalsintermediate18-22 min

Confidence vs Prediction vs Tolerance Intervals: Three Different Questions

A confidence interval describes a parameter, a prediction interval describes the next observation, and a tolerance interval describes a proportion of the population. The math looks similar; the meaning is completely different. Here is exactly how each is constructed, with worked examples.

What You'll Learn

  • Distinguish what each interval covers: a parameter, a future value, or a population proportion.
  • Compute each from a sample of normal data by hand.
  • Pick the right interval for the question the audience is actually asking.

1. Direct Answer: Three Different Targets

A confidence interval (CI) is a range of plausible values for an UNKNOWN PARAMETER — typically the population mean μ. A 95% CI means that if you repeated the sampling many times, 95% of such intervals would contain the true parameter. A prediction interval (PI) is a range of plausible values for the NEXT SINGLE OBSERVATION drawn from the same population. A 95% PI means that 95% of new observations will fall inside it, in the long run. A tolerance interval (TI) covers a SPECIFIED PROPORTION OF THE POPULATION with a stated confidence — a 95/95 TI is a range expected to contain at least 95% of the population, with 95% confidence. CIs shrink as sample size grows because the parameter sits at a fixed point; PIs converge to the population’s ± z·σ width because the observation itself has variance that does not shrink; TIs converge similarly. Misusing one for another is the single most common interval-related mistake in applied work.

Key Points

  • CI: where the PARAMETER probably is.
  • PI: where the NEXT OBSERVATION probably is.
  • TI: where a SPECIFIED PROPORTION of the population is, with stated confidence.

2. Confidence Interval Construction

For the mean of a normal population with unknown variance, the 95% CI is x̄ ± t_{0.025, n−1} × (s / √n). The standard error s/√n shrinks like 1/√n, so doubling the sample roughly cuts the CI width by √2. The width depends on sample size n, variability s, and the chosen confidence level. The CI does NOT tell you where new observations will fall; it tells you where μ is likely to be. A frequent misuse is computing a tight CI from n = 1000 and concluding that 95% of customers will pay within that range — the CI is for the average customer, not the next customer.

Key Points

  • Formula: x̄ ± t_{α/2, n−1} × (s / √n).
  • Width shrinks like 1/√n; doubling n cuts width ~1.4×.
  • CI is about μ, not about new observations.

3. Prediction Interval Construction

For the next single observation from a normal population, the 95% PI is x̄ ± t_{0.025, n−1} × s × √(1 + 1/n). The extra “1 +” inside the square root accounts for the observation’s own variance, which is the dominant term for large n. As n → ∞ the t-multiplier converges to 1.96 and the term √(1+1/n) converges to 1, so the PI width approaches ±1.96σ — about ±1.96 standard deviations regardless of sample size. This is the right number when you need to say “95% of next units will measure between A and B,” but it is wider than a CI and many practitioners do not realize they should be reporting it.

Key Points

  • Formula: x̄ ± t_{α/2, n−1} × s × √(1 + 1/n).
  • Width → ±1.96σ as n grows; does NOT shrink to zero.
  • Use when the question is about a single future observation.

4. Tolerance Interval Construction

A two-sided (p, 1−α) tolerance interval covers at least proportion p of the population with confidence 1 − α. For a normal population, the form is x̄ ± k × s, where k is a tolerance factor that depends on n, p, and α and is read from a table or computed from the non-central t distribution. A 95/95 TI requires a larger k than the 1.96 of a CI or PI — for n = 30 the factor is approximately 2.55 and for n = 100 it drops to about 2.23 (converging to the z-quantile that captures p of a normal, namely 1.96 for p = 0.95, plus a confidence buffer). TIs are the right answer to regulatory and engineering questions like “what range will contain 99% of manufactured parts with 95% confidence?”

Key Points

  • Formula: x̄ ± k × s with tolerance factor k from a table.
  • A 95/95 TI is WIDER than a 95% CI or PI for any finite n.
  • Right tool for regulatory specs and proportion-of-population claims.

5. Worked Example 1: Battery Life Sample (All Three Intervals)

A sample of n = 25 AA batteries shows mean lifetime x̄ = 12.4 hours and s = 1.6 hours, with the data approximately normal. 95% CI for μ: t_{0.025, 24} = 2.064; CI = 12.4 ± 2.064 × (1.6 / √25) = 12.4 ± 0.661 = (11.74, 13.06). 95% PI for next battery: 12.4 ± 2.064 × 1.6 × √(1 + 1/25) = 12.4 ± 2.064 × 1.6 × 1.0198 = 12.4 ± 3.37 = (9.03, 15.77). 95/95 TI: k_{25, 0.95, 0.95} ≈ 2.631; TI = 12.4 ± 2.631 × 1.6 = 12.4 ± 4.21 = (8.19, 16.61). The CI is narrow because n is moderate; the PI is much wider; the TI is widest because it must contain 95% of all batteries with 95% confidence. A manager asking “how long does an average battery last?” gets the CI; an end-user asking “how long will MY next battery last?” gets the PI; a regulator asking “what range covers 95% of batteries?” gets the TI.

Key Points

  • CI (11.74, 13.06): narrow, about the mean.
  • PI (9.03, 15.77): wider, about the next observation.
  • TI (8.19, 16.61): widest, about 95% of the population.

6. Worked Example 2: Concrete Compressive Strength

Twenty cured concrete cylinders have mean compressive strength 4,200 psi with s = 250 psi. A structural engineer needs to certify that the design strength will be met by at least 95% of cylinders with 95% confidence — a textbook tolerance-interval question. With n = 20 and a one-sided 95/95 TI, the tolerance factor k₁ ≈ 2.396. The one-sided lower TI is 4,200 − 2.396 × 250 = 4,200 − 599 = 3,601 psi. The engineer reports that with 95% confidence at least 95% of cylinders will exceed 3,601 psi. A CI of (4,083, 4,317) would be irrelevant — it speaks to the mean, not to individual cylinders, and would over-promise the lower bound of any single cylinder.

Key Points

  • Engineering and regulatory claims about individuals require TIs, not CIs.
  • One-sided TI factor differs from two-sided; tables specify which.
  • A CI here would mislead — it speaks to the mean, not to spec compliance.

7. Reporting and Common Mistakes

Three errors keep recurring. First, reporting a tight CI and treating it as a forecast for individuals — “the 95% CI for income is $48k–$52k” said as if 95% of customers earn in that range. The PI or TI is the right answer. Second, growing the sample size and trumpeting a narrower CI as evidence of more precise individual predictions — only the parameter estimate got tighter; individual variance is unchanged. Third, conflating one-sided and two-sided tolerance factors. A 95% lower one-sided TI (acceptable to spec) is not the same as a 95/95 two-sided TI; the tolerance factor and the conclusion differ. Always state which interval and which side(s) explicitly.

Key Points

  • CIs about parameters, PIs about a single new value, TIs about a population proportion.
  • Large n narrows CIs but NOT PIs or TIs to zero.
  • Specify one-sided versus two-sided for TIs.

8. Running All Three Intervals in StatsIQ

Provide a sample and a confidence level, plus a population proportion for tolerance, and StatsIQ produces the CI, PI, and TI side by side with the right t-multiplier and tolerance factor, choosing the one-sided or two-sided form you need. It also explains which interval answers which framing of the question — parameter, next observation, or population proportion. This content is for educational purposes only.

Key Points

  • All three intervals computed side by side from the same sample.
  • One-sided versus two-sided TI handled explicitly with the correct factor.
  • Wording prompts help match the right interval to the audience’s question.

Key Takeaways

  • CI = x̄ ± t × s/√n shrinks like 1/√n; targets the parameter.
  • PI = x̄ ± t × s × √(1+1/n) converges to ±1.96σ; targets the next observation.
  • TI = x̄ ± k × s with k from non-central t; covers a population proportion at stated confidence.
  • A 95/95 TI is wider than a 95% PI which is wider than a 95% CI for the same data.
  • One-sided versus two-sided TI changes the tolerance factor — state which.

Practice Questions

1. Your CI for mean weekly hours studied is (8.2, 9.4) with n = 200. Why is it wrong to tell a student “you will study between 8.2 and 9.4 hours next week”?
The CI describes where the population MEAN is, not what an individual student will do. The right interval for one student’s next week is a prediction interval, which for the same data would be much wider — perhaps (3.5, 14.1) — reflecting the variance of individual behavior, not the parameter estimate.
2. A vendor reports a 95/95 tolerance interval of (3.4 mm, 3.8 mm) for screw diameter. What does that mean operationally?
With 95% confidence, at least 95% of screws have diameter between 3.4 and 3.8 mm. It does NOT mean exactly 95% of every batch will be in range — only that, across repeated sampling and TI construction, 95% of such intervals will contain at least 95% of the population.
3. For a fixed sample, which interval gets wider FASTEST as you go from 90% to 99%?
The tolerance interval. Both the t-multiplier (or z) and the implicit tail probability for the population proportion change, so the TI factor k grows more steeply than the CI or PI multiplier. The CI grows the slowest because only the t-multiplier changes.

Study with AI

Get personalized help and instant answers anytime.

Download StatsIQ

FAQs

Common questions about this topic

In strict frequentist interpretation, no. The parameter is fixed; the interval is random. The 95% refers to the long-run proportion of similarly constructed intervals that contain the parameter. The common informal usage — “95% chance μ is in here” — is a Bayesian credible-interval interpretation that requires a prior. In Bayesian analysis a 95% credible interval does mean exactly that, conditional on the prior.

Because the next observation has its own intrinsic variance σ², which is a property of the population, not of how much you have measured it. Estimating σ better does not reduce σ. The CI for μ shrinks because μ is a single point you can pin down more precisely with more data; the PI must always be at least ±1.96σ wide for a 95% interval.

When you need to make a claim about a PROPORTION of the population rather than the NEXT observation. Regulatory specifications, engineering quality limits, and process-capability claims (Cpk, Ppk) are tolerance-interval questions. A pharmaceutical batch-release spec saying “at least 95% of tablets weigh between X and Y” is a one-sided or two-sided 95/95 TI claim.

For CIs use bootstrap percentile or BCa methods, or robust alternatives like the median-of-medians. For PIs there are non-parametric versions based on order statistics or quantile regression. For TIs use non-parametric tolerance limits (Wilks, 1941) based on order statistics — for n samples the kth largest and (n−k+1)th smallest order statistics form a non-parametric tolerance interval with calculable coverage probability.

Yes. Phrase your question and StatsIQ matches it to the right interval — parameter (CI), next observation (PI), or proportion of population (TI) — and computes the appropriate form, including non-parametric versions when normality is doubtful. This content is for educational purposes only.

Related Study Guides

Browse All Study Guides