Understanding Statistical Distributions

A guide to the most important probability distributions in statistics. Learn the shapes, parameters, and applications of the normal, binomial, t, chi-square, and other key distributions.

What You'll Learn

✓Identify and describe the key properties of major statistical distributions.
✓Apply the normal and binomial distributions to solve probability problems.
✓Understand how sampling distributions underpin statistical inference.

1. The Normal Distribution

The normal (Gaussian) distribution is the most important distribution in statistics. It is symmetric and bell-shaped, completely described by its mean and standard deviation. The empirical rule and z-scores provide quick probability estimates.

Key Points

•About 68% of data fall within one standard deviation, 95% within two, and 99.7% within three (the 68-95-99.7 rule).
•A z-score converts any normal value to the standard normal: z = (x - mu) / sigma.
•The Central Limit Theorem guarantees the sampling distribution of the mean is approximately normal for large samples.

2. The Binomial Distribution

The binomial distribution models the number of successes in a fixed number of independent trials, each with the same probability of success. It is the foundation for inference about proportions.

Key Points

•Parameters: n (number of trials) and p (probability of success on each trial).
•Mean = np and standard deviation = sqrt(np(1-p)).
•The binomial can be approximated by the normal when np >= 10 and n(1-p) >= 10.

3. Sampling Distributions and the t, Chi-Square, and F Distributions

Sampling distributions describe how a statistic varies across repeated samples. The t-distribution is used for means with unknown sigma, the chi-square for variance and categorical tests, and the F-distribution for comparing variances in ANOVA.

Key Points

•The t-distribution has heavier tails than the normal, accounting for extra uncertainty with small samples.
•The chi-square distribution is right-skewed and used in goodness-of-fit tests and tests of independence.
•The F-distribution is the ratio of two chi-square variables and is the basis of the ANOVA F-test.

Key Takeaways

★The normal distribution is symmetric, so the mean, median, and mode are all equal.
★A binomial random variable counts successes; a geometric random variable counts trials until the first success.
★The t-distribution approaches the normal as degrees of freedom increase.
★Chi-square values are always non-negative because they are sums of squared terms.

Practice Questions

1. Scores on a test are normally distributed with mean 500 and SD 100. What percentage score above 700?

z = (700 - 500) / 100 = 2.0. Using the standard normal table, P(Z > 2) is approximately 0.0228, so about 2.28% of test-takers score above 700.

2. A fair coin is flipped 20 times. What is the probability of exactly 10 heads?

Using the binomial formula: P(X=10) = C(20,10) * (0.5)^10 * (0.5)^10 = 184,756 / 1,048,576, which is approximately 0.176 or 17.6%.

Study with AI

Get personalized help and instant answers anytime.

Download StatsIQ

FAQs

Common questions about this topic

The normal distribution appears throughout statistics for two reasons: many natural phenomena are approximately normal, and the Central Limit Theorem guarantees that sample means are approximately normal regardless of the population shape, enabling inference even when the original data are not normal.

Match the distribution to the context. Counting successes in fixed trials? Binomial. Measuring a continuous variable? Often normal. Testing means with unknown sigma? t-distribution. Comparing variances or testing categorical data? Chi-square or F. The type of data and research question determine the distribution.

Related Study Guides

🎲 fundamentals

Browse All Study Guides

🎯 AP Statistics 🔬 Introduction to 📈 Regression Analysis 🎲 Probability Foundations 📊 Understanding Statistical 🧪 ANOVA and 📉 Data Visualization 🔄 Bayesian vs 📊 What Is 📐 What Is 🔗 Correlation vs 📐 Central Limit 📏 Confidence Intervals:📐 P-Values and 📐 Chi-Square Tests ⚠️ Type I 🎲 Sampling Methods 📈 Introduction to 📏 Effect Size 📉 Multiple Regression:🔀 Non-Parametric Tests:🎯 How to 🧪 A/B Testing 🧹 Data Cleaning ⏱️ Survival Analysis:🔗 Introduction to 📈 Time Series 🔬 Principal Component 🔀 How to 📐 Two-Sample t-Test 📊 How to 🔀 Paired vs 📋 How to 📊 Z-Scores and 📈 R Squared 🎲 Binomial Probability 🎲 Expected Value 📐 Standard Error 🎯 Margin of 📊 Contingency Tables 📉 Poisson Distribution:📏 Cohen's d 🔗 Pearson vs ⚖️ One-Tailed vs 🔔 Normal Distribution 📉 Linear Regression 📊 Mean vs 🎯 Confidence vs 📊 Two-Way ANOVA:⚡ Statistical Power 🎯 Conditional Probability 🎲 Permutations vs 📈 Log Transformations 🔄 Simpson's Paradox:🧪 Hypothesis Testing:🎲 Probability Distributions:📈 Central Limit ⚖️ Type I 🎯 P-Value Interpretation:↔️ One-Tailed vs 🎲 Binomial vs 📊 Normal Distribution 📈 Discrete vs 📊 Chi-Square Goodness-of-Fit 🔬 Mann-Whitney U ⏱️ Exponential Distribution:🎯 Geometric vs 🎯 Wilcoxon Signed-Rank 🎯 Kruskal-Wallis Test 🎯 Tukey HSD 🎯 Relative Risk 🔁 Friedman Test 📈 Spearman vs 🎚️ Bonferroni vs 🎯 Confidence vs ⚡ A-Priori vs

Understanding Statistical Distributions

What You'll Learn

1. The Normal Distribution

Key Points

2. The Binomial Distribution

Key Points

3. Sampling Distributions and the t, Chi-Square, and F Distributions

Key Points

Key Takeaways

Practice Questions

Study with AI

FAQs

Why is the normal distribution so important?

How do I choose which distribution to use?

Related Study Guides

Probability Foundations

Introduction to Hypothesis Testing

Expected Value and Variance: Formulas + 6 Worked Examples

Central Limit Theorem: Definition, Formula + 4 Examples

Browse All Study Guides