Statistics Study Guides
76 comprehensive guides to help you master statistics concepts, from fundamentals to exam prep.
AP Statistics Exam Prep
A targeted study guide for the AP Statistics exam. Covers exploring data, sampling and experimentation, probability, and statistical inference with exam-specific strategies.
Introduction to Hypothesis Testing
A beginner-friendly guide to the logic and mechanics of hypothesis testing. Learn how to formulate hypotheses, calculate test statistics, interpret p-values, and draw conclusions.
Regression Analysis Complete Guide
A comprehensive guide to regression analysis, from simple linear regression to multiple regression. Covers model fitting, diagnostics, interpretation of coefficients, and common pitfalls.
Probability Foundations
Build a solid understanding of probability, the mathematical language underlying all of statistics. Covers basic rules, conditional probability, Bayes theorem, and common counting methods.
Understanding Statistical Distributions
A guide to the most important probability distributions in statistics. Learn the shapes, parameters, and applications of the normal, binomial, t, chi-square, and other key distributions.
ANOVA and Experimental Design
A comprehensive guide to analysis of variance (ANOVA) and the design of experiments. Covers one-way ANOVA, two-way ANOVA, blocking, randomization, and post-hoc comparisons.
Data Visualization and Descriptive Statistics
Learn how to summarize and visualize data effectively. Covers measures of center, spread, shape, graphical displays, and best practices for communicating data clearly.
Bayesian vs Frequentist Statistics
An exploration of the two major paradigms of statistical inference. Frequentist methods rely on long-run frequencies and fixed parameters. Bayesian methods incorporate prior beliefs and update them with data.
What Is a P-Value? Definition, Interpretation, and Examples
Understand what a p-value actually means, how to interpret it correctly, common misconceptions that lead to wrong conclusions, and how p-values connect to hypothesis testing.
What Is Standard Deviation and How Do You Calculate It?
Learn what standard deviation measures, how to calculate it by hand and interpret it, and why it is the most important measure of spread in statistics.
Correlation vs Causation: What Is the Difference?
Understand why correlation does not imply causation, learn to identify confounding variables and spurious correlations, and know what study designs can establish causal relationships.
Central Limit Theorem: Definition, Formula + 4 Examples
The Central Limit Theorem (CLT) is the single most important theorem in statistics β it is the reason we can make inferences about populations from samples, and it underpins virtually every confidence interval and hypothesis test you will ever encounter. This guide explains what the CLT actually says, why it works, and how to apply it to real problems.
Confidence Intervals: What They Mean, How to Calculate Them, and What They Do NOT Tell You
A confidence interval gives you a range of plausible values for a population parameter based on sample data β and it is one of the most misinterpreted concepts in all of statistics. This guide explains what confidence intervals actually mean (and what they do not mean), how to calculate them for means and proportions, and how to interpret them correctly on exams and in practice.
P-Values and Statistical Significance: What They Actually Mean
A p-value is the probability of observing data as extreme as (or more extreme than) your sample data, assuming the null hypothesis is true. It is NOT the probability that the null hypothesis is true, NOT the probability that your results are due to chance, and NOT the probability of making an error. Getting this definition right is the foundation of all statistical inference.
Chi-Square Tests Explained: Goodness of Fit and Test of Independence
A clear walkthrough of both chi-square tests β goodness of fit (does a distribution match what we expected?) and test of independence (are two categorical variables related?) β with worked examples, assumptions, and interpretation guidelines.
Type I and Type II Errors Explained: Power, Sample Size, and the Trade-Off
Understand the two kinds of mistakes in hypothesis testing, how they relate to each other, what statistical power actually means, and how sample size affects your ability to detect real effects.
Sampling Methods Explained: Random, Stratified, Cluster, and When to Use Each
A practical guide to the four major sampling methods β simple random, stratified, cluster, and systematic β covering how each works, when to use it, common mistakes, and how sampling method affects the conclusions you can draw.
Introduction to Logistic Regression: When and Why Linear Regression Fails for Binary Outcomes
A clear introduction to logistic regression for students who understand linear regression and need to extend to binary outcomes β covering why linear regression breaks down for yes/no predictions, how the logit transformation works, and how to interpret odds ratios.
Effect Size Measures: Cohen's d, Eta-Squared, and Why P-Values Are Not Enough
A practical guide to effect size β what it measures, why statistical significance alone is misleading, how to calculate and interpret Cohen's d and eta-squared, and how reporting effect sizes makes your research more honest and more useful.
Multiple Regression: How to Handle Multiple Predictors and Avoid Multicollinearity
A clear guide to multiple regression for students who understand simple regression and need to extend to two or more predictors β covering the model equation, how to interpret each coefficient, what multicollinearity is and why it wrecks your analysis, and how to detect and fix it.
Non-Parametric Tests: When to Use Mann-Whitney, Wilcoxon, and Kruskal-Wallis
A practical guide to the three most common non-parametric tests covering when parametric assumptions fail, how rank-based tests work, and step-by-step procedures for Mann-Whitney U (two independent groups), Wilcoxon signed-rank (paired data), and Kruskal-Wallis (three or more groups).
How to Handle Outliers in Your Data: Detection Methods and Decision Framework
A practical guide to outlier detection and treatment covering how to identify outliers using statistical methods (IQR rule, z-scores) and visual methods (boxplots, scatterplots), the decision framework for keeping, transforming, or removing them, and the consequences of getting it wrong.
A/B Testing Done Right: Experiment Design, Sample Size, and Avoiding False Discoveries
A practical guide to A/B testing covering how to design valid experiments, calculate the sample size you actually need, choose the right statistical test, interpret results without fooling yourself, and avoid the most common mistakes that produce false discoveries in industry A/B tests.
Data Cleaning and Preprocessing: The Unglamorous Work That Determines Whether Your Analysis Is Worth Anything
A practical guide to data cleaning covering how to diagnose data quality problems (missing values, duplicates, inconsistencies, outliers), the decision framework for handling each type, common preprocessing transformations, and why data cleaning is where most analyses go wrong β not in the modeling.
Survival Analysis: Time-to-Event Data, Kaplan-Meier Curves, and Cox Proportional Hazards Regression
A practical guide to survival analysis covering censored data, Kaplan-Meier estimation, log-rank tests, and Cox proportional hazards regression β the core toolkit for analyzing how long it takes for events to occur.
Introduction to Causal Inference: Why Correlation Is Not Enough and What Actually Establishes Causation
A rigorous introduction to causal inference covering why observational correlations mislead, the potential outcomes framework, confounding, DAGs, and the key methods β randomized experiments, difference-in-differences, instrumental variables, and regression discontinuity β that let researchers make causal claims from imperfect data.
Time Series Analysis: How to Decompose Trend, Seasonality, and Noise β and Why Your Forecast Depends on Getting It Right
A practical guide to time series analysis covering the components of a time series (trend, seasonality, cyclicality, noise), decomposition methods, stationarity and differencing, and the core forecasting models (ARIMA, exponential smoothing) with enough worked examples to actually use them.
Principal Component Analysis (PCA): Reducing Dimensions Without Losing What Matters
A practical guide to PCA covering why high-dimensional data is hard to work with, how PCA finds the directions of maximum variance, the mechanics of eigenvalues and eigenvectors in plain language, how to choose the right number of components, and common mistakes that produce misleading results.
How to Choose the Right Statistical Test: A Decision Flowchart for Every Common Scenario
The most practical guide in statistics: given your data type and research question, which test do you use? A decision flowchart covering t-tests, ANOVA, chi-square, correlation, regression, and non-parametric alternatives β with the criteria for choosing each one.
Two-Sample t-Test Step by Step: Hypotheses, Calculation, and Interpretation With a Worked Example
A complete step-by-step walkthrough of the independent two-sample t-test β from stating hypotheses through calculating the test statistic, finding the p-value, and writing the conclusion. Includes a fully worked numerical example that you can follow along with.
How to Interpret Regression Output: Coefficients, R-Squared, p-Values, and What They Mean
A practical guide to reading regression output from any statistical software β covering what each number means, how to interpret coefficients (slope and intercept), RΒ² and adjusted RΒ², the F-test, coefficient p-values, confidence intervals, and the mistakes students make when interpreting results on exams.
Paired vs Independent t-Test: When to Use Which and Why It Matters for Your Results
A clear guide to choosing between the paired (dependent) t-test and the independent (two-sample) t-test β covering the key distinction (same subjects vs different subjects), how each test calculates the t-statistic differently, why using the wrong test inflates your error rate, and worked examples showing when each applies.
How to Read ANOVA Output: Sum of Squares, Mean Square, F-Statistic, and Post-Hoc Tests
ANOVA tables show up in every statistics course and every research paper that compares group means, but most students stare at the rows and columns without knowing what any of them actually mean. This guide walks through every piece of ANOVA output β sum of squares, degrees of freedom, mean square, the F-statistic, and the p-value β and then explains what to do after a significant result with post-hoc tests.
Z-Scores and the Standard Normal Table: How to Calculate and Look Up Probabilities
Z-scores convert any normally distributed value into a standard scale, and the standard normal table (z-table) turns those scores into probabilities. This guide covers the full workflow: calculating z-scores, reading the z-table correctly, handling left-tail and right-tail areas, working between two z-values, and applying z-scores to real problems involving percentiles and probability.
R Squared (RΒ²) Explained: Formula, Interpretation, and Worked Examples
A complete guide to the coefficient of determination (RΒ²) β covering the formula 1 - SS_res / SS_tot, how to compute the sum of squared residuals and total sum of squares, what RΒ² means in regression analysis, and how to interpret values from 0 to 1 with worked examples.
Binomial Probability Formula P(X=k): Worked Examples
A complete guide to the binomial probability formula β covering the four conditions for a binomial experiment, the formula derivation, how to compute binomial coefficients, worked examples for exact and at-least probabilities, and how to use the binomial in hypothesis testing.
Expected Value and Variance: Formulas + 6 Worked Examples
A complete guide to calculating expected value E(X) and variance Var(X) for discrete random variables β covering the formulas, step-by-step worked examples, the shortcut formula for variance, and applications in probability and statistics courses.
Standard Error vs Standard Deviation: What's the Difference and When to Use Each
A clear explanation of the difference between standard deviation (SD) and standard error (SE) β two concepts that are commonly confused but measure completely different things. Covers what each one represents, how they are calculated, when to report each, and the common mistakes students make.
Margin of Error and Sample Size: How to Calculate Each and Why They're Connected
A clear guide to margin of error and sample size calculations β covering what margin of error means, how to calculate it for proportions and means, how to determine the sample size needed for a desired margin of error, and the practical tradeoffs in survey and experiment design.
Contingency Tables and Two-Way Tables: How to Build, Read, and Test for Association
A practical guide to contingency tables (two-way tables) β covering how to construct them from raw data, how to calculate row and column percentages, how to test for association between two categorical variables using the chi-square test, and the common mistakes students make when interpreting two-way tables.
Poisson Distribution: Formula, When to Use, and Worked Examples for Students
A complete guide to the Poisson distribution β covering the formula, the conditions that make a Poisson distribution appropriate, worked examples for counts of events in time/space, and the relationship to the binomial distribution.
Cohen's d Effect Size: Formula, 0.2 / 0.5 / 0.8 Thresholds
A complete guide to Cohen's d β covering the formula, how to calculate it from typical study data, the standard interpretation thresholds, and why effect sizes are essential alongside p-values in modern statistical reporting.
Pearson vs Spearman vs Kendall Correlation Coefficient: Formulas, Differences, and Which to Use
Three correlation coefficients dominate statistical practice β Pearson, Spearman, and Kendall. They measure different things and work with different data types. Learn the formulas, the assumptions, when each is appropriate, and the worked examples that show exactly when they agree and disagree.
One-Tailed vs Two-Tailed Hypothesis Tests: When to Use Each with Worked Examples
Choosing between a one-tailed and two-tailed hypothesis test is one of the most consequential and most commonly botched decisions in applied statistics. Learn the formal definitions, the conditions under which each is appropriate, the penalty for choosing incorrectly, and worked examples across t-tests, z-tests, and proportion tests.
Normal Distribution Probability: Z-Score to Area Worked Examples (Step-by-Step)
Every intro statistics class revolves around the normal distribution and the z-score. But the worked-example side β given a real-world problem, produce the probability β is where students get stuck. This guide walks through the full translation from real-world question to z-score to area to probability, with examples for left tail, right tail, between-value, and outside ranges.
Linear Regression Assumptions: How to Check Residuals, Homoscedasticity, and Normality
Linear regression gives you coefficients and p-values β but those are only trustworthy if the underlying assumptions hold. This guide walks through the five key assumptions, how to check each using residual plots and diagnostic tests, and what to do when assumptions are violated.
Mean vs Median: When to Use Each Measure of Central Tendency (Worked Examples)
The mean and median both describe the center of a distribution, but they give different answers on skewed data β sometimes dramatically different. This guide walks through when to use each, how outliers affect them, and worked examples from income, test scores, housing prices, and clinical data.
Confidence vs Prediction vs Tolerance Intervals: Which to Use and Worked Examples
Three different intervals quantify different types of uncertainty: confidence intervals for parameter estimation, prediction intervals for single future observations, and tolerance intervals for proportions of the population. This guide walks through each with worked examples and shows when to choose which.
Two-Way ANOVA: Main Effects, Interactions, and Worked Examples
Two-way ANOVA tests two factors simultaneously. This guide walks through main effects, interaction effects, and the F-tests β with numerical examples you can reproduce step-by-step.
Statistical Power and Sample Size: Beating Type II Error
Power is the probability of detecting a real effect. Sample size planning uses power analysis to decide how much data you need. This guide walks through the full relationship between alpha, beta, effect size, and sample size with worked examples.
Conditional Probability and Bayes Theorem: Worked Examples
Conditional probability quantifies how an event probability changes given new information; Bayes theorem inverts the conditioning. This guide walks through the formulas with 6 worked examples covering medical screening, courtroom probability, and the classic Monty Hall problem.
Permutations vs Combinations: Worked Examples and the Decision Framework
Permutations count arrangements where order matters; combinations count selections where order doesn't. Choosing the wrong formula is the most common error in counting problems. This guide gives the decision framework and 6 worked examples covering passwords, lottery tickets, committee selection, and arrangements with repetition.
Log Transformations in Regression: Linear-Log, Log-Linear, and Log-Log Interpretation (Worked Examples)
How to interpret coefficients when one or both variables in a regression are log-transformed. Covers linear-linear, log-linear (semi-log), linear-log (semi-log), and log-log (double-log / elasticity) models with worked examples and the percent-change interpretations.
Simpson's Paradox: Confounding Variables and Stratification (Worked Examples)
How to recognize, diagnose, and resolve Simpson's Paradox β when an aggregated trend reverses or disappears within subgroups. Covers the classic Berkeley admissions case, kidney stone treatment data, and the stratification analysis that reveals the true relationship.
Hypothesis Testing: The Complete Guide With 6 Worked Tests
A pillar guide to statistical hypothesis testing covering the seven-step framework, all major tests (z-test, t-test, paired and two-sample t-tests, ANOVA, chi-square, Mann-Whitney), p-values, type I and type II errors, statistical power, and a decision tree for choosing the right test. Includes worked examples for each test family and a power-and-sample-size lookup table.
Probability Distributions: The Complete Guide With Decision Tree
A pillar guide to the most important probability distributions in applied statistics β Bernoulli, binomial, Poisson, geometric, normal, t, chi-square, F, uniform, exponential, beta β with worked probability calculations, a decision tree for choosing the right distribution, and a Central Limit Theorem walkthrough that ties them all together.
Central Limit Theorem: Worked Examples and Simulation
A focused cluster guide on the Central Limit Theorem with multiple worked sampling examples at n = 5, 30, and 100 from skewed and uniform populations, demonstrating convergence to normality of the sample mean. Includes the connection to bootstrap methods.
Type I vs Type II Errors: Worked Examples and Tradeoffs
A focused walkthrough of Type I and Type II errors in hypothesis testing: definitions, probability notation, the alpha/beta tradeoff, statistical power, and four worked examples in different fields (medical screening, A/B testing, manufacturing QA, criminal trials).
P-Value Interpretation: Common Mistakes and Correct Reading
A focused walkthrough of how to correctly interpret p-values: the technical definition, the four most common misinterpretations, why p-values are not the probability that H0 is true, the relationship to confidence intervals, and worked examples showing correct versus incorrect interpretation.
One-Tailed vs Two-Tailed Tests: When to Use Each
A complete comparison of one-tailed and two-tailed hypothesis tests: the technical difference, when each is appropriate, the criteria for choosing pre-specified direction, the danger of post-hoc direction selection, and worked examples showing identical data analyzed under both choices.
Binomial vs Poisson Distribution: When to Use Each
A focused comparison of two of the most common discrete probability distributions: the binomial (fixed number of trials, two outcomes) and the Poisson (count of events in a fixed interval, rate-based). Covers the assumptions, formulas, when each fits the data, and the Poisson-as-binomial-limit relationship.
Normal Distribution and Z-Scores: Worked Examples
A focused walkthrough of the normal distribution and z-scores: properties of the bell curve, the empirical rule, computing z-scores, looking up probabilities, and worked examples covering exam scores, manufacturing tolerances, and quality control charts.
Discrete vs Continuous Distributions: How to Choose
A practical guide to choosing between discrete and continuous probability distributions: the conceptual difference, four common discrete distributions, four common continuous distributions, decision criteria for each, and a flowchart for matching data to distribution.
Chi-Square Goodness-of-Fit Test: Step-by-Step Worked Examples
A walkthrough of the chi-square goodness-of-fit test from null hypothesis through expected counts, the chi-square statistic, degrees of freedom, and decision rule β with three worked examples (fair die, genetic ratios, Benfordβs law) and the assumption checks most students skip.
Mann-Whitney U Test vs t-Test: When to Use Which (Worked Examples)
A practical comparison of the Mann-Whitney U test (Wilcoxon rank-sum) and the independent-samples t-test β when normality assumptions justify the t-test, when ranks are the safer choice, and how to compute and interpret both with two worked examples.
Exponential Distribution: Formula, Applications, and Worked Examples
The exponential distribution explained β PDF, CDF, mean, variance, the memoryless property, and how to use it for time-between-events problems (call center waiting, equipment failure, queue arrivals) with three worked examples.
Geometric vs Negative Binomial Distribution: When to Use Each
Side-by-side comparison of the geometric (trials until first success) and negative binomial (trials until r-th success) distributions β formulas, mean and variance, two worked examples, and the conceptual link to the binomial.
Wilcoxon Signed-Rank Test vs Paired t-Test: When to Use Each
The paired t-test and the Wilcoxon signed-rank test both compare two related measurements. Here is exactly how each works, when the non-parametric version wins, and two fully worked examples.
Kruskal-Wallis Test vs One-Way ANOVA: When to Use Each
The Kruskal-Wallis test is the non-parametric alternative to one-way ANOVA for three or more independent groups. Here is how each works, when to switch, the H statistic by hand, and the correct post-hoc.
Tukey HSD Post-Hoc Test After ANOVA: Worked Examples
A significant ANOVA tells you the group means differ β Tukeyβs HSD tells you which pairs. Here is the studentized range, the HSD formula, a full comparison table, and how Tukey stacks up against Bonferroni, ScheffΓ©, and Dunnett.
Relative Risk vs Odds Ratio: 2Γ2 Table Worked Examples
Relative risk and the odds ratio both measure association in a 2Γ2 table, but they answer different questions and apply to different study designs. Here is how to compute each, when each is valid, and why the odds ratio overstates risk for common outcomes.
Friedman Test vs Repeated-Measures ANOVA: When to Use Each
Repeated-measures ANOVA tests means across three or more related conditions; the Friedman test does the same job non-parametrically with ranks. Here is exactly how each works, when ranks beat means, and two fully worked examples.
Spearman vs Pearson Correlation: When to Use Each
Pearson measures linear association on interval data; Spearman measures monotonic association using ranks. Here is exactly how each works, when to switch, and two fully worked examples β including the cases where r and rho disagree dramatically.
Bonferroni vs Holm vs FDR: Multiple Comparisons Correction Worked
When you run many hypothesis tests, some will be βsignificantβ by pure chance. Here is exactly how Bonferroni, Holm-Bonferroni, and the Benjamini-Hochberg FDR procedure differ β with worked examples of each on the same 10 p-values.
Confidence vs Prediction vs Tolerance Intervals: Three Different Questions
A confidence interval describes a parameter, a prediction interval describes the next observation, and a tolerance interval describes a proportion of the population. The math looks similar; the meaning is completely different. Here is exactly how each is constructed, with worked examples.
A-Priori vs Post-Hoc Power Analysis: Sample-Size Planning Done Right
A-priori power analysis tells you how many subjects to collect; post-hoc power analysis is widely misused. Here is exactly how each works, what observed power actually means, and four worked sample-size calculations across t-test, ANOVA, correlation, and chi-square.