Introduction to Hypothesis Testing

A beginner-friendly guide to the logic and mechanics of hypothesis testing. Learn how to formulate hypotheses, calculate test statistics, interpret p-values, and draw conclusions.

What You'll Learn

✓Understand the logical framework behind hypothesis testing.
✓Learn to set up null and alternative hypotheses correctly.
✓Interpret p-values and make proper conclusions about statistical significance.

1. The Logic of Hypothesis Testing

Hypothesis testing uses a proof-by-contradiction approach. We assume the null hypothesis is true and then evaluate whether the observed data are consistent with that assumption. If the data are very unlikely under the null, we reject it.

Key Points

•The null hypothesis (H0) represents no effect or no difference; the alternative (Ha) represents the claim being tested.
•We never prove the alternative; we only find enough evidence to reject the null or fail to reject it.
•Failing to reject the null does not mean the null is true; it means we lack sufficient evidence against it.

2. P-values and Significance

The p-value quantifies how surprising the observed data would be if the null hypothesis were true. A small p-value suggests the data are inconsistent with the null. The significance level alpha is the threshold for making a decision.

Key Points

•The p-value is the probability of obtaining results as extreme as observed, assuming H0 is true.
•If p-value <= alpha, reject H0; if p-value > alpha, fail to reject H0.
•The most common significance level is alpha = 0.05, but this is a convention, not a universal rule.

3. Drawing Conclusions

A proper conclusion states the decision in the context of the problem. Statistical significance does not always imply practical significance, and the size of the effect matters as much as whether it is statistically detectable.

Key Points

•Always state conclusions in the context of the original research question, not just in terms of rejecting or failing to reject.
•Statistical significance depends on sample size; a trivially small effect can be significant with a huge sample.
•Report effect sizes and confidence intervals alongside p-values for a fuller picture.

Key Takeaways

★A p-value of 0.03 means there is a 3% chance of observing data this extreme if the null hypothesis is true.
★Rejecting the null at alpha = 0.05 does not mean there is only a 5% chance the null is true.
★The power of a test is the probability of correctly rejecting a false null hypothesis.
★Increasing sample size increases power without changing the significance level.

Practice Questions

1. A test yields a p-value of 0.08. At alpha = 0.05, what is the conclusion?

Fail to reject the null hypothesis. Since 0.08 > 0.05, there is not sufficient evidence at the 5% significance level to reject the null. However, the result might be considered marginally significant and may warrant further investigation with a larger sample.

2. What is the difference between statistical significance and practical significance?

Statistical significance means the result is unlikely due to chance alone (small p-value). Practical significance means the effect is large enough to matter in the real world. A study with a very large sample might find a statistically significant but practically trivial difference.

Study with AI

Get personalized help and instant answers anytime.

Download StatsIQ

FAQs

Common questions about this topic

No. The p-value is the probability of the observed data (or more extreme) given that the null hypothesis is true. It is not the probability that the null hypothesis is true or false. Bayesian methods are needed to make probability statements about hypotheses.

The 0.05 threshold is a convention popularized by Ronald Fisher. It represents a 1-in-20 chance of a false positive, which was deemed a reasonable balance between being too strict and too lenient. Different fields may use stricter thresholds (e.g., 0.01 in physics).

Related Study Guides

🎯 exam-prep

Browse All Study Guides

🎯 AP Statistics 🔬 Introduction to 📈 Regression Analysis 🎲 Probability Foundations 📊 Understanding Statistical 🧪 ANOVA and 📉 Data Visualization 🔄 Bayesian vs 📊 What Is 📐 What Is 🔗 Correlation vs 📐 Central Limit 📏 Confidence Intervals:📐 P-Values and 📐 Chi-Square Tests ⚠️ Type I 🎲 Sampling Methods 📈 Introduction to 📏 Effect Size 📉 Multiple Regression:🔀 Non-Parametric Tests:🎯 How to 🧪 A/B Testing 🧹 Data Cleaning ⏱️ Survival Analysis:🔗 Introduction to 📈 Time Series 🔬 Principal Component 🔀 How to 📐 Two-Sample t-Test 📊 How to 🔀 Paired vs 📋 How to 📊 Z-Scores and 📈 R Squared 🎲 Binomial Probability 🎲 Expected Value 📐 Standard Error 🎯 Margin of 📊 Contingency Tables 📉 Poisson Distribution:📏 Cohen's d 🔗 Pearson vs ⚖️ One-Tailed vs 🔔 Normal Distribution 📉 Linear Regression 📊 Mean vs 🎯 Confidence vs 📊 Two-Way ANOVA:⚡ Statistical Power 🎯 Conditional Probability 🎲 Permutations vs 📈 Log Transformations 🔄 Simpson's Paradox:🧪 Hypothesis Testing:🎲 Probability Distributions:📈 Central Limit ⚖️ Type I 🎯 P-Value Interpretation:↔️ One-Tailed vs 🎲 Binomial vs 📊 Normal Distribution 📈 Discrete vs 📊 Chi-Square Goodness-of-Fit 🔬 Mann-Whitney U ⏱️ Exponential Distribution:🎯 Geometric vs 🎯 Wilcoxon Signed-Rank 🎯 Kruskal-Wallis Test 🎯 Tukey HSD 🎯 Relative Risk 🔁 Friedman Test 📈 Spearman vs 🎚️ Bonferroni vs 🎯 Confidence vs ⚡ A-Priori vs

Introduction to Hypothesis Testing

What You'll Learn

1. The Logic of Hypothesis Testing

Key Points

2. P-values and Significance

Key Points

3. Drawing Conclusions

Key Points

Key Takeaways

Practice Questions

Study with AI

FAQs

Can a p-value tell me the probability that my hypothesis is true?

Why is 0.05 used as the significance level?

Related Study Guides

AP Statistics Exam Prep

Bayesian vs Frequentist Statistics

One-Tailed vs Two-Tailed Hypothesis Tests: When to Use Each with Worked Examples

Browse All Study Guides