How to Choose the Right Statistical Test: A Decision Flowchart for Every Common Scenario
The most practical guide in statistics: given your data type and research question, which test do you use? A decision flowchart covering t-tests, ANOVA, chi-square, correlation, regression, and non-parametric alternatives โ with the criteria for choosing each one.
What You'll Learn
- โIdentify the correct statistical test based on the number of variables, data type, and research question
- โDistinguish between parametric tests (require normality) and non-parametric alternatives
- โApply the decision flowchart to common research scenarios in homework and exam questions
- โRecognize when a test's assumptions are violated and select the appropriate alternative
1. The Direct Answer: 3 Questions That Determine the Test
You can identify the correct statistical test by answering three questions about your data: **Question 1: What is your research question?** Are you comparing groups (is there a difference?), testing a relationship (are these variables associated?), or predicting an outcome (can X predict Y)? **Question 2: What type of data do you have?** Continuous/numerical (height, weight, test scores, income), categorical/nominal (gender, treatment group, yes/no), or ordinal (Likert scales, rankings)? **Question 3: How many groups or variables?** One group vs a known value, two groups, three+ groups, or two continuous variables? The answers map directly to specific tests: - Comparing 2 group means (continuous data): independent samples t-test - Comparing 3+ group means (continuous data): one-way ANOVA - Comparing 2 related/paired means: paired t-test - Testing association between 2 categorical variables: chi-square test of independence - Testing relationship between 2 continuous variables: Pearson correlation - Predicting a continuous outcome from one or more predictors: linear regression - Predicting a categorical outcome (yes/no): logistic regression Snap a photo of any stats problem and StatsIQ identifies the correct test, explains why that test applies, and walks through the solution step by step.
Key Points
- โขThree questions determine the test: research question type, data type, and number of groups/variables
- โขComparing means: t-test (2 groups) or ANOVA (3+ groups). Testing associations: chi-square (categorical) or correlation (continuous).
- โขPredicting outcomes: regression (linear for continuous, logistic for categorical)
- โขThis flowchart covers ~90% of introductory statistics problems
2. The Complete Decision Flowchart
Here is the full flowchart. Start at the top and follow the branches. **Branch 1: Comparing group means (continuous dependent variable)** โ How many groups? 2 groups โ Are they independent or paired? Independent โ Independent t-test. Paired/matched โ Paired t-test. 3+ groups โ One-way ANOVA. If significant, follow with post-hoc tests (Tukey HSD) to determine which groups differ. 2+ groups with 2+ factors โ Two-way (factorial) ANOVA. **Branch 2: Testing association between categorical variables** โ Both variables categorical? Yes โ Chi-square test of independence (or Fisher's exact test if any expected cell count < 5). One categorical, one continuous โ go to Branch 1 (the categorical variable defines your groups). **Branch 3: Testing relationship between continuous variables** โ 2 continuous variables, testing strength of linear relationship โ Pearson correlation (r). Testing whether one variable predicts another โ Simple linear regression. Multiple predictors โ Multiple regression. Non-linear relationship โ consider transformation or non-parametric (Spearman rank correlation). **Branch 4: Predicting a categorical outcome** โ Outcome is binary (yes/no, pass/fail) โ Logistic regression. Outcome has 3+ categories โ Multinomial logistic regression. **Branch 5: Non-parametric alternatives (when normality is violated)** โ Instead of independent t-test โ Mann-Whitney U test. Instead of paired t-test โ Wilcoxon signed-rank test. Instead of one-way ANOVA โ Kruskal-Wallis test. Instead of Pearson correlation โ Spearman rank correlation. StatsIQ identifies which branch your problem falls on and selects the correct test automatically โ it even checks whether parametric assumptions are met and recommends the non-parametric alternative when needed.
Key Points
- โขBranch 1: comparing means โ t-test (2 groups) or ANOVA (3+ groups)
- โขBranch 2: categorical association โ chi-square. Branch 3: continuous relationship โ correlation or regression.
- โขBranch 4: predicting binary outcome โ logistic regression
- โขBranch 5: non-parametric alternatives for when normality assumption fails
3. When to Use Non-Parametric Tests: Checking Assumptions
Parametric tests (t-test, ANOVA, Pearson correlation, linear regression) assume the data is approximately normally distributed and has equal variances across groups. When these assumptions are violated, the test may produce unreliable p-values and misleading conclusions. How to check normality: visual inspection (histogram should be roughly bell-shaped, Q-Q plot points should fall near the diagonal line) and formal tests (Shapiro-Wilk test โ if p < 0.05, normality is rejected). In practice, parametric tests are robust to mild violations of normality, especially with larger samples (n > 30 per group). The Central Limit Theorem means that sample means are approximately normal even when the underlying data is not โ so t-tests and ANOVA remain valid with mild skew if sample sizes are adequate. When to switch to non-parametric: the data is severely skewed or has extreme outliers that cannot be justified, sample sizes are small (n < 15 per group) AND the data is non-normal, the data is ordinal (rankings, Likert scales) rather than truly continuous, or the variances are dramatically unequal across groups (Levene's test p < 0.05). The trade-off: non-parametric tests make fewer assumptions but have less statistical power โ they are less likely to detect a real effect. This means a non-parametric test might give you p = 0.08 (not significant) where the parametric equivalent would give p = 0.03 (significant). Only switch to non-parametric when the parametric assumptions are clearly violated, not just because you are unsure. StatsIQ checks normality and equal variances for you when solving problems โ if assumptions are violated, it recommends the appropriate non-parametric alternative and explains why.
Key Points
- โขCheck normality: histogram shape, Q-Q plot, Shapiro-Wilk test. Mild violations are OK with n > 30.
- โขSwitch to non-parametric when: severe skew, small n + non-normal, ordinal data, or dramatically unequal variances
- โขNon-parametric tests have less power โ only use when parametric assumptions are clearly violated
- โขMann-Whitney replaces t-test, Kruskal-Wallis replaces ANOVA, Spearman replaces Pearson
4. Applying the Flowchart: 5 Practice Scenarios
Scenario 1: A researcher wants to know if a new medication reduces blood pressure more than a placebo. 50 patients are randomly assigned to medication (n=25) or placebo (n=25). Blood pressure (mmHg) is measured after 8 weeks. โ Comparing 2 independent group means with a continuous DV โ Independent t-test. Scenario 2: A survey asks 200 people their political party (Democrat, Republican, Independent) and whether they support a specific policy (yes/no). Is there an association between party and policy support? โ Two categorical variables โ Chi-square test of independence. Scenario 3: A professor wants to know if study hours predict exam scores. She measures hours studied and exam score (0-100) for 40 students. โ Predicting a continuous outcome from a continuous predictor โ Simple linear regression (or Pearson correlation if she only wants the strength of association). Scenario 4: Patients rate their pain before and after physical therapy on a 1-10 scale. Is there a significant reduction? โ Two related measurements (same patients, before/after) with ordinal data โ Wilcoxon signed-rank test (non-parametric alternative to paired t-test, because Likert pain scales are ordinal, not truly continuous). Scenario 5: Three different teaching methods are compared. Students are randomly assigned to Method A (n=20), Method B (n=20), or Method C (n=20). Final exam scores are compared. โ Comparing 3 independent group means โ One-way ANOVA. If significant, follow with Tukey HSD to determine which methods differ. The pattern: identify the research question type, check the data type, count the groups/variables, verify assumptions, and the test chooses itself. StatsIQ applies this exact logic โ snap a photo of any scenario and it walks through each decision point.
Key Points
- โข2 independent groups, continuous DV โ independent t-test
- โข2 categorical variables โ chi-square. Continuous predictor/outcome โ regression.
- โขBefore/after on same subjects โ paired t-test (or Wilcoxon if ordinal)
- โข3+ groups โ ANOVA, then post-hoc (Tukey) if significant
Key Takeaways
- โ 3 questions choose the test: research question type + data type + number of groups/variables
- โ Independent t-test: 2 independent groups, continuous DV. Paired t-test: same subjects measured twice.
- โ Chi-square: 2 categorical variables. Pearson: 2 continuous variables. Regression: predicting outcomes.
- โ Non-parametric alternatives: Mann-Whitney (t-test), Kruskal-Wallis (ANOVA), Spearman (Pearson), Wilcoxon (paired t)
- โ Parametric tests are robust to mild normality violations with n > 30 โ do not switch to non-parametric unnecessarily
Practice Questions
1. A researcher measures anxiety scores (continuous, normally distributed) in three groups: therapy only (n=30), medication only (n=30), and therapy+medication (n=30). Which test should be used?
2. A survey of 500 people records gender (male/female) and preference for coffee vs tea. Is there an association? Which test?
FAQs
Common questions about this topic
First check how severe the violation is. Mild skew with n > 30 per group is usually fine โ parametric tests are robust. For severe violations (bimodal distributions, extreme outliers, very small samples with clear non-normality), use the non-parametric equivalent: Mann-Whitney for t-test, Kruskal-Wallis for ANOVA, Spearman for Pearson. For ordinal data (Likert scales, rankings), non-parametric tests are always preferred.
Yes. Snap a photo of any statistics problem or scenario and StatsIQ identifies the research question type, data type, and number of groups โ then selects the appropriate test, checks assumptions, and solves the problem step by step. If assumptions are violated, it recommends the non-parametric alternative.