Claude-skill-registry hypothesis-test

Guide selection and interpretation of statistical hypothesis tests. Use when: (1) Choosing appropriate test for research data, (2) Checking assumptions before analysis, (3) Interpreting test results correctly, (4) Reporting statistical findings, (5) Troubleshooting assumption violations.

install

source · Clone the upstream repo

git clone https://github.com/majiayu000/claude-skill-registry

Claude Code · Install into ~/.claude/skills/

T=$(mktemp -d) && git clone --depth=1 https://github.com/majiayu000/claude-skill-registry "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/data/hypothesis-test" ~/.claude/skills/majiayu000-claude-skill-registry-hypothesis-test && rm -rf "$T"

manifest: skills/data/hypothesis-test/SKILL.md

Hypothesis Testing Skill

Purpose

Guide appropriate selection and interpretation of statistical hypothesis tests for research data analysis.

Test Selection Decision Tree

Step 1: How many variables?

One variable:

Categorical → Chi-square goodness of fit
Continuous → One-sample t-test

Two variables:

Both categorical → Chi-square test of independence
One categorical, one continuous → T-test or ANOVA
Both continuous → Correlation or regression

Three+ variables:

Multiple predictors → Multiple regression or ANOVA
Complex designs → Mixed models or advanced methods

Step 2: Check assumptions

For t-tests:

Independence of observations
Normality (especially for small N)
Homogeneity of variance

Violations?

Non-normal → Mann-Whitney U (non-parametric)
Unequal variance → Welch's t-test
Dependent observations → Paired t-test or mixed models

For ANOVA:

Independence
Normality
Homogeneity of variance
No outliers

Violations?

Non-normal → Kruskal-Wallis test
Unequal variance → Welch's ANOVA
Outliers → Robust methods or transformation

Step 3: Interpret results

Always report:

Test statistic (t, F, χ²)
Degrees of freedom
p-value
Effect size with CI
Descriptive statistics

Example:

Independent samples t-test showed a significant difference between 
groups, t(98) = 3.45, p < .001, d = 0.69, 95% CI [0.29, 1.09]. 
The experimental group (M = 45.2, SD = 8.3) scored higher than 
control (M = 37.8, SD = 9.1).

Common Tests Reference

Research Question	Test	Assumptions
2 groups, continuous outcome	Independent t-test	Normality, equal variance
2 measurements, same people	Paired t-test	Normality of differences
3+ groups, one factor	One-way ANOVA	Normality, homogeneity
3+ groups, multiple factors	Factorial ANOVA	Normality, homogeneity
Relationship between variables	Pearson correlation	Linearity, normality
Predict continuous outcome	Linear regression	Linearity, normality of residuals
2 categorical variables	Chi-square test	Expected frequencies ≥5
Ordinal data, 2 groups	Mann-Whitney U	None (non-parametric)
Ordinal data, paired	Wilcoxon signed-rank	None (non-parametric)

Assumption Checking

Normality

Visual: Q-Q plot, histogram
Statistical: Shapiro-Wilk test (N < 50), Kolmogorov-Smirnov (N ≥ 50)
Guideline: Robust to moderate violations if N ≥ 30

Homogeneity of Variance

Visual: Box plots, residual plots
Statistical: Levene's test, Bartlett's test
Guideline: Ratio of largest/smallest variance < 4

Independence

Check: Research design, data collection
Red flags: Time series, clustered data, repeated measures
Solution: Use appropriate model (mixed effects, GEE)

Integration

Use with data-analyst agent for complete statistical analysis workflow and experiment-designer agent for planning appropriate analyses.

Version: 1.0.0