Create A/B tests by chatting with AI and launch them on your website within minutes.

Try it for FREE now

Null Hypothesis Significance Testing

Quick answer

Null Hypothesis Significance Testing (NHST) is a statistical method used to determine whether observed differences between test variations are statistically significant or likely due to random chance. It involves testing a null hypothesis that assumes no difference exists between variations against an alternative hypothesis that a difference does exist.

Key takeaways

  • Null Hypothesis Significance Testing helps evaluate whether an experiment result is reliable enough to act on.
  • It should be reviewed together with sample size, duration, effect size, and business impact.
  • It is most useful when the hypothesis and primary metric are defined before the test starts.

Definition

Null Hypothesis Significance Testing (NHST) is a statistical method used to determine whether observed differences between test variations are statistically significant or likely due to random chance. It involves testing a null hypothesis that assumes no difference exists between variations against an alternative hypothesis that a difference does exist.

What Null Hypothesis Significance Testing means in A/B testing

In A/B testing, NHST begins with the assumption that both variants perform identically (the null hypothesis). Statistical tests calculate the probability (p-value) of observing the measured difference if the null hypothesis were true. If this probability falls below a predetermined threshold (typically 0.05), the null hypothesis is rejected, suggesting a statistically significant difference exists.

Why Null Hypothesis Significance Testing matters

NHST provides the mathematical foundation for determining whether A/B test results are reliable or merely random fluctuations. It helps prevent false conclusions by quantifying the likelihood that observed performance differences are genuine, enabling data-driven decisions about which variation to implement. Without NHST, experimenters risk making costly changes based on statistical noise.

Example of Null Hypothesis Significance Testing

If Variation B shows a 3% conversion rate increase over Control A, NHST helps determine whether this 3% lift is a real improvement or could have occurred by chance. A p-value of 0.02 would indicate only a 2% probability the difference is random, supporting the conclusion that Variation B truly performs better.

How to use Null Hypothesis Significance Testing

Use Null Hypothesis Significance Testing after you have chosen a primary metric and collected enough traffic for a reliable read. Avoid checking it in isolation; compare it with effect size, confidence, practical impact, and whether the test ran long enough to cover normal traffic patterns.

Common mistake

A common mistake is treating Null Hypothesis Significance Testing as a yes-or-no shortcut while ignoring sample size, test duration, and practical business impact. A statistically interesting result can still be too small, too noisy, or too risky to ship.

Related A/B testing terms

FAQ

What does null hypothesis significance testing mean in A/B testing?

Null Hypothesis Significance Testing (NHST) is a statistical method used to determine whether observed differences between test variations are statistically significant or likely due to random chance. It involves testing a null hypothesis that assumes no difference exists between variations against an alternative hypothesis that a difference does exist.

Why does null hypothesis significance testing matter for experiments?

NHST provides the mathematical foundation for determining whether A/B test results are reliable or merely random fluctuations. It helps prevent false conclusions by quantifying the likelihood that observed performance differences are genuine, enabling data-driven decisions about which variation to implement. Without NHST, experimenters risk making costly changes based on statistical noise.

How should teams use null hypothesis significance testing in an experiment?

Use Null Hypothesis Significance Testing after you have chosen a primary metric and collected enough traffic for a reliable read. Avoid checking it in isolation; compare it with effect size, confidence, practical impact, and whether the test ran long enough to cover normal traffic patterns.

Download our free 100 point Ecommerce CRO Checklist

This comprehensive checklist covers all critical pages, from homepage to checkout, giving you actionable steps to boost sales and revenue.