This question evaluates understanding of statistical inference and experimental analysis, focusing on the limitations of p-values and the identification of complementary metrics and practices in A/B testing within the Statistics & Math domain.
Your team is running an A/B test for a new product feature and stakeholders rely on p-values to decide whether to ship.
Explain at least three key limitations of p-values in hypothesis testing and suggest complementary measures or approaches that mitigate these issues. Use A/B testing context and brief examples where helpful.
Consider topics such as: effect size, dependence on sample size, multiple testing, practical significance, prior beliefs, and p-hacking/data peeking.
Login required