This question evaluates data-science competencies in online experiment design and analysis, including statistical power and sample-size estimation, variance-reduction methods, guardrail and SRM monitoring, randomization and contamination handling, sequential monitoring and stopping rules, heterogeneity/subgroup analysis, and result communication within the analytics & experimentation domain. It is commonly asked to assess practical application of statistical inference and experimental rigor in product and business contexts, primarily testing practical application while requiring conceptual understanding of underlying statistical principles and operational trade-offs.
Context: You will run a 2-arm online experiment on a pricing page. The primary metric is user-level paid conversion within 7 days of a user's first pricing-page view. Baseline conversion is 6.0%, the minimum detectable effect (MDE) is a +5% relative lift (to 6.3%), with α = 0.05 (two-sided) and power = 0.80. Daily traffic is ~200,000 unique users (60% new, 40% returning; 10% tablet, 30% desktop, 60% mobile). Some users switch devices; a bot filter is applied; a portion of traffic is from paid ads with different intent.
Tasks
Login required