This question evaluates competency in clustered randomized experiment analysis, including calculation of design effect and effective sample size, cluster-robust inference for differences in proportions, sequential alpha spending (O’Brien–Fleming-style) and comparisons with Bonferroni, Holm–Bonferroni adjustments for multiple guardrail metrics, and Bayesian ROPE interpretation. It is in the Statistics & Math domain and is commonly asked to probe how candidates handle intra-cluster correlation, control Type I error across interim looks and multiple metrics, and demonstrate both conceptual understanding and practical application of power, duration, and multiplicity trade-offs.
Context: A creator-level (cluster) randomized experiment evaluates a tipping UI. Creators are clusters; viewers are units within clusters. The outcome is a binary purchase at the viewer-session level.
Given:
Tasks:
Login required