This question evaluates a data scientist's competency in experimental design and causal inference for networked features, covering metrics selection (primary and guardrail metrics), strategies for mitigating interference via randomization or quasi-experimental approaches, exposure and contamination controls, rollout planning, and pre-registration and power analysis. Commonly asked in Analytics & Experimentation interviews for Data Scientist roles, it assesses reasoning about bias–variance trade-offs, engineering constraints, and validity diagnostics in interconnected user environments, requiring both conceptual understanding of identification and practical application of experiment implementation.

You are adding a Group Calls feature to an existing 1:1 calling app. Design a robust experiment to measure the feature's impact when users can influence each other (network interference is likely).
Assume a large, global user base and that group calls allow a user to create a group and invite multiple contacts. Some users might only join via invites. You may gate creation or visibility to mitigate contamination.
Address the following:
(a) Metrics
(b) Randomization Unit and Rollout
(c) If Cluster Randomization Is Not Feasible
(d) Exposure, Triggers, and Contamination Controls
(e) Pre-registration and Power
Login required