This question evaluates a data scientist's competencies in online experiment design, causal inference, metric selection, assessment of statistical and practical significance, identification and handling of heterogeneous treatment effects, data-quality and marketplace interference mitigation, and ethical rollout decision-making.

You operate an ads platform with an existing recommender/ranking model. Engineers built a new ML ranker that is hypothesized to improve outcomes. You need to run an online controlled experiment (A/B) to validate performance, choose appropriate metrics, assess significance, and make a rollout decision, including handling heterogeneous effects and ethics.
Login required