This question evaluates experimental-design and product-analytics competencies—A/B testing, traffic allocation and ramping, metric definition and guardrails, statistical power and significance, and handling marketplace/auction interference—within the Analytics & Experimentation domain for a Data Scientist role.

You have trained a new ad-recommendation model and must decide whether it should replace the incumbent model that currently ranks/serves ads in a large-scale, auction-based ads system.
Design an experiment to evaluate the new model against the incumbent and answer:
Login required