You work as a data scientist for a ride-hailing marketplace. The company wants to launch a new pricing model that may change the price shown to riders and the earnings or trip value shown to drivers in the request prompt.
Design an experiment to evaluate whether the new pricing model should be launched.
Address the following questions:
-
Before designing the experiment, how would you reason through the driver's mental state when receiving a trip request prompt?
-
What primary metrics and guardrail metrics would you use for riders, drivers, and the marketplace?
-
If the experiment is run in only one city using ordinary treatment and control groups, what problems can arise from marketplace network effects or interference?
-
How would you design a switchback experiment for one city?
-
How would you handle limited sample size in a switchback test?
-
What are the tradeoffs of using full-day switchback windows in one city?
-
What are the tradeoffs between larger and smaller treatment groups?
-
What are the tradeoffs between longer and shorter treatment windows?
-
If the company later rolls the test out to 12 cities, how would you increase sample size and improve the experimental design?