How do I approach Statistics & Math interview questions?

Statistics & Math questions require understanding of core concepts and practice. PracHub provides solutions with explanations to help you master statistics & math interviews.

What difficulty level is this interview question?

This is a medium difficulty Statistics & Math question, commonly asked during Take-home Project rounds at Waymo.

What role is this question designed for?

This question is commonly asked for Data Scientist candidates at Waymo during technical interviews.

Assess Routing Experiment Validity | Waymo Interview Question

Q: Assess Routing Experiment Validity

This question evaluates experimental analysis and statistical hypothesis testing skills, including data preparation and joining, interpretation of p-values, and understanding of causal inference and threats to validity in A/B tests.

A ride-hailing team runs an A/B test in San Francisco in July 2024 for a new routing algorithm intended to reduce time to pickup, abbreviated TTP.

You are given two pandas DataFrames:

df_users:

user_id : unique user identifier
variant : experiment assignment, either control or treatment

df_rides:

ride_id : unique ride identifier
user_id : user identifier
ride_date : ride timestamp or date
city : ride city
time_to_pickup : numeric TTP in minutes

Tasks:

Write Python code to filter to San Francisco rides in July 2024, join ride records to experiment assignments, run a Welch two-sample t-test comparing treatment versus control TTP, and return the p-value.
Given the returned p-value, how would you decide whether the result is statistically significant?
Is a statistically significant p-value conclusive proof that the new routing algorithm is better? If not, explain the main threats to validity.
Propose a stronger experiment design and analysis plan for this routing algorithm.

A ride-hailing team runs an A/B test in San Francisco in July 2024 for a new routing algorithm intended to reduce time to pickup, abbreviated TTP.

You are given two pandas DataFrames:

df_users:

user_id : unique user identifier
variant : experiment assignment, either control or treatment

df_rides:

ride_id : unique ride identifier
user_id : user identifier
ride_date : ride timestamp or date
city : ride city
time_to_pickup : numeric TTP in minutes

Tasks:

Write Python code to filter to San Francisco rides in July 2024, join ride records to experiment assignments, run a Welch two-sample t-test comparing treatment versus control TTP, and return the p-value.
Given the returned p-value, how would you decide whether the result is statistically significant?
Is a statistically significant p-value conclusive proof that the new routing algorithm is better? If not, explain the main threats to validity.
Propose a stronger experiment design and analysis plan for this routing algorithm.

Assess Routing Experiment Validity

Quick Overview

Solution

Submit Your Answer to Earn 20XP

Assess Routing Experiment Validity

Quick Overview

Solution

Submit Your Answer to Earn 20XP