Design real-time grid ETA for drivers
Company: Uber
Role: Machine Learning Engineer
Category: ML System Design
Difficulty: hard
Interview Round: Onsite
Design a real-time system to maintain the remaining ETA for every driver within each cell of a city grid. You may only use the data provided in the prompt (no manual features or external data). Describe, step by step, how you would:
(
1) ingest and partition GPS pings and trip events;
(
2) denoise GPS signals and correct drift to robustly localize drivers within grid cells;
(
3) compute and continuously update each driver's remaining ETA;
(
4) select and train a model suitable for online serving;
(
5) perform online feature computation strictly from the approved data;
(
6) meet low-latency, high-throughput SLAs;
(
7) monitor, alert, and rollback models; and
(
8) evaluate offline and online. Specify concrete algorithms for GPS error reduction (e.g., filtering, map matching), data schemas, state stores and keys, windowing strategy, update frequency, failure handling, and your validation methodology.
Quick Answer: This question evaluates a candidate's competency in real-time ML system design, covering streaming data ingestion, GPS noise handling and denoising, online feature computation, low-latency model serving, and operational monitoring.