This question evaluates a data scientist's competency in time-series demand forecasting, covering temporal target definition, temporal feature engineering, model family selection, evaluation metric alignment with business priorities, and handling data-quality issues like missing data, cold-starts, and distribution shifts.
You are working on a docked bike-sharing system. Build a model that predicts how many bikes will be checked out from a specific dock in the next hour.
Assume you have access to:
trips(trip_id, start_time, start_station_id, end_station_id, user_type)
station_status(station_id, ts, bikes_available, docks_available, capacity)
weather(ts, temperature, precipitation, wind_speed)
calendar(date, is_holiday, is_weekend, special_event)
Discuss: