This question evaluates a data scientist's competency in metric design, causal experimentation, instrumentation, segmentation, and trade-off analysis between short-term engagement and long-term retention, as well as reasoning about threats to validity like interference and selection bias.

You are a Data Scientist partnering with a product team at Facebook/Meta that owns push/in-app notifications.
The team’s goal is to send fewer but higher-quality notifications—i.e., notifications that users find relevant and that improve long-term product outcomes—while avoiding spammy experiences.
Assume you can instrument notification events (send, deliver, open/click, dismiss, mute/disable notifications, uninstall), downstream engagement (sessions, time spent, content actions), and longer-term outcomes (retention). You also have user/device attributes and can join events by user_id and time.
The team wants to test a new feature: geographic/nearby-event notifications (e.g., “A concert is happening near you tonight”).
Provide your reasoning and any assumptions you need to make.
Login required