Define and apply Gmail user segments
Company: Google
Role: Data Scientist
Category: Analytics & Experimentation
Difficulty: hard
Interview Round: Technical Screen
##### Question
Gmail wants to create actionable user segments to drive both product improvements and marketing/lifecycle outcomes. Propose a segmentation scheme and show how it would be built, validated, used, and measured.
1. **Propose actionable segmentation axes.** Define a small set of segments as rules you can compute from logs (e.g., daily active senders, newsletter-heavy consumers, attachment-heavy users, mobile-only users, power users, workplace vs. personal domains, frequent spam reporters, lifecycle/recency stage, engagement depth, mailbox state, storage utilization, feature adoption, and security posture). For each, give a concrete, computable definition.
2. **Validate stability and separability over ~8 weeks.** Show how you would confirm that segments are stable week-over-week and meaningfully distinct from one another.
3. **Choose three segments and design targeted interventions.** For each chosen segment, propose specific product and/or lifecycle-marketing interventions (e.g., Smart Compose/autocomplete tuning, faster attachment upload, bulk-unsubscribe UX, inbox cleanup, storage upsell, security checkup), with primary success metrics and guardrails.
4. **Design the experiment to measure lift vs. the global average.** Cover assignment/randomization, analysis of heterogeneous effects, and how you handle small segments via k-anonymity thresholds and differential-privacy noise to mitigate privacy and fairness risks.
5. **Connect segments to lifecycle messaging and roadmap prioritization.** Explain how the segments inform onboarding/activation/win-back messaging and how you would prioritize the resulting feature/intervention backlog.
##### Constraints
Use only behavioral metadata (sessions, sends, opens, searches, label/filter edits), device, settings, and aggregate mailbox state. Do **not** use email content, and do not infer sensitive attributes; respect opt-outs and admin policies for managed (Workspace/education) accounts.
Quick Answer: A Google data scientist analytics screen: propose actionable Gmail user segments from behavioral metadata, validate their stability and separability over eight weeks, design targeted interventions with metrics and guardrails, and run a privacy-aware experiment to measure per-segment lift. Covers segmentation axes, validation metrics, k-anonymity and differential privacy, CUPED, and RICE roadmap prioritization.