Design an email flu-shot experiment
Company: CVS Health
Role: Data Scientist
Category: Analytics & Experimentation
Difficulty: hard
Interview Round: Technical Screen
Design an end-to-end email campaign to increase verified flu vaccinations within 30 days among eligible members. Specify (a) targeting and exclusion rules, (b) the randomization unit (individual vs household vs clinic) and how you will limit spillover/interference, (c) holdout structure (A/A, A/B, multi-arm) and seeding, (d) primary and guardrail metrics and how you will obtain ground-truth outcomes (e.g., claims/EHR, pharmacy feeds) and handle missing outcomes when shots occur outside our network, (e) power analysis: estimate required sample size if baseline vaccination is 2.0% over the window and you expect a 20% relative uplift; use alpha=0.05 and power=0.8 with option for unequal allocation, (f) cadence/frequency capping and multivariate subject-line/creative testing without contaminating the main test, (g) seasonality/holiday controls and overlapping campaigns, (h) compliance/privacy constraints (opt-outs, HIPAA) and their analytical implications, and (i) the analysis plan (intention-to-treat vs per-protocol), heterogeneity reads, and a backstop causal method if randomization is partially broken.
Quick Answer: This question evaluates end-to-end experiment design and causal inference skills for a Data Scientist, including A/B testing and randomization choices, interference control, metric definition and power/sample-size calculation, outcome sourcing from claims/EHR/pharmacy feeds, cadence/MVT design, seasonality controls, and compliance/privacy considerations. It is commonly asked in analytics and experimentation interviews because it tests both practical application and conceptual understanding of field experiment planning, real-world data challenges (missing outcomes, claim lag), and analysis decisions (ITT vs per-protocol, multiple testing) within the Analytics & Experimentation domain.