Cvs Health Data Scientist Interview Questions
Preparing for CVS Health Data Scientist interview questions means getting ready for interviews that blend rigorous technical screening with heavy emphasis on healthcare impact. Expect your SQL and Python skills (pandas, data-wrangling, window functions and CTEs) and core ML/statistics knowledge to be tested alongside your ability to translate models into measurable patient- or cost-outcome improvements. Interviewers typically evaluate coding accuracy, experimental-design intuition, model evaluation and fairness, and how you communicate findings to clinical, product, and business partners. For effective interview preparation focus on three things: sharpen hands-on skills with timed SQL and pandas exercises, rehearse end-to-end ML case studies (feature engineering, validation, deployment considerations), and prepare concise STAR stories that show ownership and cross-functional influence in regulated settings. Be ready for a recruiter screen, one or more technical rounds (live coding or take-home), a case-style or A/B testing conversation, and behavioral/hiring-manager interviews. Tailor examples to pharmacy, claims, or care-management contexts and practice explaining trade-offs and privacy/regulatory implications clearly.
Handle challenges in MMM/MMX
MMM Fragility Diagnosis and Remediation Plan (Weekly, 156 Weeks) Context You inherit a weekly Marketing Mix Model (MMM/MMX) with 156 weeks of data. Th...
Design classification under missingness and imbalance
30-Day Readmission Classifier: End-to-End Plan Context: You are building a binary classifier to predict 30-day readmission using claims and EHR featur...
Use pandas to aggregate, pivot, and label
Given two pandas DataFrames, write code to: (1) merge and aggregate revenue; (2) produce a 2x2 pivot; (3) compute per-state counts with value_counts, ...
Compute A/B significance, CI, and power
You run an A/B test for 7 days. Control A: 520 conversions out of 10,000 sessions. Variant B: 630 conversions out of 11,500 sessions. Use a calculator...
Diagnose a failing campaign
Vaccination-Lift Experiment Triage Plan Context: You ran a randomized CRM experiment (e.g., email/SMS/push) to increase vaccination uptake. The overal...
Explain p-value and choose correct test
Technical Screen: P-values and Robust Two-Sample/Paired Tests Context: You are a data scientist evaluating healthcare interventions. Answer in clear, ...
Lead structured response to accuracy incident
Incident Response Plan: Spike in Incorrect Payments for CA Users Aged 18–24 Scenario Since 00:00 today, the share of incorrect payments for users aged...
Tune classifier and compute key metrics
Payment Error Classifier — Evaluation, Thresholding, and Cost-Sensitive Design Context You built a binary classifier to flag incorrect payments (posit...
Implement R² and Compare PCA With/Without Scaling
NumPy-only implementation: R² and PCA (Data Scientist take-home) Implement from scratch using only NumPy (no scikit-learn). Use float64 throughout and...
Calculate CI and Test Correlation Under Normality
Inference on a Mean, Significance of a Correlation, and a Normal Quantile Assume standard parametric conditions (normality as stated). Show formulas, ...
Assess Work Authorization and Professional Experience for Job Change
Initial HR Phone Screen — Behavioral Questions (Data Scientist) Context You are in an initial HR/phone screen for a Data Scientist role. The goal is t...
Explain Causal-Inference Techniques in Your Machine Learning Project
Technical Deep-Dive: ML Project With Causal Inference Prompt Walk me through one machine-learning project you led and explain any causal-inference tec...
Build a leak-free sklearn churn pipeline
Take‑Home ML Task: Reproducible Subscription Classification Pipeline You are given a daily user-level dataset and must build a reproducible Python (sc...
Create and query an e-commerce schema
PostgreSQL only. 1) Create these tables with appropriate types and constraints (choose minimal correct types): products(product_id PK, name NOT NULL, ...
Explain your top strengths concretely
Behavioral Prompt: Strengths, STAR Evidence, and 90-Day Application Instructions State your top one or two strengths most relevant to a Senior Data Sc...
Launch and measure a TV campaign
6-Week Linear TV Experiment to Increase Flu Vaccinations Design a 6-week linear TV campaign and its measurement plan to causally estimate incremental ...
Design an email flu-shot experiment
End-to-End Experiment Design: Email Campaign to Increase Verified Flu Vaccinations Context: You are designing a 30-day email campaign to increase veri...
Aggregate radiology spend and derive fiscal month
Using Python/pandas, complete the tasks below. Assume the following CSV-like input, where service_dt is object dtype and amounts may be negative (adju...
Compute age-band spend and YoY in Georgia
Using ANSI SQL only, answer the three parts below on the following schema and samples. Assume dates are YYYY-MM-DD, paid_amt can be negative (adjustme...
Test payment-accuracy lift with p-value and power
AB Test of Payment Accuracy (Two Proportions) Assume transactions in the two periods are independent and large enough for normal approximations. Defin...