This question evaluates incident leadership, operational decision-making, and data-driven troubleshooting skills for a data scientist, focusing on triage, hypothesis formation, mitigation choice, cross-functional communication, and recurrence prevention.
Since 00:00 today, the share of incorrect payments for users aged 18–24 in CA has risen from a historical 2.0% to 3.1% across approximately 120,000 transactions. You are the on-call lead on a payment-accuracy team with access to:
Finance is escalating and requests a mitigation within 24 hours.
Assume the system uses a real-time ML risk score to route transactions (approve vs send to review/decline). "Incorrect payment" is measured by a near-real-time audit/labeling pipeline.
Describe, in a structured, time-boxed plan, exactly how you would:
Be explicit about risks you will accept vs avoid and how you would measure success by end of day.
Login required