This question evaluates a data scientist's competency in cost-sensitive threshold tuning, calibrated probability modeling, and operational decision-making for fraud triage, including formulating expected costs under review capacity and monetary trade-offs.

You have a fraud model that outputs a calibrated score s ∈ [0, 1] per account, where s ≈ P(fake | features). Each day you must triage 2,000,000 accounts into one of three actions:
Constraints and costs:
Assumption: Manual review does not incorrectly block real users (false blocks via review are negligible compared to auto-block false positives). If this is not true in your system, add that cost explicitly.
(a) Formulate the expected daily cost as a function of thresholds r and t given calibrated scores. Describe how to estimate it from historical labeled data using isotonic or Platt calibration and empirical score distributions.
(b) Optimize r and t under the review budget. Explain how you would choose the operating point on the precision–recall (PR) curve and verify the choice with an online interleaved test.
(c) Describe a drift monitoring plan and a weekly threshold re-tuning process with backtesting and safety rails.
Login required