Evaluate and monitor a credit risk model
Company: Capital One
Role: Data Scientist
Category: Machine Learning
Difficulty: hard
Interview Round: Technical Screen
You are shipping a credit-risk model (probability of default within 12 months) for a consumer lender with a 1% default rate. Misclassification costs: a false negative (approve a defaulter) costs $1,200; a false positive (decline a good applicant) costs $60. The regulator expects stability and explainability over pure accuracy. Pick the three most important evaluation priorities for this context and justify them. Then design an end-to-end plan covering: (1) offline evaluation (temporal cross-validation, class imbalance handling, choice of metrics such as AUC-PR, expected cost, calibration error, KS, and reason for each); (2) threshold selection to minimize expected cost subject to max 4% decline rate; (3) backtesting on out-of-time cohorts and stress periods; (4) a champion–challenger live test with guardrails; (5) production monitoring (data/label drift, calibration monitoring, PSI thresholds, stability by segment, and an alert/runbook). Be specific about calculations, acceptance criteria, and what you would do if calibration drifts while rank ordering remains stable.
Quick Answer: This question evaluates competency in designing, evaluating, and monitoring cost-sensitive consumer credit probability-of-default (PD) models, emphasizing calibration, rank ordering, threshold selection, backtesting, champion–challenger experimentation, and production monitoring under regulatory stability and explainability constraints.