This question evaluates a data scientist's skills in building reproducible machine learning pipelines for imbalanced binary classification, covering temporal splitting to avoid leakage, class imbalance handling, feature preprocessing, probability calibration, threshold selection, and monitoring for calibration drift.
You are given an event‑level dataset for a binary classification problem with severe class imbalance (positive rate ≈ 1%). The goal is to build a reproducible modeling pipeline, evaluate it with appropriate metrics, and propose a principled operating point for production.
(a) Build a reproducible training pipeline that:
(b) Report on the test split:
(c) Describe how to choose the operating point for a production system with a hard requirement of at most 2 false positives per 1,000 predictions.
(d) Discuss how calibration might drift over time and one technique to monitor and re‑calibrate without label leakage.
Login required