Handle highly imbalanced classification data
Company: Google
Role: Data Scientist
Category: Machine Learning
Difficulty: Medium
Interview Round: Technical Screen
Quick Answer: This question evaluates a candidate's competency in handling highly imbalanced binary classification problems, including data splitting and leakage prevention, imbalance mitigation techniques, appropriate metric selection and threshold calibration, algorithm selection for scalability, robust validation, and deployment monitoring.