Improve Model Generalization with Cross-Validation and Feature Engineering
Company: Boston Consulting Group
Role: Data Scientist
Category: Machine Learning
Difficulty: medium
Interview Round: Take-home Project
Quick Answer: This question evaluates a data scientist's practical competence in supervised machine learning, covering stratified train/test splitting, reproducible preprocessing pipelines that standardize numeric features and robustly encode categoricals, training gradient-boosted models, and assessing discrimination with ROC AUC.