This question evaluates proficiency in supervised machine learning model development and evaluation, including use of scikit-learn pipelines, GradientBoostingClassifier configuration, stratified 5-fold cross-validation, ROC-AUC performance measurement, and model serialization.
Assume the notebook already contains a prepared feature matrix X and a binary target y (0/1), with any necessary preprocessing completed. Your goal is to evaluate and train a final model.
Provide the full Python code that performs all steps above.
Login required