This question evaluates ML system design and applied modeling competencies, including data assumptions, preprocessing, feature engineering, model selection, validation strategies, temporal leakage handling, and end-to-end pipeline construction for a binary housing-affordability classifier and a time-series regression for wind-farm power prediction.
Given applicant and market data, design a binary classifier to predict whether an applicant can buy a house (labels: can buy, cannot buy). Specify the following:
You are provided three files: train.csv, test.csv, sample_submission.csv. The target column is power output in train.csv. For each record in test.csv, predict power output and produce a submissions.csv with exactly two columns and a header: id, power output.
Describe your end-to-end approach:
Login required