This question evaluates competencies in data quality assessment, feature engineering and selection, modeling choice between classification and regression, diagnosing multicollinearity (including variance inflation factors), and experimental design for model validation in an applied flight-delay prediction context.

You have historical flight operations and weather data and need to build a model that predicts whether a flight will be delayed (e.g., more than 15 minutes late) at departure or arrival.
Assume you have tables such as: Flights (schedule, actuals, carrier, route), Weather (station, time, conditions), Airports (metadata), and possibly Air Traffic Control (ATC) constraints.
Hints: Mention imputation, data validation, one-hot encoding, feature selection, regularization, variance inflation factors, and A/B or switchback tests.
Login required