This question evaluates a data scientist's competency in statistical modeling, hypothesis testing, confounding control, and uncertainty quantification for observational data, with attention to issues like seasonality, heteroskedasticity, and time-aware validation.

You are analyzing a flight-level dataset to identify which factors most impact delays. Assume you have one row per flight with columns such as:
Propose a statistical approach to determine which factors most impact delays. Specifically:
Hints: Consider regression, ANOVA/Type II/III tests, confidence intervals, seasonality modeling, heteroskedasticity checks, and time-aware validation.
Login required