This question evaluates model selection and diagnostic skills in supervised learning, specifically assessing feature engineering, interaction detection, handling heteroskedastic residuals, incorporation of monotonicity or interaction constraints in tree-based models, and fair cross-validation-based comparison between linear and tree approaches.

You have 100,000 i.i.d. observations with features x1 (range 0–100), x2, x3, and target y. The true data-generating process (unknown to you) is piecewise linear with a hinge at x1 = 50 and an interaction between x2 and x3:
Design an analysis to decide between linear regression and a decision tree. Specify:
Login required