This question evaluates proficiency in exploratory data analysis, feature preprocessing, baseline linear regression and small neural network training, as well as detecting and mitigating overfitting and selecting appropriate train/validation/test protocols and evaluation metrics for tabular regression.
You are given a tabular regression dataset with numeric and categorical features and a continuous target.
Describe (at a code-and-practice level) how you would:
Be explicit about the train/validation/test protocol, metrics, and common pitfalls.