This question evaluates a candidate's ability to construct a leakage-safe preprocessing and modeling pipeline for tabular data, including imputation, categorical encoding, feature scaling, ordinary least squares linear regression, and interpretation of coefficients and performance metrics.
You are given a tabular dataset in a pandas DataFrame df. The goal is to predict a continuous target column target. Build a leakage-safe baseline linear regression pipeline and report performance and coefficients.
Provide clean, runnable Python code plus a short interpretation of the metrics and coefficients.
Login required