This question evaluates understanding of gradient-based optimization and linear model training, including competency in deriving gradients for mean squared error and implementing stochastic (or mini-batch) gradient descent for parameter estimation.
You are given a dataset of 1D samples , where and are real numbers.
We want to fit a linear model:
by minimizing the mean squared error (MSE).
After training, return the learned parameters and that approximately minimize the chosen loss on the provided data.
Login required