This question evaluates understanding of linear regression theory and statistical inference, specifically how duplicating observations affects OLS coefficient estimates, estimated standard errors, t-statistics, R^2, and adjusted R^2.
You fit a standard linear regression model (with intercept) using ordinary least squares (OLS). Suppose you have:
You now duplicate every observation once, forming a new dataset by stacking the original data under itself:
You refit the same regression model on this duplicated dataset using OLS and compute the usual summary statistics.
How do the following quantities change, if at all, compared with the original fit?
Explain your reasoning mathematically (you may use matrix notation) and also interpret the result intuitively.
Login required