Estimate b when features exceed samples
Company: Google
Role: Data Scientist
Category: Machine Learning
Difficulty: Medium
Interview Round: Technical Screen
Quick Answer: This question evaluates proficiency in linear regression theory, including identifiability and the sampling distribution of OLS, together with high-dimensional competencies such as regularization, variable selection, dimensionality reduction, properties of the Moore–Penrose pseudoinverse, and the statistical consequences of naive upsampling.