PracHub
QuestionsPremiumLearningGuidesInterview PrepNEWCoaches
|Home/Statistics & Math/Other

Diagnose and fix linear regression violations

Last updated: Mar 29, 2026

Quick Overview

This question evaluates mastery of linear regression diagnostics and inference, including Gauss–Markov assumptions and their role for BLUE versus hypothesis testing, OLS residual properties and orthogonality, multicollinearity and VIF, the effects of predictor scaling, and robustness to heavy tails and heteroskedasticity.

  • Medium
  • Other
  • Statistics & Math
  • Data Scientist

Diagnose and fix linear regression violations

Company: Other

Role: Data Scientist

Category: Statistics & Math

Difficulty: Medium

Interview Round: Onsite

Given a linear model y = Xβ + ε on 10,000 observations: (a) State all Gauss–Markov assumptions and which are needed for BLUE vs inference. (b) Show why OLS residuals sum to zero and why fitted residuals are orthogonal to the column space of X. (c) You observe multicollinearity among three predictors; define and compute VIF, and propose two remedies. (d) If you scale one predictor by a factor of 100, how do coefficients, standard errors, R^2, and predictions change? (e) Interpret a QQ-plot and residual-vs-fitted plot that show heavy tails and heteroskedasticity; propose robust alternatives (e.g., HC3, WLS) and appropriate hypothesis tests.

Quick Answer: This question evaluates mastery of linear regression diagnostics and inference, including Gauss–Markov assumptions and their role for BLUE versus hypothesis testing, OLS residual properties and orthogonality, multicollinearity and VIF, the effects of predictor scaling, and robustness to heavy tails and heteroskedasticity.

Related Interview Questions

  • Compute counts and pacing for verbal section - Other (medium)
  • Simulate via inverse transform and Gibbs - Other (Medium)
  • Compute p-values and adjust multiplicity - Other (Medium)
  • Apply Bayes theorem with conjugate priors - Other (Medium)
  • Compare LLN and CLT with heavy tails - Other (Medium)
Other logo
Other
Oct 13, 2025, 9:49 PM
Data Scientist
Onsite
Statistics & Math
2
0
Loading...

Given a linear model y = Xβ + ε on 10,000 observations: (a) State all Gauss–Markov assumptions and which are needed for BLUE vs inference. (b) Show why OLS residuals sum to zero and why fitted residuals are orthogonal to the column space of X. (c) You observe multicollinearity among three predictors; define and compute VIF, and propose two remedies. (d) If you scale one predictor by a factor of 100, how do coefficients, standard errors, R^2, and predictions change? (e) Interpret a QQ-plot and residual-vs-fitted plot that show heavy tails and heteroskedasticity; propose robust alternatives (e.g., HC3, WLS) and appropriate hypothesis tests.

Comments (0)

Sign in to leave a comment

Loading comments...

Browse More Questions

More Statistics & Math•More Other•More Data Scientist•Other Data Scientist•Other Statistics & Math•Data Scientist Statistics & Math
PracHub

Master your tech interviews with 7,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.