PracHub
QuestionsPremiumLearningGuidesInterview PrepNEWCoaches
|Home/Machine Learning/Google

Detect Overfitting or Underfitting in Logistic Regression Models

Last updated: Mar 29, 2026

Quick Overview

This question evaluates understanding of bias–variance trade-offs, model generalization and diagnostics for logistic regression applied to high‑dimensional, sparse, and imbalanced datasets, including feature sparsity and regularization considerations.

  • medium
  • Google
  • Machine Learning
  • Data Scientist

Detect Overfitting or Underfitting in Logistic Regression Models

Company: Google

Role: Data Scientist

Category: Machine Learning

Difficulty: medium

Interview Round: Technical Screen

##### Scenario Building a large-scale binary classifier with hundreds or thousands of features for Google Display ads performance prediction. ##### Question Does logistic regression typically underfit or overfit in this setting? Describe the conditions that drive each, how you would detect the problem, and the techniques you would use to address it. ##### Hints Cover regularization strength, feature selection, high-dimensional sparsity, learning curves, cross-validation.

Quick Answer: This question evaluates understanding of bias–variance trade-offs, model generalization and diagnostics for logistic regression applied to high‑dimensional, sparse, and imbalanced datasets, including feature sparsity and regularization considerations.

Related Interview Questions

  • Explain ranking cold-start strategies - Google (medium)
  • Explain LLM fine-tuning and generative models - Google (medium)
  • Compare NLP tokenization and LLM recommendations - Google (medium)
  • Explain LLM lifecycle and trade-offs - Google (medium)
  • Build a bigram next-word predictor with weighted sampling - Google (medium)
Google logo
Google
Aug 4, 2025, 10:55 AM
Data Scientist
Technical Screen
Machine Learning
21
0

Logistic Regression Bias–Variance in High‑Dimensional Ads Prediction

Scenario

You are building a large‑scale binary classifier (e.g., click/conversion prediction for Google Display ads) with hundreds to thousands of mostly sparse, high‑cardinality features (one‑hot categorials, text/ids, and some numerics). The dataset is large and exhibits class imbalance.

Question

  1. In this setting, does logistic regression typically underfit or overfit? Describe the conditions that drive each outcome.
  2. How would you detect underfitting vs overfitting in practice (e.g., learning curves, cross‑validation)?
  3. What techniques would you use to address each case (consider regularization strength, feature selection, high‑dimensional sparsity, and related tooling)?

Solution

Show

Comments (0)

Sign in to leave a comment

Loading comments...

Browse More Questions

More Machine Learning•More Google•More Data Scientist•Google Data Scientist•Google Machine Learning•Data Scientist Machine Learning
PracHub

Master your tech interviews with 7,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.