How do I approach Machine Learning interview questions?

Machine Learning questions require understanding of core concepts and practice. PracHub provides solutions with explanations to help you master machine learning interviews.

What difficulty level is this interview question?

This is a medium difficulty Machine Learning question, commonly asked during Onsite rounds at Google.

What role is this question designed for?

This question is commonly asked for Data Scientist candidates at Google during technical interviews.

Compare Logistic Regression and Random Forest in Limited Data Scenarios

Last updated: Mar 29, 2026

Quick Overview

This question evaluates a data scientist's understanding of binary classification model selection, covering logistic regression fundamentals, L1/L2 regularization effects, overfitting detection, and comparisons between Random Forest and boosting within the Machine Learning domain.

Google

Jul 12, 2025, 6:59 PM

Data Scientist

Onsite

Machine Learning

Model Selection for Binary Classification with Limited Data and Potential Non-Linearities

Scenario

You are designing a binary classifier with limited labeled data. The signal may be partly non-linear, and you care about generalization and interpretability.

Questions

What is logistic regression, and what is its loss function? Briefly note its optimization properties (convexity).
When can logistic regression outperform a Random Forest?
Explain L1 and L2 regularization and their effects (e.g., sparsity, multicollinearity).
How would you detect and mitigate overfitting in logistic regression?
Compare Random Forest and Boosting (e.g., Gradient Boosting) in terms of bias, variance, interpretability, and typical use cases. Include thoughts on ensemble diversity and probability calibration.

Solution

Show

Submit Your Answer to Earn 20XP

Loading comments...

Browse More Questions

More Machine Learning•More Google•More Data Scientist•Google Data Scientist•Google Machine Learning•Data Scientist Machine Learning