PracHub
QuestionsPremiumLearningGuidesCheatsheetNEWCoaches
|Home/Machine Learning/Amazon

Handle Missing Values and Choose ML Algorithms Wisely

Last updated: Mar 29, 2026

Quick Overview

This question evaluates a data scientist's competencies in data preprocessing (handling missing values), algorithm selection and justification, model interpretation (contrasting Random Forests with linear regression), and understanding of model generalization (overfitting and underfitting).

  • medium
  • Amazon
  • Machine Learning
  • Data Scientist

Handle Missing Values and Choose ML Algorithms Wisely

Company: Amazon

Role: Data Scientist

Category: Machine Learning

Difficulty: medium

Interview Round: Technical Screen

##### Scenario General ML theory and practice questions during a technical interview. ##### Question a) How do you handle missing values before model training and why? b) Given a business scenario, how would you choose an appropriate ML algorithm and justify it? c) Explain Random Forests in lay terms and contrast them with linear regression. d) Define overfitting vs. underfitting and methods to detect/mitigate each. ##### Hints Cover imputation, algorithm bias-variance trade-off, ensemble intuition, cross-validation and regularization.

Quick Answer: This question evaluates a data scientist's competencies in data preprocessing (handling missing values), algorithm selection and justification, model interpretation (contrasting Random Forests with linear regression), and understanding of model generalization (overfitting and underfitting).

Related Interview Questions

  • Explain Core ML Interview Concepts - Amazon (hard)
  • Evaluate NLP Classification Models - Amazon (easy)
  • Explain overfitting, regularization, and LLM techniques - Amazon (medium)
  • Explain NLP/RL concepts used in LLM agents - Amazon (hard)
  • Design and evaluate a RAG system - Amazon (easy)
Amazon logo
Amazon
Jul 12, 2025, 6:59 PM
Data Scientist
Technical Screen
Machine Learning
58
0

ML Interview: Core Modeling Concepts

Context: Technical phone screen for a Data Scientist role. Assume primarily tabular datasets; address both classification and regression where relevant.

Questions

(a) How would you handle missing values before model training, and why?

(b) Given a business scenario, how would you choose an appropriate ML algorithm and justify your choice?

(c) Explain Random Forests in lay terms and contrast them with linear regression.

(d) Define overfitting and underfitting, and describe methods to detect and mitigate each.

Solution

Show

Comments (0)

Sign in to leave a comment

Loading comments...

Browse More Questions

More Machine Learning•More Amazon•More Data Scientist•Amazon Data Scientist•Amazon Machine Learning•Data Scientist Machine Learning
PracHub

Master your tech interviews with 7,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.