PracHub
QuestionsPremiumCoachesLearningGuidesInterview Prep
|Home/Machine Learning/Amazon

Optimize Feature Selection and Handling in Machine Learning Models

Last updated: Mar 29, 2026

Quick Overview

This question evaluates feature engineering competencies—scaling and normalization effects on different algorithms, strategies for handling missing or zero-inflated numeric predictors, and approaches to detect and remediate multicollinearity for feature selection in predictive models.

  • medium
  • Amazon
  • Machine Learning
  • Data Scientist

Optimize Feature Selection and Handling in Machine Learning Models

Company: Amazon

Role: Data Scientist

Category: Machine Learning

Difficulty: medium

Interview Round: Technical Screen

##### Scenario Feature engineering for a customer-propensity machine-learning model. ##### Question When do we need to standardize or normalize variables? How would you handle numeric predictors that contain many null or zero values? If several features are highly correlated, how would you decide which one(s) to keep? ##### Hints Discuss scaling impact, imputation vs flagging, and multicollinearity remedies.

Quick Answer: This question evaluates feature engineering competencies—scaling and normalization effects on different algorithms, strategies for handling missing or zero-inflated numeric predictors, and approaches to detect and remediate multicollinearity for feature selection in predictive models.

Related Interview Questions

  • Predicting the Next Elevator Call Location - Amazon (medium)
  • Explain Transformer and MoE Fundamentals - Amazon (medium)
  • Explain Core ML Interview Concepts - Amazon (hard)
  • Evaluate NLP Classification Models - Amazon (easy)
  • Explain overfitting, regularization, and LLM techniques - Amazon (medium)
Amazon logo
Amazon
Jul 12, 2025, 6:59 PM
Data Scientist
Technical Screen
Machine Learning
18
0

Scenario

You are building a customer propensity model to predict the probability that a user will take a desired action (e.g., purchase, subscribe). You have mixed feature types from transactions, web/app activity, and demographics.

Task

Answer the following practical feature-engineering questions for this setting.

Questions

  1. When should we standardize or normalize variables? (Discuss the impact of scaling on different algorithms.)
  2. How would you handle numeric predictors that contain many null or zero values? (Discuss imputation versus flagging and approaches for zero-inflated features.)
  3. If several features are highly correlated, how would you decide which one(s) to keep? (Discuss multicollinearity detection and remedies.)

Solution

Show

Submit Your Answer to Earn 20XP

Sign in to leave a comment

Loading comments...

Browse More Questions

More Machine Learning•More Amazon•More Data Scientist•Amazon Data Scientist•Amazon Machine Learning•Data Scientist Machine Learning
PracHub

Master your tech interviews with 8,000+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.