PracHub
QuestionsPremiumLearningGuidesCheatsheetNEWCoaches
|Home/Machine Learning/Amazon

Choose Models for Imbalanced Data and Time-Series Forecasting

Last updated: Mar 29, 2026

Quick Overview

This question evaluates competence in selecting and tuning models for time-series forecasting with trend and seasonality and for extremely imbalanced classification, while assessing understanding of OLS assumptions, trade-offs among tree ensemble methods, imbalance handling approaches, and end-to-end forecasting workflows.

  • hard
  • Amazon
  • Machine Learning
  • Data Scientist

Choose Models for Imbalanced Data and Time-Series Forecasting

Company: Amazon

Role: Data Scientist

Category: Machine Learning

Difficulty: hard

Interview Round: Technical Screen

##### Scenario You are asked to choose and tune models for forecasting marketplace demand and detecting fraud in a highly imbalanced dataset. ##### Question Explain how ordinary least squares linear regression works and state its key assumptions. Compare gradient-boosted trees, random forests, and bagging; when would you prefer each? Your positive class is 0.2 % of the data. How would you handle this imbalance during model training and evaluation? Describe a full workflow for building a time-series forecasting model when seasonality and trend are present. ##### Hints Cover data preprocessing, feature engineering, resampling/weighting, proper metrics, and cross-validation for temporal data.

Quick Answer: This question evaluates competence in selecting and tuning models for time-series forecasting with trend and seasonality and for extremely imbalanced classification, while assessing understanding of OLS assumptions, trade-offs among tree ensemble methods, imbalance handling approaches, and end-to-end forecasting workflows.

Related Interview Questions

  • Explain Core ML Interview Concepts - Amazon (hard)
  • Evaluate NLP Classification Models - Amazon (easy)
  • Explain overfitting, regularization, and LLM techniques - Amazon (medium)
  • Explain NLP/RL concepts used in LLM agents - Amazon (hard)
  • Design and evaluate a RAG system - Amazon (easy)
Amazon logo
Amazon
Aug 4, 2025, 10:55 AM
Data Scientist
Technical Screen
Machine Learning
54
0

Scenario

You must choose and tune models for (a) forecasting marketplace demand with seasonality and trend, and (b) detecting fraud where the positive class rate is only 0.2%.

Tasks

  1. Ordinary Least Squares (OLS): Explain how OLS linear regression works and list its key assumptions.
  2. Tree Ensembles: Compare gradient-boosted trees, random forests, and bagging. When would you prefer each?
  3. Class Imbalance (0.2% positive): How would you handle this imbalance during model training and evaluation?
  4. Time-Series Forecasting Workflow: Describe a full, practical workflow for modeling a series with trend and seasonality, including preprocessing, feature engineering, appropriate metrics, and time-aware cross-validation.

Hint

Address data preprocessing, feature engineering, resampling/weighting, proper metrics for imbalance, and cross-validation suited for temporal data.

Solution

Show

Comments (0)

Sign in to leave a comment

Loading comments...

Browse More Questions

More Machine Learning•More Amazon•More Data Scientist•Amazon Data Scientist•Amazon Machine Learning•Data Scientist Machine Learning
PracHub

Master your tech interviews with 7,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.