PracHub
QuestionsCoachesLearningGuidesInterview Prep
|Home/Machine Learning/Amazon

Explain key ML theory and techniques

Last updated: Jun 15, 2026

Quick Overview

This interview question evaluates core ML concepts, assumptions, math intuition, training/evaluation trade-offs, and practical failure modes in a realistic interview setting. A strong answer for Explain key ML theory and techniques states assumptions, handles edge cases, explains trade-offs, and shows how to validate the result clearly.

  • hard
  • Amazon
  • Machine Learning
  • Machine Learning Engineer

Explain key ML theory and techniques

Company: Amazon

Role: Machine Learning Engineer

Category: Machine Learning

Difficulty: hard

Interview Round: Onsite

##### Question This Amazon Machine Learning Engineer onsite covers a breadth of core ML theory and applied modeling. Be ready to go deep on each of the following: 1. **XGBoost parallel computation.** Explain how XGBoost achieves parallelism during training. Compare feature-parallel vs. data-parallel (histogram-based) split finding, describe distributed training across machines, and discuss the trade-offs in memory, speed, and accuracy. 2. **Layer normalization in Transformers.** Give the mathematical formulation, explain where it is applied (pre-norm vs. post-norm), why it stabilizes training, and its effect on gradient flow. Contrast it with batch normalization. 3. **Multimodal neural network design.** Design a network that fuses text and images. Describe early/late/cross-attention fusion strategies, how to align modalities, how to handle missing modalities, and how to choose loss functions and evaluation metrics. 4. **Collaborative filtering.** Compare user-based vs. item-based neighborhood methods and matrix factorization (including implicit feedback). Discuss regularization, cold-start mitigation, and scaling to sparse, large datasets. 5. **Multi-armed bandits.** Formulate the problem and define regret. Compare epsilon-greedy, UCB, and Thompson Sampling, address non-stationary and contextual settings, and describe offline policy evaluation and safe deployment. 6. **Logistic regression.** Derive the log-likelihood and gradients, compare L1 vs. L2 regularization, interpret coefficients as odds ratios, and handle class imbalance, calibration, and decision-threshold selection.

Quick Answer: This interview question evaluates core ML concepts, assumptions, math intuition, training/evaluation trade-offs, and practical failure modes in a realistic interview setting. A strong answer for Explain key ML theory and techniques states assumptions, handles edge cases, explains trade-offs, and shows how to validate the result clearly.

Related Interview Questions

  • LLM Fundamentals: Tokenization Design and KL-Regularized SFT - Amazon (medium)
  • Predicting the Next Elevator Call Location - Amazon (medium)
  • Explain Transformer and MoE Fundamentals - Amazon (medium)
  • Explain Core ML Interview Concepts - Amazon (hard)
  • Evaluate NLP Classification Models - Amazon (easy)
|Home/Machine Learning/Amazon

Explain key ML theory and techniques

Amazon logo
Amazon
Aug 4, 2025, 10:55 AM
hardMachine Learning EngineerOnsiteMachine Learning
27
0

Explain key ML theory and techniques

This Amazon Machine Learning Engineer onsite covers a breadth of core ML theory and applied modeling. Be ready to go deep on each of the following:

  1. XGBoost parallel computation. Explain how XGBoost achieves parallelism during training. Compare feature-parallel vs. data-parallel (histogram-based) split finding, describe distributed training across machines, and discuss the trade-offs in memory, speed, and accuracy.
  2. Layer normalization in Transformers. Give the mathematical formulation, explain where it is applied (pre-norm vs. post-norm), why it stabilizes training, and its effect on gradient flow. Contrast it with batch normalization.
  3. Multimodal neural network design. Design a network that fuses text and images. Describe early/late/cross-attention fusion strategies, how to align modalities, how to handle missing modalities, and how to choose loss functions and evaluation metrics.
  4. Collaborative filtering. Compare user-based vs. item-based neighborhood methods and matrix factorization (including implicit feedback). Discuss regularization, cold-start mitigation, and scaling to sparse, large datasets.
  5. Multi-armed bandits. Formulate the problem and define regret. Compare epsilon-greedy, UCB, and Thompson Sampling, address non-stationary and contextual settings, and describe offline policy evaluation and safe deployment.
  6. Logistic regression. Derive the log-likelihood and gradients, compare L1 vs. L2 regularization, interpret coefficients as odds ratios, and handle class imbalance, calibration, and decision-threshold selection.

Constraints & Assumptions

  • Preserve the scope, facts, inputs, and requested outputs from the prompt above.
  • If the prompt leaves a detail unspecified, state a reasonable assumption before relying on it.
  • Keep the answer interview-ready: concise enough to present, but concrete enough to implement or evaluate.

Clarifying Questions to Ask

  • Clarify the task, data shape, labels, constraints, and evaluation metric.
  • State assumptions behind the math or modeling technique you choose.
  • Connect theory to practical training, debugging, and deployment implications.

What a Strong Answer Covers

  • Correct definitions and formulas where the prompt requires them.
  • A practical explanation of how the method behaves on real data.
  • Trade-offs, failure modes, diagnostics, and mitigation strategies.
  • Evaluation choices that match the product or modeling objective.

Follow-up Questions

  • How would noisy labels, class imbalance, or distribution shift affect the answer?
  • What would you monitor after deployment?
  • Which baseline would you compare against first?
Loading comments...

Browse More Questions

More Machine Learning•More Amazon•More Machine Learning Engineer•Amazon Machine Learning Engineer•Amazon Machine Learning•Machine Learning Engineer Machine Learning

Write your answer

Your first approved answer each day earns 20 XP.

Sign in to write your answer.
PracHub

Master your tech interviews with 8,000+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • AI Coding Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.