PracHub
QuestionsPremiumCoachesLearningGuidesInterview Prep
|Home/Machine Learning/Netflix

Design Real-Time Fraud Detection with XGBoost Model

Last updated: Mar 29, 2026

Quick Overview

This question evaluates competency in designing and operationalizing latency-sensitive machine learning systems, covering real-time feature engineering, training with delayed labels, class imbalance strategies, metric selection, deployment architecture, and post-deployment monitoring.

  • medium
  • Netflix
  • Machine Learning
  • Data Scientist

Design Real-Time Fraud Detection with XGBoost Model

Company: Netflix

Role: Data Scientist

Category: Machine Learning

Difficulty: medium

Interview Round: Other

##### Scenario Building a real-time fraud-detection system for subscription payments. ##### Question Outline the end-to-end workflow for training and deploying an XGBoost model to flag fraudulent transactions in real time. Which evaluation metrics would you prioritize and why? How would you handle severe class imbalance during training? Describe one strategy for monitoring model drift after deployment. ##### Hints Cover feature engineering, class weighting or sampling, precision-recall trade-offs, and online monitoring.

Quick Answer: This question evaluates competency in designing and operationalizing latency-sensitive machine learning systems, covering real-time feature engineering, training with delayed labels, class imbalance strategies, metric selection, deployment architecture, and post-deployment monitoring.

Related Interview Questions

  • Compare Losses and Explain LoRA - Netflix (medium)
  • Explain self-attention, LoRA, Adam vs SGD, ViT - Netflix (medium)
  • Design a robust conversion propensity model - Netflix (hard)
  • Explain tokenization and Transformer variants - Netflix (medium)
  • Address Fraud Detection with Imbalance and Concept Drift Solutions - Netflix (medium)
Netflix logo
Netflix
Aug 4, 2025, 10:55 AM
Data Scientist
Other
Machine Learning
8
0

Real-Time Fraud Detection with XGBoost (Subscription Payments)

Scenario

You need to build and operate a real-time system that flags potentially fraudulent subscription-payment transactions with sub-second latency. Historical labels come from chargebacks/refunds with a delay of weeks. Data includes transaction attributes, user/account metadata, device/network signals, and historical behavior.

Task

Outline the end-to-end approach, covering:

  1. End-to-end workflow
  • Data ingestion, labeling, feature engineering (batch + streaming), training/validation protocol, hyperparameter tuning, offline–online feature parity, deployment architecture, and a feedback loop.
  1. Evaluation metrics
  • Which metrics you would prioritize in an imbalanced, high-stakes setting and why.
  1. Handling severe class imbalance
  • Approaches such as class weighting, sampling, threshold tuning, and any loss/metric choices.
  1. Monitoring for model drift post-deployment
  • Describe one concrete strategy to detect and respond to drift.

Solution

Show

Submit Your Answer

Sign in to leave a comment

Loading comments...

Browse More Questions

More Machine Learning•More Netflix•More Data Scientist•Netflix Data Scientist•Netflix Machine Learning•Data Scientist Machine Learning
PracHub

Master your tech interviews with 8,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.