PracHub
QuestionsPremiumCoachesLearningGuidesInterview Prep
|Home/Machine Learning/Netflix

Address Fraud Detection with Imbalance and Concept Drift Solutions

Last updated: Mar 29, 2026

Quick Overview

This question evaluates a data scientist's competence in designing end-to-end machine learning systems for fraud detection, emphasizing challenges such as delayed labels, severe class imbalance, and evolving data distributions (concept drift) in near-real-time scoring.

  • medium
  • Netflix
  • Machine Learning
  • Data Scientist

Address Fraud Detection with Imbalance and Concept Drift Solutions

Company: Netflix

Role: Data Scientist

Category: Machine Learning

Difficulty: medium

Interview Round: Onsite

##### Scenario You are tasked with building a fraud-detection model for an online payments product. ##### Question Outline the end-to-end ML workflow: data collection, feature engineering, model selection, validation, deployment, and monitoring. How would you handle severe class imbalance and concept drift in this context? ##### Hints Discuss resampling, cost-sensitive learning, ROC-AUC, sliding windows, and automated retraining triggers.

Quick Answer: This question evaluates a data scientist's competence in designing end-to-end machine learning systems for fraud detection, emphasizing challenges such as delayed labels, severe class imbalance, and evolving data distributions (concept drift) in near-real-time scoring.

Related Interview Questions

  • Compare Losses and Explain LoRA - Netflix (medium)
  • Explain self-attention, LoRA, Adam vs SGD, ViT - Netflix (medium)
  • Design a robust conversion propensity model - Netflix (hard)
  • Explain tokenization and Transformer variants - Netflix (medium)
  • Design Real-Time Fraud Detection with XGBoost Model - Netflix (medium)
Netflix logo
Netflix
Jul 12, 2025, 6:59 PM
Data Scientist
Onsite
Machine Learning
14
0

End-to-End ML Workflow: Online Payments Fraud Detection

Scenario

You are designing a fraud-detection system for an online payments product that must score transactions in (near) real time. Labels for fraud (e.g., chargebacks) arrive with delays, fraud is rare (severe class imbalance), and fraud patterns evolve over time (concept drift).

Task

Outline the end-to-end ML workflow, covering:

  1. Data collection and labeling
  2. Feature engineering
  3. Model selection and training
  4. Validation and offline evaluation
  5. Deployment and inference
  6. Monitoring and retraining

Additionally, explain how you would handle:

  • Severe class imbalance
  • Concept drift

Note: Discuss techniques such as resampling, cost-sensitive learning, ROC-AUC/PR-AUC, sliding windows, and automated retraining triggers.

Solution

Show

Submit Your Answer

Sign in to leave a comment

Loading comments...

Browse More Questions

More Machine Learning•More Netflix•More Data Scientist•Netflix Data Scientist•Netflix Machine Learning•Data Scientist Machine Learning
PracHub

Master your tech interviews with 8,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.