PracHub
QuestionsPremiumLearningGuidesCheatsheetNEWCoaches
|Home/ML System Design/PayPal

Design a traditional fraud detection system

Last updated: Apr 17, 2026

Quick Overview

This question evaluates a Machine Learning Engineer's competency in end-to-end ML system design for real-time payments fraud detection, including labeling under delayed confirmations, handling extreme class imbalance and sampling, feature engineering across behavioral, graph, device and merchant signals, model selection for latency and scale, and production scoring and monitoring architecture. It is commonly asked in the ML System Design category to assess how an engineer balances low-latency decision-making with delayed sparse labels, calibration and threshold trade-offs, operational scalability and resiliency, and drift/adversarial detection, testing both conceptual understanding and practical application.

  • hard
  • PayPal
  • ML System Design
  • Machine Learning Engineer

Design a traditional fraud detection system

Company: PayPal

Role: Machine Learning Engineer

Category: ML System Design

Difficulty: hard

Interview Round: Onsite

Design an end-to-end fraud detection system. Specify positive/negative labeling strategy given delayed and scarce fraud confirmations, sampling to address extreme class imbalance, feature sets (behavioral, graph, device, merchant), model choices and justification, real-time scoring architecture and latency constraints, thresholding and precision/recall trade-offs, evaluation metrics (PR-AUC, precision@k, cost-sensitive metrics), and monitoring for drift and adversarial adaptation.

Quick Answer: This question evaluates a Machine Learning Engineer's competency in end-to-end ML system design for real-time payments fraud detection, including labeling under delayed confirmations, handling extreme class imbalance and sampling, feature engineering across behavioral, graph, device and merchant signals, model selection for latency and scale, and production scoring and monitoring architecture. It is commonly asked in the ML System Design category to assess how an engineer balances low-latency decision-making with delayed sparse labels, calibration and threshold trade-offs, operational scalability and resiliency, and drift/adversarial detection, testing both conceptual understanding and practical application.

Related Interview Questions

  • Design RL-based spending limit policy - PayPal (hard)
  • Detect credit-card transaction fraud - PayPal (hard)
  • Design fraud detection from raw transactions - PayPal (hard)
PayPal logo
PayPal
Sep 6, 2025, 12:00 AM
Machine Learning Engineer
Onsite
ML System Design
8
0

Design an End-to-End Real-Time Payments Fraud Detection System

Context: You are designing a fraud detection system for a large-scale online payments platform. Decisions must be made synchronously at authorization time with tight latency budgets, while confirmed fraud labels (e.g., chargebacks) arrive late and are scarce.

Specify and justify the following:

  1. Labeling strategy under delayed, scarce confirmations
  • How to define positive/negative labels when chargebacks arrive weeks later.
  • Aging/observation windows, handling disputed outcomes, and avoiding target leakage.
  1. Sampling to handle extreme class imbalance
  • Offline training strategies (downsampling, weighting) and how to keep calibration.
  • Online serving considerations.
  1. Feature sets
  • Behavioral/velocity features.
  • Graph/link features across users, devices, payment instruments.
  • Device/network features.
  • Merchant/context features.
  1. Model choices and justification
  • Baseline and advanced models suitable for latency and scale.
  • Handling graphs, sequences, and semi-/weak supervision.
  1. Real-time scoring architecture and latency constraints
  • Event ingestion, online/offline feature store, streaming aggregations, model serving, and fallbacks.
  • Expected P99 latency budget and resiliency.
  1. Thresholding and precision/recall trade-offs
  • Decision policies (approve/review/decline) using cost-aware thresholds and calibration.
  1. Evaluation metrics
  • PR-AUC, precision@k, expected-cost/profit metrics, and how to evaluate with delayed labels and policy bias.
  1. Monitoring for drift and adversarial adaptation
  • Detecting data/model drift, label delay proxies, and adversarial pattern monitoring.
  • Retraining cadence, rollout, and guardrails.

Solution

Show

Comments (0)

Sign in to leave a comment

Loading comments...

Browse More Questions

More ML System Design•More PayPal•More Machine Learning Engineer•PayPal Machine Learning Engineer•PayPal ML System Design•Machine Learning Engineer ML System Design
PracHub

Master your tech interviews with 7,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.