Design a traditional fraud detection system

Q: Design a traditional fraud detection system

This is a ML System Design interview question from PayPal for Machine Learning Engineer roles. View the full question and solution on PracHub.

Q: How do I approach ML System Design interview questions?

ML System Design questions require understanding of core concepts and practice. PracHub provides solutions with explanations to help you master ml system design interviews.

Question

Design an End-to-End Real-Time Payments Fraud Detection System

Context: You are designing a fraud detection system for a large-scale online payments platform. Decisions must be made synchronously at authorization time with tight latency budgets, while confirmed fraud labels (e.g., chargebacks) arrive late and are scarce.

Specify and justify the following:

Labeling strategy under delayed, scarce confirmations

How to define positive/negative labels when chargebacks arrive weeks later.
Aging/observation windows, handling disputed outcomes, and avoiding target leakage.

Sampling to handle extreme class imbalance

Offline training strategies (downsampling, weighting) and how to keep calibration.
Online serving considerations.

Feature sets

Behavioral/velocity features.
Graph/link features across users, devices, payment instruments.
Device/network features.
Merchant/context features.

Model choices and justification

Baseline and advanced models suitable for latency and scale.
Handling graphs, sequences, and semi-/weak supervision.

Real-time scoring architecture and latency constraints

Event ingestion, online/offline feature store, streaming aggregations, model serving, and fallbacks.
Expected P99 latency budget and resiliency.

Thresholding and precision/recall trade-offs

Decision policies (approve/review/decline) using cost-aware thresholds and calibration.

Evaluation metrics

PR-AUC, precision@k, expected-cost/profit metrics, and how to evaluate with delayed labels and policy bias.

Monitoring for drift and adversarial adaptation

Detecting data/model drift, label delay proxies, and adversarial pattern monitoring.
Retraining cadence, rollout, and guardrails.

Design a traditional fraud detection system

Design an End-to-End Real-Time Payments Fraud Detection System

Solution

Comments (0)