This question evaluates applied machine learning system design and data science competencies, including discriminative feature selection, classification modeling, label bootstrapping strategies, evaluation metrics, deployment constraints, product integration, and privacy and fairness considerations.

You are designing an ML system to detect and reduce spammy friend requests on a social-network platform. The system should run near real time at the moment a request is sent and integrate with product surfaces (e.g., inbox ranking, rate limits, warnings).
Assume you can log request-level events (sender, recipient, time, device/IP, graph context) and respect privacy and fairness constraints. If needed, make minimal reasonable assumptions and call them out.
Login required