This question evaluates competencies in applied machine learning for detecting fake accounts on social networks, covering feature engineering, handling class imbalance and noisy labels, model selection, evaluation metrics, and production deployment considerations.

You are a data scientist at a large social platform. The goal is to detect and mitigate fake or abusive accounts while minimizing harm to legitimate users. Fake accounts are rare compared to legitimate ones, so class imbalance, noisy labels, and high business costs of mistakes are central concerns.
Login required