Investigate Harassment Surge and Mitigation

Q: Investigate Harassment Surge and Mitigation

This question evaluates a data scientist's skills in diagnostic analytics, causal reasoning, validation of model-driven metrics using human-reviewed labels, and designing product/ML mitigations within the Analytics & Experimentation domain.

Q: How do I approach Analytics & Experimentation interview questions?

Analytics & Experimentation questions require understanding of core concepts and practice. PracHub provides solutions with explanations to help you master analytics & experimentation interviews.

Q: What difficulty level is this interview question?

This is a medium difficulty Analytics & Experimentation question, commonly asked during Onsite rounds at Airwallex.

Q: What role is this question designed for?

This question is commonly asked for Data Scientist candidates at Airwallex during technical interviews.

Question

Using the same moderation setting, suppose your monthly violation analysis shows that Harassment increased sharply in the most recent month.

Discuss the following:

What are the most plausible explanations for the surge? Consider both real-world and measurement-related causes, such as:
- a true increase in abusive behavior
- traffic mix changes across surfaces, regions, languages, or creator cohorts
- seasonality or external events
- coordinated attacks or repeat offenders
- policy-definition changes
- model-threshold changes or model-version changes
- calibration drift in the classifier
- data-quality, logging, or backfill issues
How would you investigate whether the surge is real versus an artifact? Be specific about:
- which prevalence metrics you would use
- the denominators and time windows you would compare
- which segments you would break the data into
- what additional datasets you would request, such as human-review labels, user reports, enforcement logs, or model-version metadata
- how you would account for confounding, selection bias, and Simpson's paradox
If the surge is real, what product, policy, ranking, operational, and ML solutions would you propose? Include both short-term containment actions and longer-term fixes.
How would you evaluate those solutions? Include primary metrics, guardrail metrics, experiment or rollout design, and the main tradeoffs involving false positives, fairness, and user experience.

Your answer should distinguish a volume increase from a rate increase and should explain how to validate model-driven metrics using human-reviewed labels.

Investigate Harassment Surge and Mitigation

Quick Overview

Solution

Comments (0)