This question evaluates SQL query writing, metric definition and normalization, ranking/aggregation techniques, and analytical reasoning for identifying problematic advertisers.
You are on the analytics team at TikTok and need to analyze the presence of bad content in ads and identify problematic advertisers.
Propose a clear definition of a "top bad advertiser" and write an SQL query to identify them. Consider factors such as the number of reports an advertiser receives and appropriate normalization/ranking metrics.
There is a positive correlation between ad page visits and reports on that ad being bad. Outline how you would investigate reasons behind this correlation. What hypotheses would you test?