This question evaluates statistical measurement and machine-learning model evaluation competencies, covering prevalence estimation, sampling design, bias correction, exposure-weighted metrics, and operational constraints for misinformation measurement on an online social platform.

Policy teams need an overnight view of fake-news prevalence on the platform, but only a small number of human reviewers are available. In parallel, leadership wants a long-term, statistically sound measurement program and a plan to improve detection models. Assume you can use an existing ML model to pre-score content as likely fake or not, and you can access impression counts to estimate user exposure.
Report and reason about both:
Login required