Identify Probability of Request Originating from Bad User

Q: Identify Probability of Request Originating from Bad User

This question evaluates probabilistic reasoning and statistical inference skills, specifically application of Bayes' theorem, handling class imbalance and differing activity rates, estimation of event-origin probabilities, and use of observational features for classification within the Statistics & Math domain.

Q: How do I approach Statistics & Math interview questions?

Statistics & Math questions require understanding of core concepts and practice. PracHub provides solutions with explanations to help you master statistics & math interviews.

Question

Measuring Abuse in Friend-Requests: Bayes, Identification, and Precision

Scenario

A social-network platform wants to measure and control abuse. Five percent of users are classified as "bad" and, on average, each bad user sends 10× as many friend-requests as a good user.

Tasks

Compute the probability that a randomly selected friend-request came from a bad user.
Using only existing event logs, propose a method to identify the likely bad users.
With additional features (e.g., request timing, acceptance rate), write an expression for P(good | request features) using Bayes' theorem.
If you must shrink the confidence interval (CI) of that probability estimate to one-tenth its current width, what changes in data collection or analysis would you make?

Hints

Apply Bayes’ rule and reason about class imbalance and different activity rates.
Use unsupervised/weakly supervised signals from logs; normalize for exposure (tenure/active days).
CI width typically shrinks as 1/sqrt(n); consider variance-reduction techniques.

Identify Probability of Request Originating from Bad User

Measuring Abuse in Friend-Requests: Bayes, Identification, and Precision

Scenario

Tasks

Hints

Solution

Comments (0)