How do I approach Analytics & Experimentation interview questions?

Analytics & Experimentation questions require understanding of core concepts and practice. PracHub provides solutions with explanations to help you master analytics & experimentation interviews.

What difficulty level is this interview question?

This is a medium difficulty Analytics & Experimentation question, commonly asked during Onsite rounds at Meta.

What role is this question designed for?

This question is commonly asked for Data Scientist candidates at Meta during technical interviews.

[Analytics Reasoning] Impact of Malicious Accounts on Meta

Company: Meta

Role: Data Scientist

Category: Analytics & Experimentation

Difficulty: medium

Interview Round: Onsite

Context & Assumptions On a large social network (similar to Facebook), only 1% of accounts are malicious or "bad" Bad accounts send friend requests at a rate ten times higher than good accounts You have developed a classification model that achieves a 95% true positive rate (TPR) and a 95% true negative rate (TNR) Questions 1. Single Friend Request Probability Estimate the probability that a received friend request comes from a bad account. 2. Multiple Friend Requests Determine the likelihood that, out of five friend requests, at least one originates from a bad account. 3. Model Reliability If the model flags an account as bad, how likely is it to be truly malicious, given the stated TPR and TNR? 4. Data & Features Identify which types of data (e.g., behavior logs, friend-request patterns, reported incidents) would be most relevant to classify accounts accurately. 5. Assessing "Bad Account" Prevalence Propose methods (such as stratified or random sampling) to determine whether the bad-account issue is substantial enough to warrant further intervention. 6. Defining a "Bad User" Outline what characteristics or behaviors would qualify an account as "bad" (e.g., spammer, scammer, bot). 7. Platform Impact Discuss how the presence of malicious users could affect the platform's trust, user experience, and reputation. 8. Friend Request Implications Examine how frequent friend requests from bad accounts might impact legitimate users and overall community health. Follow-Up Considerations Threshold Tuning Discuss how adjusting the classification threshold could alter false positives and false negatives. Cost-Benefit Analysis Evaluate the resource investment needed for automated detection vs. manual review. Long-Term Effects of False Positives Consider how incorrect labeling of legitimate users might erode trust. Advanced Feature Engineering Suggest additional signals (e.g., content quality, login behavior) that could enhance detection accuracy. Scaling & Fairness Explain how to maintain model performance and fairness across billions of accounts, ensuring no disproportionate impact on certain demographics.

Quick Answer: Solve a Meta analytics case on malicious accounts and friend-request risk. Covers base-rate probability, Bayes precision, bad-account features, prevalence measurement, platform impact, and safe enforcement.

You are analyzing malicious accounts on a large social network. Assume:

1% of all accounts are malicious.
Malicious accounts send friend requests at 10 times the rate of normal accounts.
A binary bad-account classifier has 95% true positive rate and 95% true negative rate.
Unless otherwise stated, treat friend requests as independent over a short time window.

Answer the following analytical and product-safety questions.

Constraints & Assumptions

Be explicit about whether you are reasoning over all accounts or over accounts that send friend requests.
Use base rates correctly; do not assume model precision equals TPR.
State any independence assumptions for the five-request calculation.
Discuss both detection accuracy and user/platform impact.
Avoid proposing enforcement that ignores false-positive harm.

Clarifying Questions to Ask

Are we classifying all accounts, only accounts that send friend requests, or individual friend-request events?
What costs are associated with false positives and false negatives?
What actions follow a bad-account prediction: friction, review, rate limit, or removal?
Are labels from manual review, user reports, confirmed enforcement, or a sampled audit?

Part 1 - Probability and Model Reliability

Calculate:

The probability that a received friend request comes from a bad account.
The probability that at least one of five received friend requests comes from a bad account.
The probability an account is truly bad if the classifier flags it as bad, under both the all-account population and the friend-request-sender population.

What This Part Should Cover

Rate-weighted base rates.
Complement rule for at least one bad request.
Bayes' rule for precision/positive predictive value.
Explanation of why prevalence changes model reliability.

Part 2 - Data, Features, and Prevalence Measurement

Describe what data you would use to classify malicious accounts and how you would estimate the true prevalence of bad behavior.

What This Part Should Cover

Friend-request behavior, graph features, content, reports, login/device signals, and enforcement history.
Random or stratified sampling with human audit.
Confidence intervals and biased-label concerns.
Honeypots, capture-recapture, or active-learning ideas if useful.

Part 3 - Product Impact and Enforcement

Explain why malicious friend requests matter and how you would design detection or intervention safely.

What This Part Should Cover

Effects on trust, user experience, retention, graph quality, brand, and operations.
Definition of a bad account.
Threshold tuning, manual review, friction, appeal paths, and fairness monitoring.

What a Strong Answer Covers

Correct arithmetic and clear population definitions.
Awareness of the base-rate problem.
Practical feature families and data-quality risks.
A product-safety perspective that balances harm reduction with false-positive risk.
Scalable and fair enforcement strategy.

Follow-up Questions

How would you choose the classifier threshold?
How would you estimate harm prevented by detection?
How would you monitor adversarial adaptation?
What would you do if one region has worse precision than others?

[Analytics Reasoning] Impact of Malicious Accounts on Meta

Company: Meta

Role: Data Scientist

Category: Analytics & Experimentation

Difficulty: medium

Interview Round: Onsite

You are analyzing malicious accounts on a large social network. Assume:

1% of all accounts are malicious.
Malicious accounts send friend requests at 10 times the rate of normal accounts.
A binary bad-account classifier has 95% true positive rate and 95% true negative rate.
Unless otherwise stated, treat friend requests as independent over a short time window.

Answer the following analytical and product-safety questions.

Constraints & Assumptions

Be explicit about whether you are reasoning over all accounts or over accounts that send friend requests.
Use base rates correctly; do not assume model precision equals TPR.
State any independence assumptions for the five-request calculation.
Discuss both detection accuracy and user/platform impact.
Avoid proposing enforcement that ignores false-positive harm.

Clarifying Questions to Ask

Are we classifying all accounts, only accounts that send friend requests, or individual friend-request events?
What costs are associated with false positives and false negatives?
What actions follow a bad-account prediction: friction, review, rate limit, or removal?
Are labels from manual review, user reports, confirmed enforcement, or a sampled audit?

Part 1 - Probability and Model Reliability

Calculate:

The probability that a received friend request comes from a bad account.
The probability that at least one of five received friend requests comes from a bad account.
The probability an account is truly bad if the classifier flags it as bad, under both the all-account population and the friend-request-sender population.

What This Part Should Cover

Rate-weighted base rates.
Complement rule for at least one bad request.
Bayes' rule for precision/positive predictive value.
Explanation of why prevalence changes model reliability.

Part 2 - Data, Features, and Prevalence Measurement

Describe what data you would use to classify malicious accounts and how you would estimate the true prevalence of bad behavior.

What This Part Should Cover

Friend-request behavior, graph features, content, reports, login/device signals, and enforcement history.
Random or stratified sampling with human audit.
Confidence intervals and biased-label concerns.
Honeypots, capture-recapture, or active-learning ideas if useful.

Part 3 - Product Impact and Enforcement

Explain why malicious friend requests matter and how you would design detection or intervention safely.

What This Part Should Cover

Effects on trust, user experience, retention, graph quality, brand, and operations.
Definition of a bad account.
Threshold tuning, manual review, friction, appeal paths, and fairness monitoring.

What a Strong Answer Covers

Correct arithmetic and clear population definitions.
Awareness of the base-rate problem.
Practical feature families and data-quality risks.
A product-safety perspective that balances harm reduction with false-positive risk.
Scalable and fair enforcement strategy.

Follow-up Questions

How would you choose the classifier threshold?
How would you estimate harm prevented by detection?
How would you monitor adversarial adaptation?
What would you do if one region has worse precision than others?

[Analytics Reasoning] Impact of Malicious Accounts on Meta

Quick Overview

[Analytics Reasoning] Impact of Malicious Accounts on Meta

Constraints & Assumptions

Clarifying Questions to Ask

Part 1 - Probability and Model Reliability

What This Part Should Cover

Part 2 - Data, Features, and Prevalence Measurement

What This Part Should Cover

Part 3 - Product Impact and Enforcement

What This Part Should Cover

What a Strong Answer Covers

Follow-up Questions

Write your answer

[Analytics Reasoning] Impact of Malicious Accounts on Meta

Quick Overview

[Analytics Reasoning] Impact of Malicious Accounts on Meta

Constraints & Assumptions

Clarifying Questions to Ask

Part 1 - Probability and Model Reliability

What This Part Should Cover

Part 2 - Data, Features, and Prevalence Measurement

What This Part Should Cover

Part 3 - Product Impact and Enforcement

What This Part Should Cover

What a Strong Answer Covers

Follow-up Questions

Write your answer