PracHub
QuestionsPremiumCoachesLearningGuidesInterview Prep
|Home/Analytics & Experimentation/Meta

Diagnose sales correlations without claiming causality

Last updated: Mar 29, 2026

Quick Overview

This question evaluates a data scientist's competency in designing correlation-focused observational analyses, including exposure-window definition, confounding control, within-group differencing, bias identification, sensitivity checks, and communication of non-causal findings.

  • hard
  • Meta
  • Analytics & Experimentation
  • Data Scientist

Diagnose sales correlations without claiming causality

Company: Meta

Role: Data Scientist

Category: Analytics & Experimentation

Difficulty: hard

Interview Round: Technical Screen

You support a sales team and are asked to find "which outreach channels correlate with higher deal win rate" without building predictive models. You have deal-level data: deal_id, account_id, rep_id, created_at, closed_at, is_won, amount_usd, product_line, region; and touch-level data: account_id, rep_id, touch_date, channel (email/call/demo/webinar), is_primary_contact. Design a correlation-focused analysis that is decision-ready but avoids causal claims: (a) Define a defensible exposure window (e.g., touches within the first 14 days after created_at) and justify how you’ll handle right-censoring for open deals and late touches; (b) Specify stratifications and/or matching you’ll use (e.g., region, segment, deal size buckets, rep tenure) to control confounding without modeling; (c) Show exactly how you’d compute within-rep, within-segment correlations to avoid between-rep composition bias (outline de-meaning or fixed-effects style differencing before correlating); (d) List bias risks (reverse causality when hot deals drive more touches, missing-not-at-random touches on lost deals, seasonality) and propose sensitivity checks (pre-registration of windows, placebo windows before deal creation, leave-one-rep-out analysis, randomization inference) to see if correlations are robust; (e) Describe two plots you’d present that can reveal Simpson’s paradox across regions or segments and how you’d detect and communicate it; (f) Write the exact decision guardrails you’ll present to sales leadership to prevent causal overreach and how you’d phrase them.

Quick Answer: This question evaluates a data scientist's competency in designing correlation-focused observational analyses, including exposure-window definition, confounding control, within-group differencing, bias identification, sensitivity checks, and communication of non-causal findings.

Related Interview Questions

  • Measure scheduled posts feature success - Meta (medium)
  • Estimate ads ranking revenue impact - Meta (medium)
  • How should you evaluate unconnected content? - Meta (medium)
  • Should WhatsApp launch group calls? - Meta (medium)
  • How would you grow Meta products? - Meta (medium)
Meta logo
Meta
Oct 13, 2025, 9:49 PM
Data Scientist
Technical Screen
Analytics & Experimentation
0
0
Loading...

Correlation-Focused Analysis: Outreach Channels vs. Deal Win Rate

You support a sales team and are asked to find which outreach channels correlate with higher deal win rate, without building predictive models. You have two datasets:

  • Deals: deal_id, account_id, rep_id, created_at, closed_at, is_won, amount_usd, product_line, region
  • Touches: account_id, rep_id, touch_date, channel (email/call/demo/webinar), is_primary_contact

Assume you have a frozen data snapshot date T0 (the last day touches and deals are observed). Design a decision-ready, correlation-focused analysis that avoids causal claims:

(a) Define a defensible exposure window (e.g., touches within the first 14 days after created_at) and justify how you’ll handle right-censoring for open deals and late touches.

(b) Specify stratifications and/or matching (e.g., region, segment, deal size buckets, rep tenure) to control confounding without modeling.

(c) Show exactly how you’d compute within-rep, within-segment correlations to avoid between-rep composition bias. Outline a de-meaning or fixed-effects-style differencing before correlating.

(d) List bias risks (reverse causality when hot deals drive more touches, missing-not-at-random touches on lost deals, seasonality) and propose sensitivity checks (pre-registration of windows, placebo windows before deal creation, leave-one-rep-out analysis, randomization inference) to assess robustness.

(e) Describe two plots that can reveal Simpson’s paradox across regions or segments and how you’d detect and communicate it.

(f) Write the exact decision guardrails you’ll present to sales leadership to prevent causal overreach and how you’d phrase them.

Solution

Show

Submit Your Answer to Earn 20XP

Sign in to leave a comment

Loading comments...

Browse More Questions

More Analytics & Experimentation•More Meta•More Data Scientist•Meta Data Scientist•Meta Analytics & Experimentation•Data Scientist Analytics & Experimentation
PracHub

Master your tech interviews with 8,000+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.