PracHub
QuestionsPremiumCoachesLearningGuidesInterview Prep
|Home/Analytics & Experimentation/Meta

Design and analyze a group-calls experiment

Last updated: Mar 29, 2026

Quick Overview

This question evaluates competency in experimental design, causal inference, metric specification, interference handling, and infrastructure-aware analytics within the Analytics & Experimentation domain for a Data Scientist role.

  • Medium
  • Meta
  • Analytics & Experimentation
  • Data Scientist

Design and analyze a group-calls experiment

Company: Meta

Role: Data Scientist

Category: Analytics & Experimentation

Difficulty: Medium

Interview Round: Technical Screen

You are considering launching Group Video Calls. Answer all parts precisely; justify choices with pros/cons and formulas where relevant. 1) Clarify CTR comparison: For a new entry-point button, should the primary CTR be (a) Treatment vs Control at the same time (between-subjects), or (b) Treatment users’ Month 1 vs their own Month 6 baseline (within-subjects)? Specify when each is valid, risks of bias (seasonality, maturation), and propose a preferred estimator. Provide formulas for: (i) simple difference vs control, (ii) within-user pre/post, and (iii) difference-in-differences combining both. State the exact windows you’d use (e.g., M1 = 2025-09-01 to 2025-09-30, M6 = 2025-03-01 to 2025-03-31) and how you’d handle users without a full baseline. 2) Experiment design: Propose a full A/B test (or cluster test) for Group Calls considering interference/network effects (users call each other). Specify randomization unit (user, household, call-graph clusters, geography), allocation, stratification (e.g., country, device), and ramp plan. Define primary success metric(s) and why (e.g., incremental video-call minutes per DAU, completed-call rate), and guardrails (e.g., app crashes, latency, churn). Provide decision thresholds and a stopping plan (power, MDE, alpha, sequential corrections). 3) Interference handling: Calls involve multiple users who might land in different variants. Describe a design that avoids contamination (e.g., cluster-by-ego-network, geo switchback). Explain how you’ll attribute a group call to a variant, and how you’ll analyze spillovers; include at least one robustness check (e.g., exposure reweighting, CUPED, cluster-robust SEs). 4) Launch decision resources: Besides the experiment, list external/internal resources you’d consult before greenlighting group calls (market research, competitor benchmarks, support tickets, qualitative UX studies, capacity/SRE constraints, cost models) and explain what each would change in your threshold for launch. 5) Set participant limit: Propose a data-driven method to choose a max participants-per-call limit (e.g., 4/8/16) under infra constraints. Outline how you’d simulate expected concurrency using historical call distributions, model QoS (latency, failure rate) as a function of N, and run a multi-arm experiment to pick the limit. Include guardrails and rollback criteria. 6) Edge cases: Explicitly cover novelty effects, day-of-week effects, France-only vs global rollouts, and how you’d ensure analysis uses the correct absolute dates (e.g., yesterday = 2025-08-31) and local time handling.

Quick Answer: This question evaluates competency in experimental design, causal inference, metric specification, interference handling, and infrastructure-aware analytics within the Analytics & Experimentation domain for a Data Scientist role.

Related Interview Questions

  • Measure scheduled posts feature success - Meta (medium)
  • Estimate ads ranking revenue impact - Meta (medium)
  • How should you evaluate unconnected content? - Meta (medium)
  • Should WhatsApp launch group calls? - Meta (medium)
  • How would you grow Meta products? - Meta (medium)
Meta logo
Meta
Oct 13, 2025, 9:49 PM
Data Scientist
Technical Screen
Analytics & Experimentation
1
0

You are considering launching Group Video Calls. Answer all parts precisely; justify choices with pros/cons and formulas where relevant.

  1. Clarify CTR comparison: For a new entry-point button, should the primary CTR be (a) Treatment vs Control at the same time (between-subjects), or (b) Treatment users’ Month 1 vs their own Month 6 baseline (within-subjects)? Specify when each is valid, risks of bias (seasonality, maturation), and propose a preferred estimator. Provide formulas for: (i) simple difference vs control, (ii) within-user pre/post, and (iii) difference-in-differences combining both. State the exact windows you’d use (e.g., M1 = 2025-09-01 to 2025-09-30, M6 = 2025-03-01 to 2025-03-31) and how you’d handle users without a full baseline.
  2. Experiment design: Propose a full A/B test (or cluster test) for Group Calls considering interference/network effects (users call each other). Specify randomization unit (user, household, call-graph clusters, geography), allocation, stratification (e.g., country, device), and ramp plan. Define primary success metric(s) and why (e.g., incremental video-call minutes per DAU, completed-call rate), and guardrails (e.g., app crashes, latency, churn). Provide decision thresholds and a stopping plan (power, MDE, alpha, sequential corrections).
  3. Interference handling: Calls involve multiple users who might land in different variants. Describe a design that avoids contamination (e.g., cluster-by-ego-network, geo switchback). Explain how you’ll attribute a group call to a variant, and how you’ll analyze spillovers; include at least one robustness check (e.g., exposure reweighting, CUPED, cluster-robust SEs).
  4. Launch decision resources: Besides the experiment, list external/internal resources you’d consult before greenlighting group calls (market research, competitor benchmarks, support tickets, qualitative UX studies, capacity/SRE constraints, cost models) and explain what each would change in your threshold for launch.
  5. Set participant limit: Propose a data-driven method to choose a max participants-per-call limit (e.g., 4/8/16) under infra constraints. Outline how you’d simulate expected concurrency using historical call distributions, model QoS (latency, failure rate) as a function of N, and run a multi-arm experiment to pick the limit. Include guardrails and rollback criteria.
  6. Edge cases: Explicitly cover novelty effects, day-of-week effects, France-only vs global rollouts, and how you’d ensure analysis uses the correct absolute dates (e.g., yesterday = 2025-08-31) and local time handling.

Submit Your Answer to Earn 20XP

Sign in to leave a comment

Loading comments...

Browse More Questions

More Analytics & Experimentation•More Meta•More Data Scientist•Meta Data Scientist•Meta Analytics & Experimentation•Data Scientist Analytics & Experimentation
PracHub

Master your tech interviews with 8,000+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.