PracHub
QuestionsPremiumLearningGuidesCheatsheetNEWCoaches
|Home/Statistics & Math/Airbnb

Test conversion difference and adjust for clustering

Last updated: Mar 29, 2026

Quick Overview

This question evaluates proficiency in statistical inference for A/B testing—estimating and comparing conversion proportions, conducting two-sided hypothesis tests, adjusting for day-level clustering using ICC and design-effect corrections, and performing power and sample-size calculations; it belongs to the Statistics & Math domain for a Data Scientist role and combines conceptual understanding with practical application. It is commonly asked to assess an interviewee's ability to interpret conversion uplift under realistic experimental constraints, account for intra-cluster correlation when estimating effective sample sizes and uncertainty, and reason about experiment duration and robustness checks.

  • Medium
  • Airbnb
  • Statistics & Math
  • Data Scientist

Test conversion difference and adjust for clustering

Company: Airbnb

Role: Data Scientist

Category: Statistics & Math

Difficulty: Medium

Interview Round: Technical Screen

Using aggregated results for the 7‑day window 2025‑08‑26..2025‑09‑01, evaluate statistical significance and power for conversion uplift, accounting for day‑level clustering: Given totals: Control (C): visits n_C=10,240, bookings x_C=308; Treatment (T): visits n_T=10,180, bookings x_T=351. 1) Point estimates: compute p_C, p_T, absolute lift (p_T − p_C, in percentage points) and relative lift. 2) Significance: perform a two‑sided test for difference in proportions (unpooled standard error). Report z, p‑value, and a 95% CI for (p_T − p_C). State any continuity correction you apply. 3) Clustering: adjust for day‑level clustering with ICC=0.01 and 7 days per variant. Use design effect DE = 1 + (\bar{m} − 1)·ICC where \bar{m} = n_variant / 7. Recompute effective sample sizes n_eff = n / DE and provide an adjusted p‑value/CI. Explain assumptions and limitations of this correction. 4) Power and sample size: What total visits per variant are required to detect a 0.30 percentage‑point absolute lift from a 3.00% baseline at 80% power and alpha=0.05 using an unpooled z‑test? Show the formula and final n per variant. Then recompute with the design effect from ICC=0.01 to give a clustered n per variant and the implied experiment duration if each variant receives 2,000,000 visits/day. 5) Robustness: briefly describe how you would check day‑to‑day heterogeneity (e.g., Q‑test or interaction with weekday) and how that influences the decision to launch.

Quick Answer: This question evaluates proficiency in statistical inference for A/B testing—estimating and comparing conversion proportions, conducting two-sided hypothesis tests, adjusting for day-level clustering using ICC and design-effect corrections, and performing power and sample-size calculations; it belongs to the Statistics & Math domain for a Data Scientist role and combines conceptual understanding with practical application. It is commonly asked to assess an interviewee's ability to interpret conversion uplift under realistic experimental constraints, account for intra-cluster correlation when estimating effective sample sizes and uncertainty, and reason about experiment duration and robustness checks.

Airbnb logo
Airbnb
Oct 13, 2025, 9:49 PM
Data Scientist
Technical Screen
Statistics & Math
7
0

Using aggregated results for the 7‑day window 2025‑08‑26..2025‑09‑01, evaluate statistical significance and power for conversion uplift, accounting for day‑level clustering: Given totals: Control (C): visits n_C=10,240, bookings x_C=308; Treatment (T): visits n_T=10,180, bookings x_T=351.

  1. Point estimates: compute p_C, p_T, absolute lift (p_T − p_C, in percentage points) and relative lift.
  2. Significance: perform a two‑sided test for difference in proportions (unpooled standard error). Report z, p‑value, and a 95% CI for (p_T − p_C). State any continuity correction you apply.
  3. Clustering: adjust for day‑level clustering with ICC=0.01 and 7 days per variant. Use design effect DE = 1 + (\bar{m} − 1)·ICC where \bar{m} = n_variant / 7. Recompute effective sample sizes n_eff = n / DE and provide an adjusted p‑value/CI. Explain assumptions and limitations of this correction.
  4. Power and sample size: What total visits per variant are required to detect a 0.30 percentage‑point absolute lift from a 3.00% baseline at 80% power and alpha=0.05 using an unpooled z‑test? Show the formula and final n per variant. Then recompute with the design effect from ICC=0.01 to give a clustered n per variant and the implied experiment duration if each variant receives 2,000,000 visits/day.
  5. Robustness: briefly describe how you would check day‑to‑day heterogeneity (e.g., Q‑test or interaction with weekday) and how that influences the decision to launch.

Comments (0)

Sign in to leave a comment

Loading comments...

Browse More Questions

More Statistics & Math•More Airbnb•More Data Scientist•Airbnb Data Scientist•Airbnb Statistics & Math•Data Scientist Statistics & Math
PracHub

Master your tech interviews with 7,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.