Airbnb Data Scientist Interview Questions
Master your tech interview with our curated database of real questions from top companies.
Aggregate User Activity, Fit Regression, Interpret Coefficients
user_metrics +----------+------------+---------+--------+-----------+ | user_id | activity_dt| variant | clicks | purchases | +----------+-----------...
Define Success Metrics and Experiment Plan for Product Development
Scenario You are in a product-planning session and must define success criteria before development begins for a new change to the core booking funnel ...
Analyze A/B Test Results to Inform Stakeholder Decisions
A/B Test: Clean, Analyze, Visualize, and Interpret Raw Log-Level Data Scenario You receive raw, log-level event data for an A/B test on a consumer boo...
Estimate Causal Impact Using Synthetic Control Methods
Estimating Causal Impact After a 100% Rollout (No Holdout) Context A product feature was launched to 100% of traffic simultaneously, so there is no ex...
Influence Decisions Without Direct Authority: Strategies and Outcomes
Behavioral & Leadership: Influencing Without Authority Scenario Cross-functional business interview with product and engineering stakeholders for a Da...
Test conversion difference and adjust for clustering
Using aggregated results for the 7‑day window 2025‑08‑26..2025‑09‑01, evaluate statistical significance and power for conversion uplift, accounting fo...
Compute C/T metrics from bookings and visits
Given two tables, compute control vs treatment (C/T) metrics, apply 24‑hour attribution, and generate a daily plot. Treat “today” as 2025‑09‑01; use t...
Design a network-aware Wi‑Fi badge experiment
You work on a two‑sided travel search marketplace and product wants to add a “High Wi‑Fi” badge/filter in the search bar to help remote workers. Recom...
Build and evaluate an order prediction model
Predict 7-Day Order Completion from First Session You are building a binary classifier to predict whether a guest will complete an order within 7 days...
Compute browsing metrics in Python from logs
Given event logs, write idiomatic Pandas to compute segment-level metrics and a funnel. Data schema: events(event_id, ts_utc, guest_id, device in {des...
Design an A/B test with causal inference
A/B Test Design: Checkout Nudge (Guest-Level Randomization) Setup - Run dates: 2025-08-04 to 2025-08-31 (28 days). Analyze the primary metric on a mat...
Lead cross-functional decision without RCT evidence
Behavioral: Ship vs. Rollback After a Global Launch Without a Holdout Context You are a Data Scientist in a consumer marketplace. An important feature...
Design robust primary and guardrail metrics
Experiment Metric Design, Guardrails, and Power for a 14-Day A/B Test Context You are testing a newly launched, guest-facing booking feature in a glob...
Build panel in SQL; run causal regression
Assume today is 2025-09-01 (UTC). Schema and small samples: users(user_id INT, country STRING, signup_date DATE, platform STRING) Sample: user_id | co...
Analyze A/B test with rigorous diagnostics
A/B Test Analysis Live Walkthrough (Python) Context You are given a user-level randomized experiment dataset experiment.csv with columns: - user_id - ...
Estimate impact of global launch without holdout
Causal Lift Plan After a Global Launch Without a Holdout Background A new product feature was launched globally on 2025-05-10, with no control or hold...
Design and assess an A/B test
Experiment Design: New Onboarding Flow to Improve D7 Retention You are testing a new onboarding flow for a consumer marketplace app available on iOS, ...
Define product success metrics
Define Metrics and Experiment Guardrails for a New Consumer Feature Context (Assumption to Ground the Exercise) Assume you are launching a "Price Drop...