Airbnb Data Scientist Interview Questions
Airbnb Data Scientist interview questions typically probe both technical depth and product impact: expect live Python coding and SQL (joins, window functions, CTEs), experiment design and statistics, applied machine learning trade-offs, plus a take‑home or project presentation. What’s distinctive about Airbnb’s loop is its marketplace focus—interviewers often evaluate your ability to tie models and analyses to concrete business metrics (supply/demand, pricing, conversion funnels) and to reason about biases, causality, and operational constraints while collaborating across product and engineering teams. For interview preparation, prioritize three threads: technical fluency (SQL, pandas/numpy, model evaluation), experiment and product sense (A/B test design, metric definition, diagnostic thinking), and storytelling (clear presentation of a take‑home or past project, plus strong STAR behavioral examples). Practice full “data loops” end‑to‑end: frame the business question, outline the analysis or model, defend assumptions and tradeoffs, and translate findings into actionable recommendations. Familiarize yourself with Airbnb’s values and be ready to explain impact, cross‑functional collaboration, and decisions under uncertainty.
Design an A/B test with causal inference
A/B Test Design: Checkout Nudge (Guest-Level Randomization) Setup - Run dates: 2025-08-04 to 2025-08-31 (28 days). Analyze the primary metric on a mat...
Analyze A/B test with rigorous diagnostics
A/B Test Analysis Live Walkthrough (Python) Context You are given a user-level randomized experiment dataset experiment.csv with columns: - user_id - ...
Compute C/T metrics from bookings and visits
Given two tables, compute control vs treatment (C/T) metrics, apply 24‑hour attribution, and generate a daily plot. Treat “today” as 2025‑09‑01; use t...
Design a network-aware Wi‑Fi badge experiment
You work on a two‑sided travel search marketplace and product wants to add a “High Wi‑Fi” badge/filter in the search bar to help remote workers. Recom...
Compute browsing metrics in Python from logs
Given event logs, write idiomatic Pandas to compute segment-level metrics and a funnel. Data schema: events(event_id, ts_utc, guest_id, device in {des...
Estimate impact of global launch without holdout
Causal Lift Plan After a Global Launch Without a Holdout Background A new product feature was launched globally on 2025-05-10, with no control or hold...
Test conversion difference and adjust for clustering
Using aggregated results for the 7‑day window 2025‑08‑26..2025‑09‑01, evaluate statistical significance and power for conversion uplift, accounting fo...
Lead cross-functional decision without RCT evidence
Behavioral: Ship vs. Rollback After a Global Launch Without a Holdout Context You are a Data Scientist in a consumer marketplace. An important feature...
Influence Decisions Without Direct Authority: Strategies and Outcomes
Behavioral & Leadership: Influencing Without Authority Scenario Cross-functional business interview with product and engineering stakeholders for a Da...
Define Success Metrics and Experiment Plan for Product Development
Scenario You are in a product-planning session and must define success criteria before development begins for a new change to the core booking funnel ...
Design robust primary and guardrail metrics
Experiment Metric Design, Guardrails, and Power for a 14-Day A/B Test Context You are testing a newly launched, guest-facing booking feature in a glob...
Estimate Causal Impact Using Synthetic Control Methods
Estimating Causal Impact After a 100% Rollout (No Holdout) Context A product feature was launched to 100% of traffic simultaneously, so there is no ex...
Design and assess an A/B test
Experiment Design: New Onboarding Flow to Improve D7 Retention You are testing a new onboarding flow for a consumer marketplace app available on iOS, ...
Analyze A/B Test Results to Inform Stakeholder Decisions
A/B Test: Clean, Analyze, Visualize, and Interpret Raw Log-Level Data Scenario You receive raw, log-level event data for an A/B test on a consumer boo...
Build and evaluate an order prediction model
Predict 7-Day Order Completion from First Session You are building a binary classifier to predict whether a guest will complete an order within 7 days...
Build panel in SQL; run causal regression
Assume today is 2025-09-01 (UTC). Schema and small samples: users(user_id INT, country STRING, signup_date DATE, platform STRING) Sample: user_id | co...
Define product success metrics
Define Metrics and Experiment Guardrails for a New Consumer Feature Context (Assumption to Ground the Exercise) Assume you are launching a "Price Drop...
Aggregate User Activity, Fit Regression, Interpret Coefficients
user_metrics +----------+------------+---------+--------+-----------+ | user_id | activity_dt| variant | clicks | purchases | +----------+-----------...