PracHub
QuestionsPremiumLearningGuidesInterview PrepNEWCoaches
|Home/Analytics & Experimentation/Capital One

Design and analyze ad A/B test

Last updated: Mar 29, 2026

Quick Overview

This question evaluates a candidate's competency in online experimentation and statistical analysis, covering hypothesis formulation, variance reduction, clustering and stratification, multiplicity control, power/MDE calculation, and operational metric guardrails.

  • hard
  • Capital One
  • Analytics & Experimentation
  • Data Scientist

Design and analyze ad A/B test

Company: Capital One

Role: Data Scientist

Category: Analytics & Experimentation

Difficulty: hard

Interview Round: Take-home Project

You are testing a new ad-ranking algorithm (B) against the current production system (A) on an online video platform. Primary metric: mean watch_time per impression (seconds). Guardrails: (1) error rate ≤ 1%, (2) ad load (ads per session) must not increase, (3) click-through rate (CTR) must not decrease by more than 2% relative. Traffic is 50/50 A/B and randomized at user level for 14 days. Seasonality is weekly and there is a known weekday/weekend effect. Data available daily: impressions, total_watch_time_sec, clicks, sessions, errors by variant and platform (Web, Mobile). Design the experiment analysis plan: (a) State H0/HA and justify whether a one-tailed or two-tailed test is appropriate for the primary metric; (b) Specify the exact test for the primary metric (e.g., two-sample t-test on per-user means, CUPED with a covariate, or cluster-robust approach) and justify assumptions and clustering; (c) Define the variance reduction strategy you would use (e.g., CUPED using pre-experiment watch_time) and how you would compute it; (d) Show how you will check guardrails with multiplicity control (e.g., Holm-Bonferroni), and what decision rule you will use if a guardrail is violated; (e) Describe stratification/segmentation you will pre-register (e.g., by platform and weekday/weekend) and how you will combine strata (fixed vs random effects meta-analysis); (f) Provide a power/MDE calculation sketch assuming baseline mean=70s, sd=25s at user-day level, average 4 impressions/user-day, intra-user correlation 0.35, 200k users per arm over 14 days; (g) Explain how you would diagnose and mitigate traffic imbalance or novelty effects.

Quick Answer: This question evaluates a candidate's competency in online experimentation and statistical analysis, covering hypothesis formulation, variance reduction, clustering and stratification, multiplicity control, power/MDE calculation, and operational metric guardrails.

Related Interview Questions

  • Analyze Subscription, Insurance, App, and Card Cases - Capital One (medium)
  • Diagnose Flight Delays and Burger Launch - Capital One (easy)
  • How should you renew or replace a show? - Capital One (medium)
  • How would you decide to cancel a TV show? - Capital One (easy)
  • Decide Which Show to Renew - Capital One (medium)
Capital One logo
Capital One
Oct 13, 2025, 9:49 PM
Data Scientist
Take-home Project
Analytics & Experimentation
1
0

A/B Test Analysis Plan: New Ad-Ranking Algorithm (B) vs. Production (A)

Context: You are evaluating a new ad-ranking algorithm (B) against the current production system (A) on an online video platform. Randomization is 50/50 at the user level, run for 14 days. Primary metric is mean watch_time per impression (seconds). Weekly seasonality and weekday/weekend effects are known. Daily data by variant and platform (Web, Mobile): impressions, total_watch_time_sec, clicks, sessions, errors.

Guardrails:

  • Error rate ≤ 1% (absolute)
  • Ad load (ads per session) must not increase
  • CTR must not decrease by more than 2% (relative)

Tasks: (a) State H0/HA for the primary metric and justify one-tailed vs. two-tailed. (b) Specify the exact test for the primary metric and justify assumptions and clustering. (c) Define the variance reduction strategy (e.g., CUPED) and how you will compute it. (d) Show how you will check guardrails with multiplicity control and the decision rule if violated. (e) Describe stratification/segmentation to pre-register (e.g., by platform and weekday/weekend) and how to combine strata (fixed vs. random effects). (f) Provide a power/MDE sketch with: baseline mean=70s, sd=25s at user-day level, avg 4 impressions/user-day, intra-user correlation=0.35, 200k users per arm over 14 days. (g) Explain how you will diagnose and mitigate traffic imbalance or novelty effects.

Solution

Show

Comments (0)

Sign in to leave a comment

Loading comments...

Browse More Questions

More Analytics & Experimentation•More Capital One•More Data Scientist•Capital One Data Scientist•Capital One Analytics & Experimentation•Data Scientist Analytics & Experimentation
PracHub

Master your tech interviews with 7,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.