PracHub
QuestionsPremiumCoachesLearningGuidesInterview Prep
|Home/Analytics & Experimentation/Capital One

Design and analyze a card signup A/B test

Last updated: Mar 29, 2026

Quick Overview

This question evaluates a data scientist's competency in A/B test design, causal inference, statistical power and sample-size calculation, risk-adjusted profit metrics, sequential testing, identity resolution, and monitoring for credit and fraud impacts.

  • hard
  • Capital One
  • Analytics & Experimentation
  • Data Scientist

Design and analyze a card signup A/B test

Company: Capital One

Role: Data Scientist

Category: Analytics & Experimentation

Difficulty: hard

Interview Round: Technical Screen

A bank is launching a co-branded gym credit card. You can show a 3‑month free gym membership offer on the application landing page (Variant B) vs no offer (Control A). Traffic ≈ 200,000 sessions/day; baseline apply→approval rate = 8%; average initial credit line = $1,000; 90‑day charge‑off rate = 1.2% with 60% loss severity; average 90‑day revenue per approved account (interest + interchange) = $120; acquisition bonus and onboarding costs per approved = $40. Risk requires: predicted default probability (PD) of approved pool must not increase by >10% relative to control. You will run a 14‑day 50/50 test. Design and analyze this experiment: 1) Define a single primary metric as risk‑adjusted profit per visitor (RAPV) and write its exact formula from the inputs above; specify at least two guardrail metrics (with thresholds) for risk/fraud/compliance. 2) Convert the 5% relative lift hypothesis on approval rate into an expected lift on RAPV; state assumptions needed to avoid Simpson’s paradox across traffic sources and day‑of‑week. 3) Compute or set up the required sample size per variant for 80% power at α=0.05 to detect the expected RAPV lift; justify variance estimation (delta method vs bootstrap) and whether you’ll use CUPED or pre‑period covariates. 4) Specify randomization unit and identity resolution to prevent contamination (cookies, logged‑in IDs, device graph) and how you’ll treat repeat applicants, bots, and duplicate identities. 5) Detail your sequential testing plan (e.g., group‑sequential or alpha spending) to allow interim safety stops without inflating Type I error; define exact stop/go/ramp criteria. 6) Show how you will monitor risk mix shift (e.g., PD by score bands) and enforce the 10% PD guardrail while avoiding conditioning on post‑treatment variables; propose a stratified analysis and a heterogeneity readout by acquisition channel and geography. 7) Outline data quality checks (event schema, missingness, timeouts), instrumentation events, and backfill/late arrival handling. 8) If Variant B increases approvals by 6% but raises PD by 9% and reduces average credit line by 5%, decide whether to ship, using a 12‑month NPV sensitivity (state discount rate and churn assumptions). 9) List two follow‑on experiments to isolate mechanism (e.g., offer placement vs wording), and one off‑policy evaluation you’d run using historical scores.

Quick Answer: This question evaluates a data scientist's competency in A/B test design, causal inference, statistical power and sample-size calculation, risk-adjusted profit metrics, sequential testing, identity resolution, and monitoring for credit and fraud impacts.

Related Interview Questions

  • Analyze Subscription, Insurance, App, and Card Cases - Capital One (medium)
  • Diagnose Flight Delays and Burger Launch - Capital One (easy)
  • How should you renew or replace a show? - Capital One (medium)
  • How would you decide to cancel a TV show? - Capital One (easy)
  • Decide Which Show to Renew - Capital One (medium)
Capital One logo
Capital One
Oct 13, 2025, 9:49 PM
Data Scientist
Technical Screen
Analytics & Experimentation
4
0

A/B Test Design: Co‑Branded Gym Credit Card Offer

Context: You will A/B test a 3‑month free gym membership offer shown on the application landing page (Variant B) against no offer (Control A). You will run a 14‑day 50/50 experiment. Traffic is ~200,000 sessions/day. Use the inputs below and design a rigorous, risk‑aware experiment and analysis.

Assumptions you may explicitly make when needed:

  • "Approval rate" refers to approvals per visitor/session unless otherwise specified.
  • Revenue/loss inputs are 90‑day averages unless you scale them.
  • The offer is paid by the partner (cost = $0) unless you introduce a placeholder term.

Inputs:

  • Baseline approvals per visitor: 8% (apply→approval rate = 8%)
  • Average initial credit line (CL): $1,000
  • 90‑day charge‑off rate (PD_90): 1.2%
  • Loss severity (LGD): 60%
  • Average 90‑day revenue per approved account: $120 (interest + interchange)
  • Acquisition bonus + onboarding costs per approved: $40
  • Risk constraint: Predicted default probability (PD) of the approved pool must not increase by >10% relative to control.
  • Test: 14 days, 50/50 split, ~200k sessions/day total.

Tasks:

  1. Define a single primary metric as risk‑adjusted profit per visitor (RAPV). Write an exact formula using the inputs above. Specify at least two guardrail metrics (with thresholds) covering risk/fraud/compliance.
  2. Convert a 5% relative lift hypothesis on approval rate into an expected lift on RAPV. State assumptions needed to avoid Simpson’s paradox across traffic sources and day‑of‑week.
  3. Compute or set up the required sample size per variant for 80% power at α = 0.05 to detect the expected RAPV lift. Justify variance estimation (delta method vs bootstrap) and whether you’ll use CUPED or pre‑period covariates.
  4. Specify randomization unit and identity resolution to prevent contamination (cookies, logged‑in IDs, device graph) and how you’ll treat repeat applicants, bots, and duplicate identities.
  5. Detail your sequential testing plan (e.g., group‑sequential or alpha spending) to allow interim safety stops without inflating Type I error; define exact stop/go/ramp criteria.
  6. Show how you will monitor risk mix shift (e.g., PD by score bands) and enforce the 10% PD guardrail while avoiding conditioning on post‑treatment variables; propose a stratified analysis and a heterogeneity readout by acquisition channel and geography.
  7. Outline data quality checks (event schema, missingness, timeouts), instrumentation events, and backfill/late arrival handling.
  8. If Variant B increases approvals by 6% but raises PD by 9% and reduces average credit line by 5%, decide whether to ship, using a 12‑month NPV sensitivity (state discount rate and churn assumptions).
  9. List two follow‑on experiments to isolate mechanism (e.g., offer placement vs wording), and one off‑policy evaluation you’d run using historical scores.

Solution

Show

Submit Your Answer

Sign in to leave a comment

Loading comments...

Browse More Questions

More Analytics & Experimentation•More Capital One•More Data Scientist•Capital One Data Scientist•Capital One Analytics & Experimentation•Data Scientist Analytics & Experimentation
PracHub

Master your tech interviews with 8,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.