PracHub
QuestionsPremiumCoachesLearningGuidesInterview Prep
|Home/Analytics & Experimentation/Capital One

Build a causal ML pipeline end-to-end

Last updated: Mar 29, 2026

Quick Overview

This question evaluates expertise in causal inference and policy learning, covering ATE/CATE estimation, heterogeneous treatment effect methods, off‑policy evaluation, overlap and sensitivity diagnostics, and operational deployment with fairness guardrails in the Analytics & Experimentation domain.

  • hard
  • Capital One
  • Analytics & Experimentation
  • Data Scientist

Build a causal ML pipeline end-to-end

Company: Capital One

Role: Data Scientist

Category: Analytics & Experimentation

Difficulty: hard

Interview Round: Onsite

You led a causal-inference project and now need to productionize policy targeting. Tasks: 1) Formulate a DAG for treatment→outcome with key confounders and justify conditional independence assumptions. State your estimand (ATE and ITE/uplift). 2) Choose and justify a method (e.g., doubly robust learner, causal forest, uplift gradient boosting). Describe checks for overlap/positivity and how you’d handle violations. 3) Training/validation: Describe sample splitting for nuisance models, hyperparameter tuning without biasing effect estimates, and how you’d use policy risk/off-policy evaluation (IPW/DR) to compare targeting policies. 4) Diagnostics: Produce uplift curves/Qini, sensitivity to unobserved confounding (e.g., Rosenbaum bounds) and balance checks. 5) Deployment: Translate ITEs into a treatment policy with operational guardrails, fairness constraints across cohorts, and post-deployment monitoring.

Quick Answer: This question evaluates expertise in causal inference and policy learning, covering ATE/CATE estimation, heterogeneous treatment effect methods, off‑policy evaluation, overlap and sensitivity diagnostics, and operational deployment with fairness guardrails in the Analytics & Experimentation domain.

Related Interview Questions

  • Analyze Subscription, Insurance, App, and Card Cases - Capital One (medium)
  • Diagnose Flight Delays and Burger Launch - Capital One (easy)
  • How should you renew or replace a show? - Capital One (medium)
  • How would you decide to cancel a TV show? - Capital One (easy)
  • Decide Which Show to Renew - Capital One (medium)
Capital One logo
Capital One
Oct 13, 2025, 9:49 PM
Data Scientist
Onsite
Analytics & Experimentation
1
0

Policy Targeting from Causal Inference to Production

Context

You completed a causal-inference project estimating the effect of a binary marketing treatment (e.g., a targeted offer) on a business outcome (e.g., customer spend or profit) using observational and/or experimental data. You now need to productionize a targeting policy that treats customers who are most likely to benefit, subject to operational and fairness guardrails.

Assume:

  • Treatment T ∈ {0,1} delivered at time t.
  • Outcome Y measured after treatment (e.g., 90-day profit or conversion).
  • Feature vector X with potential confounders (e.g., demographics, historical spend, engagement, credit/risk, channel, time/seasonality, eligibility).

Tasks

  1. Causal graph and estimands
    • Formulate a DAG for treatment → outcome including key confounders. Justify conditional independence assumptions (e.g., ignorability, SUTVA, positivity).
    • State your estimands: ATE and ITE/uplift (CATE).
  2. Method choice and overlap
    • Choose and justify a method to estimate heterogeneous treatment effects (e.g., doubly robust learner, causal forest, uplift gradient boosting). Explain pros/cons.
    • Describe how you will check overlap/positivity and how you would handle violations.
  3. Training and validation
    • Describe sample splitting and cross-fitting for nuisance models (propensity and outcome models).
    • Explain how to tune hyperparameters without biasing effect estimates.
    • Describe how you would use policy risk/off-policy evaluation (IPW/DR) to compare targeting policies.
  4. Diagnostics
    • Produce uplift curves and Qini coefficients; discuss calibration and interpretation.
    • Describe sensitivity analysis for unobserved confounding (e.g., Rosenbaum bounds) and balance checks.
  5. Deployment
    • Translate ITEs into a treatment policy with operational guardrails (budget, eligibility, risk), fairness constraints across cohorts, and post-deployment monitoring.

Solution

Show

Submit Your Answer to Earn 20XP

Sign in to leave a comment

Loading comments...

Browse More Questions

More Analytics & Experimentation•More Capital One•More Data Scientist•Capital One Data Scientist•Capital One Analytics & Experimentation•Data Scientist Analytics & Experimentation
PracHub

Master your tech interviews with 8,000+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.