PracHub
QuestionsPremiumCoachesLearningGuidesInterview Prep
|Home/Analytics & Experimentation/Airwallex

How would you measure billboard impact?

Last updated: Apr 11, 2026

Quick Overview

This question evaluates a data scientist's skills in experiment and product design, causal inference (including propensity-score approaches), metric definition and trade-off analysis, power and sample-size estimation, and handling biases such as interference, repeated exposure, and heterogeneous effects.

  • medium
  • Airwallex
  • Analytics & Experimentation
  • Data Scientist

How would you measure billboard impact?

Company: Airwallex

Role: Data Scientist

Category: Analytics & Experimentation

Difficulty: medium

Interview Round: Onsite

Pinterest is considering launching a new **billboard**, a large and prominent promotional module shown near the top of the home feed or another major surface to highlight a campaign, creator, topic, or commercial placement. As a Data Scientist, design how you would evaluate whether this billboard is effective. Your answer should address: 1. **Product design and targeting** - What user problem or business goal is the billboard solving? - Which users should see it? - How often should it be shown? 2. **Experiment design** - What should the treatment and control groups be? - What is the correct randomization unit: user, session, device, geo, or something else? - How would you account for interference, repeated exposure, or novelty effects? 3. **Metrics** - Define primary success metrics, secondary metrics, and guardrail metrics. - Consider tradeoffs across short-term engagement, long-term retention, monetization, content diversity, user satisfaction, and downstream creator or advertiser outcomes. 4. **Causal inference if randomization is not feasible** - Explain the difference between **Propensity Score Matching (PSM)** and **Propensity Score Weighting (PSW / inverse-propensity weighting)**. - What estimand does each approach usually target, such as ATE or ATT? - What is each method trying to optimize or balance? - What diagnostics would you use to assess whether the method is credible? 5. **Sample size and power** - How would you calculate the required sample size or minimum detectable effect for the main metric? - What tradeoffs matter when choosing duration, traffic allocation, and metric sensitivity? 6. **Risks and edge cases** - Discuss selection bias, heterogeneous treatment effects, spillovers, Simpson's paradox across segments, and other practical pitfalls. Provide a rigorous measurement plan, not just a list of metrics.

Quick Answer: This question evaluates a data scientist's skills in experiment and product design, causal inference (including propensity-score approaches), metric definition and trade-off analysis, power and sample-size estimation, and handling biases such as interference, repeated exposure, and heterogeneous effects.

Related Interview Questions

  • How would you test billboard effectiveness? - Airwallex (medium)
  • Investigate Harassment Surge and Mitigation - Airwallex (medium)
  • How would you test swapping two CTA buttons? - Airwallex (medium)
  • Diagnose a dip in approval/conversion rate - Airwallex (medium)
  • Diagnose conversion-rate time series and CTA swap - Airwallex (easy)
Airwallex logo
Airwallex
Oct 24, 2025, 12:00 AM
Data Scientist
Onsite
Analytics & Experimentation
2
0
Loading...

Pinterest is considering launching a new billboard, a large and prominent promotional module shown near the top of the home feed or another major surface to highlight a campaign, creator, topic, or commercial placement.

As a Data Scientist, design how you would evaluate whether this billboard is effective. Your answer should address:

  1. Product design and targeting
    • What user problem or business goal is the billboard solving?
    • Which users should see it?
    • How often should it be shown?
  2. Experiment design
    • What should the treatment and control groups be?
    • What is the correct randomization unit: user, session, device, geo, or something else?
    • How would you account for interference, repeated exposure, or novelty effects?
  3. Metrics
    • Define primary success metrics, secondary metrics, and guardrail metrics.
    • Consider tradeoffs across short-term engagement, long-term retention, monetization, content diversity, user satisfaction, and downstream creator or advertiser outcomes.
  4. Causal inference if randomization is not feasible
    • Explain the difference between Propensity Score Matching (PSM) and Propensity Score Weighting (PSW / inverse-propensity weighting) .
    • What estimand does each approach usually target, such as ATE or ATT?
    • What is each method trying to optimize or balance?
    • What diagnostics would you use to assess whether the method is credible?
  5. Sample size and power
    • How would you calculate the required sample size or minimum detectable effect for the main metric?
    • What tradeoffs matter when choosing duration, traffic allocation, and metric sensitivity?
  6. Risks and edge cases
    • Discuss selection bias, heterogeneous treatment effects, spillovers, Simpson's paradox across segments, and other practical pitfalls.

Provide a rigorous measurement plan, not just a list of metrics.

Solution

Show

Submit Your Answer to Earn 20XP

Sign in to leave a comment

Loading comments...

Browse More Questions

More Analytics & Experimentation•More Airwallex•More Data Scientist•Airwallex Data Scientist•Airwallex Analytics & Experimentation•Data Scientist Analytics & Experimentation
PracHub

Master your tech interviews with 8,000+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.