PracHub
QuestionsPremiumLearningGuidesInterview PrepNEWCoaches
|Home/Analytics & Experimentation/Upstart

Decide to ship a signup experiment

Last updated: Mar 29, 2026

Quick Overview

This question evaluates A/B testing and experimentation competencies, including data quality validation, metric selection, statistical estimation, power analysis, temporal and cohort effects, and decision-making trade-offs, and it falls under the Analytics & Experimentation domain for Data Scientist roles.

  • hard
  • Upstart
  • Analytics & Experimentation
  • Data Scientist

Decide to ship a signup experiment

Company: Upstart

Role: Data Scientist

Category: Analytics & Experimentation

Difficulty: hard

Interview Round: Technical Screen

You receive results for an A/B experiment on a redesigned user signup flow. Data file columns: variant (A/B), sessions, signups, activation_d7, p95_latency_ms, support_tickets, refund_rate, and revenue_d30. Describe exactly how you would: (1) validate data quality (SRM test, bucketing integrity, exposure logs vs analytics counts); (2) choose primary and guardrail metrics and justify them; (3) compute effects with CIs (include ratio metrics and non-parametric options if skewed), and apply variance reduction (e.g., CUPED) if appropriate; (4) check power/min detectable effect and whether the observed duration met the pre-registered stopping rule; (5) evaluate novelty and learning effects (time-sliced and cohort views); (6) make the ship/no-ship call with a concrete decision framework that balances activation gains vs increased latency/support tickets; (7) list at least three additional insights to extract beyond the ship decision (e.g., segment heterogeneity by traffic source/device, step-drop analysis, form-field sensitivity). Be precise about formulas, tests, and thresholds you would use.

Quick Answer: This question evaluates A/B testing and experimentation competencies, including data quality validation, metric selection, statistical estimation, power analysis, temporal and cohort effects, and decision-making trade-offs, and it falls under the Analytics & Experimentation domain for Data Scientist roles.

Related Interview Questions

  • Estimate impact without experiments and pick variant - Upstart (easy)
  • Evaluate channels and allocate budget - Upstart (hard)
  • Analyze aggregator lender page flows - Upstart (hard)
  • Formulate hypotheses and metrics for video-pin ramp - Upstart (hard)
  • Design Experiment to Measure Airport Surge-Pricing Impact - Upstart (hard)
Upstart logo
Upstart
Oct 13, 2025, 9:49 PM
Data Scientist
Technical Screen
Analytics & Experimentation
5
0

A/B Test Plan: Redesigned User Signup Flow

Context and Data

You are analyzing an A/B experiment for a redesigned user signup flow. The dataset includes the following columns per user/session or per aggregation unit: variant (A/B), sessions, signups, activation_d7, p95_latency_ms, support_tickets, refund_rate, revenue_d30.

Assumptions to make explicit and verify:

  • Randomization unit: user_id (sticky assignment). If only session-level logs exist, cluster by user_id.
  • Definitions:
    • sessions: count of sessions exposed to the variant.
    • signups: count of completed signups.
    • activation_d7: count of users who activated within 7 days of signup (define “activation” precisely for your product).
    • p95_latency_ms: 95th percentile request latency during signup flow (computed from request-level logs).
    • support_tickets: count of tickets attributable to the signup/onboarding experience.
    • refund_rate: refunds per activated or paying user (clarify denominator; use consistent unit across variants).
    • revenue_d30: total revenue within 30 days from users exposed (define whether revenue is per user, per activated user, or all exposed; prefer per-user for inference).

Goal: Decide whether to ship the redesign using a principled testing plan with data quality checks, metric selection, estimation, power, temporal effects, and decision criteria.

Tasks

Describe exactly how you would:

  1. Validate data quality (SRM test, bucketing integrity, exposure logs vs analytics counts).
  2. Choose primary and guardrail metrics and justify them.
  3. Compute effects with confidence intervals (including ratio metrics and non-parametric options if skewed), and apply variance reduction (e.g., CUPED) if appropriate.
  4. Check power/min detectable effect and whether the observed duration met the pre-registered stopping rule.
  5. Evaluate novelty and learning effects (time-sliced and cohort views).
  6. Make the ship/no-ship call with a concrete decision framework that balances activation gains vs increased latency/support tickets.
  7. List at least three additional insights to extract beyond the ship decision (e.g., segment heterogeneity by traffic source/device, step-drop analysis, form-field sensitivity).

Solution

Show

Comments (0)

Sign in to leave a comment

Loading comments...

Browse More Questions

More Analytics & Experimentation•More Upstart•More Data Scientist•Upstart Data Scientist•Upstart Analytics & Experimentation•Data Scientist Analytics & Experimentation
PracHub

Master your tech interviews with 7,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.