PracHub
QuestionsPremiumLearningGuidesCheatsheetNEWCoaches
|Home/Statistics & Math/Instacart

Choose tests under non‑normal, unequal variance

Last updated: Mar 29, 2026

Quick Overview

This question evaluates understanding of statistical inference for heavy‑tailed, heteroskedastic A/B test metrics, covering CLT conditions for t‑tests, variance‑robust tests, nonparametric and resampling approaches, log transformations and back‑transformation interpretation, and handling zero inflation within the Statistics & Math domain for Data Scientist roles. It is commonly asked because real‑world experimental metrics violate parametric assumptions, so interviewers probe both conceptual understanding of asymptotics and test assumptions and practical application of resampling, transformation, and two‑part modeling choices along with appropriate robustness checks.

  • hard
  • Instacart
  • Statistics & Math
  • Data Scientist

Choose tests under non‑normal, unequal variance

Company: Instacart

Role: Data Scientist

Category: Statistics & Math

Difficulty: hard

Interview Round: Onsite

Your metric is heavy‑tailed and non‑normal with heteroskedasticity across groups. a) Under what conditions is the t‑test still valid via the CLT, and when does Welch’s t‑test materially reduce Type I error inflation? Be specific about sample size, variance ratio, and tail behavior. b) Evaluate Mann–Whitney U, permutation tests, and bootstrap CIs for mean vs median effects; when will each mislead decision‑making? c) A teammate suggests log‑transforming highly skewed AOV. Precisely state what treatment effect a log‑scale comparison estimates; show how to back‑transform and interpret a mean‑difference on log scale as a multiplicative effect on the original scale, and when the “log‑then‑t‑test” biases estimates (zeros, log‑normality violations, Duan smearing). d) With zero inflation (15% zeros), compare delta‑lognormal/Two‑part (hurdle) models vs trimmed means; justify your choice and the robustness checks you’d run.

Quick Answer: This question evaluates understanding of statistical inference for heavy‑tailed, heteroskedastic A/B test metrics, covering CLT conditions for t‑tests, variance‑robust tests, nonparametric and resampling approaches, log transformations and back‑transformation interpretation, and handling zero inflation within the Statistics & Math domain for Data Scientist roles. It is commonly asked because real‑world experimental metrics violate parametric assumptions, so interviewers probe both conceptual understanding of asymptotics and test assumptions and practical application of resampling, transformation, and two‑part modeling choices along with appropriate robustness checks.

Instacart logo
Instacart
Oct 13, 2025, 9:49 PM
Data Scientist
Onsite
Statistics & Math
5
0
Loading...

Heavy-Tailed, Heteroskedastic Metrics in A/B Tests (AOV example)

Context: You are comparing two groups in an A/B test on a spend metric (e.g., Average Order Value per user over a period). The outcome is heavy‑tailed, non‑normal, and shows heteroskedasticity across groups.

Questions

a) Validity of t-tests

  • Under what conditions is a two-sample t-test still valid via the Central Limit Theorem (CLT)?
  • When does Welch’s t-test materially reduce Type I error inflation? Be specific about sample size, variance ratio, and tail behavior.

b) Nonparametric and resampling methods

  • Evaluate Mann–Whitney U, permutation tests, and bootstrap confidence intervals (CIs) for mean vs median effects. When will each mislead decision‑making?

c) Log transformation of AOV

  • If a teammate suggests log-transforming highly skewed AOV, what treatment effect does a log-scale comparison estimate?
  • Show how to back-transform and interpret a mean difference on the log scale as a multiplicative effect on the original scale.
  • When does “log‑then‑t‑test” bias estimates (e.g., zeros, log-normality violations, Duan smearing)?

d) Zero inflation (15% zeros)

  • Compare delta‑lognormal / two‑part (hurdle) models versus trimmed means.
  • Justify your choice and describe robustness checks you would run.

Solution

Show

Comments (0)

Sign in to leave a comment

Loading comments...

Browse More Questions

More Statistics & Math•More Instacart•More Data Scientist•Instacart Data Scientist•Instacart Statistics & Math•Data Scientist Statistics & Math
PracHub

Master your tech interviews with 7,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.