PracHub
QuestionsPremiumCoachesLearningGuidesInterview Prep
|Home/Statistics & Math/PayPal

Explain p-values and interpret regressions

Last updated: Jun 15, 2026

Quick Overview

A PayPal Data Scientist onsite statistics rapid-fire covering p-values (plain-language and formal definitions, common misinterpretations, and an A/B-test decision at p=0.03), interpreting linear-regression coefficients under confounding (associational vs causal, 'controlling for' failure modes, and bias-reduction methods), and L1 vs L2 regularization with how to choose the penalty strength. It tests both statistical depth and the ability to communicate to product and technical audiences.

  • easy
  • PayPal
  • Statistics & Math
  • Data Scientist

Explain p-values and interpret regressions

Company: PayPal

Role: Data Scientist

Category: Statistics & Math

Difficulty: easy

Interview Round: Onsite

##### Question This is a statistics rapid-fire onsite for a Data Scientist role. Answer each part clearly and precisely — first as if explaining to a Product Manager, then for a technical audience. **Part A — p-values** 1. Explain **what a p-value is** in plain language (PM-friendly). 2. Give the **formal definition** of a p-value. 3. How should a p-value be **interpreted**, and what are the **common misinterpretations** to avoid? 4. If you run an A/B test and obtain **p = 0.03** for the primary metric, what decision would you make? What additional context would you request before shipping? **Part B — Linear regression with confounding** 5. You fit a linear regression with multiple features and suspect **confounding factors** exist. How do you interpret each parameter (coefficient)? 6. What does **“controlling for other variables”** actually mean, and when can that interpretation fail? 7. What is the difference between an **associational** and a **causal** interpretation of a coefficient? 8. What checks or approaches would you use to **reduce confounding bias**? **Part C — L1 vs L2 regularization** 9. What are **L1 (Lasso)** and **L2 (Ridge)** regularization, and how do they differ in effect? 10. When and why would you prefer one over the other (feature selection, multicollinearity, prediction vs interpretability)? 11. How would you **select the regularization strength** (λ), and what are the practical tradeoffs?

Quick Answer: A PayPal Data Scientist onsite statistics rapid-fire covering p-values (plain-language and formal definitions, common misinterpretations, and an A/B-test decision at p=0.03), interpreting linear-regression coefficients under confounding (associational vs causal, 'controlling for' failure modes, and bias-reduction methods), and L1 vs L2 regularization with how to choose the penalty strength. It tests both statistical depth and the ability to communicate to product and technical audiences.

Related Interview Questions

  • Why does the CLT matter? - PayPal (medium)
  • Explain and interpret p-values correctly - PayPal (easy)
  • Interpret p-values and common pitfalls - PayPal (hard)
  • Decide if a dice-betting game is favorable - PayPal (easy)
  • Explain confounding with an Uber example - PayPal (easy)
PayPal logo
PayPal
Oct 10, 2025, 12:00 AM
Data Scientist
Onsite
Statistics & Math
3
0
Question

This is a statistics rapid-fire onsite for a Data Scientist role. Answer each part clearly and precisely — first as if explaining to a Product Manager, then for a technical audience.

Part A — p-values

  1. Explain what a p-value is in plain language (PM-friendly).
  2. Give the formal definition of a p-value.
  3. How should a p-value be interpreted , and what are the common misinterpretations to avoid?
  4. If you run an A/B test and obtain p = 0.03 for the primary metric, what decision would you make? What additional context would you request before shipping?

Part B — Linear regression with confounding 5. You fit a linear regression with multiple features and suspect confounding factors exist. How do you interpret each parameter (coefficient)? 6. What does “controlling for other variables” actually mean, and when can that interpretation fail? 7. What is the difference between an associational and a causal interpretation of a coefficient? 8. What checks or approaches would you use to reduce confounding bias?

Part C — L1 vs L2 regularization 9. What are L1 (Lasso) and L2 (Ridge) regularization, and how do they differ in effect? 10. When and why would you prefer one over the other (feature selection, multicollinearity, prediction vs interpretability)? 11. How would you select the regularization strength (λ), and what are the practical tradeoffs?

Solution

Show

Submit Your Answer to Earn 20XP

Sign in to leave a comment

Loading comments...

Browse More Questions

More Statistics & Math•More PayPal•More Data Scientist•PayPal Data Scientist•PayPal Statistics & Math•Data Scientist Statistics & Math
PracHub

Master your tech interviews with 8,000+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.