PracHub
QuestionsPremiumLearningGuidesInterview PrepNEWCoaches
|Home/Machine Learning/Squarepoint

Solve Probability and Statistics Questions

Last updated: May 19, 2026

Quick Overview

This question evaluates proficiency in statistical modeling (ordinary least squares linear regression), probability theory (law of large numbers and central limit theorem), and combinatorial/probabilistic strategy reasoning, testing competencies in model formulation, inference assumptions, asymptotic approximation, and optimal decision-making under uncertainty. It is commonly asked in machine learning and probability/statistics interviews to verify foundational knowledge of modeling and inference, the ability to interpret assumptions and results, and the application of both conceptual understanding and practical reasoning to probabilistic scenarios.

  • medium
  • Squarepoint
  • Machine Learning
  • Data Scientist

Solve Probability and Statistics Questions

Company: Squarepoint

Role: Data Scientist

Category: Machine Learning

Difficulty: medium

Interview Round: Technical Screen

Answer the following probability, statistics, and modeling questions. ### Part 1: Linear regression and OLS Explain ordinary least squares linear regression. Include: 1. The model form. 2. The loss function minimized by OLS. 3. The closed-form estimator when it exists. 4. Key assumptions commonly used for inference. 5. How to interpret coefficients. 6. What can go wrong with multicollinearity, outliers, heteroskedasticity, or omitted variables. 7. How you would evaluate the model. ### Part 2: Law of large numbers and central limit theorem Explain the difference between the law of large numbers and the central limit theorem. Then apply them to the following situation: Let `X_1, X_2, ..., X_n` be independent and identically distributed random variables with mean `mu` and variance `sigma^2`. Define the sample mean: `X_bar = (X_1 + X_2 + ... + X_n) / n`. 1. What happens to `X_bar` as `n` becomes large? 2. What is the approximate distribution of `X_bar` for large `n`? 3. How would you approximate `P(X_bar > a)` for a given threshold `a`? ### Part 3: Three-person hat strategy Three participants are randomly assigned hats, each independently either black or white with equal probability. Each participant can see the other two participants' hats but not their own. The participants are asked simultaneously to either guess their own hat color or pass. Rules: - If at least one participant guesses correctly and nobody guesses incorrectly, the team wins. - If anyone guesses incorrectly, the team loses. - If everyone passes, the team loses. Before seeing the hats, the participants may agree on a strategy. What strategy maximizes their probability of winning, and what is that maximum probability?

Quick Answer: This question evaluates proficiency in statistical modeling (ordinary least squares linear regression), probability theory (law of large numbers and central limit theorem), and combinatorial/probabilistic strategy reasoning, testing competencies in model formulation, inference assumptions, asymptotic approximation, and optimal decision-making under uncertainty. It is commonly asked in machine learning and probability/statistics interviews to verify foundational knowledge of modeling and inference, the ability to interpret assumptions and results, and the application of both conceptual understanding and practical reasoning to probabilistic scenarios.

Related Interview Questions

  • Analyze expectations, correlations, and investment strategies - Squarepoint (hard)
Squarepoint logo
Squarepoint
May 3, 2026, 12:00 AM
Data Scientist
Technical Screen
Machine Learning
0
0

Answer the following probability, statistics, and modeling questions.

Part 1: Linear regression and OLS

Explain ordinary least squares linear regression.

Include:

  1. The model form.
  2. The loss function minimized by OLS.
  3. The closed-form estimator when it exists.
  4. Key assumptions commonly used for inference.
  5. How to interpret coefficients.
  6. What can go wrong with multicollinearity, outliers, heteroskedasticity, or omitted variables.
  7. How you would evaluate the model.

Part 2: Law of large numbers and central limit theorem

Explain the difference between the law of large numbers and the central limit theorem.

Then apply them to the following situation:

Let X_1, X_2, ..., X_n be independent and identically distributed random variables with mean mu and variance sigma^2. Define the sample mean:

X_bar = (X_1 + X_2 + ... + X_n) / n.

  1. What happens to X_bar as n becomes large?
  2. What is the approximate distribution of X_bar for large n ?
  3. How would you approximate P(X_bar > a) for a given threshold a ?

Part 3: Three-person hat strategy

Three participants are randomly assigned hats, each independently either black or white with equal probability. Each participant can see the other two participants' hats but not their own. The participants are asked simultaneously to either guess their own hat color or pass.

Rules:

  • If at least one participant guesses correctly and nobody guesses incorrectly, the team wins.
  • If anyone guesses incorrectly, the team loses.
  • If everyone passes, the team loses.

Before seeing the hats, the participants may agree on a strategy. What strategy maximizes their probability of winning, and what is that maximum probability?

Solution

Show

Comments (0)

Sign in to leave a comment

Loading comments...

Browse More Questions

More Machine Learning•More Squarepoint•More Data Scientist•Squarepoint Data Scientist•Squarepoint Machine Learning•Data Scientist Machine Learning
PracHub

Master your tech interviews with 7,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.