How do I approach Statistics & Math interview questions?

Statistics & Math questions require understanding of core concepts and practice. PracHub provides solutions with explanations to help you master statistics & math interviews.

What difficulty level is this interview question?

This is a medium difficulty Statistics & Math question, commonly asked during Technical Screen rounds at Roblox.

What role is this question designed for?

This question is commonly asked for Data Scientist candidates at Roblox during technical interviews.

Derive variance and CTR confidence intervals

Quick Overview

This question evaluates understanding of estimation and inference for proportions, including CTR point estimation, standard errors, multiple confidence-interval methods (Wald, Wilson, Clopper–Pearson), pooled estimation across days, and the distinction between sample and population variance for day-level variability.

CTR as a Proportion: Estimation, Confidence Intervals, and Day-Level Variability

Context: Click-through rate (CTR) is a proportion metric defined as C/I where C is clicks and I is impressions. Assume impressions are independent and C | I ~ Binomial(I, p).

1) Single-day CTR: point estimate, SE, and 95% CIs

Given I = 200,000 and C = 4,200:

Estimate p̂ and its standard error.
Compute 95% confidence intervals using: a) Normal/Wald with continuity correction b) Wilson score interval c) Exact Clopper–Pearson
Compare widths and coverage properties and state which you would use in production and why.

2) Pooled CTR across multiple days

Three days with impressions [50,000, 120,000, 30,000] and CTRs [2.0%, 2.6%, 1.8%]. Compute: a) The pooled CTR across the three days. b) The standard error of the pooled CTR. c) The day-to-day standard deviation of CTR treating days as the unit (unweighted STDDEV_SAMP) vs an impression-weighted day-level SD. When is each appropriate?

3) Sample vs population variance in practice

Explain why we divide by n−1 (sample variance) vs n (population variance). In SQL, when would you prefer STDDEV_SAMP vs STDDEV_POP for daily CTR and CPC aggregates? Provide concrete examples tied to experiment analysis.

Quick Overview

1) Single-day CTR: point estimate, SE, and 95% CIs

Given I = 200,000 and C = 4,200:

Estimate p̂ and its standard error.

Compute 95% confidence intervals using: a) Normal/Wald with continuity correction b) Wilson score interval c) Exact Clopper–Pearson

Compare widths and coverage properties and state which you would use in production and why.

2) Pooled CTR across multiple days

Derive variance and CTR confidence intervals

Quick Overview

Derive variance and CTR confidence intervals

CTR as a Proportion: Estimation, Confidence Intervals, and Day-Level Variability

1) Single-day CTR: point estimate, SE, and 95% CIs

2) Pooled CTR across multiple days

3) Sample vs population variance in practice

Write your answer

Derive variance and CTR confidence intervals

Quick Overview

Derive variance and CTR confidence intervals

CTR as a Proportion: Estimation, Confidence Intervals, and Day-Level Variability

1) Single-day CTR: point estimate, SE, and 95% CIs

2) Pooled CTR across multiple days

3) Sample vs population variance in practice

Write your answer