How do I approach Machine Learning interview questions?

Machine Learning questions require understanding of core concepts and practice. PracHub provides solutions with explanations to help you master machine learning interviews.

What difficulty level is this interview question?

This is a medium difficulty Machine Learning question, commonly asked during Technical Screen rounds at LinkedIn.

What role is this question designed for?

This question is commonly asked for Data Scientist candidates at LinkedIn during technical interviews.

Explain variance reduction in random forests

Last updated: Mar 29, 2026

Quick Overview

This question evaluates understanding of variance reduction in ensemble methods, the impact of inter-tree correlation on averaged predictors, and the bias-variance trade-off in random forests, framed within the Machine Learning domain for Data Scientist roles.

Feb 19, 2026, 12:00 AM

Data Scientist

Technical Screen

Machine Learning

Consider a random forest (or bagged ensemble) that predicts at a fixed input $x$ by averaging $B$ tree predictions:

$\hat f_B(x) = \frac{1}{B}\sum_{b=1}^B T_b(x).$

Assume each tree prediction has the same variance $\sigma^2$ and any pair of tree predictions has correlation $\rho$ .

Derive the variance of the ensemble prediction $\hat f_B(x)$ .
Explain how this connects to the variance formula for the average of correlated random variables.
Interpret the result for the cases $\rho=0$ , $\rho=1$ , and $B \to \infty$ .
Based on this result, explain why random forests use bagging and random feature subsampling, and discuss the bias-variance trade-off when making trees more random.

Solution

Show

Comments (0)

Loading comments...

Browse More Questions

More Machine Learning•More LinkedIn•More Data Scientist•LinkedIn Data Scientist•LinkedIn Machine Learning•Data Scientist Machine Learning