PracHub
QuestionsPremiumLearningGuidesInterview PrepNEWCoaches
|Home/Statistics & Math/Thumbtack

Design cross-validation; explain bias–variance

Last updated: Mar 29, 2026

Quick Overview

This question evaluates understanding of cross-validation methodologies and the bias–variance tradeoff, testing competency in robust model evaluation, experimental design for temporally ordered and class-imbalanced datasets, and quantitative interpretation of cross-validation error curves.

  • Medium
  • Thumbtack
  • Statistics & Math
  • Data Scientist

Design cross-validation; explain bias–variance

Company: Thumbtack

Role: Data Scientist

Category: Statistics & Math

Difficulty: Medium

Interview Round: Onsite

Define cross-validation rigorously and compare k-fold, stratified k-fold, leave-one-out, nested CV, and time-series rolling/blocked CV. For a dataset with temporal ordering and class imbalance, design an evaluation that avoids leakage while providing stable estimates; justify fold construction and metric choice. Explain bias–variance tradeoff quantitatively: how model complexity, training data size, and regularization shift bias and variance; how CV error curves reveal under/overfitting; and which levers (features, model class, regularization, data augmentation) you would pull when variance is high vs. when bias is high.

Quick Answer: This question evaluates understanding of cross-validation methodologies and the bias–variance tradeoff, testing competency in robust model evaluation, experimental design for temporally ordered and class-imbalanced datasets, and quantitative interpretation of cross-validation error curves.

Related Interview Questions

  • Optimize red-ball draw probability, prove optimality - Thumbtack (medium)
  • Test regional response-rate differences rigorously - Thumbtack (medium)
Thumbtack logo
Thumbtack
Oct 13, 2025, 9:49 PM
Data Scientist
Onsite
Statistics & Math
3
0

Define cross-validation rigorously and compare k-fold, stratified k-fold, leave-one-out, nested CV, and time-series rolling/blocked CV. For a dataset with temporal ordering and class imbalance, design an evaluation that avoids leakage while providing stable estimates; justify fold construction and metric choice. Explain bias–variance tradeoff quantitatively: how model complexity, training data size, and regularization shift bias and variance; how CV error curves reveal under/overfitting; and which levers (features, model class, regularization, data augmentation) you would pull when variance is high vs. when bias is high.

Comments (0)

Sign in to leave a comment

Loading comments...

Browse More Questions

More Statistics & Math•More Thumbtack•More Data Scientist•Thumbtack Data Scientist•Thumbtack Statistics & Math•Data Scientist Statistics & Math
PracHub

Master your tech interviews with 7,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.