PracHub
QuestionsPremiumLearningGuidesInterview PrepNEWCoaches
|Home/Machine Learning/Intuit

Build a predictive model from TurboTax sample data

Last updated: Mar 29, 2026

Quick Overview

This question evaluates predictive modeling skills including target selection, label leakage prevention, feature engineering from product and historical interaction data, model selection and evaluation, and considerations for deployment and interpretability using a tax-preparation sample dataset.

  • easy
  • Intuit
  • Machine Learning
  • Data Scientist

Build a predictive model from TurboTax sample data

Company: Intuit

Role: Data Scientist

Category: Machine Learning

Difficulty: easy

Interview Round: Onsite

You receive a **TurboTax sample dataset** (user-level and/or session-level) and are asked to build a predictive model. ## Task 1. Pick a concrete prediction target (choose one and justify): - Probability a user will **file** within 14 days of starting. - Probability a user will **churn** (not file this season). - Expected **revenue** from the user this season. 2. Describe how you would: - Define labels and avoid label leakage. - Build features from product interactions and historical attributes. - Choose a baseline model and at least one stronger model. - Evaluate performance (metrics, calibration, slice performance). - Turn the model into an actionable recommendation (e.g., targeting, prioritization, interventions). ## Constraints / realism - Data may be missing or delayed. - Class imbalance is likely (e.g., churn). - The business cares about interpretability and safe deployment.

Quick Answer: This question evaluates predictive modeling skills including target selection, label leakage prevention, feature engineering from product and historical interaction data, model selection and evaluation, and considerations for deployment and interpretability using a tax-preparation sample dataset.

Related Interview Questions

  • When should products use AI? - Intuit (easy)
  • Engineer and Impute ZIP Features - Intuit (medium)
  • Engineer ZIP Features and Handle Missingness - Intuit (medium)
  • Handle missing and unavailable predictive features - Intuit (easy)
  • Decide when to model courier ETA - Intuit (hard)
Intuit logo
Intuit
Aug 1, 2025, 12:00 AM
Data Scientist
Onsite
Machine Learning
5
0
Loading...

You receive a TurboTax sample dataset (user-level and/or session-level) and are asked to build a predictive model.

Task

  1. Pick a concrete prediction target (choose one and justify):
    • Probability a user will file within 14 days of starting.
    • Probability a user will churn (not file this season).
    • Expected revenue from the user this season.
  2. Describe how you would:
    • Define labels and avoid label leakage.
    • Build features from product interactions and historical attributes.
    • Choose a baseline model and at least one stronger model.
    • Evaluate performance (metrics, calibration, slice performance).
    • Turn the model into an actionable recommendation (e.g., targeting, prioritization, interventions).

Constraints / realism

  • Data may be missing or delayed.
  • Class imbalance is likely (e.g., churn).
  • The business cares about interpretability and safe deployment.

Solution

Show

Comments (0)

Sign in to leave a comment

Loading comments...

Browse More Questions

More Machine Learning•More Intuit•More Data Scientist•Intuit Data Scientist•Intuit Machine Learning•Data Scientist Machine Learning
PracHub

Master your tech interviews with 7,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.