PracHub
QuestionsPremiumLearningGuidesInterview PrepNEWCoaches
|Home/Machine Learning/Intuit

Engineer and Impute ZIP Features

Last updated: Apr 8, 2026

Quick Overview

This question evaluates feature engineering, missing-data imputation, high-cardinality encoding, and considerations of fairness, privacy, and data leakage in predictive modeling.

  • medium
  • Intuit
  • Machine Learning
  • Data Scientist

Engineer and Impute ZIP Features

Company: Intuit

Role: Data Scientist

Category: Machine Learning

Difficulty: medium

Interview Round: Technical Screen

You are building a predictive model for a product team. For some users, you have address fields such as street, city, state, and ZIP code. 1. What features would you derive directly from the address, and what external public datasets could you join on ZIP code or geography to create additional model features? 2. If ZIP code is missing for some users, how would you handle those cases? In your answer, discuss: - useful geographic and socioeconomic features - high-cardinality encoding choices - when to drop vs. impute missing ZIP codes - missingness as a potentially informative signal - fairness, privacy, and leakage risks - how you would evaluate whether these features improve model performance

Quick Answer: This question evaluates feature engineering, missing-data imputation, high-cardinality encoding, and considerations of fairness, privacy, and data leakage in predictive modeling.

Related Interview Questions

  • When should products use AI? - Intuit (easy)
  • Engineer ZIP Features and Handle Missingness - Intuit (medium)
  • Handle missing and unavailable predictive features - Intuit (easy)
  • Decide when to model courier ETA - Intuit (hard)
  • Build a predictive model from TurboTax sample data - Intuit (easy)
Intuit logo
Intuit
Feb 23, 2026, 12:00 AM
Data Scientist
Technical Screen
Machine Learning
5
0

You are building a predictive model for a product team. For some users, you have address fields such as street, city, state, and ZIP code.

  1. What features would you derive directly from the address, and what external public datasets could you join on ZIP code or geography to create additional model features?
  2. If ZIP code is missing for some users, how would you handle those cases?

In your answer, discuss:

  • useful geographic and socioeconomic features
  • high-cardinality encoding choices
  • when to drop vs. impute missing ZIP codes
  • missingness as a potentially informative signal
  • fairness, privacy, and leakage risks
  • how you would evaluate whether these features improve model performance

Solution

Show

Comments (0)

Sign in to leave a comment

Loading comments...

Browse More Questions

More Machine Learning•More Intuit•More Data Scientist•Intuit Data Scientist•Intuit Machine Learning•Data Scientist Machine Learning
PracHub

Master your tech interviews with 7,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.