How do I approach Machine Learning interview questions?

Machine Learning questions require understanding of core concepts and practice. PracHub provides solutions with explanations to help you master machine learning interviews.

What difficulty level is this interview question?

This is a easy difficulty Machine Learning question, commonly asked during Onsite rounds at Capital One.

What role is this question designed for?

This question is commonly asked for Data Scientist candidates at Capital One during technical interviews.

Build House Price Model Responsibly | Capital One Interview Question

Quick Overview

This question evaluates a data scientist's competencies in end-to-end supervised learning pipeline design—covering train/validation/test strategy, target and metric selection, handling of categorical features, missing values and outliers, model benchmarking and leakage detection—alongside responsible AI considerations such as subgroup performance evaluation, calibration, ethical risks, and deployment governance. It is commonly asked in Machine Learning interviews to probe both conceptual understanding and practical application, testing technical modeling skills together with ethical and operational judgment, and thus sits in the Machine Learning domain with a level of abstraction spanning conceptual and practical.

You are asked two machine-learning questions.

Part A: House-price prediction Using a cleaned housing dataset with target sale_price, describe an end-to-end approach for building a predictive model.

Your answer should cover:

train, validation, and test splitting strategy,
target transformation and metric choice such as RMSE vs MAE vs RMSLE,
handling categorical features, missing values, and outliers,
baseline model vs stronger models,
leakage checks,
how you would explain your approach if you used an off-the-shelf modeling package during the interview.

Part B: Face-recognition ethics A company wants to deploy face recognition in a high-impact setting. What are the main ethical and ML risks, how would you evaluate subgroup performance and calibration, and what operational safeguards or governance would you require before deployment or before recommending against deployment?

Quick Overview

You are asked two machine-learning questions.

Part A: House-price prediction Using a cleaned housing dataset with target sale_price, describe an end-to-end approach for building a predictive model.

Your answer should cover:

train, validation, and test splitting strategy,
target transformation and metric choice such as RMSE vs MAE vs RMSLE,
handling categorical features, missing values, and outliers,
baseline model vs stronger models,
leakage checks,
how you would explain your approach if you used an off-the-shelf modeling package during the interview.

Build House Price Model Responsibly

Quick Overview

Solution

Submit Your Answer to Earn 20XP

Build House Price Model Responsibly

Quick Overview

Solution

Submit Your Answer to Earn 20XP