Citadel Data Scientist Machine Learning Interview Questions
Master your tech interview with our curated database of real questions from top companies.
Design Framework for Robust House-Price Prediction Model
Model Robustness, Diagnostics, Random Forests, and Large-Scale Regression Context You are building and evaluating a supervised model to predict reside...
Estimate OLS via streaming sufficient statistics
Streaming OLS and Ridge for Out-of-Core, High-Dimensional Linear Regression You need to estimate linear regression coefficients when the dataset is to...
Design city home-price prediction system
End-to-End System Design: Predict Residential Property Sale Prices Context You are tasked with building a production-grade machine learning system to ...
Diagnose outliers and influence in linear regression
OLS Diagnostics: Outliers, Leverage, Influence, and Cook's Distance Context You are fitting an ordinary least squares (OLS) linear regression with an ...
Explain RF optimization and variable-importance pitfalls
Optimize and Regularize a Random Forest Regressor for Tabular Data Context: You are training a Random Forest (RF) regressor on tabular data and need t...
Design regression and classification ML pipelines
Take‑Home: Two End‑to‑End ML Workflows on Tabular Data Objective Design and implement two complete machine learning workflows on tabular data (typical...
Build a baseline linear regression pipeline
Task: Baseline Linear Regression Pipeline (Python) Context You are given a tabular dataset in a pandas DataFrame df. The goal is to predict a continuo...