Data Scientist Interview Questions
Practice 2,964 real Data Scientist interview questions for 2026. Data Scientist interview questions drawn from Meta, Capital One, Amazon, Google, TikTok and similar employers — real questions from actual interviews with detailed solutions — designed to accelerate your interview preparation for product analytics, ML and production data roles. This collection emphasizes the practical skills interviewers test: SQL and data manipulation, experiment design and A/B testing, statistical reasoning, Python coding for data problems, model evaluation and feature engineering, plus machine-learning system tradeoffs and metric design. What’s distinctive about modern data-science loops is the blend of product thinking and reproducible ML: expect hands-on SQL tasks and funnel analysis in screens, deeper experiment-design and causality questions in mid rounds, and coding or modeling challenges plus ML-system discussions in senior loops. Interviewers evaluate problem framing, statistical rigor, and how you communicate decisions to product partners. To prepare, prioritize daily SQL practice (CTEs, window functions), refresh hypothesis-testing and power calculations, rehearse concise metric-driven narratives, and build a few end-to-end model or experiment stories you can explain clearly under time pressure.

"I got asked a hardcore MCM DP question and I saw it on PracHub as well. Solved that question in 5 minutes. Without PracHub I doubt I could solve it in 5 hours. Though somehow didn't get hired, perhaps I guess I solved it too fast? /s"

"Believe me i'm a student here jn US. Recently interviewed for MSFT. They asked me exact question from PracHub. I saw it the night before and ignored it cause why waste time on random sites. I legit wanna go back and redo this whole thing if I had chance. Not saying will work for everyone but there is certainly some merit to that website. And i'm gonna use it in future prep from now on like lc tagged"

"10 years of experience but never worked at a top company. PracHub's senior-level questions helped me break into FAANG at 35. Age is just a number."

"I was skeptical about the 'real questions' claim, so I put it to the test. I searched for the exact question I got grilled on at my last Meta onsite... and it was right there. Word for word."

"Got a Google recruiter call on Monday, interview on Friday. Crammed PracHub for 4 days. Passed every round. This platform is a miracle worker."

"I've used LC, Glassdoor, and random Discords. Nothing comes close to the accuracy here. The questions are actually current — that's what got me. Felt like I had a cheat sheet during the interview."

"The solution quality is insane. It covers approach, edge cases, time complexity, follow-ups. Nothing else comes close."

"Legit the only resource you need. TC went from 180k -> 350k. Just memorize the top 50 for your target company and you're golden."

"PracHub Premium for one month cost me the price of two coffees a week. It landed me a $280K+ starting offer."

"Literally just signed a $600k offer. I only had 2 weeks to prep, so I focused entirely on the company-tagged lists here. If you're targeting L5+, don't overthink it."

"Coaches and bootcamp prep courses cost around $200-300 but PracHub Premium is actually less than a Netflix subscription. And it landed me a $178K offer."

"I honestly don't know how you guys gather so many real interview questions. It's almost scary. I walked into my Amazon loop and recognized 3 out of 4 problems from your database."

"Discovered PracHub 10 days before my interview. By day 5, I stopped being nervous. By interview day, I was actually excited to show what I knew."

"I recently cleared Uber interviews (strong hire in the design round) and all the questions were present in prachub."
"The search is what sold me. I typed in a really niche DP problem I got asked last year and it actually came up, full breakdown and everything. These guys are clearly updating it constantly."
Find Top-5 Similar Rows
You are given two point-in-time snapshot tables generated on the same day in UTC. There is no direct key relationship between the tables; each row in ...
Analyze expectations, correlations, and investment strategies
Consider the following independent quantitative questions. --- 1. Stopping game with three outcomes You play a game consisting of independent rounds. ...
Generate Synthetic Clickstream Data with Python Function
Scenario The analytics team needs to generate synthetic click-stream records to test a new reporting pipeline before real traffic arrives. Question Wr...
Evaluate Joint Campaign Strategies for Credit-Card Growth
Scenario C1 plans to grow its credit-card business by partnering with merchant RentAHome.com on a limited-time offer: 30% off when customers pay with ...
Measure Impact of Merchant Variety on Consumer Experience
Scenario DoorDash's product team is exploring how merchant variety/selection affects consumer experience and marketplace health, and is considering ex...
Calculate Profit of 4-Month Loan at 30% APR
Credit Risk — Short-term Personal Loan Profitability Context You are evaluating the profit on a simple-interest personal loan. Assume no compounding, ...
Compute specialty spend share and top age band
You are given healthcare claims data split across member tables. Tables Assume the following schemas (types may be adapted to your SQL dialect): mem1 ...
What are sum expectation and variance?
Let random variables X and Y have finite means and variances. How do you compute: 1. The expectation of X + Y 2. The variance of X + Y Your answer sho...
Design metrics and experiment for stolen-post detection
You work on Stolen Post Detection for a social platform (detecting content that is copied/reposted without permission). A new detection algorithm is p...
Design and assess an A/B test
Experiment Design: New Onboarding Flow to Improve D7 Retention You are testing a new onboarding flow for a consumer marketplace app available on iOS, ...
Optimize switching puzzle solution
Consider a switching puzzle on an m×n grid of lights where toggling a cell flips its state and that of its orthogonal neighbors (Lights Out variant). ...
Implement Buffer Parsers and Generic Map Class
In C++, complete the following two independent implementation tasks. 1. Sequential buffer reader You are given a raw byte buffer and a current poin...
Compute monthly break-even subscribers
Break-even Analysis for a Subscription Streaming Service Context A streaming startup incurs monthly fixed costs and per-subscriber variable costs and ...
Design experiment for fake accounts impact
Experiment Design: Removing Detected Fake Accounts and Measuring Causal Impact Context: You are designing an end-to-end experiment on a large, interac...
Diagnose and validate a ratio trend change
You are shown a weekly dispute_rate time series (disputes/succeeded_payments) that rises sharply, then partially reverts. Diagnose whether the change ...
Design a hierarchical forecast for transactions
Stripe wants a country×industry daily GMV forecast for the next 90 days (2025-09-01 to 2025-11-29) using 3+ years of history. You have features: day-o...
Compute Groupon unit economics and break-even
Restaurant Coupons and Unit Economics Context: A restaurant's variable cost (VC) is 40% of pre-discount spend and fixed cost (FC) is $100/day. Assume ...
Find longest uniform substring after k replacements
Given a string s (ASCII, length up to 2e5) and integer k (0 ≤ k ≤ |s|), return the length of the longest substring that can be turned into all the sam...
Maximize products bought under budget
Given N products and M customers, for each customer find the list of distinct products they can buy without exceeding their budget such that the numbe...
Compute sample sizes and error control
Using the Biker experiment context, compute required sample sizes and describe error control under practical constraints. Show formulas and numeric an...