Google Data Scientist Interview Questions
Master your tech interview with our curated database of real questions from top companies.
Build Model to Predict Customer Contract Renewal
Predicting Enterprise Customer Renewal for Google Meet You are tasked with designing a model to predict whether an enterprise customer will renew thei...
Diagnose YouTube Usage Decline: Key Metrics and Segmentation
Scenario YouTube observes a sudden decline in daily active users (DAU) and total watch time across the platform. Task Design a systematic diagnosis pl...
Calculate User Deviation from Team Average Messages
usage_stats +---------+---------+---------------+------------+ | user_id | team_id | messages_sent | date | +---------+---------+---------------...
Compare Logistic Regression and Random Forest in Limited Data Scenarios
Model Selection for Binary Classification with Limited Data and Potential Non-Linearities Scenario You are designing a binary classifier with limited ...
Analyze User Flags and Review Outcomes for Moderation Prioritization
UserFlags +---------------+--------------+----------+---------+ | User_FirstName| User_LastName| Video_ID | Flag_ID | +---------------+--------------+...
Engineer Features to Enhance Smartphone Battery Life Prediction
Battery Life Prediction with Sparse History Problem You are given sparse discharge traces that record battery percentage over elapsed time for prior u...
Analyze Linear Regression Changes with Duplicated Observations
Linear Regression, p-values, and Chi-square with Large Samples Context You are analyzing regression and goodness-of-fit results. Consider what happens...
Diagnose Google Meet Disconnections and Assess Business Impact
Scenario Enterprise clients report that Google Meet calls frequently disconnect. Task Outline an end-to-end analysis plan to diagnose why calls drop a...
Sample and Simulate Price Adjustments in R with dplyr
Products +----+-----------+-------+ | id | product | price | | 1 | phone | 500 | | 2 | tablet | 300 | | 3 | laptop | 1000 | | 4 |...
Estimate Population Mean and Conversion Rate Accurately
Statistical Inference: Hypothesis Tests, Confidence Intervals, Sampling Design, and Truncated Normal Estimation Context You are evaluating a set of pr...
Design A/B Test for Subscription Price Increase Effectiveness
Scenario A B2B SaaS company is considering: 1) Raising subscription prices and wants a two-week A/B test to evaluate the impact. 2) Improving a sign-u...
Design Scalable Database and Analyze E-commerce Data
transactions +-----------+----------+------------+------------+ | user_id | order_id | product_id | order_time | +-----------+----------+-----------...
Identify and Fix Predictive Model Performance Gaps
Model Review: Month Encoding, Feature Scaling, and Imbalanced Data Context You are auditing an existing predictive model for operational performance. ...
Design A/B Test to Isolate Product Usage Drop Causes
Investigating a 10–11% Product Usage Drop via Experimentation Context You observe that product usage fell by 10% in the U.S. and 11% in Mexico over th...
Build Classifier: Evaluate with AUROC for Imbalanced Data
Detecting Dead Links: Build and Evaluate a Classifier Scenario You have a dataset of 1,000 URLs labeled as good (alive) or bad (dead). The classes are...
Generate Samples from Truncated Normal Distribution
Scenario You draw from a normal distribution but only keep observations that are greater than 1 (i.e., values below 1 are discarded). Assume the origi...
Analyze Call Drop Rates Pre- and Post-Update Implementation
Evaluate a Non-Experimental Release: Google Meet Call Drops Scenario Engineers shipped a new Google Meet version intended to reduce call drops. A trad...
Analyze Impact of Customer Reviews on Sales Performance
Measuring the Impact of Product Reviews on Sales Context You are asked to propose how to quantify and causally estimate the impact of product review m...
Implement Sampling and Minimize Loss in Numerical Coding
Scenario Numerical coding challenges on sampling and loss minimization. Question a) Implement functions to sample from truncated normal distributions ...
Describe Overcoming Challenges and Persuading Non-Data Colleagues
Behavioral Interview: Persuasion and Problem-Solving (Technical Phone Screen — Data Scientist) Context You are interviewing for a Data Scientist role ...