Yahoo Data Scientist Interview Questions
Yahoo Data Scientist interview questions typically probe a mix of product analytics, experimentation, and applied machine learning with an emphasis on real-world impact. At Yahoo you can expect interviewers to evaluate SQL fluency, Python-based analysis, statistical thinking for A/B testing, and the ability to translate metrics into product decisions. What’s distinctive is the product-and-ad-driven context: interviewers often frame problems around user engagement, ad performance, and scalable pipelines, so clear communication and cross-functional storytelling are as important as raw technical chops. For interview preparation, plan for a multi-stage process that usually begins with a recruiter screen and moves into technical screens (SQL and coding), a product or analytics case, a statistics/experiment-design conversation, and behavioral rounds. Practice writing concise SQL queries, building reproducible analyses in Python, and explaining tradeoffs in experiment design. Prepare STAR-style examples that show impact and ownership, and rehearse product-facing explanations of your projects. Mock interviews that combine coding, analytics, and storytelling will align your skills with what Yahoo commonly assesses.
Explain Motivations and Cross-Functional Collaboration in Business Intelligence
Behavioral Interview Prompt: Motivation, Trajectory, and Communication Scenario You are interviewing for a Data Scientist role on a business intellige...
Solve Data Structure Challenges with Python Algorithms
Scenario You are given coding challenges in Python to manipulate simple data structures. Question Remove duplicates from a list of integers while pres...
Investigate Yahoo Mail's 10% DAU Decline Causes
Investigate a 10% DAU Drop in Yahoo Mail Scenario Yahoo Mail suddenly reports a 10% decline in daily active users (DAU). Task Walk through how you wou...
Diagnose a 10% DAU drop
On-Call Incident: Yahoo Mail DAU Down 10% on 2025-09-01 Assume all times are UTC, the product is global, and DAU is the canonical daily active user me...
Merge words by head-tail chaining
Given a list of lowercase words, merge them into a single string by repeatedly chaining words where the last character of the current string equals th...
Use OrderedDict to dedupe stably
Implement dedupe_preserve_order(seq) that removes duplicates from a list while preserving the order of first appearance, in O(n) time and O(k) space (...
Diagnose DAU drop with SQL by country
Write a single SQL query to diagnose a country-level DAU drop for an email product. Assume today is 2025-09-01. DAU is the count of distinct user_id w...
Prove Equal Probability Impossible with Relabeled Dice Faces
Uniform Sums from Two Relabeled Fair Dice Setup You have two fair six-sided dice. You may relabel each die's faces with integers; duplicates are allow...
Evaluate Widget Impact on User Engagement with A/B Testing
A/B Test Design: New In‑App Recommendation Widget Scenario A company is preparing to roll out a new in‑app recommendation widget and needs evidence th...
Calculate Expected Flips for Heads-Tails Sequence
Probability Check — Waiting Time for HT Setup - You repeatedly flip a fair coin (independent flips; P(H)=P(T)=1/2). - Stop as soon as the consecutive ...
Calculate Total Spend and Identify Key User Metrics
Transactions +----+---------+--------+------------+ | id | user_id | amount | date | +----+---------+--------+------------+ | 1 | 101 | 23....