PracHub
QuestionsCoachesLearningGuidesInterview Prep
|Home/Machine Learning/Snapchat

Optimize Churn Prediction: Feature Engineering and Model Selection

Last updated: Mar 29, 2026

Quick Overview

Optimize Churn Prediction: Feature Engineering and Model Selection evaluates core ML concepts, assumptions, math intuition, training/evaluation trade-offs, and practical failure modes in a realistic interview setting. A strong answer states assumptions, handles edge cases, explains trade-offs, and shows how to validate the result clearly.

  • hard
  • Snapchat
  • Machine Learning
  • Data Scientist

Optimize Churn Prediction: Feature Engineering and Model Selection

Company: Snapchat

Role: Data Scientist

Category: Machine Learning

Difficulty: hard

Interview Round: Onsite

##### Scenario You own a churn-prediction pipeline that trains weekly on 10M users. ##### Question Walk me through feature engineering, model selection and hyper-parameter tuning for churn prediction. Why might you favor Gradient Boosted Trees over Logistic Regression here? Describe two techniques for explaining model outputs to non-technical stakeholders. If recall suddenly drops by 15% week-over-week, outline a debugging checklist. ##### Hints Discuss imbalance handling, SHAP, feature drift, and offline/online parity checks.

Quick Answer: Optimize Churn Prediction: Feature Engineering and Model Selection evaluates core ML concepts, assumptions, math intuition, training/evaluation trade-offs, and practical failure modes in a realistic interview setting. A strong answer states assumptions, handles edge cases, explains trade-offs, and shows how to validate the result clearly.

Related Interview Questions

  • Explain Overfitting and Transformer Attention - Snapchat (medium)
  • Discuss ML Project Tradeoffs - Snapchat (medium)
  • Model an ads ranking system - Snapchat (medium)
  • Explain BatchNorm, optimizers, and L1/L2 - Snapchat (medium)
  • Explain CLIP, contrastive losses, and retrieval limits - Snapchat (medium)
|Home/Machine Learning/Snapchat

Optimize Churn Prediction: Feature Engineering and Model Selection

Snapchat logo
Snapchat
Aug 4, 2025, 10:55 AM
hardData ScientistOnsiteMachine Learning
87
0

Optimize Churn Prediction: Feature Engineering and Model Selection

Weekly Churn Prediction (10M users): Feature Engineering, Model Choice, Explainability, and Debugging

Scenario

You own a weekly churn-prediction pipeline that trains on 10 million users. The goal is to predict who will churn so the business can target retention interventions.

Tasks

  1. Feature Engineering
    • Define the label, observation/prediction windows, and leakage controls.
    • Propose key feature families and how to handle imbalance.
  2. Model Selection and Hyper-parameter Tuning
    • Describe the model development process, evaluation, and tuning strategy at this scale.
  3. Model Choice Rationale
    • Why might you favor Gradient Boosted Trees (GBTs) over Logistic Regression (LR) here?
  4. Explainability
    • Describe two techniques for explaining model outputs to non-technical stakeholders.
  5. Production Debugging
    • If recall drops by 15% week-over-week, provide a step-by-step debugging checklist.

Hints: Discuss imbalance handling, SHAP, feature drift, and offline/online parity checks.

Constraints & Assumptions

  • Preserve the scope, facts, inputs, and requested outputs from the prompt above.
  • If the prompt leaves a detail unspecified, state a reasonable assumption before relying on it.
  • Keep the answer interview-ready: concise enough to present, but concrete enough to implement or evaluate.

Clarifying Questions to Ask

  • Clarify the task, data shape, labels, constraints, and evaluation metric.
  • State assumptions behind the math or modeling technique you choose.
  • Connect theory to practical training, debugging, and deployment implications.

What a Strong Answer Covers

  • Correct definitions and formulas where the prompt requires them.
  • A practical explanation of how the method behaves on real data.
  • Trade-offs, failure modes, diagnostics, and mitigation strategies.
  • Evaluation choices that match the product or modeling objective.

Follow-up Questions

  • How would noisy labels, class imbalance, or distribution shift affect the answer?
  • What would you monitor after deployment?
  • Which baseline would you compare against first?
Loading comments...

Browse More Questions

More Machine Learning•More Snapchat•More Data Scientist•Snapchat Data Scientist•Snapchat Machine Learning•Data Scientist Machine Learning

Write your answer

Your first approved answer each day earns 20 XP.

Sign in to write your answer.
PracHub

Master your tech interviews with 8,000+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • AI Coding Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.