PracHub
QuestionsPremiumLearningGuidesInterview PrepNEWCoaches
|Home/Machine Learning/Other

Explain OS usage gap via trees

Last updated: Mar 29, 2026

Quick Overview

This question evaluates a data scientist's competency in user-behavior modeling, causal inference, telemetry instrumentation, missing-data handling, and segment-based product analysis.

  • hard
  • Other
  • Machine Learning
  • Data Scientist

Explain OS usage gap via trees

Company: Other

Role: Data Scientist

Category: Machine Learning

Difficulty: hard

Interview Round: Onsite

Instagram usage is substantially higher among iOS than Android users. 1) Frame this as a supervised learning problem to identify discriminant variables: define target, candidate features (demographics, engagement, acquisition channel, device/app telemetry), and the modeling approach (decision tree vs. random forest). 2) Explain how to detect whether OS is a causal factor vs. a proxy for other variables; outline tests (feature ablation, conditional permutation importance, propensity stratification). 3) Specify additional telemetry you would collect (e.g., cold‑start latency, crash rate, time‑to‑first‑paint, battery drain) and how you’d compare distributions across OS versions. 4) Discuss handling missing values in consumer data: defend the '-1 sentinel' approach vs. model‑based imputation, and when each is appropriate. 5) Given two discovered segments—(a) Argentinians underperforming, (b) Indians <30 overperforming—propose concrete next steps for product/marketing and how to validate impact causally.

Quick Answer: This question evaluates a data scientist's competency in user-behavior modeling, causal inference, telemetry instrumentation, missing-data handling, and segment-based product analysis.

Related Interview Questions

  • Derive and regularize logistic regression - Other (hard)
  • Design anomaly detection and handle imbalanced logistic regression - Other (Medium)
  • Extract companies from noisy text - Other (hard)
  • Evaluate and select K in K-means - Other (medium)
  • Explain SVM kernels and complexity - Other (hard)
Other logo
Other
Oct 13, 2025, 9:49 PM
Data Scientist
Onsite
Machine Learning
2
0

iOS vs. Android Usage Gap: Modeling, Causality, Telemetry, Missing Data, and Segmented Actions

Context

You observe that Instagram usage is substantially higher among iOS users than Android users. Assume "usage" refers to per-user weekly minutes spent in app (or a similar engagement proxy). Your task is to: model the drivers of usage, assess whether OS has a causal role, design telemetry comparisons across OS versions, handle missing data appropriately, and propose actions for discovered segments.

Tasks

  1. Supervised learning framing
    • Define a clear prediction target for user-level usage.
    • List candidate features across: demographics, engagement, acquisition channel, and device/app telemetry.
    • Choose and justify a modeling approach (decision tree vs. random forest) for identifying discriminant variables.
  2. Causality check: OS as cause vs. proxy
    • Explain how you would test whether OS is causal vs. a proxy for other factors.
    • Outline procedures for feature ablation, conditional permutation importance, and propensity stratification.
  3. Telemetry collection and cross-OS comparison
    • Specify additional telemetry to collect (e.g., cold-start latency, crash rate, time-to-first-paint, battery drain).
    • Describe how to compare distributions across OS versions robustly.
  4. Missing data handling
    • Defend the "-1 sentinel" approach vs. model-based imputation.
    • Explain when each is appropriate and any guardrails.
  5. Segment actions and validation
    • Given two segments: (a) Argentinians underperforming, (b) Indians <30 overperforming, propose concrete product/marketing next steps.
    • Explain how to validate impact causally.

Solution

Show

Comments (0)

Sign in to leave a comment

Loading comments...

Browse More Questions

More Machine Learning•More Other•More Data Scientist•Other Data Scientist•Other Machine Learning•Data Scientist Machine Learning
PracHub

Master your tech interviews with 7,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.