Model y from x and interpret distributions

Q: Model y from x and interpret distributions

This is a Machine Learning interview question from Reddit for Machine Learning Engineer roles. View the full question and solution on PracHub.

Q: How do I approach Machine Learning interview questions?

Machine Learning questions require understanding of core concepts and practice. PracHub provides solutions with explanations to help you master machine learning interviews.

Question

Scenario

You are given a dataset with one input feature x and a target y. The interviewer asks: “How would you model this?”

Later, you are shown a plot with two distributions (e.g., distribution of a feature for two groups/classes, or train vs. production) and asked to interpret what it implies.

Finally, you are asked several cold-start questions.

Tasks

Explain how you decide whether this is regression vs classification , what baseline models you try first, and what evaluation metrics you use.
Given a plot with two distributions, explain how you would:
- Describe what you see (separation/overlap, shift, variance, multimodality)
- Diagnose potential issues (label leakage, covariate shift, class imbalance, thresholding)
- Decide next steps (feature engineering, calibration, sampling, monitoring)
Describe practical cold start strategies for:
- New users
- New items (videos)
- New regions/languages

Assume you care about both predictive quality and production robustness.

Model y from x and interpret distributions

Scenario

Tasks

Solution

Comments (0)