This question evaluates skills in predictive modeling, feature engineering and selection, handling data leakage and multicollinearity, model interpretability, metric selection and diagnostics, and production deployment and monitoring within a marketing/growth data science role.
You are interviewing for a Data Scientist role on a marketing/growth team. The business wants lead scoring: ranking or scoring incoming leads so Sales/Marketing can prioritize outreach.
Assume you have a historical dataset of leads with:
lead_id
(string/int)
created_at
(timestamp)
converted
(boolean): whether the lead converted within a defined window
time_to_convert_days
(numeric, optional)
Be explicit about assumptions (conversion window, label definition, scoring cadence) and call out key pitfalls/edge cases.
Login required