This question evaluates proficiency in product analytics, instrumentation design, experimentation planning, and funnel-level diagnostic analysis for an app marketplace, and falls squarely in the Analytics & Experimentation domain for data scientist roles.
Shopify is launching a Shopify App Store where merchants can browse/install apps built by third-party developers (some paid, some free). You are the Data Scientist supporting the launch.
Propose a success measurement framework with:
Be explicit about whose success you’re optimizing for (merchants, developers, Shopify) and how you’d balance tradeoffs.
Specify what data you’d need and where it comes from.
Assume events arrive in near-real-time; define any time windowing (e.g., daily in UTC) and identity rules (merchant_id, app_id, developer_id, session_id).
Design at least one experiment to improve App Store outcomes (e.g., ranking algorithm, pricing surfaces, recommendation modules, onboarding prompts). Include:
You’re given a dataset with 3 years of daily metrics for the App Store. You notice:
Assume the table below (you may create derived fields like YoY, WoW, and rolling averages):
daily_app_store_metricsdate
(DATE)
sessions
(INT) — total visits to the App Store
product_views
(INT)
add_to_cart
(INT)
purchases
(INT)
revenue
(NUMERIC)
channel
(STRING) — e.g., organic, paid_search, email, affiliate, referral
device_type
(STRING) — desktop/mobile/tablet
geo
(STRING)
merchant_tier
(STRING) — e.g., trial/basic/plus
landing_page
(STRING)
app_category
(STRING)
is_bot_suspected
(BOOL) — if available
Define rates as:
ATC_rate = add_to_cart / product_views
(or justify an alternative)
CVR = purchases / sessions
Output expected: a structured investigation plan plus the key visualizations you’d generate.
Login required