PracHub
QuestionsPremiumLearningGuidesCheatsheetNEWCoaches
|Home/Analytics & Experimentation/Amazon

Profile and visualize an unfamiliar dataset

Last updated: Mar 29, 2026

Quick Overview

This question evaluates a data scientist's competency in exploratory data analysis, data quality assessment, time-series aggregation, visualization design, and concise stakeholder communication when working with an undocumented events-plus-purchases CSV.

  • medium
  • Amazon
  • Analytics & Experimentation
  • Data Scientist

Profile and visualize an unfamiliar dataset

Company: Amazon

Role: Data Scientist

Category: Analytics & Experimentation

Difficulty: medium

Interview Round: Technical Screen

You receive an undocumented CSV combining user events and purchases with columns: order_id, user_id, event_ts (UTC), merchant_id, session_id, event_type (view/add_to_cart/purchase), amount_usd, device_type, country. In 30 minutes, outline exactly how you would understand the dataset and create executive-ready visualizations. Requirements: - Data understanding: list the first 10 checks you run (e.g., nulls, duplicates, timestamp monotonicity by session, timezone sanity, categorical cardinality, outliers, unit consistency, referential integrity between event_type='purchase' and amount_usd, weekend/weekday patterns, country/device coverage). - Visual plan: propose 3–5 specific charts (titles, axes, grain) to answer “What is happening?” and “So what?”. Justify each choice and expected insight. - Granularity: choose daily vs hourly aggregation for a launch week; defend trade-offs and how you’d switch with a parameter. - Data quality: show how one bad clock-skew day would appear in your visuals and how you’d annotate/adjust. - Deliverable: describe a one-slide dashboard wireframe (sections, KPIs, filters) and how you’d validate it with a stakeholder in a 5‑minute readout.

Quick Answer: This question evaluates a data scientist's competency in exploratory data analysis, data quality assessment, time-series aggregation, visualization design, and concise stakeholder communication when working with an undocumented events-plus-purchases CSV.

Related Interview Questions

  • Explain why CTR rises but CVR unchanged - Amazon (medium)
  • How would you test a price increase? - Amazon (medium)
  • How to evaluate adding video ads in a game - Amazon (easy)
  • How would you analyze and test a price increase? - Amazon (easy)
  • How would you evaluate adding video ads? - Amazon (medium)
Amazon logo
Amazon
Oct 13, 2025, 9:49 PM
Data Scientist
Technical Screen
Analytics & Experimentation
3
0

Task: Quickly Understand an Undocumented Events+Purchases CSV and Produce Executive-Ready Visuals

Context

You are handed a single CSV that mixes user events and purchases for a launch week. The schema is undocumented. Columns:

  • order_id, user_id, event_ts (UTC), merchant_id, session_id, event_type (view/add_to_cart/purchase), amount_usd, device_type, country

Your goal in ~30 minutes is to: sanity-check the data, outline the key analyses, and design concise, decision-oriented visuals.

Requirements

  1. Data understanding: List the first 10 checks you run (e.g., nulls, duplicates, timestamp monotonicity by session, timezone sanity, categorical cardinality, outliers, unit consistency, referential integrity between event_type = "purchase" and amount_usd, weekend/weekday patterns, country/device coverage).
  2. Visual plan: Propose 3–5 specific charts (titles, axes, grain) to answer “What is happening?” and “So what?”. Justify each choice and expected insight.
  3. Granularity: Choose daily vs hourly aggregation for a launch week; defend trade-offs and how you’d switch with a parameter.
  4. Data quality: Show how one bad clock-skew day would appear in your visuals and how you’d annotate/adjust.
  5. Deliverable: Describe a one-slide dashboard wireframe (sections, KPIs, filters) and how you’d validate it with a stakeholder in a 5‑minute readout.

Solution

Show

Comments (0)

Sign in to leave a comment

Loading comments...

Browse More Questions

More Analytics & Experimentation•More Amazon•More Data Scientist•Amazon Data Scientist•Amazon Analytics & Experimentation•Data Scientist Analytics & Experimentation
PracHub

Master your tech interviews with 7,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.