Alexa Domain-Knowledge Data Pipelines
Company: Amazon
Role: Product Manager
Category: Product / Decision Making
Difficulty: hard
Interview Round: Onsite
##### Question
Holidays: Design an end-to-end data pipeline that enables Alexa to answer holiday-related questions worldwide. Describe data sources, ingestion, normalization, storage, and query layers. How will you reconcile multiple calendar systems (Gregorian, Lunar, Federal) and keep content current?
Animals: Extend the pipeline so Alexa can answer animal-related questions. What additional data elements, taxonomies, or ML models are required? How will you detect and correct mis-routed queries such as “Peppa Pig” (actually a cartoon character) that are falsely labeled as animal questions?
Debugging: You receive error logs showing Alexa fails for specific animal queries. Propose a framework to categorize these errors, trace root causes, and prioritize fixes.
##### Hints
Calendars differ by locale—normalize date formats, offsets, and observance rules.
Implement confidence thresholds and intent-reclassification to handle ambiguous queries like Peppa Pig.
Quick Answer: Practice an Alexa knowledge-pipeline system design covering global holidays, animal facts, source reliability, ingestion, calendar normalization, knowledge graphs, answer serving, entity linking, confidence thresholds, misrouted queries, and debugging.