Design a job scheduler ETL pipeline system

Q: Design a job scheduler ETL pipeline system

This is a System Design interview question from Microsoft for Software Engineer roles. View the full question and solution on PracHub.

Q: How do I approach System Design interview questions?

System Design questions require understanding of core concepts and practice. PracHub provides solutions with explanations to help you master system design interviews.

Question

Loading...

System Design Prompt

Design a Job Scheduler + ETL pipeline system.

The system should allow users (or internal services) to:

Define ETL jobs (extract → transform → load)
Schedule jobs (cron / fixed interval / one-off)
Run jobs on a fleet of workers
Track job/run status and logs
Support retries and failure handling

Requirements

Job definition : store metadata (owner, schedule, source/sink, parameters, code/artifact location).
Scheduling : trigger runs on time; avoid duplicate runs.
Execution : dispatch runs to workers; support horizontal scaling.
Reliability :
- at-least-once execution with dedup where needed
- retries with backoff
- handle worker crashes mid-run
Observability : job/run status, logs, metrics, alerting.
Multi-tenancy (basic): isolate teams with quotas/limits.

Discussion points

Explain key components (API, scheduler, queue, workers, metadata DB), data model, scaling strategy, and how you’d use/load-balance caches/queues. Call out major tradeoffs (exactly-once vs at-least-once, latency vs throughput).

Design a job scheduler ETL pipeline system

System Design Prompt

Requirements

Discussion points

Solution

Comments (0)