PracHub
QuestionsPremiumLearningGuidesInterview PrepNEWCoaches
|Home/System Design/Robinhood

Design a job scheduler with SLA and logs

Last updated: Mar 29, 2026

Quick Overview

This question evaluates understanding of distributed job scheduling, SLA enforcement, observability, and fault-tolerant execution across worker fleets, assessing competencies in scalability, reliability, state tracking, and operational logging within the System Design domain.

  • medium
  • Robinhood
  • System Design
  • Software Engineer

Design a job scheduler with SLA and logs

Company: Robinhood

Role: Software Engineer

Category: System Design

Difficulty: medium

Interview Round: Technical Screen

## Scenario Design a **distributed job scheduler** that runs **scheduled (time-based) jobs only** (e.g., cron/interval). Each job execution has an **SLA**: if the run does not finish within the SLA window, the system must **mark it as timed out and raise an error/alert**. Users must be able to **inspect execution logs**. ## Core requirements - Create/update/delete scheduled jobs. - Trigger job runs at the correct times (support cron-like schedules or fixed intervals). - Execute jobs on a fleet of workers. - Track job run state: `PENDING/RUNNING/SUCCEEDED/FAILED/TIMED_OUT/CANCELED`. - **SLA enforcement** per run: detect overruns and surface as an error/alert. - Provide **log viewing** per job run. ## Non-functional requirements (clarify in discussion) - Scale: potentially many jobs and high execution rate. - Reliability: tolerate worker crashes, scheduler crashes, network partitions. - Correctness: avoid missed schedules; minimize duplicate runs (define acceptable semantics). - Observability: metrics, tracing, auditing. ## Follow-ups to address - How to handle failures at each stage (scheduler failure, worker failure, log pipeline failure, DB outages). - Retries/backoff, dead-lettering, and idempotency. - Handling long-running jobs and cancellation. - Handling clock skew/time zones and daylight savings.

Quick Answer: This question evaluates understanding of distributed job scheduling, SLA enforcement, observability, and fault-tolerant execution across worker fleets, assessing competencies in scalability, reliability, state tracking, and operational logging within the System Design domain.

Related Interview Questions

  • Design a Photo Album App - Robinhood (medium)
  • Design a distributed job scheduler - Robinhood
  • Design a Photo Management Service - Robinhood (medium)
  • Design authorization and audit logging systems - Robinhood (medium)
  • Design an At-Most-Once Job Scheduler - Robinhood (medium)
Robinhood logo
Robinhood
Feb 11, 2026, 12:00 AM
Software Engineer
Technical Screen
System Design
12
0
Loading...

Scenario

Design a distributed job scheduler that runs scheduled (time-based) jobs only (e.g., cron/interval). Each job execution has an SLA: if the run does not finish within the SLA window, the system must mark it as timed out and raise an error/alert. Users must be able to inspect execution logs.

Core requirements

  • Create/update/delete scheduled jobs.
  • Trigger job runs at the correct times (support cron-like schedules or fixed intervals).
  • Execute jobs on a fleet of workers.
  • Track job run state: PENDING/RUNNING/SUCCEEDED/FAILED/TIMED_OUT/CANCELED .
  • SLA enforcement per run: detect overruns and surface as an error/alert.
  • Provide log viewing per job run.

Non-functional requirements (clarify in discussion)

  • Scale: potentially many jobs and high execution rate.
  • Reliability: tolerate worker crashes, scheduler crashes, network partitions.
  • Correctness: avoid missed schedules; minimize duplicate runs (define acceptable semantics).
  • Observability: metrics, tracing, auditing.

Follow-ups to address

  • How to handle failures at each stage (scheduler failure, worker failure, log pipeline failure, DB outages).
  • Retries/backoff, dead-lettering, and idempotency.
  • Handling long-running jobs and cancellation.
  • Handling clock skew/time zones and daylight savings.

Solution

Show

Comments (0)

Sign in to leave a comment

Loading comments...

Browse More Questions

More System Design•More Robinhood•More Software Engineer•Robinhood Software Engineer•Robinhood System Design•Software Engineer System Design
PracHub

Master your tech interviews with 7,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.