How do I approach System Design interview questions?

System Design questions require understanding of core concepts and practice. PracHub provides solutions with explanations to help you master system design interviews.

What difficulty level is this interview question?

This is a medium difficulty System Design question, commonly asked during Onsite rounds at Dropbox.

What role is this question designed for?

This question is commonly asked for Software Engineer candidates at Dropbox during technical interviews.

Design a recursive distributed file crawler

Last updated: Apr 2, 2026

Quick Overview

This prompt evaluates expertise in distributed systems, asynchronous task orchestration, recursive job scheduling, data modeling for job and file metadata, and reliability concerns such as retries, idempotency, deduplication, and partial failure handling.

Dropbox

Jan 25, 2026, 12:00 AM

Software Engineer

Onsite

System Design

Design a distributed service that crawls a large file system starting from a root path. A client should be able to call an API to start a crawl job, and background workers should traverse directories and files asynchronously. While processing a directory, a worker may split the work into smaller tasks and enqueue additional async jobs, so the async service can recursively trigger more of its own jobs.

Discuss:

APIs for creating a crawl job and checking job status
the data model for crawl jobs, crawl tasks, and discovered file metadata
how workers recursively schedule child tasks
how to handle retries, idempotency, deduplication, and partial failures
how to scale to very large directory trees
the role of the database, message queue, and async workers
how to expose progress and final results to clients

Solution

Show

Submit Your Answer to Earn 20XP

Loading comments...

Browse More Questions

More System Design•More Dropbox•More Software Engineer•Dropbox Software Engineer•Dropbox System Design•Software Engineer System Design