How do I approach System Design interview questions?

System Design questions require understanding of core concepts and practice. PracHub provides solutions with explanations to help you master system design interviews.

What difficulty level is this interview question?

This is a medium difficulty System Design question, commonly asked during Onsite rounds at Atlassian.

What role is this question designed for?

This question is commonly asked for Software Engineer candidates at Atlassian during technical interviews.

Design an image crawler for unlimited URLs | Atlassian Interview Question

Design an image crawler for unlimited URLs

Last updated: Mar 29, 2026

Quick Overview

This question evaluates a candidate's ability to design scalable, fault-tolerant distributed systems for web crawling, covering competencies in concurrency, queueing and scheduling, deduplication, storage architecture, and observability.

Atlassian

Jan 5, 2026, 12:00 AM

Software Engineer

Onsite

System Design

Design a service that crawls images starting from a set of root URLs.

Requirements:

Input: one or more root URLs.
Crawl pages, discover links, and download image resources.
Support unlimited number of root URLs and unlimited crawl depth .
Must handle failures (network errors, timeouts, crashes) and avoid re-crawling the same URL excessively.
Discuss storage for downloaded images and metadata.

Deliverables:

High-level architecture (components, data flow).
Queue/scheduler design and politeness (per-host rate limiting).
Deduplication strategy.
DB schema for crawl state and results.
Failure/retry model and monitoring.

Solution

Show

Comments (0)

Loading comments...

Browse More Questions

More System Design•More Atlassian•More Software Engineer•Atlassian Software Engineer•Atlassian System Design•Software Engineer System Design