How do I approach System Design interview questions?

System Design questions require understanding of core concepts and practice. PracHub provides solutions with explanations to help you master system design interviews.

What difficulty level is this interview question?

This is a hard difficulty System Design question, commonly asked during Technical Screen rounds at Anthropic.

What role is this question designed for?

This question is commonly asked for Software Engineer candidates at Anthropic during technical interviews.

Design a Concurrent Domain Crawler

Last updated: Mar 29, 2026

Quick Overview

This question evaluates a candidate's understanding of concurrent system design, web crawling fundamentals, URL frontier organization, duplicate detection, concurrency control, network I/O models, fault tolerance, and politeness mechanisms such as throttling.

Anthropic

Jan 6, 2026, 12:00 AM

Software Engineer

Technical Screen

System Design

Design a crawler that starts from one seed URL and explores all reachable pages in the same domain efficiently.

Discuss:

How you would structure the frontier of URLs to visit.
How to prevent duplicate fetches across concurrent workers.
Whether you would use asynchronous I/O, multithreading, or multiprocessing, and why.
How to detect when the crawl is complete.
How to handle failures, timeouts, and back-pressure.
How to enforce politeness limits such as request throttling.

Assume the workload is primarily fetching web pages over the network.

Solution

Show

Comments (0)

Loading comments...

Browse More Questions

More System Design•More Anthropic•More Software Engineer•Anthropic Software Engineer•Anthropic System Design•Software Engineer System Design