How do I approach System Design interview questions?

System Design questions require understanding of core concepts and practice. PracHub provides solutions with explanations to help you master system design interviews.

What difficulty level is this interview question?

This is a medium difficulty System Design question, commonly asked during Onsite rounds at Anthropic.

What role is this question designed for?

This question is commonly asked for Software Engineer candidates at Anthropic during technical interviews.

Design Instagram (Feed, Photos, and Friend Recommendations)

Q: Design Instagram (Feed, Photos, and Friend Recommendations)

This question evaluates a candidate's ability to design a large-scale, read-heavy social media backend, covering data modeling for follow graphs, media storage, and feed generation. It tests system design skills such as choosing fan-out strategies for feed delivery, handling highly skewed follower distributions, and reasoning about latency and availability trade-offs at scale.

Design Instagram (Feed, Photos, and Friend Recommendations)

Design the backend for a photo-sharing social network like Instagram. Users follow other users, upload posts (an image plus a caption), and open a home feed showing recent posts from the accounts they follow, newest first. The system must serve a very read-heavy workload at large scale and also surface "people you may want to follow" recommendations.

Constraints & Assumptions

Hundreds of millions of users; reads vastly outnumber writes (feed opens ≫ posts created), on the order of a 100:1 read/write ratio.
A post is an image (stored in object storage / served via CDN) plus a caption and metadata; feeds show reverse-chronological recent posts (treat ranking as out of scope unless asked).
The follow graph is directed (A follows B does not imply B follows A) and highly skewed: some accounts have tens of millions of followers.
Targets: home-feed load p99 well under a few hundred milliseconds; high availability; a brief delay before a new post appears in followers' feeds is acceptable.

Clarifying Questions to Ask

What is the scale (DAU, posts/day, average and maximum follower counts) and the read:write ratio?
Is the feed strictly reverse-chronological or ranked? Is eventual consistency (a few seconds before a post appears in feeds) acceptable?
How skewed is the follow graph — do we must-handle celebrity accounts with tens of millions of followers?
What media sizes and formats must we support, and what are the latency targets for image load vs. feed metadata?
Is the friend-recommendation requirement "good enough suggestions" or a precise quality bar with specific signals?

Part 1 — Requirements, data model, and APIs

Lay out functional and non-functional requirements, the core data model (users, follow graph, posts, media), and the main APIs (create post with image upload, follow/unfollow, get home feed). Describe how images are uploaded and served separately from post metadata.

What This Part Should Cover Premium

Part 2 — Feed generation: fan-out on write vs. on read

Design how a user's home feed is produced. Compare fan-out on write (push a new post id into each follower's precomputed feed at post time) with fan-out on read (gather and merge followees' recent posts at read time). Recommend an approach, justify it for a read-heavy workload, and explain how it handles celebrity accounts and how it serves high read concurrency.

What This Part Should Cover Premium

Part 3 — Scaling: partitioning, media delivery, and recommendations

Scale the system. Cover how the data and feed stores are partitioned across many nodes (and why consistent hashing is used to add/remove nodes with minimal reshuffling), how images are stored and delivered globally, replication for read scale and availability, and a first-cut design for the friend/follow recommendation feature.

What This Part Should Cover Premium

What a Strong Answer Covers Premium

Follow-up Questions

A user with 5,000 followees opens their feed — how do you keep that read fast under fan-out-on-write, and what's the cost of fan-out-on-read for them?
When a user unfollows someone, how do the already-pushed posts get removed (or do they), and what consistency guarantee do you offer?
How would you evolve the strictly chronological feed into a ranked feed without rebuilding the whole pipeline?
How do you keep the precomputed feed store from growing without bound (per-user feed length caps, TTLs, recomputation on read for inactive users)?

Design Instagram (Feed, Photos, and Friend Recommendations)

Constraints & Assumptions

Hundreds of millions of users; reads vastly outnumber writes (feed opens ≫ posts created), on the order of a 100:1 read/write ratio.
A post is an image (stored in object storage / served via CDN) plus a caption and metadata; feeds show reverse-chronological recent posts (treat ranking as out of scope unless asked).
The follow graph is directed (A follows B does not imply B follows A) and highly skewed: some accounts have tens of millions of followers.
Targets: home-feed load p99 well under a few hundred milliseconds; high availability; a brief delay before a new post appears in followers' feeds is acceptable.

Clarifying Questions to Ask

What is the scale (DAU, posts/day, average and maximum follower counts) and the read:write ratio?
Is the feed strictly reverse-chronological or ranked? Is eventual consistency (a few seconds before a post appears in feeds) acceptable?
How skewed is the follow graph — do we must-handle celebrity accounts with tens of millions of followers?
What media sizes and formats must we support, and what are the latency targets for image load vs. feed metadata?
Is the friend-recommendation requirement "good enough suggestions" or a precise quality bar with specific signals?

Part 1 — Requirements, data model, and APIs

What This Part Should Cover Premium

Part 2 — Feed generation: fan-out on write vs. on read

What This Part Should Cover Premium

Part 3 — Scaling: partitioning, media delivery, and recommendations

What This Part Should Cover Premium

What a Strong Answer Covers Premium

Follow-up Questions

A user with 5,000 followees opens their feed — how do you keep that read fast under fan-out-on-write, and what's the cost of fan-out-on-read for them?
When a user unfollows someone, how do the already-pushed posts get removed (or do they), and what consistency guarantee do you offer?
How would you evolve the strictly chronological feed into a ranked feed without rebuilding the whole pipeline?
How do you keep the precomputed feed store from growing without bound (per-user feed length caps, TTLs, recomputation on read for inactive users)?

Design Instagram (Feed, Photos, and Friend Recommendations)

Quick Overview

Design Instagram (Feed, Photos, and Friend Recommendations)

Design Instagram (Feed, Photos, and Friend Recommendations)

Constraints & Assumptions

Clarifying Questions to Ask

Part 1 — Requirements, data model, and APIs

What This Part Should Cover Premium

Part 2 — Feed generation: fan-out on write vs. on read

What This Part Should Cover Premium

Part 3 — Scaling: partitioning, media delivery, and recommendations

What This Part Should Cover Premium

What a Strong Answer Covers Premium

Follow-up Questions

Submit Your Answer to Earn 20XP

Design Instagram (Feed, Photos, and Friend Recommendations)

Quick Overview

Design Instagram (Feed, Photos, and Friend Recommendations)

Design Instagram (Feed, Photos, and Friend Recommendations)

Constraints & Assumptions

Clarifying Questions to Ask

Part 1 — Requirements, data model, and APIs

What This Part Should Cover Premium

Part 2 — Feed generation: fan-out on write vs. on read

What This Part Should Cover Premium

Part 3 — Scaling: partitioning, media delivery, and recommendations

What This Part Should Cover Premium

What a Strong Answer Covers Premium

Follow-up Questions

Submit Your Answer to Earn 20XP