PracHub
QuestionsPremiumCoachesLearningGuidesInterview Prep
|Home/System Design/Pinterest

Design a Google Photos–like service

Last updated: Mar 29, 2026

Quick Overview

This question evaluates proficiency in designing large-scale distributed media systems, testing competencies in distributed storage and caching, ingest pipelines, CDN-backed delivery, indexing/search, sync/conflict resolution, access control/privacy compliance, and cost/operational planning.

  • hard
  • Pinterest
  • System Design
  • Software Engineer

Design a Google Photos–like service

Company: Pinterest

Role: Software Engineer

Category: System Design

Difficulty: hard

Interview Round: Onsite

Design a Google Photos–like service for billions of photos and videos. Cover client upload workflows (mobile/desktop), media ingestion, metadata extraction, deduplication and content hashing, storage architecture (hot/cold tiers), CDN delivery, sharing and permissions, search and indexing (by time, location, EXIF), thumbnail/transform pipelines, sync across devices, privacy/compliance, disaster recovery, and cost estimation. Provide high-level APIs, data models, partitioning/sharding strategies, and choices for consistency and availability.

Quick Answer: This question evaluates proficiency in designing large-scale distributed media systems, testing competencies in distributed storage and caching, ingest pipelines, CDN-backed delivery, indexing/search, sync/conflict resolution, access control/privacy compliance, and cost/operational planning.

Related Interview Questions

  • Design a Distributed Rate Limiter - Pinterest (medium)
  • Design Catalog Update Pipeline - Pinterest (medium)
  • Design an ads event reporting system - Pinterest (medium)
  • Design autocomplete and merchant bulk edits - Pinterest (medium)
  • Design highly available blob storage service - Pinterest (medium)
Pinterest logo
Pinterest
Sep 6, 2025, 12:00 AM
Software Engineer
Onsite
System Design
5
0

System Design: Google Photos–like Service (Web + Mobile)

Context

Design a large-scale consumer media service that ingests, stores, indexes, and serves billions of photos and videos across mobile and desktop. Users expect near-instant uploads, fast viewing, powerful search, easy sharing, and seamless sync across devices with strong privacy.

Assume a modern cloud environment with globally distributed users.

Scale Assumptions (to ground trade-offs)

  • Users: 100M MAU, 25M DAU.
  • Uploads per DAU: ~5 photos/day, 0.1 videos/day → ~125M photos/day, 2.5M videos/day.
  • Average sizes: photo 4 MB, video 50 MB.
  • New data/day: ~0.5 PB photos + ~125 TB videos ≈ ~0.625 PB/day.
  • Peak ingest: 3× daily average during regional evenings.
  • Read-heavy: 10× more reads than writes; p95 photo load < 200 ms from cache.

You may adjust numbers slightly if needed; justify trade-offs.

Requirements

  1. Client upload workflows
    • Mobile: background/resumable uploads, low battery/data usage, offline queue, idempotency.
    • Desktop: bulk sync, folder watch, conflict resolution.
  2. Media ingestion pipeline
    • Edge upload, resumable sessions, virus/abuse scanning, metadata extraction (EXIF), deduplication via content hashes.
  3. Storage architecture
    • Object storage for originals and variants; hot/cold tiers; lifecycle policies; encryption.
  4. Delivery
    • CDN-backed delivery for images/thumbnails/video streaming; signed URLs; cache strategies.
  5. Sharing & permissions
    • Private-by-default; per-item/album ACLs; shareable links; link expiration; collaborative albums.
  6. Search & indexing
    • Filter by time/location/EXIF; text tags; face/object recognition; similarity search; privacy-preserving per-user indexes.
  7. Thumbnail/transforms
    • Multi-size thumbnails; server-side video transcoding (HLS/DASH); on-demand vs precompute trade-offs.
  8. Sync across devices
    • Delta sync; read-your-writes; notifications; conflict resolution.
  9. Privacy & compliance
    • E2E transport security; encryption at rest; data residency; GDPR/CCPA deletion; audit logging; safe content handling.
  10. Disaster recovery
  • Multi-region durability; RPO/RTO targets; backups; runbooks.
  1. Cost awareness
  • High-level capacity planning and monthly cost estimation; cost-reduction levers.

Deliverables

  • High-level architecture and key components.
  • API design (upload, list, get, search, share), idempotency, auth.
  • Data model (logical schema) for users, media, albums, ACLs, indexes.
  • Partitioning/sharding strategies.
  • Consistency vs availability choices per workflow.
  • Reasonable cost estimate and key optimizations.

Solution

Show

Submit Your Answer

Sign in to leave a comment

Loading comments...

Browse More Questions

More System Design•More Pinterest•More Software Engineer•Pinterest Software Engineer•Pinterest System Design•Software Engineer System Design
PracHub

Master your tech interviews with 8,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.