Design an image copyright-violation detection system
Company: Meta
Role: Machine Learning Engineer
Category: ML System Design
Difficulty: medium
Interview Round: Onsite
Design an ML system that detects whether a **user-uploaded image** violates copyright.
### Requirements
- Input: an image uploaded by a user (optionally with text/caption and user/account metadata).
- Output: a decision such as `{allow, block, human_review}` plus an explanation signal (e.g., matched copyrighted work ID, similarity score, region of match).
- Must scale to large numbers of uploads and a large database of copyrighted/reference images.
### Follow-up questions to address
1. **How would you fine-tune** a pretrained vision(-language) model used for embeddings or classification?
2. If an uploaded image is a **3×3 grid (collage)** and only one tile is infringing, how do you detect that?
3. How do you handle **adversarial text overlays** or other adversarial manipulations intended to evade detection?
4. If the original work is copyrighted, does a **photo of the work** (e.g., taken by a camera from a screen/poster) count as infringement, and how should the system treat such near-duplicates?
Assume you can combine retrieval, classification, and human review, and you must justify metrics, thresholds, and latency/cost tradeoffs.
Quick Answer: This question evaluates competency in designing scalable machine learning systems for image copyright detection, testing knowledge across computer vision, multimodal retrieval, similarity matching, robustness to adversarial edits, and handling partial matches like collages.