PracHub
QuestionsPremiumCoachesLearningGuidesInterview Prep
|Home/System Design/Rippling

Design a news feed aggregator

Last updated: Mar 29, 2026

Quick Overview

This question evaluates a candidate's competence in designing large-scale distributed systems, including real-time ingestion, personalized ranking, data modeling and indexing, search, scalability, reliability, security, and operational practices.

  • hard
  • Rippling
  • System Design
  • Software Engineer

Design a news feed aggregator

Company: Rippling

Role: Software Engineer

Category: System Design

Difficulty: hard

Interview Round: Take-home Project

Design a large-scale news aggregator and personalized feed system. Requirements: - Ingest articles from thousands of publishers via RSS/webhooks/APIs; handle rate limits, retries, failures, and deduplication. - Support near-real-time updates (<5 seconds end-to-end) and backfill for missed content. - Provide a personalized ranked feed per user using signals (followed sources, topics, recency, engagement) and experimentation support (A/B, holdouts). - Implement search and topic/tag pages; support geo and language filters. - Ensure idempotent ingestion, content normalization, media handling, and spam/NSFW detection. - Design storage (hot/cold), indexing, and caching layers; include data models and schemas. - Describe feed generation strategy (pull vs. push; fan-out-on-write/read) and trade-offs. - Plan for multi-region availability, eventual consistency, and disaster recovery. - Estimate capacity (QPS, throughput, data volume), scaling strategy, and cost controls. - Define external/internal APIs, rate limiting, authentication/authorization, and user privacy/compliance (GDPR/CCPA). - Include monitoring, alerting, logging, and SLOs; describe rollback and incident mitigation. - Provide a phased rollout and end-to-end testing plan.

Quick Answer: This question evaluates a candidate's competence in designing large-scale distributed systems, including real-time ingestion, personalized ranking, data modeling and indexing, search, scalability, reliability, security, and operational practices.

Related Interview Questions

  • Design a personalized news aggregator - Rippling (medium)
  • Design a Scalable News Feed - Rippling (medium)
  • Design Scalable Expense Violation Processing - Rippling (hard)
  • Design a news aggregator like Google News - Rippling (medium)
  • Design several large-scale systems - Rippling (hard)
Rippling logo
Rippling
Jul 26, 2025, 12:00 AM
Software Engineer
Take-home Project
System Design
16
0

System Design Take‑Home: Large-Scale News Aggregator and Personalized Feed

Context

Design a production-ready news aggregation and personalized feed platform that ingests articles from thousands of publishers, updates users’ feeds in near real time, and supports search, topics, and experimentation at scale.

Functional Requirements

  • Ingest articles from thousands of publishers via RSS, webhooks, and partner APIs.
    • Handle publisher rate limits, retries, transient failures, and deduplication.
    • Ensure idempotent ingestion and content normalization.
    • Support media handling (images/video) and spam/NSFW detection.
  • Near–real-time updates: end-to-end under 5 seconds from publisher to user feed.
    • Provide backfill for missed content/outages.
  • Personalized, ranked feed per user using signals: followed sources, topics, recency, user engagement.
    • Support experimentation (A/B tests, holdouts) for ranking variants and UI.
  • Implement keyword search and topic/tag pages, including geo and language filters.

Data, Storage, and Indexing

  • Design hot vs. cold storage, indexing, and caching layers.
  • Provide core data models and schemas for publishers, articles, users, follows, topics, engagements, experiments, and feeds.

Feed Generation Strategy

  • Describe pull vs. push and fan‑out‑on‑write vs. fan‑out‑on‑read trade‑offs.
  • Choose a strategy (or hybrid) and justify it for scale and latency targets.

Reliability and Availability

  • Multi-region availability with eventual consistency.
  • Disaster recovery plan and RPO/RTO targets.

Capacity and Cost

  • Estimate capacity: QPS, throughput, data volumes.
  • Propose scaling strategies and cost controls.

APIs, Security, and Privacy

  • Define external and internal APIs, rate limiting, authentication/authorization.
  • Address user privacy and compliance (GDPR/CCPA): consent, data deletion/export.

Operations

  • Monitoring, alerting, logging, tracing, and SLOs.
  • Rollback and incident mitigation strategies.

Rollout and Testing

  • Phased rollout plan and end‑to‑end testing strategy (including load, chaos, and experiment guardrails).

Solution

Show

Submit Your Answer

Sign in to leave a comment

Loading comments...

Browse More Questions

More System Design•More Rippling•More Software Engineer•Rippling Software Engineer•Rippling System Design•Software Engineer System Design
PracHub

Master your tech interviews with 8,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.