PracHub
QuestionsPremiumLearningGuidesInterview PrepNEWCoaches
|Home/ML System Design/Reddit

Design comment-likelihood prediction platform

Last updated: Apr 28, 2026

Quick Overview

This question evaluates a candidate's competency in ML system design and infrastructure, covering feature pipelines, feature store strategy, training/serving consistency, online inference at high QPS, and production monitoring.

  • medium
  • Reddit
  • ML System Design
  • Software Engineer

Design comment-likelihood prediction platform

Company: Reddit

Role: Software Engineer

Category: ML System Design

Difficulty: medium

Interview Round: Onsite

## Scenario You’re building an ML platform component that serves a model to predict **the likelihood that a user will comment on a given post**. The interviewer says you can treat the model as a **black box** (you don’t need to pick a specific architecture); focus on **ML infrastructure**: feature pipelines, feature store, training/serving, inference, and production concerns. ## Goals Design an end-to-end system that supports: - Offline training data generation and model training - Online inference (real-time scoring) for product surfaces (e.g., feed ranking, notification candidate scoring) - A feature store strategy (offline + online) - Monitoring, logging, and iteration ## Requirements & constraints (you may make reasonable assumptions) - High QPS online scoring (potentially tens of thousands/sec) - P95 latency budget for scoring: e.g., 50–150 ms end-to-end (state your assumption) - Freshness: some features need near-real-time updates (seconds to minutes) - Avoid training/serving feature skew - Handle cold start (new users/posts) - Support A/B testing and safe rollout ## Deliverables Explain: 1. Data sources and event logging 2. Feature engineering and feature store design 3. Training pipeline and dataset versioning 4. Online inference architecture (batch vs real-time, caching, fallbacks) 5. Monitoring (data + model), retraining triggers, and reliability considerations

Quick Answer: This question evaluates a candidate's competency in ML system design and infrastructure, covering feature pipelines, feature store strategy, training/serving consistency, online inference at high QPS, and production monitoring.

Related Interview Questions

  • Design comment ranking - Reddit (hard)
  • Design a video recommendation system - Reddit (medium)
  • Design a feature store with CI/CD and reliability - Reddit (hard)
Reddit logo
Reddit
Nov 10, 2025, 12:00 AM
Software Engineer
Onsite
ML System Design
7
0

Scenario

You’re building an ML platform component that serves a model to predict the likelihood that a user will comment on a given post.

The interviewer says you can treat the model as a black box (you don’t need to pick a specific architecture); focus on ML infrastructure: feature pipelines, feature store, training/serving, inference, and production concerns.

Goals

Design an end-to-end system that supports:

  • Offline training data generation and model training
  • Online inference (real-time scoring) for product surfaces (e.g., feed ranking, notification candidate scoring)
  • A feature store strategy (offline + online)
  • Monitoring, logging, and iteration

Requirements & constraints (you may make reasonable assumptions)

  • High QPS online scoring (potentially tens of thousands/sec)
  • P95 latency budget for scoring: e.g., 50–150 ms end-to-end (state your assumption)
  • Freshness: some features need near-real-time updates (seconds to minutes)
  • Avoid training/serving feature skew
  • Handle cold start (new users/posts)
  • Support A/B testing and safe rollout

Deliverables

Explain:

  1. Data sources and event logging
  2. Feature engineering and feature store design
  3. Training pipeline and dataset versioning
  4. Online inference architecture (batch vs real-time, caching, fallbacks)
  5. Monitoring (data + model), retraining triggers, and reliability considerations

Solution

Show

Comments (0)

Sign in to leave a comment

Loading comments...

Browse More Questions

More ML System Design•More Reddit•More Software Engineer•Reddit Software Engineer•Reddit ML System Design•Software Engineer ML System Design
PracHub

Master your tech interviews with 7,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.