PracHub
QuestionsPremiumCoachesLearningGuidesInterview Prep
|Home/System Design/HubSpot

Design a near-real-time weather API

Last updated: Jun 15, 2026

Quick Overview

A HubSpot onsite system-design question: build a weather data platform that ingests hourly dumps for ~2,000 U.S. locations and serves the current temperature with an at-most-10-minute freshness SLA. It tests ingestion pipelines, idempotent retries and backfill, caching and invalidation, storage modeling, rate-limit handling, scaling, and observability on freshness SLOs.

  • hard
  • HubSpot
  • System Design
  • Software Engineer

Design a near-real-time weather API

Company: HubSpot

Role: Software Engineer

Category: System Design

Difficulty: hard

Interview Round: Onsite

##### Question Design a weather data platform that retrieves best-effort hourly dumps from an external Weather Service API for approximately 2,000 locations across the U.S. and exposes an API to return the **current temperature** for a requested location. **Hard requirement:** data served must be at most **10 minutes stale** relative to the provider's hourly publish time. The system should: 1. Define a **high-level architecture**: ingestion, storage, cache, read API, and scheduler/orchestrator. 2. Design the **API contract**: e.g. `GET /temperature?locationId=...` (and optionally `lat,lon`), the response schema, error cases, freshness metadata, and versioning. Support lookup by internal `id` and by nearest covered location for `lat,lon`. 3. Choose a **data model and storage** (including the location schema, TTL/retention, indexing, and partitioning strategy). 4. Specify the **ingestion pipeline**: detecting a new provider publish, fan-out, retries with backoff, idempotent ingestion, deduplication, and backfill of missed hours, while respecting provider **rate limits**. 5. Lay out the **caching and invalidation plan** (TTLs, write-through, event-driven invalidation, optional CDN) that meets the 10-minute freshness SLA, plus the API-side freshness gate. 6. Cover **scaling and partitioning** for traffic spikes, and provide rough **QPS and storage estimates**. 7. Describe **deployment, cost, and security** considerations. 8. Describe the **test and observability** strategy: monitoring/alerting on freshness SLOs, failure handling for upstream slowness / partial failures / outages, and graceful degradation. 9. Discuss **multi-region availability and disaster recovery** (RPO/RTO).

Quick Answer: A HubSpot onsite system-design question: build a weather data platform that ingests hourly dumps for ~2,000 U.S. locations and serves the current temperature with an at-most-10-minute freshness SLA. It tests ingestion pipelines, idempotent retries and backfill, caching and invalidation, storage modeling, rate-limit handling, scaling, and observability on freshness SLOs.

Related Interview Questions

  • Design a video streaming platform like Netflix/YouTube - HubSpot (medium)
  • Design an hourly weather crawling service - HubSpot (easy)
  • Design a Netflix-like video platform - HubSpot (easy)
  • Design a streaming video platform MVP (Netflix-like) - HubSpot (hard)
  • Schedule and execute future payments - HubSpot (medium)
HubSpot logo
HubSpot
Sep 6, 2025, 12:00 AM
Software Engineer
Onsite
System Design
12
0
Question

Design a weather data platform that retrieves best-effort hourly dumps from an external Weather Service API for approximately 2,000 locations across the U.S. and exposes an API to return the current temperature for a requested location.

Hard requirement: data served must be at most 10 minutes stale relative to the provider's hourly publish time.

The system should:

  1. Define a high-level architecture : ingestion, storage, cache, read API, and scheduler/orchestrator.
  2. Design the API contract : e.g. GET /temperature?locationId=... (and optionally lat,lon ), the response schema, error cases, freshness metadata, and versioning. Support lookup by internal id and by nearest covered location for lat,lon .
  3. Choose a data model and storage (including the location schema, TTL/retention, indexing, and partitioning strategy).
  4. Specify the ingestion pipeline : detecting a new provider publish, fan-out, retries with backoff, idempotent ingestion, deduplication, and backfill of missed hours, while respecting provider rate limits .
  5. Lay out the caching and invalidation plan (TTLs, write-through, event-driven invalidation, optional CDN) that meets the 10-minute freshness SLA, plus the API-side freshness gate.
  6. Cover scaling and partitioning for traffic spikes, and provide rough QPS and storage estimates .
  7. Describe deployment, cost, and security considerations.
  8. Describe the test and observability strategy: monitoring/alerting on freshness SLOs, failure handling for upstream slowness / partial failures / outages, and graceful degradation.
  9. Discuss multi-region availability and disaster recovery (RPO/RTO).

Solution

Show

Submit Your Answer to Earn 20XP

Sign in to leave a comment

Loading comments...

Browse More Questions

More System Design•More HubSpot•More Software Engineer•HubSpot Software Engineer•HubSpot System Design•Software Engineer System Design
PracHub

Master your tech interviews with 8,000+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.