PracHub
QuestionsPremiumLearningGuidesCheatsheetNEW
|Home/Software Engineering Fundamentals/DoorDash

Investigate High Memory Usage

Last updated: Mar 29, 2026

Quick Overview

This question evaluates incident-response and operational debugging skills, focusing on diagnosing high memory usage in a distributed microservice architecture that includes caching and third-party integrations.

  • medium
  • DoorDash
  • Software Engineering Fundamentals
  • Software Engineer

Investigate High Memory Usage

Company: DoorDash

Role: Software Engineer

Category: Software Engineering Fundamentals

Difficulty: medium

Interview Round: Onsite

You are the on-call engineer for a delivery platform. ### System context - Couriers use a mobile app to accept and complete deliveries. - The mobile app calls a public gateway service (`Dasher Service`), which then calls a `Payment Card Integration Service`. - For some merchants, the courier must pay in person using a prepaid debit card. - That card is funded programmatically during checkout through a third-party payment card provider. - The integration service also relies on Redis for card and account information caching. - The company is in the middle of migrating from a monolith to microservices. High-level flow: `Courier App -> Dasher Service -> Payment Card Integration Service -> Third-Party Card Provider` `Payment Card Integration Service <-> Redis cache` ### Incident It is 4:30 PM Pacific, during a busy period, and you are paged because the `Payment Card Integration Service` is showing much higher than expected memory utilization. Explain how you would handle this on-call investigation. Your answer should cover: 1. How you would assess severity and business impact. 2. What metrics, dashboards, and logs you would check first. 3. The most likely causes of high memory usage in this architecture. 4. How you would determine whether the issue is caused by traffic, a recent deploy, Redis behavior, retries, or the third-party provider. 5. Immediate mitigation steps you would consider. 6. How you would communicate during the incident. 7. What long-term fixes or follow-up actions you would propose after recovery.

Quick Answer: This question evaluates incident-response and operational debugging skills, focusing on diagnosing high memory usage in a distributed microservice architecture that includes caching and third-party integrations.

Related Interview Questions

  • How to prepare for AI-assisted coding interviews? - DoorDash (hard)
  • Build Resilient Aggregation and Debug Routing - DoorDash (medium)
  • Handle payment-service outages - DoorDash (easy)
  • Design an API for pay computation with retries - DoorDash (medium)
  • Debug and harden a driver-selection component - DoorDash (medium)
DoorDash logo
DoorDash
Jan 31, 2026, 12:00 AM
Software Engineer
Onsite
Software Engineering Fundamentals
9
0

You are the on-call engineer for a delivery platform.

System context

  • Couriers use a mobile app to accept and complete deliveries.
  • The mobile app calls a public gateway service ( Dasher Service ), which then calls a Payment Card Integration Service .
  • For some merchants, the courier must pay in person using a prepaid debit card.
  • That card is funded programmatically during checkout through a third-party payment card provider.
  • The integration service also relies on Redis for card and account information caching.
  • The company is in the middle of migrating from a monolith to microservices.

High-level flow: Courier App -> Dasher Service -> Payment Card Integration Service -> Third-Party Card Provider

Payment Card Integration Service <-> Redis cache

Incident

It is 4:30 PM Pacific, during a busy period, and you are paged because the Payment Card Integration Service is showing much higher than expected memory utilization.

Explain how you would handle this on-call investigation. Your answer should cover:

  1. How you would assess severity and business impact.
  2. What metrics, dashboards, and logs you would check first.
  3. The most likely causes of high memory usage in this architecture.
  4. How you would determine whether the issue is caused by traffic, a recent deploy, Redis behavior, retries, or the third-party provider.
  5. Immediate mitigation steps you would consider.
  6. How you would communicate during the incident.
  7. What long-term fixes or follow-up actions you would propose after recovery.

Solution

Show

Comments (0)

Sign in to leave a comment

Loading comments...

Browse More Questions

More Software Engineering Fundamentals•More DoorDash•More Software Engineer•DoorDash Software Engineer•DoorDash Software Engineering Fundamentals•Software Engineer Software Engineering Fundamentals
PracHub

Master your tech interviews with 7,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.