This question evaluates system design proficiency in distributed resource accounting, real-time quota enforcement, billing models, idempotency, and observability for multi-tenant GPU platforms, and falls under the System Design category.
You are designing a multi-tenant platform where organizations run GPU jobs. Each organization receives monthly GPU credits that are consumed as jobs run. The system must expose APIs to issue, transfer, and spend credits; enforce budgets and rate limits in real time; and integrate with a job scheduler to admit or reject workloads based on available credits.
Assume credits are the unit of spend (e.g., 1 credit = $0.01 of GPU time) and that different GPU types have different prices (credits per GPU-minute). Jobs may be submitted by users to projects within an org. The platform should support prepaid and postpaid billing models.
Design a service that:
Make minimal reasonable assumptions where unspecified, and call them out.
Login required