Design GPU inference request batching | Anthropic Interview Question