Design a high-concurrency LLM inference service | Anthropic Interview Question