Design efficient Transformer inference with KV cache | Startups.Com Interview Question