This question evaluates a candidate's ability to design LLM-driven arithmetic solving pipelines, covering decision policies for direct answers versus formula application versus code execution, orchestration and routing logic, tool integration and validation, prompting strategy, safety guardrails, evaluation metrics, and experiment tracking.
You are building an LLM-driven service that answers arithmetic questions ranging from simple expressions (e.g., 4 + 5) to queries like compute the sum from 1 to 100. The system must choose among answering directly, applying closed-form formulas, or executing code in a sandbox.
Provide concrete examples using 4 + 5 and sum from 1 to 100.
Login required