You have an oracle-style math reasoning solver. On each independent run, the time to produce a correct answer is a random variable T with known distribution. If you stop and restart the solver, all previous work is discarded and a fresh independent run begins. This is analogous to a Las Vegas algorithm with a random stopping time.
Answer the following:
-
Explain when restarting can reduce expected completion time compared with simply waiting for one run to finish.
-
For a fixed cutoff
tau
, analyze the policy that restarts whenever a run has not finished by time
tau
.
-
Derive the expected runtime of this single-threshold restart policy and explain how to choose the best threshold.
-
Propose a strategy that can achieve better worst-case robustness than using only one fixed threshold, especially when the runtime distribution has a long tail.