This question evaluates operational troubleshooting, root-cause analysis, and resilience design skills for a single-node web server, testing a candidate's competence in diagnostics, incident response, and architectural mitigation.
You own a single-machine web server (one host running the web service). Suddenly users report that a specific page (or the whole website) is down.
Assume you have standard production access (logs/metrics, SSH, ability to roll back/deploy, etc.), but start from a single-node baseline.