This question evaluates understanding of gradient-based optimization and numerical methods for locating extrema, assessing competency in interpreting derivatives/gradients, step-size effects, and convergence behavior within the Machine Learning domain.
Consider a real-valued, differentiable function f(x) defined on R (or more generally on R^n).
You have access to an oracle that, for any input x, can return both the function value f(x) and its derivative (for n = 1) or gradient (for n > 1), denoted by ∇f(x).
Answer the following:
Focus on the high-level algorithm and reasoning; you do not need to provide code.
Login required