This question evaluates understanding of reinforcement learning fundamentals, specifically the components of Markov decision processes, typical agent-side elements, and their temporal interaction in the agent-environment loop.
In reinforcement learning, we model an agent that interacts with an environment over time. The agent observes the state of the environment, takes actions, and receives rewards.
Describe the standard components of a reinforcement learning formulation:
Give clear definitions and briefly explain the role of each component.
Login required