Which principle is essential to reinforcement learning? O A programmer has specified the best action for the agent in a given state. If it follows their recommendation, it is rewarded. The agent receives instructions on how to behave in a particular state. The agent receives a reward or punishment at certain points in time and thus learns to assess what value an action has in a certain state. Reward and punishment are balanced in their frequency of occurrence. Question 4 3.0 Pts What influence does a reward typically have on the learning process in reinforcement learning? All actions that did not contribute to obtaining the reward will be shown less frequently in the future. O None, as long as the agent has not also experienced a punishment. All actions that contributed to the reward are shown more frequently in the future. Only the action that directly led to the reward will be shown more often in the future.

