Passive Reinforcement Learning
CSC 261 - Weinman


Answer the following questions. Record your answers in your Reading Journal.
  1. Select the sentence from today's reading that you feel highlights the most important consideration for direct utility estimation. Briefly (3-5 sentences) explain your selection.
  2. Select the sentence from today's reading that you feel best distinguishes between direct utility estimation and adaptive dynamic programming. Briefly (3-5 sentences) explain your selection.
  3. How would you explain the TD update equation (21.3) to a fellow computer science student who has only completed the review of intelligent agent architectures in Chapter 2?
  4. Identify (and briefly elaborate on) the sentence, section or concept from the current or previous reading that remains the most confusing to you.