Passive Reinforcement Learning
CSC 261 - Weinman
Answer the following questions. Record your answers in your Reading
Journal.
- Select the sentence from today's reading that you feel highlights
the most important consideration for direct utility
estimation. Briefly (3-5 sentences) explain your selection.
- Select the sentence from today's reading that you feel best
distinguishes between direct utility estimation and adaptive
dynamic programming. Briefly (3-5 sentences) explain your selection.
- How would you explain the TD update equation (21.3) to a fellow computer
science student who has only completed the review of intelligent agent
architectures in Chapter 2?
- Identify (and briefly elaborate on) the sentence, section or concept
from the current or previous reading that remains the most confusing
to you.