^-^
人工智能、计算机、机器学习、linux、程序员
OpenAI Spinning Up Part 1: Key Concepts in RL
最近更新:2020-06-14   |   字数总计:3.9k   |   阅读估时:15分钟   |   阅读量:
  1. Key Concepts and Terminology
    1. States and Observations
    2. Action Spaces
    3. Policies
      1. Deterministic Policies
      2. Stochastic Policies
    4. Trajectories
    5. Reward and Return
    6. The RL Problem
    7. Value Functions
    8. The Optimal Q-Function and the Optimal Action
    9. Bellman Equations
    10. Advantage Functions
  2. (Optional) Formalism