오늘부터 공부해 보겠음. 도전!
- 23년 1월 21일
Introduction to RL problems & OpenAI Gym
Dynamic Programming: Model-Based RL, Policy Iteration and Value Iteration
Monte Carlo Model-Free Prediction & Control
Temporal Difference Model-Free Prediction & Control
Deep Q Learning (WIP)
Policy Gradient Methods (WIP)
'Reinforcement Learning' 카테고리의 다른 글
MDP (3) | 2023.01.21 |
---|---|
Introduction (2) | 2023.01.21 |