[Sung Kim] Lecture 4: Q-learning (table) exploit&exploration and discounted reward

[Sung Kim] Lecture 4: Q-learning (table) exploit&exploration and discounted reward

Reinforcement Learning with TensorFlow&OpenAI Gym 강의 - 수업웹페이지/슬라이드: hunkim.github.io/ml/ - 인프런: https://www.inflearn.com/course/reinf... - 커뮤니티(질문): https://www.facebook.com/groups/Tenso...

댓글