책 이미지

책 정보
· 분류 : 외국도서 > 기술공학 > 기술공학 > 공학일반
· ISBN : 9783030411879
· 쪽수 : 206쪽
· 출판일 : 2021-01-03
목차
Prediction Error and Actor-Critic Hypotheses in the Brain.- Reviewing on-policy / o?-policy critic learning in the context of Temporal Di?erences and Residual Learning.- Reward Function Design in Reinforcement Learning.- Exploration Methods In Sparse Reward Environments.- A Survey on Constraining Policy Updates Using the KL Divergence.- Fisher Information Approximations in Policy Gradient Methods.- Benchmarking the Natural gradient in Policy Gradient Methods and Evolution Strategies.- Information-Loss-Bounded Policy Optimization.- Persistent Homology for Dimensionality Reduction.- Model-free Deep Reinforcement Learning ? Algorithms and Applications.- Actor vs Critic.- Bring Color to Deep Q-Networks.- Distributed Methods for Reinforcement Learning.- Model-Based Reinforcement Learning.- Challanges of Model Predicitve Control in a Black Box Environment.- Control as Inference?