logo
logo
x
바코드검색
BOOKPRICE.co.kr
책, 도서 가격비교 사이트
바코드검색

인기 검색어

일간
|
주간
|
월간

실시간 검색어

검색가능 서점

도서목록 제공

Statistical Reinforcement Learning: Modern Machine Learning Approaches

Statistical Reinforcement Learning: Modern Machine Learning Approaches (Hardcover)

Masashi Sugiyama, Hirotaka Hachiya (지은이)
Chapman & Hall
200,780원

일반도서

검색중
서점 할인가 할인률 배송비 혜택/추가 실질최저가 구매하기
164,630원 -18% 0원
8,240원
156,390원 >
yes24 로딩중
교보문고 로딩중
notice_icon 검색 결과 내에 다른 책이 포함되어 있을 수 있습니다.

중고도서

검색중
서점 유형 등록개수 최저가 구매하기
로딩중

eBook

검색중
서점 정가 할인가 마일리지 실질최저가 구매하기
로딩중

책 이미지

Statistical Reinforcement Learning: Modern Machine Learning Approaches
eBook 미리보기

책 정보

· 제목 : Statistical Reinforcement Learning: Modern Machine Learning Approaches (Hardcover) 
· 분류 : 외국도서 > 경제경영 > 통계
· ISBN : 9781439856895
· 쪽수 : 206쪽
· 출판일 : 2015-04-15

목차

Introduction to Reinforcement Learning
Reinforcement Learning
Mathematical Formulation
Structure of the Book
     Model-Free Policy Iteration
     Model-Free Policy Search
     Model-Based Reinforcement Learning

MODEL-FREE POLICY ITERATION

Policy Iteration with Value Function Approximation
Value Functions
     State Value Functions
     State-Action Value Functions
Least-Squares Policy Iteration
      Immediate-Reward Regression
     Algorithm
     Regularization
     Model Selection
Remarks

Basis Design for Value Function Approximation
Gaussian Kernels on Graphs
     MDP-Induced Graph
     Ordinary Gaussian Kernels
     Geodesic Gaussian Kernels
     Extension to Continuous State Spaces
Illustration
     Setup
     Geodesic Gaussian Kernels
     Ordinary Gaussian Kernels
     Graph-Laplacian Eigenbases
     Diffusion Wavelets
Numerical Examples
     Robot-Arm Control
     Robot-Agent Navigation
Remarks

Sample Reuse in Policy Iteration
Formulation
Off-Policy Value Function Approximation
     Episodic Importance Weighting
     Per-Decision Importance Weighting
     Adaptive Per-Decision Importance Weighting
     Illustration
Automatic Selection of Flattening Parameter
     Importance-Weighted Cross-Validation
     Illustration
Sample-Reuse Policy Iteration
     Algorithm
     Illustration
Numerical Examples
     Inverted Pendulum
     Mountain Car
Remarks

Active Learning in Policy Iteration
Efficient Exploration with Active Learning
     Problem Setup
     Decomposition of Generalization Error
     Estimation of Generalization Error
     Designing Sampling Policies
     Illustration
Active Policy Iteration
     Sample-Reuse Policy Iteration with Active Learning
     Illustration
Numerical Examples
Remarks

Robust Policy Iteration
Robustness and Reliability in Policy Iteration
     Robustness
     Reliability
Least Absolute Policy Iteration
     Algorithm
     Illustration
     Properties
Numerical Examples
Possible Extensions
     Huber Loss
     Pinball Loss
     Deadzone-Linear Loss
     Chebyshev Approximation
     Conditional Value-At-Risk
Remarks

MODEL-FREE POLICY SEARCH

Direct Policy Search by Gradient Ascent
Formulation
Gradient Approach
     Gradient Ascent
     Baseline Subtraction for Variance Reduction
     Variance Analysis of Gradient Estimators
Natural Gradient Approach 
     Natural Gradient Ascent
     Illustration
Application in Computer Graphics: Artist Agent
     Sumie Paining 
     Design of States, Actions, and Immediate Rewards
     Experimental Results
Remarks

Direct Policy Search by Expectation-Maximization
Expectation-Maximization Approach
Sample Reuse
     Episodic Importance Weighting
     Per-Decision Importance Weight
     Adaptive Per-Decision Importance Weighting
     Automatic Selection of Flattening Parameter
     Reward-Weighted Regression with Sample Reuse
Numerical Examples
Remarks

Policy-Prior Search
Formulation
Policy Gradients with Parameter-Based Exploration 
     Policy-Prior Gradient Ascent
     Baseline Subtraction for Variance Reduction
     Variance Analysis of Gradient Estimators
     Numerical Examples
Sample Reuse in Policy-Prior Search 
     Importance Weighting
     Variance Reduction by Baseline Subtraction
     Numerical Examples
Remarks

MODEL-BASED REINFORCEMENT LEARNING

Transition Model Estimation
Conditional Density Estimation
     Regression-Based Approach
     Q-Neighbor Kernel Density Estimation
     Least-Squares Conditional Density Estimation
Model-Based Reinforcement Learning
Numerical Examples
     Continuous Chain Walk
     Humanoid Robot Control
Remarks

Dimensionality Reduction for Transition Model Estimation
Sufficient Dimensionality Reduction
Squared-Loss Conditional Entropy
     Conditional Independence
     Dimensionality Reduction with SCE
     Relation to Squared-Loss Mutual Information
Numerical Examples
     Artificial and Benchmark Datasets 
     Humanoid Robot
Remarks

References
Index

이 포스팅은 쿠팡 파트너스 활동의 일환으로,
이에 따른 일정액의 수수료를 제공받습니다.
이 포스팅은 제휴마케팅이 포함된 광고로 커미션을 지급 받습니다.
도서 DB 제공 : 알라딘 서점(www.aladin.co.kr)
최근 본 책