logo
logo
x
바코드검색
BOOKPRICE.co.kr
책, 도서 가격비교 사이트
바코드검색

인기 검색어

실시간 검색어

검색가능 서점

도서목록 제공

Reinforcement Learning and Dynamic Programming Using Function Approximators

Reinforcement Learning and Dynamic Programming Using Function Approximators (Hardcover)

(Control in Continuous State Spaces)

Robert Babuska, Lucian Busoniu (지은이)
CRC Pr I Llc
255,750원

일반도서

검색중
서점 할인가 할인률 배송비 혜택/추가 실질최저가 구매하기
209,710원 -18% 0원
10,490원
199,220원 >
yes24 로딩중
교보문고 로딩중
notice_icon 검색 결과 내에 다른 책이 포함되어 있을 수 있습니다.

중고도서

검색중
서점 유형 등록개수 최저가 구매하기
로딩중

eBook

검색중
서점 정가 할인가 마일리지 실질최저가 구매하기
로딩중

책 이미지

Reinforcement Learning and Dynamic Programming Using Function Approximators
eBook 미리보기

책 정보

· 제목 : Reinforcement Learning and Dynamic Programming Using Function Approximators (Hardcover) (Control in Continuous State Spaces)
· 분류 : 외국도서 > 컴퓨터 > 기계이론
· ISBN : 9781439821084
· 쪽수 : 280쪽
· 출판일 : 2010-04-29

목차

1 Introduction
The dynamic programming and reinforcement learning problem
Approximation in dynamic programming and reinforcement learning
About this book
2 An introduction to dynamic programming and reinforcement learning
Introduction
Markov decision processes
Value iteration
Policy iteration
Policy search
Summary and discussion
3 Dynamic programming and reinforcement learning in large and continuous
spaces
Introduction
The need for approximation in large and continuous spaces
Approximation architectures
Approximate value iteration
Approximate policy iteration
Finding value function approximators automatically
Approximate policy search
Comparison of approximate value iteration, policy iteration, and policy search
Summary and discussion
4 Approximate value iteration with a fuzzy representation
Introduction
Fuzzy Q-iteration
Analysis of fuzzy Q-iteration
Optimizing the membership functions
Experimental study
Summary and discussion
5 Approximate policy iteration for online learning and continuous-action control
Introduction
A recapitulation of least-squares policy iteration
Online least-squares policy iteration
Online LSPI with prior knowledge
LSPI with continuous-action, polynomial approximation
Experimental study
Summary and discussion
6 Approximate policy search with cross-entropy optimization of basis functions
Introduction
Cross-entropy optimization
Cross-entropy policy search
Experimental study
Summary and discussion
Appendix A Extremely randomized trees
Structure of the approximator
Building and using a tree
Appendix B The cross-entropy method
Rare-event simulation using the cross-entropy method
Cross-entropy optimization
Symbols and abbreviations
Bibliography
List of algorithms
Index

저자소개

Lucian Busoniu (지은이)    정보 더보기
펼치기
이 포스팅은 쿠팡 파트너스 활동의 일환으로,
이에 따른 일정액의 수수료를 제공받습니다.
이 포스팅은 제휴마케팅이 포함된 광고로 커미션을 지급 받습니다.
도서 DB 제공 : 알라딘 서점(www.aladin.co.kr)
최근 본 책