0
Your cart

Your cart is empty

Browse All Departments
  • All Departments
Price
  • R250 - R500 (1)
  • R500 - R1,000 (2)
  • R1,000 - R2,500 (1)
  • -
Status
Brand

Showing 1 - 4 of 4 matches in All Departments

A Reckoning on the Drum (Paperback): Thomas J. Walsh A Reckoning on the Drum (Paperback)
Thomas J. Walsh
R514 R437 Discovery Miles 4 370 Save R77 (15%) Ships in 10 - 15 working days
Youth Again - Letters to an Overweight Friend (Paperback): Thomas J. Walsh Youth Again - Letters to an Overweight Friend (Paperback)
Thomas J. Walsh
R650 Discovery Miles 6 500 Ships in 10 - 15 working days

A Simple, Non-Technical Exposition Of The Principles Underlying Weight Control And Correct Eating.

Youth Again - Letters to an Overweight Friend (Hardcover): Thomas J. Walsh Youth Again - Letters to an Overweight Friend (Hardcover)
Thomas J. Walsh
R938 Discovery Miles 9 380 Ships in 10 - 15 working days

A Simple, Non-Technical Exposition Of The Principles Underlying Weight Control And Correct Eating.

A Tutorial on Linear Function Approximators for Dynamic Programming and Reinforcement Learning (Paperback): Alborz Geramifard,... A Tutorial on Linear Function Approximators for Dynamic Programming and Reinforcement Learning (Paperback)
Alborz Geramifard, Thomas J. Walsh, Tellex Stefanie, Girish Chowdhary, Nicholas Roy, …
R1,581 Discovery Miles 15 810 Ships in 10 - 15 working days

A Markov Decision Process (MDP) is a natural framework for formulating sequential decision-making problems under uncertainty. In recent years, researchers have greatly advanced algorithms for learning and acting in MDPs. This book reviews such algorithms, beginning with well-known dynamic programming methods for solving MDPs such as policy iteration and value iteration, then describes approximate dynamic programming methods such as trajectory based value iteration, and finally moves to reinforcement learning methods such as Q-Learning, SARSA, and least-squares policy iteration. It describes algorithms in a unified framework, giving pseudocode together with memory and iteration complexity analysis for each. Empirical evaluations of these techniques, with four representations across four domains, provide insight into how these algorithms perform with various feature sets in terms of running time and performance. This tutorial provides practical guidance for researchers seeking to extend DP and RL techniques to larger domains through linear value function approximation. The practical algorithms and empirical successes outlined also form a guide for practitioners trying to weigh computational costs, accuracy requirements, and representational concerns. Decision making in large domains will always be challenging, but with the tools presented here this challenge is not insurmountable.

Free Delivery
Pinterest Twitter Facebook Google+
You may like...
Thrown Upon the World - A True Story
George Kolber, Charles Kolber Hardcover R627 R561 Discovery Miles 5 610
The Umbrella That Changed the World
Bern Clay Paperback R224 R188 Discovery Miles 1 880
The Lifegiving Home - Creating A Place…
Sally Clarkson Paperback R477 R388 Discovery Miles 3 880
The Works of Flavius Josephus - to Which…
Flavius Josephus Paperback R707 Discovery Miles 7 070
Genocide, the World Wars and the…
Donald Bloxham Paperback R609 Discovery Miles 6 090
Maverick Africans - The Shaping Of The…
Hermann Giliomee Paperback  (1)
R420 R361 Discovery Miles 3 610
Khwezi - The Remarkable Story Of…
Redi Tlhabi Paperback  (7)
R637 Discovery Miles 6 370
Fatima Meer - Memories Of Love And…
Fatima Meer Paperback  (1)
R365 R314 Discovery Miles 3 140
Blood On Her Hands - South Africa's Most…
Tanya Farber Paperback R300 R240 Discovery Miles 2 400
Englishman in Auschwitz
Leon Greenman Paperback R452 Discovery Miles 4 520

 

Partners