Reinforcement Learning with History Lists

Name: Reinforcement Learning with History Lists
Price: 69.90 EUR
Availability: InStock
ISBN: 3838106210

Solving Partially Observable Decision Processes by Using Short Term Memory

Reinforcement Learning with History Lists - Timmer, Stephan

Fotogalerie

Stephan Timmer

Reinforcement Learning with History Lists

Solving Partially Observable Decision Processes by Using Short Term Memory

Broschiertes Buch

Jetzt bewerten Jetzt bewerten

Autorenporträt

Andere Kunden interessierten sich auch für

Martin Giersich
Real-time Intention Analysis in Teams

58,99 €
Research and Development in Intelligent Systems XXIV

237,99 €
Kostas Kostiadis
Learning to Co-operate in Multi-Agent Systems

38,99 €
AISB91

40,99 €
Ugochukwu O. Matthew
Defence Application of Artificial Intelligence & Machine Learning

48,99 €
Louis F. Pau
Economic and Financial Knowledge-Based Processing

37,99 €
Mark Coeckelbergh
AI Ethics

11,99 €

Produktbeschreibung

A very general framework for modeling uncertainty in learning environments is given by Partially observable Markov Decision Processes (POMDPs). In a POMDP setting, the learning agent infers a policy for acting optimally in all possible states of the environment, while receiving only observations of these states. The basic idea for coping with partial observability is to include memory into the representation of the policy. Perfect memory is provided by the belief space, i.e. the space of probability distributions over environmental states. However, computing policies defined on the belief space requires a considerable amount of prior knowledge about the learning problem and is expensive in terms of computation time.The author Stephan Timmer presents a reinforcement learning algorithm for solving POMDPs based on short term memory. In contrast to belief states, short term memory is not capable of representing optimal policies, but is far more practical and requires no prior knowledge about the learning problem. It can be shown that the algorithm can also be used to solve large Markov Decision Processes (MDPs) with continuous, multi-dimensional state spaces.

Produktdetails

Produktdetails
Verlag: Südwestdeutscher Verlag für Hochschulschriften
Seitenzahl: 160
Erscheinungstermin: 12. August 2015
Deutsch
Abmessung: 220mm x 150mm x 11mm
Gewicht: 229g
ISBN-13: 9783838106212
ISBN-10: 3838106210
Artikelnr.: 26235858

Produktdetails

Verlag: Südwestdeutscher Verlag für Hochschulschriften
Seitenzahl: 160
Erscheinungstermin: 12. August 2015
Deutsch
Abmessung: 220mm x 150mm x 11mm
Gewicht: 229g
ISBN-13: 9783838106212
ISBN-10: 3838106210
Artikelnr.: 26235858

Autorenporträt

Stephan Timmer, Dr. rer. nat.:Studium der Informatik an derUniversität Dortmund. Nach Abschluss der Diplomarbeit mehrjährigeTätigkeit als Wissenschaftlicher Mitarbeiter an der UniversitätOnsabrück mit Schwerpunkt Maschinelles Lernen und KünstlicheIntelligenz. Promotion im Jahr 2009.

Reinforcement Learning with History Lists

Rechnungen

Retourenschein anfordern

Bestellstatus

Storno

Serviceseiten

Schließen

Reinforcement Learning with History Lists

Reinforcement Learning with History Lists

Bitte wählen Sie Ihr Anliegen aus.

Rechnungen

Retourenschein anfordern

Bestellstatus

Storno

Serviceseiten

Schließen