Optimizing Large-Scale Systems with Reinforcement Learning

Sayak Ray Chowdhury

Broschiertes Buch

Optimizing Large-Scale Systems with Reinforcement Learning

Versandkostenfrei!

Nicht lieferbar

Reinforcement learning (RL) is concerned with learning to take actions to maximize rewards, by trial and error, in environments that can evolve in response to actions. A Markov decision process (MDP) [6] is a popular framework to model decision making in RL environments. In the MDP, starting from an initial observed state, an agent repeatedly (a) takes an action, (b) receives a reward, and (c) observes the next state of the MDP. The traditional objective in RL is a search goal - find a policy (a rule to select an action for each state) with high total reward using as few interactions with the ...

Weiterlesen / Aufklappen

Andere Kunden interessierten sich für

Ketan Ramakrishnan, …
U.S. Tort Liability for Large-Scale …

Taschenbuch

27,99 €
Tom Hope, Yehezkel S …
Learning Tensorflow

Taschenbuch

54,99 €
Andrzej Cichocki, …
Tensor Networks for Dimensionality …

Taschenbuch

119,99 €
Chong Li, Meikang …
Reinforcement Learning for …

Taschenbuch

56,99 €
Large-scale 3D Data Integration

Taschenbuch

84,99 €
Matthew A Russell
21 Recipes for Mining Twitter

Taschenbuch

29,99 €
Pete Warden
Big Data Glossary

Taschenbuch

21,99 €
Kristina Chodorow
Scaling MongoDB

Taschenbuch

29,99 €
Data Science and Big Data Analytics in …

Buch

92,99 €
William A. Martin, …
Optimizing Binary Trees Grown With a …

Taschenbuch

16,99 €