Skip to Main Content
TensorFlow: Powerful Predictive Analytics with TensorFlow
book

TensorFlow: Powerful Predictive Analytics with TensorFlow

by Md. Rezaul Karim
March 2018
Intermediate to advanced content levelIntermediate to advanced
164 pages
3h 39m
English
Packt Publishing
Content preview from TensorFlow: Powerful Predictive Analytics with TensorFlow

Developing a Multiarmed Bandit's Predictive Model

One of the simplest RL problems is called n-armed bandits. The thing is there are n-many slot machines but each has different fixed payout probability. The goal is to maximize the profit by always choosing the machine with the best payout.

As mentioned earlier, we will also see how to use policy gradient that produces explicit outputs. For our multiarmed bandits, we don't need to formalize these outputs on any particular state. To be simpler, we can design our network such that it will consist of just a set of weights that are corresponding to each of the possible arms to be pulled in the bandit. Then, we will represent how good an agent thinks to pull each arm to make maximum profit. A naive way ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

TensorFlow Machine Learning Projects

TensorFlow Machine Learning Projects

Ankit Jain, Dr. Amita Kapoor
Machine Learning Using TensorFlow Cookbook

Machine Learning Using TensorFlow Cookbook

Alexia Audevart, Konrad Banachewicz, Luca Massaron

Publisher Resources

ISBN: 9781789136913Supplemental Content