book

Reinforcement Learning for Finance

by Yves Hilpisch

October 2024

Intermediate to advanced

214 pages

5h 4m

English

O'Reilly Media, Inc.

Audio summary available

Read now

Unlock full access

Includes

Includes Quizzes

Target AudienceOverview of the BookAbout the Code in This BookConventions Used in This BookUsing Code ExamplesO’Reilly Online LearningHow to Contact UsAcknowledgments
Bayesian LearningTossing a Biased CoinRolling a Biased DieBayesian UpdatingReinforcement LearningMajor BreakthroughsMajor Building BlocksDeep Q-LearningConclusionsReferences
Decision ProblemsDynamic ProgrammingQ-LearningCartPole as an ExampleThe Game EnvironmentA Random AgentThe DQL AgentQ-Learning Versus Supervised LearningConclusionsReferences
Finance EnvironmentDQL AgentWhere the Analogy FailsLimited DataNo ImpactConclusionsReferences
Noisy Time Series DataSimulated Time Series DataConclusionsReferencesDQLAgent Python Class
Simple ExampleFinancial ExampleKolmogorov-Smirnov TestConclusionsReferences
Prediction Game RevisitedTrading EnvironmentTrading AgentConclusionsReferencesFinance EnvironmentDQLAgent ClassSimulation Environment

Delta HedgingHedging EnvironmentHedging AgentConclusionsReferencesBSM (1973) Formula
Two-Fund SeparationTwo-Asset CaseThree-Asset CaseEqually Weighted PortfolioConclusionsReferencesThree-Asset Code
The ModelModel ImplementationExecution EnvironmentRandom AgentExecution AgentConclusionsReferences
References

Content preview from Reinforcement Learning for Finance

Chapter 1. Learning Through Interaction

The idea that we learn by interacting with our environment is probably the first to occur to us when we think about the nature of learning.

Sutton and Barto (2018)

For human beings and animals alike, learning is almost as fundamental as breathing. It is something that happens continuously and most often unconsciously. There are different forms of learning. The one most important to the topics covered in this book is based on interacting with an environment.

Interaction with an environment provides the learner—or agent henceforth—with feedback that can be used to update their knowledge or to refine a skill. In this book, we are mostly interested in learning quantifiable facts about an environment, such as the odds of winning a bet or the reward that an action yields.

The next section discusses Bayesian learning as an example of learning through interaction. “Reinforcement Learning” presents breakthroughs in AI that were made possible through RL. It also describes the major building blocks of RL. “Deep Q-Learning” explains the two major characteristics of DQL, which is the most important algorithm in the remainder of the book.

Bayesian Learning

Two examples illustrate learning by interacting with an environment: tossing a biased coin and rolling a biased die. The examples are based on the idea that an agent betting repeatedly on the outcome of a biased gamble (and remembering all outcomes) can learn bet-by-bet about a gamble’s bias and ...