book

Reinforcement Learning for Finance

October 2024

Intermediate to advanced

214 pages

5h 4m

English

Read now

Unlock full access

Includes Quizzes

Target AudienceOverview of the BookAbout the Code in This BookConventions Used in This BookUsing Code ExamplesO’Reilly Online LearningHow to Contact UsAcknowledgments
Bayesian LearningTossing a Biased CoinRolling a Biased DieBayesian UpdatingReinforcement LearningMajor BreakthroughsMajor Building BlocksDeep Q-LearningConclusionsReferences
Decision ProblemsDynamic ProgrammingQ-LearningCartPole as an ExampleThe Game EnvironmentA Random AgentThe DQL AgentQ-Learning Versus Supervised LearningConclusionsReferences
Finance EnvironmentDQL AgentWhere the Analogy FailsLimited DataNo ImpactConclusionsReferences
Noisy Time Series DataSimulated Time Series DataConclusionsReferencesDQLAgent Python Class
Simple ExampleFinancial ExampleKolmogorov-Smirnov TestConclusionsReferences
Prediction Game RevisitedTrading EnvironmentTrading AgentConclusionsReferencesFinance EnvironmentDQLAgent ClassSimulation Environment

Delta HedgingHedging EnvironmentHedging AgentConclusionsReferencesBSM (1973) Formula
Two-Fund SeparationTwo-Asset CaseThree-Asset CaseEqually Weighted PortfolioConclusionsReferencesThree-Asset Code
The ModelModel ImplementationExecution EnvironmentRandom AgentExecution AgentConclusionsReferences
References

Part I. The Basics

The first part of the book covers the basics of reinforcement learning and provides background information. It consists of three chapters:

Chapter 1 focuses on learning through interaction with four major examples: probability matching, Bayesian updating, reinforcement learning (RL), and deep Q-learning (DQL).
Chapter 2 introduces concepts from dynamic programming (DP) and discusses DQL as an approach to approximate solutions to DP problems. The major theme is the derivation of optimal policies to maximize a given objective function through taking a sequence of actions and updating the optimal policy iteratively. DQL is illustrated based on the CartPole game from the Gymnasium Python package.
Chapter 3 develops a first Finance environment that allows the DQL agent from Chapter 2 to learn a financial prediction game. Although the environment formally replicates the API of the CartPole, it misses some important characteristics that are needed to apply RL successfully.