The idea is to get the trained model and construct the best policy JSON output for the maximum reward case. PlayML is a web app that uses the options trading Q-learning algorithm to provide a compute API endpoint that takes the input dataset and some options to calculate the q-values and returns them in JSON format to be modeled in the frontend.
The wrapped up Scala web ML app has the following directory structure:
In the preceding structure, the app folder has both the original QLearning implementation (see the ml folder) and some additional ...