The genome encoding scheme
The deep RL neural network that we use as the controller of the game agent has about 4 million trainable parameters. Each trainable parameter is the weight of a connection between two nodes of the neural network. Traditionally, training neural networks is about finding the appropriate values of all the connection weights, allowing the neural network to approximate a function that describes the specifics of the modeled process.
The conventional way to estimate these trainable parameters is to use some form of error backpropagation based on the gradient descent of the loss value, which is very computationally intensive. On the other hand, the neuroevolution algorithm allows us to train ANNs using a nature-inspired ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access