There are several further advancements of AC algorithms, and there are many tips and tricks to keep in mind, while designing such algorithms:
- Architectural design: In our implementation, we implemented two distinct neural networks, one for the critic, and one for the actor. It's also possible to design a neural network that shares the main hidden layers, while keeping the heads distinct. This architecture can be more difficult to tune, but overall, it increases the efficiency of the algorithms.
- Parallel environments: A widely adopted technique to decrease the variance is to collect experience from multiple environments in parallel. The A3C (Asynchronous Advantage Actor-Critic) algorithm updates the global ...