In the last few years, RL has been successfully used in a variety of tasks, especially in game-playing and robotics. Let's acquaint ourselves with some success stories of RL before learning its algorithms:
- AlphaGo Zero: Developed by Google's DeepMind team, the AlphaGo Zero Mastering the game of Go without any human knowledge, starts from an absolutely blank slate (tabula rasa). The AlphaGo Zero uses one neural network to approximate both the move probabilities and value. This neural network takes as input the raw board representation. It uses a Monte Carlo Tree search guided by the neural network to select the moves. The reinforcement learning algorithm incorporates look-ahead search inside the training loop. ...