View RSS Feed


  1. Neural networks made easy (Part 29): Advantage Actor-Critic algorithm

    by , 12-02-2022 at 12:12 AM
    Name:  ACtestTable.png
Views: 115
Size:  26.0 KB

    We continue to explore reinforcement learning methods. In previous articles we discussed methods for approximating the Q-learning Reward function and the policy gradient function learning. Each method has its own advantages and disadvantages. It would be great to use the maximum of their advantages when building and training models. When trying to find methods minimizing the shortcomings of the algorithms used, we often try to build certain conglomerates