View RSS Feed

mql5

Neural networks made easy (Part 51): Behavior-Guided Actor-Critic (BAC)

Rate this Entry
by , 12-27-2023 at 06:48 AM (222 Views)
      
   
The last two articles were devoted to the Soft Actor-Critic algorithm. As you remember, the algorithm is used to train stochastic models in a continuous action space. The main feature of this method is the introduction of an entropy component into the reward function, which allows us to adjust the balance between environmental exploration and model operation. At the same time, this approach imposes some restrictions on the trained models. Using entropy requires some idea of the probability of taking actions, which is quite difficult to directly calculate for a continuous space of actions.
more...

Submit "Neural networks made easy (Part 51): Behavior-Guided Actor-Critic (BAC)" to Google Submit "Neural networks made easy (Part 51): Behavior-Guided Actor-Critic (BAC)" to del.icio.us Submit "Neural networks made easy (Part 51): Behavior-Guided Actor-Critic (BAC)" to Digg Submit "Neural networks made easy (Part 51): Behavior-Guided Actor-Critic (BAC)" to reddit

Categories
Uncategorized

Comments