View RSS Feed

mql5

Neural networks made easy (Part 58): Decision Transformer (DT)

Rate this Entry
by , 09-05-2024 at 09:48 AM (106 Views)
      
   
In this series, we have already examined a fairly wide range of different reinforcement learning algorithms. They all use the basic approach:

  1. The agent analyzes the current state of the environment.
  2. Takes the optimal action (within the framework of the learned Policy - behavior strategy).
  3. Moves into a new state of the environment.
  4. Receives a reward from the environment for a complete transition to a new state.

The sequence is based on the principles of the Markov process. It is assumed that the starting point is the current state of the environment. There is only one optimal way out of this state and it does not depend on the previous path.
more...

Submit "Neural networks made easy (Part 58): Decision Transformer (DT)" to Google Submit "Neural networks made easy (Part 58): Decision Transformer (DT)" to del.icio.us Submit "Neural networks made easy (Part 58): Decision Transformer (DT)" to Digg Submit "Neural networks made easy (Part 58): Decision Transformer (DT)" to reddit

Tags: None Add / Edit Tags
Categories
Uncategorized

Comments