View RSS Feed

newdigital

MQL5 Wizard Techniques you should know (Part 49): Reinforcement Learning with Proximal Policy Optimization

Rate this Entry
by , Today at 10:56 AM (3 Views)
      
   
We continue our series on the MQL5 wizard, where lately we are alternating between simple patterns from common indicators and reinforcement learning algorithms. Having considered indicator patterns (Bill Williams’ Alligator) in the last article, we now return to reinforcement learning, where this time the algorithm we are looking at is Proximal Policy Optimization (PPO). It is reported that this algorithm, that was first published 7 years ago, is the reinforcement-learning algorithm of choice for ChatGPT. So, clearly there is some hype surrounding this approach to reinforcement learning. The PPO algorithm is intent on optimizing the policy (the function defining the actor’s actions) in a way that improves overall performance by preventing drastic changes that could make the learning process unstable.
more...

Submit "MQL5 Wizard Techniques you should know (Part 49): Reinforcement Learning with Proximal Policy Optimization" to Google Submit "MQL5 Wizard Techniques you should know (Part 49): Reinforcement Learning with Proximal Policy Optimization" to del.icio.us Submit "MQL5 Wizard Techniques you should know (Part 49): Reinforcement Learning with Proximal Policy Optimization" to Digg Submit "MQL5 Wizard Techniques you should know (Part 49): Reinforcement Learning with Proximal Policy Optimization" to reddit

Tags: None Add / Edit Tags
Categories
Uncategorized

Comments