newdigital

Neural networks made easy (Part 38): Self-Supervised Exploration via Disagreement

Rate this Entry

0 Comments

by

, 03-14-2024 at 05:18 AM (558 Views)

This algorithm is based on a self-learning method, where the agent uses information obtained during interaction with the environment to generate "intrinsic" rewards and update its strategy. The algorithm is based on the use of several agent models that interact with the environment and generate various predictions. If the models disagree, it is considered an "interesting" event and the agent is incentivized to explore that space of the environment. In this way, the algorithm incentivizes the agent to explore new areas of the environment and allows it to make more accurate predictions about future rewards.

more...

Share
- Share this post on
- Digg
- Del.icio.us
- Technorati
- Twitter

Tags: metatrader 5, mql5, mt5

Add / Edit Tags

Categories: Uncategorized

Email Blog Entry

« Prev Main Next »

Comments

+ Create Blog

Recent Comments
Recent Blog Posts
- Neural Networks in Trading: A Multi-Agent Self-Adaptive Model (Final Part)
  09-03-2025 06:12 AM
- Next Week News - The Channel To Subscribe
  07-31-2025 05:02 AM
- MQL5 Wizard Techniques you should know (Part 69): Using Patterns of SAR and the RVI
  06-12-2025 04:20 PM
- Price Action Analysis Toolkit Development (Part 24): Price Action Quantification Analysis Tool
  05-24-2025 06:30 AM
- Trading with the MQL5 Economic Calendar (Part 7): Preparing for Strategy Testing with Resource-Based News Event Analysis
  04-19-2025 04:59 PM
Recent Visitors
- Aspart,
- cnmltd,
- Eddieatoth,
- mahdi_mirzaie,
- vidalista

Archive

All times are GMT. The time now is 02:27 AM.

Powered by vBulletin® Version 4.2.0
Copyright © 2025 vBulletin Solutions, Inc. All rights reserved.
Content Relevant URLs by vBSEO

Image resizer by SevenSkins