View RSS Feed

Uncategorized

Entries with no category

  1. Neural networks made easy (Part 71): Goal-Conditioned Predictive Coding GCPC)

    by , 08-22-2024 at 08:30 AM
    Goal-Conditioned Behavior Cloning (BC) is a promising approach for solving various offline reinforcement learning problems. Instead of assessing the value of states and actions, BC directly trains the Agent behavior policy, building dependencies between the set goal, the analyzed environment state and the Agent's action. This is achieved using supervised learning methods on pre-collected offline trajectories. The familiar Decision Transformer method and its derivative algorithms have demonstrated
    ...
    Categories
    Uncategorized
  2. Neural networks made easy (Part 70): Closed-Form Policy Improvement Operators (CFPI)

    by , 08-08-2024 at 08:30 AM
    The approach to optimizing the Agent policy with constraints on its behavior turned out to be promising in solving offline reinforcement learning problems. By exploiting historical transitions, the Agent policy is trained to maximize a learned value function.

    Behavior constrained policy can help to avoid a significant distribution shift in relation to Agent actions, which provides sufficient confidence in the assessment of the action costs. In the previous article we got acquainted
    ...
    Categories
    Uncategorized
  3. Neural networks made easy (Part 69): Density-based support constraint for the behavioral policy (SPOT)

    by , 07-31-2024 at 08:30 AM
    Offline reinforcement learning allows the training of models based on data collected from interactions with the environment. This allows a significant reduction of the process of interacting with the environment. Moreover, given the complexity of environmental modeling, we can collect real-time data from multiple research agents and then train the model using this data.

    At the same time, using a static training dataset significantly reduces the environment information available to us.
    ...
    Categories
    Uncategorized
  4. Color buffers in multi-symbol multi-period indicators

    by , 07-05-2024 at 04:55 PM
    We continue the development of multi-symbol, multi-period indicators which we started in the previous article.

    A single-color indicator buffer is a regular double array, which is filled with data when calculating the indicator. We can obtain data from this array and display it on a chart using the CopyBuffer() function provided that the receiving array will be a double array set as an indicator's plotting buffer (SetIndexBuffer()). When copying data from the buffer of the calculated
    ...
    Categories
    Uncategorized
  5. Indicator of historical positions on the chart as their profit/loss diagram

Page 2 of 352 FirstFirst 1 2 3 4 12 52 102 ... LastLast