Is it possible to change some parameters during a RL training?

Question

0 votes

Hello everyone,

For those who don't know, I am currently working on a supervised learning/RL learning training session with a 3DOF model of a jet aircraft.

My idea is to change, during a training, some parameters such as the number of allowed interventions of the external controller as well as their duration. Another whish is to change accordingly these parametes as the agent makes progress, e.g. its episode reward will increase during training. Is it possible to do this during a training? That is, is it possible to implement some for loops that exploits the trainStats object while the training is runnig? Thank you

0 Comments
Show -2 older comments Hide -2 older comments

Sign in to comment.

Sign in to answer this question.

Follow Question

Answer 1

Subhajyoti on 5 Aug 2024

1 vote

Hi Leonardo,

Custom training loops and callbacks can adjust the parameters dynamically during the training of supervised learning or reinforcement learning models in the Reinforcement Learning Toolbox.

You can implement a custom training loop by defining your environment and agent, then loop through a predefined number of episodes. Within each episode, interact with the environment by selecting actions, applying them, and updating the agent based on the received rewards and observations. Log the total reward for each episode to monitor performance.

Monitor metrics like total reward per episode and adjust parameters such as the number of allowed interventions and their duration if specific criteria are met (e.g., every 50 episodes or when the average reward exceeds a threshold).

Use the ‘trainStats’ object to log and monitor performance metrics during training. Store the metrics like total reward for each episode, which can then be used to calculate statistics for dynamic parameter adjustment.

By combining a custom training loop, dynamic parameter adjustment, and the ‘trainStats’ object, an adaptive training process can be created that enhances the learning efficiency of the RL agent.

You may go through the following MathWorks documentation links to know more about Training RL Agents options and ‘trainStats’.

Custom Training: https://www.mathworks.com/help/reinforcement-learning/ug/train-reinforcement-learning-policy-using-custom-training.html
‘train’ function to train reinforcement learning agents: https://in.mathworks.com/help/reinforcement-learning/ref/rl.agent.rlqagent.train.html
Training Using Existing Training Data: https://www.mathworks.com/help/reinforcement-learning/ref/rl.agent.rlqagent.train.html
‘rlMultiAgentTrainingOptions‘: https://www.mathworks.com/help/reinforcement-learning/ref/rlmultiagenttrainingoptions.html

I hope this helps.

0 Comments
Show -2 older comments Hide -2 older comments

Sign in to comment.

Answer 2

Yash Sharma on 5 Aug 2024

1 vote

Hi Leonardo,

If you want to change some parameters while the training is running, you will need to implement a custom training loop. Below is the documentation that can help you achieve this.

Train Reinforcement Learning Policy Using Custom Training Loop: https://www.mathworks.com/help/reinforcement-learning/ug/train-reinforcement-learning-policy-using-custom-training.html

0 Comments
Show -2 older comments Hide -2 older comments

Sign in to comment.

Is it possible to change some parameters during a RL training?

0 Comments
Show -2 older comments Hide -2 older comments

Accepted Answer

0 Comments
Show -2 older comments Hide -2 older comments

More Answers (1)

0 Comments
Show -2 older comments Hide -2 older comments

Categories

Products

Release

Tags

Community Treasure Hunt

Is it possible to change some parameters during a RL training?

0 Comments Show -2 older comments Hide -2 older comments

Accepted Answer

0 Comments Show -2 older comments Hide -2 older comments

More Answers (1)

0 Comments Show -2 older comments Hide -2 older comments

Categories

Products

Release

Tags

See Also

Community Treasure Hunt

0 Comments
Show -2 older comments Hide -2 older comments

0 Comments
Show -2 older comments Hide -2 older comments

0 Comments
Show -2 older comments Hide -2 older comments