Reinforcement learning agent does not perform any action in simulation after train

Data store memory blocks were added to prevent algebraic loop errors:
2019-10-06_00-01-42.png
While training process, the scope shows active action signal (yellow):
2019-10-06_00-08-01.png
But after training, in simulation, or running simulink model, the action or rl agent stay idle. Non zero, but constant:
2019-10-06_00-12-32.png
What can be the cause?
How can i enable actions, when run simulink model?

1 Comment

It turns out that training at 2400 episode, converges precisely to such a result..
2019-10-08_07-24-56.jpg

Sign in to comment.

Answers (1)

Solved with multiplication x2 reward and x10 action signal.
There is no hidden settings.
The cause is somewhere in reward algorithm.
Next time i advice to play with action and reward signals, to fill all provided space, while training.

1 Comment

I can totally relate with you Alexey. Reward shaping plays a big role in the design of an RL agent. It would be great if you could share what reward function did you come up with to solve the task and how did you iterate to get a better function.

Sign in to comment.

Asked:

on 5 Oct 2019

Commented:

on 21 Feb 2020

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!