The reward is minimized using DQN agent

ZHAO Zhonghao

28 Sep 2021

0 Answers

Updated 30 Nov 2023

5 Views (30 days)

Follow Question

Show older comments

0 votes

I made a DQN reinforcement learning agent to solve my own problem. Specifically, my task is to determine the location of some electric vehicle charging stations in a transportation network, and I defined a nagative reward for each step. However, it seems that agent tries to find the worst solution. I have used the RL toolbox many times and I never met a problem like this. If I change the reward signal to a positive value, the agent will maxmize the eposide reward instead, which still gives the worst solution.

Thank you for your help!

1 Comment
Show -1 older comments Hide -1 older comments

Alessandro Fasiello on 30 Nov 2023

I'm having the same issue with the PPO agent, did you understood the cause of the problem?

Follow Question

Answers (0)

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

The reward is minimized using DQN agent

1 Comment
Show -1 older comments Hide -1 older comments

Answers (0)

Categories

Tags

Community Treasure Hunt

The reward is minimized using DQN agent

1 Comment Show -1 older comments Hide -1 older comments

Answers (0)

Categories

Tags

See Also

Community Treasure Hunt

1 Comment
Show -1 older comments Hide -1 older comments