agent doesn't take different actions to different states

Hello everyone,
I have two issues:
I wasn’t able to set up the environment so that the agent takes 24 different actions over the course of a day, meaning the agent takes one action every hour. As a workaround, I decided to train agents by the hour.
The second issue, which is the reason for my question, arises after training the agent. When I test the efficiency of its decision-making and run the simulation part of the RL Toolbox, I notice that the agent always takes the same action regardless of the state of the environment. This leads me to believe that the training process determines the best action for a set of states, which is not what I want. I want the agent to take the correct action for different states. I’ve been analyzing my environment code but can’t figure out why the agent behaves this way.
Thank you in advance.
BryanHello everyone,
I have two issues:
I wasn’t able to set up the environment so that the agent takes 24 different actions over the course of a day, meaning the agent takes one action every hour. As a workaround, I decided to train agents by the hour.
The second issue, which is the reason for my question, arises after training the agent. When I test the efficiency of its decision-making and run the simulation part of the RL Toolbox, I notice that the agent always takes the same action regardless of the state of the environment. This leads me to believe that the training process determines the best action for a set of states, which is not what I want. I want the agent to take the correct action for different states. I’ve been analyzing my environment code but can’t figure out why the agent behaves this way.
Thank you in advance.
Bryan Hello everyone,
I have two issues:
I wasn’t able to set up the environment so that the agent takes 24 different actions over the course of a day, meaning the agent takes one action every hour. As a workaround, I decided to train agents by the hour.
The second issue, which is the reason for my question, arises after training the agent. When I test the efficiency of its decision-making and run the simulation part of the RL Toolbox, I notice that the agent always takes the same action regardless of the state of the environment. This leads me to believe that the training process determines the best action for a set of states, which is not what I want. I want the agent to take the correct action for different states. I’ve been analyzing my environment code but can’t figure out why the agent behaves this way.
Thank you in advance.
Bryan agent drl, action agent, rl toobox MATLAB Answers — New Questions

Cart

Cart

agent doesn’t take different actions to different states

Related posts

How can I modify the lengh of the lines in a LEGEND?

Vertical 3D to Horizontal

download 2019b update 3

Leave a Reply Cancel reply

Information

Contact Us

All Categories

Search

Cart

All Categories

Search

Cart

agent doesn’t take different actions to different states

Share this!

Related posts

How can I modify the lengh of the lines in a LEGEND?

Vertical 3D to Horizontal

download 2019b update 3

Leave a Reply Cancel reply

Sign Up For Newsletters

Information

Contact Us