I have some questions about the minibatchsize attribute of PPO+LSTM
Hello everyone
I found this sentence when I searched the content of PPO+LSTM in the official MATLAB documentation:“For a PPO agent, the trajectory length is the MiniBatchSize property of its options object” ,But I have some doubts about this statement。When using PPO+LSTM, does the agent no longer need to sample minbatch from the current experience sequence? How to understand "For a PPO agent, the trajectory length is the MiniBatchSize property of its options object".
Hope to get your answerHello everyone
I found this sentence when I searched the content of PPO+LSTM in the official MATLAB documentation:“For a PPO agent, the trajectory length is the MiniBatchSize property of its options object” ,But I have some doubts about this statement。When using PPO+LSTM, does the agent no longer need to sample minbatch from the current experience sequence? How to understand "For a PPO agent, the trajectory length is the MiniBatchSize property of its options object".
Hope to get your answer Hello everyone
I found this sentence when I searched the content of PPO+LSTM in the official MATLAB documentation:“For a PPO agent, the trajectory length is the MiniBatchSize property of its options object” ,But I have some doubts about this statement。When using PPO+LSTM, does the agent no longer need to sample minbatch from the current experience sequence? How to understand "For a PPO agent, the trajectory length is the MiniBatchSize property of its options object".
Hope to get your answer drl MATLAB Answers — New Questions