How should I assess the training of my agent using PPO and Q-learning?

Urgent !!!!
Hello everybody
I am working on my project to implement a reinforcement learning agent to evaluate the security level of a waf with sql injection.

I started by training two PPO and Q-learning algorithms. I would like you to help me analyze the convergent curve of my models and also if there are parameters to adjust in order to find the right learning rate value.
you will find my code attached and on the image the rewards by episodesUrgent !!!!
Hello everybody
I am working on my project to implement a reinforcement learning agent to evaluate the security level of a waf with sql injection.

I started by training two PPO and Q-learning algorithms. I would like you to help me analyze the convergent curve of my models and also if there are parameters to adjust in order to find the right learning rate value.
you will find my code attached and on the image the rewards by episodes Urgent !!!!
Hello everybody
I am working on my project to implement a reinforcement learning agent to evaluate the security level of a waf with sql injection.

Cart

Cart