Update of DQN

Hello community, I need your help

I’m using a customized DQN network that consists of four seperate output layers A, B, C, and D, each outputting an action. And according to the output of A (which has 3 discrete outputs), either the action selected of B, C, or D is selected (only one action set is executed). With this scenario, I don’t know if the update function is correct or not:



submitted by /u/GuavaAgreeable208
