You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The final later of the network should not have an activation of softmax, ideally it should be "linear" activation as we will be measuring the action values, and not the probability of taking an action.
I am evaluating the code and trying to tweak a couple of things. Would it possible for you to add me as a contributor to your repo. I will create a develop branch and work in that branch.
The final later of the network should not have an activation of softmax, ideally it should be "linear" activation as we will be measuring the action values, and not the probability of taking an action.
model.add(layers.Dense(self.num_outputs,activation='softmax'))
The text was updated successfully, but these errors were encountered: