NeurIPS 2020

Softmax Deep Double Deterministic Policy Gradients

Meta Review

The reviewers appreciate the simple idea brought up in the paper and the experiments designed to understand its effect and the theoretical justification. Some reviewers did express concerns regarding the significance of the theoretical results and the concerns remain after the rebuttal. Please try to incorporate these feedback in your final draft.