Sun Dec 8th through Sat the 14th, 2019 at Vancouver Convention Center
The authors introduce a new game-style benchmark for physical reasoning, PHYRE, which contains a set of puzzles in a 2D physical environment using a set of parameterized task templates and variations on each template. The paper also presents baseline agents based on a non-parametric memorization strategy, DQN, and online learning variants of these agents. Reviewers are concerned that there is not enough visual complexity (shapes, textures, etc.), that the domain of physical reasoning is quite limited, and that the evaluations can be improved with more rigorous baselines. Although two reviewers see the work as marginally below threshold, all reviewers think an "accept" is reasonable.