Reviews: A Bayesian Theory of Conformity in Collective Decision Making

This paper introduces a Bayesian formulation of Theory of Mind with a focus on conformity. The model is able to represent multi-level reasoning processes in multi round games. The experimental results confirmed that the model provides a better fit to data compared to previous works. The manuscript is very well written and it was a pleasure read. The ideas are well described and motivated (except one section which is mentioned below) and the experiments are well conducted. I have two concerns about the paper. My first concern is about the extent of contributions. It seems to me that the only novel part of the paper is section 2.3 and the rest is very similar to reference [23] cited in the paper. Based on this, it would be important to make this point clear in the paper that the formulation is based on reference [23] and clarify the extend of the contribution in this respect. My other concern is about the implications of the work. The paper shows that the framework can better predict choices of subjects – which is interesting – but the psychological implications of this is not clear in the paper. One potential implication could be inferring the level of reasoning in these kinds of games, but in the last paragraph of the discussion the authors point out that it is not possible to determine the level of reasoning from fit to data. It then would be important to highlight the implications of the model beyond action prediction. Having said this, overall, I think this paper would be of interest to the Neurips audience and I support its acceptance. Some other comments: - In line 154-156 it is a bit unclear why the policy of *others* is assumed to be deterministic (unlike the policy of the agent). Under the non-deterministic policy, the expected next state will be (\lambda \alpha + (N -1)\pi_{k-1,t}(a1) + 1, \lambda \beta (N -1)\pi_{k-1,t}(a2), t+1), which is different from when the policy of others is assumed to be deterministic. It would be good to clarity this part. A similar question is with respect to the reward function defined in equation (5), i.e., instead of using either zero or N-1 for m, the expected number of a1 actions could be used in equation 5. - I was not able to follow lines 162 to 173. It would be great to add further explanations to this part. -In line 69, Bernoulli -> Binomial - Lines 134-137: Please add information about the method used to supplementary materials. - I might have missed this, but please define \pi^* (optimal policy) if it is not defined. ======================================== After author response: Thank you for your response to my comments. My understanding of the author response is that the framework in Ref 23 is indeed similar to the framework in this paper and has used a similar POMDP structure. Given this high similarity, I find it inappropriate to cite this closely related paper (Ref 23) only after page 7 and not to discuss its relations to the current work in terms of the mathematical framework. It should be made a lot clearer that the framework is built upon ref 23 (in sections 2.1, 2,2, intro). From the author response it does not seem that the paper will be revised accordingly in the final version, hence I've changed my score.

Reviewer 2

The authors present a new Bayesian framework to model collective decision processes incorporating both conformity bias and theory of mind processes. They start with a formal model of conformity which they then extend to model the impact a player’s actions may have on the decisions of other players in a collective decision process thereby extending it to allow for inference on the cognitive states of others or theory of mind processes with different depths. The authors also present empirical results of applying their framework as well as a model-free RL model in two different samples and show that there model predicts participants actual choices more accurately. In the following sections, I provide specific feedback with respect to originality, quality, clarity and significance. I hope you will find my comments helpful and constructive. Originality The authors provide an accessible, yet innovative framework to link conformity and theory of mind processes in collective decision making quite elegantly. The model extends past research as it is not restricted to a certain number of players (e.g., two players as in other frameworks) and can be easily scaled to larger action spaces. I believe this work is well written and will have significant impact on understanding collective decision making. Clarity & Quality The paper is very clearly written, well-referenced and still covers both theory and empirical application with respect to an intriguing question. They also include a control model and assess face validity (by assessing, whether models predict the end of the round and the theoretical relationship to the Nash equilibrium in one of the tasks) which I especially appreciated. I only have a few minor questions/suggestions for improvement: - In L. 105 you state that inferring on other players’ mental processes would be especially valuable, when the decay rate is high. My first question is, do you refer to a high decay rate in the sense that only most current results matter or do you refer to a decay rate close to 1, which, as you state in the section above equally implies that all past information are weighted equally. That has not been entirely clear from your description. Also, could you elaborate on why the ‘memory’ of past decisions would change the potential utility of explicitly inferring on other peoples’ mental states, is it because you would know more about other people and thus predict their behavior more accurately knowing the past? - Why did you decide to assess statistical difference between model performances using a t-test. Would a McNemar test not be better suited? - Did you run permutation tests to assess significance of the model performance compared to chance performance (although I believe it is quite clear, it would be good to formally show this as well)? - What is the imbalance of decisions made by the participants, if there is an imbalance, did you take this in your accuracy computation into account (e.g. by computing balanced accuracy), as a high accuracy could result from the prevalence of one particular choice. Significance As I already alluded to, I believe that this work constitutes a theoretical and empirical contribution of great significance. COMMENTS TO THE AUTHORS' RESPONSE: First of all, I wanted to thank you very much for your clarification regarding the significance assessment of the prediction performance. After discussing with my colleagues and reading your response, I still support an acceptance. However, as my colleagues rightfully pointed out, I would strongly urge you to clarify the relationship to ref 23 in the introduction. In your last submission you were giving the false impression that the entire approach is new, which is misleading. Please, reference this work early in your draft and adequately. Despite this hopefully unintentional negligence, I still support acceptance, but I will lower my score, since, while I still find that the paper is very well-written and addresses an important question, this of course reduces the contribution and is, in my view, a failure to aknowledge previous work adequately.

Reviewer 3

Update after author response: I would like to thank the authors for the detailed response which addresses most of my concerns, specifically those relating to the previously unjustified assumptions. In light of that, I am updating my score from 6 to 7. ------- In this paper, the authors propose a probabilistic model of human decision making in a collective scenario. The main proposal considers a simple binary decision-making task, and is based on updating the beta prior of each individual depending on the binomial observation likelihood of collective outcome. The modeling of the task as a POMDP follows naturally by considering the belief states. The authors also talk about higher level theory of mind models, and show that the results from the two experiments are better explained by the proposed model as compared to model-free reinforcement learning. The paper considers a very interesting problem of collective decision making that has bearings on multi-agent models, cognitive science, game theory, etc. The initial development is straightforward and the results seem encouraging. However I had a few concerns and clarification questions about the model: 1. Equations 1 and 2 are only valid if all the other agents decisions are IID (Independent and Identically Distributed) with success-rate parameter \theta. This seems like a far fetched assumption since later we acknowledge that there are individual differences between the agents. How can we justify this IID assumption? 2. Another unsubstantiated assumption (related to #1) is that of every agent starting with the exact same prior. People will come into the task with different experiences and expectations and can’t be assumed to share those parameters. 3. The introduction of the decay rate \lambda is quite ad hoc. Maybe a similar effect can be induced in a more principled way by considering that the other agents adhere to the inferred strategy with probability p and revert back to the prior with probability (1-p). That way events in the past get discounted because it’s more likely that a catastrophic forgetting/reset happened. Is there a mathematical justification for discounting the pseudo counts in the beta distribution otherwise? 4. I was left very confused by the hierarchical model in section 2.3. For example, does line 155 assume all the other (level k-1) agents make the same decisions (a1 or a2)? I didn’t understand the decision making policies of the agents with different levels of theory of mind. It would have been really useful if the authors spent more time explaining this contribution, maybe saving space by being more concise when explaining the well understood beta-binomial updates in 2.1. 5. Line 218 is again a little unprincipled. If the priors are supposed to be beta, then we should adhere to that. A simple change would be assuming Beta (1,1) which is uniform and that will get "overwritten" by the observed data in a few iterations. 6. I am not convinced how easily the results can be reproduced since some of the details seem to have been left unstated. For example, how were the parameter optimized? Jointly using some Bayesian technique or grid-search? If latter, was it joint or sequentially for one parameter at a time? (Edit: I see that some of these are answered in the supplementary material. The authors should add that reference in the main text and can safely ignore my comment.) Overall, I like the central idea of the paper but the proposal seems weakly motivated. There are several assumptions that I found unreasonable but that are not discussed. The central contribution of the paper is confusing to me and there is good scope for improving the presentation. All that said, the results, both experimental and model-fitting, are interesting and intuitive. Furthermore, overlooking the assumptions, the development of theory is reasonable. Based on these observations, I am mildly inclined for the paper to be accepted while acknowledging that there were parts of the manuscript that are still opaque to me.

Paper ID:	5129
Title:	A Bayesian Theory of Conformity in Collective Decision Making

Reviewer 1

Reviewer 2

Reviewer 3