|
Submitted by Assigned_Reviewer_1
Q1: Comments to author(s). First provide a summary of the paper, and then address the following criteria: Quality, clarity, originality and significance. (For detailed reviewing guidelines, see http://nips.cc/PaperInformation/ReviewerInstructions)
The paper describes a novel way to parameterize stimuli for generating space-time texture fields modeled after real-world deformations. The intent is to provide a more rigid framework by which scientists may psychophysically test motion perception. The model allows one to generate causal time-structured "Motion Clouds" in real-time. Motion Clouds are textured stimuli, which could be gabor-like or described by other functions. The model includes independent variables to describe motion, position, and warping. Warping describes texture zooming and rotations. They further suggest parameter distributions and stimulus properties that match well with existing psychophysical data. They use their model to generate stimuli for a speed discrimination psychophysical task and demonstrate their ability to assess how the stimulus spatial frequency affects speed discrimination. I interpret their contribution as a GPU ready method for generating real-time stimuli that is parameterized in a meaningful way for studying texture-motion related psychophysical phenomena.
A further advantage of this formalism is that it allows for statements of the model within a Bayesian framework. This sets the stage for psychophysical experiments. The main result of these (although the dataset is small) is that spatial frequency has a positive effect on perceived speed: stimuli of lower frequency with respect to the reference are perceived as going slower. The Bayesian model was able to explain these systematic biases for spatial frequency as shifts in priors on speed during the speed judgements. Overall I find the data part of this paper to be less satisfying because there is insufficient data for detailed analysis within the Bayesian frame work and the authors are unable to interpret some of their results.
Other comments:
The paper has some formatting problems (figure Legends not sufficiently different from text)
that make it difficult to read.
Note: The discussion section is cut off in the second sentence and doesn't continue. It seems as though they lost some text somewhere...
Q2: Please summarize your review in 1-2 sentences
An interesting method for generating motion stimuli for psychophysical experiments.
Submitted by Assigned_Reviewer_2
Q1: Comments to author(s). First provide a summary of the paper, and then address the following criteria: Quality, clarity, originality and significance. (For detailed reviewing guidelines, see http://nips.cc/PaperInformation/ReviewerInstructions)
The paper notes (line 269) that stationary dynamic Gaussian textures
can be produced using existing methods such as Fourier filtering of noise,
and says that the proposed method has the advantage of being able to produce stimuli in real-time.
However why is this necessary? Most psychological stimuli
are prepared in advance, one can imagine ways of easily handing any need
for on-the-fly generation.
As well, Gaussian textures are a very limited class of images.
Classic work by Julesz in the 1960s-70s explored the relation between texture discriminability and second-order statistics. The results seemed to indicate simply that the idea of nth-order statistics does not correspond in any simple way to human perception. But as well, Julesz was using the full second-order distribution, which provides much more modeling power than
is available with the Gaussian restriction used here.
Line 94, why is [3] nonparametric? Line 270, missing reference
Q2: Please summarize your review in 1-2 sentences
This looks like careful and sophisticated work, however I am not able to understand the motivations and relevance for NIPS. It may be better suited for a forum on psychological methods.
Submitted by Assigned_Reviewer_3
Q1: Comments to author(s). First provide a summary of the paper, and then address the following criteria: Quality, clarity, originality and significance. (For detailed reviewing guidelines, see http://nips.cc/PaperInformation/ReviewerInstructions)
Weak review.
Discussion section on page 7 appears to be cut-off mid-sentence. A good discussion section would be helpful for clarifying the importance and contributions of this work.
The impact of this work would be increased by providing the source code for the stimuli generation for use by other psychophysics researchers (as in the original motion cloud work).
p 6 line 308. "assess" is misspelled. p 8 line 421. "dynamic" is misspelled. p 8 line 428. "though" should be "through"
Q2: Please summarize your review in 1-2 sentences
Paper presents a generative model of dynamic textures, shows that this model can be formulated as a stochastic PDE, and uses it in a psychophysics task to show textures with higher spatial frequencies are perceived to have higher speeds. The model and psychophysics result are important, however, the paper is difficult to follow in places and would benefit from careful editing.
Submitted by Assigned_Reviewer_4
Q1: Comments to author(s). First provide a summary of the paper, and then address the following criteria: Quality, clarity, originality and significance. (For detailed reviewing guidelines, see http://nips.cc/PaperInformation/ReviewerInstructions)
This paper
provides a generative model for motion estimation based on small perturbations of observer's position by rotations, scaling and translation. Axiomatic definition of motion cloud stimuli from Ref. 8 was provide together with fast synthesis of naturalistic textures that can efficiently probe motion perception. The psychophysical results on judging the relative speed of moving
dynamical textures were inconclusive with the authors concluding that larger datasets are needed.
The paper appear to be hastily written with substantial number of typos (e.g. lines 101, 108, 270, 264, 366). In some cases the sentences were incomprehesible (e.g. lines 366-370).
Q2: Please summarize your review in 1-2 sentences
This paper considers the generation of dynamic motion cloud stimuli. These stimuli were proposed before but here are derived in a more rigorous manner. Psychophysical results were not conclusive.
Submitted by Assigned_Reviewer_5
Q1: Comments to author(s). First provide a summary of the paper, and then address the following criteria: Quality, clarity, originality and significance. (For detailed reviewing guidelines, see http://nips.cc/PaperInformation/ReviewerInstructions)
[Summary] The paper presents a a generative model for motion perception, which can be fitted into a Bayesian inference framework. This is derived from an axiomatic and biologically-driven model, and then is shown to be a generalization of the well-known luminance conservation equation.
[Originality and Significance] The contributions of this paper need to be confirmed by other reviewers with expertise.
[Clarity] The paper is mostly well written. But I have some difficulty in understanding some details, since I do not have a solid background in this area.
[Questions & comments] I did not notice major defect in the paper to my knowledge. But it would be more interesting if there could be any discussion of relationship between the proposed model and the dynamic textures by [3] (e.g, how the definition and/or properties are equivalent/relevant). This will provide a broader perspective to understand the framework.
@L94, it is said that "the most prominent method is the non-parametric Gaussian auto-regressive (AR) framework of [3]", the model presented in [3], however, is a parametric one (actually, a linear dynamic system). Is there any typo or misinterpretation here?
[Mesc.] Reference at L270 is not properly referred.
Q2: Please summarize your review in 1-2 sentences
The paper presents a a biologically-inspired generative model for motion perception, which can be fitted into a Bayesian inference framework. I cannot confidently evaluate the significance here though I do not find any major defects.
Q1:Author
rebuttal: Please respond to any concerns raised in the reviews. There are
no constraints on how you want to argue your case, except for the fact
that your text should be limited to a maximum of 5000 characters. Note
however, that reviewers and area chairs are busy and may not read long
vague rebuttals. It is in your own interest to be concise and to the
point.
We are glad that all reviewers appreciated the
significant technical contributions of our manuscript. There is a
consensus in the range of comments that the manuscript should more clearly
communicate its main contribution on the synthesis of textures optimized
for psychophysics and based on a rigorous mathematical model of natural
image transformations. In this response, we tackle those key points
seeking to clearly demonstrate why we believe our manuscript makes an
important contribution to NIPS. In addition, we will correct all errors
and omissions pointed out in the original submission, some of which we
acknowledge hindered the comprehension of the manuscript.
#Reviewer
1 Q: However I am not able to understand the motivations and relevance
for NIPS. A: We strongly believe that this contribution is directly
relevant to NIPS. Indeed, this work is at the interface between
mathematical modeling and psychophysics. NIPS has been a great avenue for
previous similar work which includes several important works of Weiss and
Simoncelli, two authors whose research groups have made seminal
contributions to Bayesian modelling of perception.
Q: However why
is this necessary? Most psychological stimuli are prepared in advance, one
can imagine ways of easily handing any need for on-the-fly
generation. A: For experimental purposes, the number of generated video
frames can be enormous. This framework is used in practice in different
labs for psychophysics & neurophysiology and we found it to be
essential for the acquisition of a substantial number of conditions and
trials. In particular, there is a genuine difficulty when very large
dynamic stimulus presentations are used in high resolution or over longer
durations (stimulation for several minutes). We have faced these
limitations in the past and so have many colleagues. The code, which will
be openly available, will make the greatest contribution in these
contexts
Q: As well, Gaussian textures are a very limited class of
images. [...] The results seemed to indicate simply that the idea of
nth-order statistics does not correspond in any simple way to human
perception. A: We agree that Gaussian textures are the simplest
statistical models, and can only account for a limited range of perceptual
phenomena. From a computer graphics perspective, these models have been
shown to be surprisingly effective at capturing micro-scale details, and
have recently been very popular at SIGGRAPH ("Gabor noise"). Moreover,
these stimuli are designed to be used in perceptual experiments. Gaussian
textures are "worst case scenario" textures, as they are composed of the
densest mixture of random textons. Using sparser texture most often makes
the job "easier" for the neural system; hence their usefulness to assess
its efficiency e.g. for speed discrimination. Finally, related to the
point above, our contribution is a first step toward more generic models,
e.g. non-linear s-PDEs that include higher order correlations. We will add
a short discussion about these issues in the conclusion, which was indeed
lacking.
Q: Line 94, why is [3] nonparametric? A: This is a
typo, which will be corrected.
#Reviewer 2
Q: The paper
appears to be hastily written with substantial number of typos [...] A:
There were indeed some errors in the previous submission. We will do a
full proof checking for the final version.
#Reviewer 3
Q:
[...] any discussion of relationship between the proposed model and the
dynamic textures by [3] A: This is indeed a good suggestion. [3] makes
use of an AR-1 dynamical system, so it can only capture 1st order sPDE,
while we make use of 2nd order PDE's. We will add this remark to the final
version of the paper. We found this difference to be absolutely crucial
for visual experiments to capture the correct correlation in time. A
detailed discussion about AR textures can be found in
[17].
#Reviewer 4
Q: The impact of this work would be
increased by providing the source code for the stimuli generation for use
by other psychophysics researchers. A: We strongly agree. We will
open-source the code and direct readers to the online github repository
within the revised text.
#Reviewer 6
Q: I find the data part
of this paper to be less satisfying because there is insufficient
data... A: The psychophysics part was meant as a proof of concept, to
bridge the gap between propositions of the generative model and Bayesian
modelling of speed perception. While more data would indeed extend the
possible interpretations, we have importantly tested the formalised
relationship between likelihoods and the motion cloud parameterization
with our dataset by asking a novel question about frequency related
adjustments to perceived speed.
#Reviewers 4 & 6
Q:
Discussion section on page 7 appears to be cut-off mid-sentence. A: In
fact, a poor typesetting resulted in a part of the discussion to appear
after Figure 3's caption. We apologise for the
inconvenience. |
|