Literature DB >> 19693010

Changes of mind in decision-making.

Arbora Resulaj¹, Roozbeh Kiani, Daniel M Wolpert, Michael N Shadlen.

Abstract

A decision is a commitment to a proposition or plan of action based on evidence and the expected costs and benefits associated with the outcome. Progress in a variety of fields has led to a quantitative understanding of the mechanisms that evaluate evidence and reach a decision. Several formalisms propose that a representation of noisy evidence is evaluated against a criterion to produce a decision. Without additional evidence, however, these formalisms fail to explain why a decision-maker would change their mind. Here we extend a model, developed to account for both the timing and the accuracy of the initial decision, to explain subsequent changes of mind. Subjects made decisions about a noisy visual stimulus, which they indicated by moving a handle. Although they received no additional information after initiating their movement, their hand trajectories betrayed a change of mind in some trials. We propose that noisy evidence is accumulated over time until it reaches a criterion level, or bound, which determines the initial decision, and that the brain exploits information that is in the processing pipeline when the initial decision is made to subsequently either reverse or reaffirm the initial decision. The model explains both the frequency of changes of mind as well as their dependence on both task difficulty and whether the initial decision was accurate or erroneous. The theoretical and experimental findings advance the understanding of decision-making to the highly flexible and cognitive acts of vacillation and self-correction.

Entities: Disease Gene Species

Mesh：

Year: 2009 PMID： 19693010 PMCID： PMC2875179 DOI： 10.1038/nature08275

Source DB: PubMed Journal: Nature ISSN： 0028-0836 Impact factor: 49.962

Decision-making spans a vast range of types and complexity, from choosing your partner, deciding whether to dive left or right to save a goal or simply when to lift your finger. Studies of simple perceptual decisions have shed insight into the neurobiological mechanisms responsible for decision-making in both monkeys and humans (for reviews, see1-3,10). These studies often require a binary choice between two possible stimulus categories, such as leftward or rightward motion. Psychophysical and neural data1 support a model termed drift diffusion6, random walk[5,7] or race8 in which a decision is made when the accumulated noisy evidence (decision variable) reaches a criterion level, termed a decision bound. Such an accumulation process explains both the choice and accuracy of decisions over a range of difficulty levels as well as the time required to make the decision9. They are naturally viewed as an extension of signal detection theory and Bayesian inference to streams of data over time[4,11]. One important limitation of these models is that they fail to explain why a decision-maker might change their mind after an initial decision has been taken. In some instances, such changes can lead to the correction of an initial error[12,13]. Here we develop a task in which we can monitor changes of mind. We then extend the bounded diffusion framework to explain both the frequency and pattern of changes of mind. Three naïve participants observed a moving random dot stimulus and made decisions about the direction of motion (leftward or rightward), which they indicated by moving a handle to a left or right target (Fig. 1a). Critically, the moving dots were extinguished as soon as the subjects initiated their movement (Fig. 1b) and hence subjects could not acquire new evidence during their movement. The choice at initiation (initial hand trajectory) and reaction times as a function of task difficulty (coherence of dot motion) were explained by a bounded drift-diffusion model (Fig. 2, black curves) consistent with previous studies in humans and monkeys1,9,14. According to this model, evidence is accumulated until it reaches one of two bounds (corresponding to leftward and rightward decisions), which determines the choice and decision time.

Figure 1

Experimental paradigm. a, Schematic of the visual display (rectangle). Subjects held the handle of a robotic interface (shown here in the home position, circle) and moved to either a left or right circular target depending on the perceived motion direction of a central random-dot display. A mirror system prevented subjects from seeing their arm. b, The time course of events that make up a trial. Each trial started when the subject’s hand was in the home position. After a random delay, the dots became visible and the subject could view the moving dot stimulus as long as they needed (up to 2 sec). Subjects indicated the direction of dot motion by moving to the left or right target. As soon as the subjects moved out of the home position, the motion stimulus vanished. The trial ended when the subject reached one of the two targets. c, Sample hand trajectories from one subject. Most trajectories are directly from the home position (bottom circle) to one of the choice targets. On a fraction of trials, the trajectories change course during the movement demonstrating a change of mind.

Figure 2

Accuracy improves through “changes of mind”. Data are from three subjects. The top row shows the probability of a correct decision at initiation (black) is lower that at termination (red) for almost all motion strengths. The bottom row shows initiation times are longer for weaker motion strengths. Solid curves are fits to the data of the bounded evidence accumulation model (R2 of fits for subjects S, A & E for initial decision 0.96, 0.95 & 0.98, for final decision 0.98, 0.96 & 0.99 and for reaction times 0.92, 0.74 & 0.87). In this model, processing after initial commitment leads to an improvement in performance during the post-initiation phase. Error bars are s.e.m.

Although no further visual information was available after movement initiation, the hand trajectories (Fig. 1c) gave a clear indication that on some trials observers changed their mind. That is subjects generated a curved hand path that initially was on course to reach one target, but changed during the movement to finish on the other target. Although some changes of mind resulted in errors, the majority corrected an initial error. Changes of mind reliably improved accuracy (Fig. 2 top row, black and red circles for the initial and final choice, respectively) for all three subjects by improving sensitivity to motion (p<0.006 for each subject). The observation is seemingly paradoxical. If there is information available to make a better decision, why does it fail to influence the initial decision? Every normative, ‘ideal observer’ based theory of decision-making would posit the decision as an inference on the available evidence. The paradox is resolved if the decision-maker does not use all of the available evidence to make the initial choice but can tap into further information in the period between commitment to the initial response and termination of the movement. Although the stimulus vanishes upon movement initiation, there is information in the processing pipeline that is potentially available to the decision-maker after movement initiation. Sensory and motor processing latencies ensure that not all of the information available from stimulus onset to movement initiation contributes to the decision. The sum of these latencies, termed the non-decision time (t), is estimated to be 300-400 ms in our experiments (Supplementary Table 1; Methods). Single unit recordings from the lateral intraparietal area (LIP) of the macaque in eye movement versions of this task, suggest the non-decision time includes sensory and motor delays of around 220 ms and 80 ms, respectively15,16. We hypothesized that the unused information could be processed after the brain has committed to an initial choice, thereby requiring an extension of the bounded diffusion mechanism that includes post-initiation processing. An analysis of the motion evidence leading to the subjects’ choices supports this hypothesis. Each stimulus is a noisy sequence of random dots, which lead to rapid fluctuations in the motion evidence, as quantified by motion energy16,17 favoring left or right. For each trial, we removed the average motion energy associated with that motion strength and direction, leaving only the moment-to-moment fluctuations about the mean. We then averaged these residuals to look for evidence in the stimulus in support of the subjects’ initial choice. The stimulus fluctuations immediately after stimulus onset supported the initial choice (Fig. 3a, left blue curve; average over first 150 ms is positive, p<0.0001), whereas the fluctuations in the final few hundred ms had little bearing on the choice. For each subject, we identified the time point when the average came within 1 s.e. of zero (arrows), thus providing an empirical estimate of non-decision time. Notice that the motion energy filtering induces a delay of 50-150 ms (Fig 3a, insert). Taking this into account, the initial choices depend on the earliest information in the stimulus, but ignore an epoch on the order of t.

Figure 3

A bounded accumulation model of decision making with post-initiation processing explains change of mind. a, Influence of motion energy fluctuations on initial and final decisions. Data are shown for all the trials (blue) and the subset of trials with a change of mind (red) aligned at stimulus onset (left) and movement onset (right). Motion energy fluctuations were obtained by applying a filter to the sequence of random dots shown on each trial and subtracting off the mean for all trials sharing the same motion strength and direction (see Methods). The residual fluctuations are designated positive if they support the direction of the initial decision. Shading indicates s.e.m. Arrows indicate the time preceding movement initiation that the average motion energy fluctuations for each subject falls to within 1 s.e. of zero. The inset shows the impulse response for the filter used to calculate motion energy. b, The model explains the probability of changes of mind from incorrect to correct choices (model, red curves; data red symbols) and changes of mind from correct to incorrect choices (black curves; black symbols) as a function of stimulus coherence. Error bars are s.e.m. c, Information flow diagram showing visual stimulus and neural events leading to a decision and a possible change of mind. The example illustrates a rightward motion stimulus which gives rise to an initial incorrect leftward choice with reaction time around 500 ms. The visual stimulus gives rise to a decision variable (blue trace) that is the accumulation of noisy evidence. This governs the initial choice and decision time. Data from neural recordings15,16 suggest that the delay from motion onset to the beginning of this accumulation (t) is around 200 ms. The initial decision is complete when a ‘Right’ or ‘Left’ bound is crossed (i.e., ±B of evidence has accumulated). The example shows an initial decision for left. The time of the termination is around the mean decision time for the three subjects. Further accumulation takes place on the evidence still in the processing pipeline and if the accumulated evidence reaches the opposite “change of mind” bound then the decision is reversed (red), otherwise it is confirmed if the deadline is reached (green).

The pattern is different for the subset of trials in which there is a change of mind. The early information from the stimulus provided weaker support for the initial choice (left red trace) and exhibited a negative trend near the time of initiation (right red trace), in support of the final, changed decision. The motion energy in this later epoch was significantly more negative compared to the motion energy on the remaining trials (p<0.0001). The observation provides evidence against two main alternatives to post-initiation processing: (1) change of decision based on recall and/or reconsideration of evidence acquired before initiation18, and (2) a correction of an initial motor error perhaps owing to confusion about the stimulus-response mapping12. The analysis instead supports a non-decision time in which information from the stimulus arrives too late to affect an initial decision but is present to refine it after the brain has committed to a particular response and action. We next consider how this extended processing could explain the pattern of changes of mind in the data. In particular, we wished to explain the proportion of changes to correct and to erroneous choices as a function of motion strength (Fig. 3b, red and black symbols, respectively). Consider a seemingly optimal solution to the problem. Suppose the subject wishes to use changes of mind to maximize the percentage of correct final choices. Then the subject ought to continue to accumulate evidence about direction until there is no more to be had (i.e., t) and to decide in favor of the more likely direction. This formulation holds regardless of the tradeoff between speed and accuracy underlying initial choice. This idea fails to explain our findings: it predicts too many changes and it would defer them to the end of the evidence stream, which is clearly not the case (Fig 1c). Because the subject must complete a hand movement, the optimal solution is likely to incorporate motor costs (energy) associated with larger corrections nearer the end of the movement. This idea can be captured by incorporating new bounds in the post-decision period to change or reaffirm an initial decision based on some criterion, thereby allowing changes to occur earlier in the movement. We considered a variety of models (see Methods). The most parsimonious of these is illustrated in Figure 3b. In this model, once the initial bound has been reached and a decision made, evidence continues to accumulate until it either reaches a new “change of mind” bound or a time-deadline terminates post-initiation processing. The decision rule is to change only if the accumulated evidence reaches the change bound and to reaffirm otherwise. The offset of the new bound and deadline (2 parameters) were fit to account for the changes of mind as a function of coherence (curves in Fig. 3b). For all three subjects, the model fits imply that upon termination of the initial decision, the subjects set a new bound at a level that would necessitate a reversal of the sign of the accumulated evidence. The amount of evidence required for a subject to change their mind (Table 1, BΔ) differed by ~30% across subjects, which explains the variation in the pattern of their changes. In all cases, the existence of this change-bound led to a significant improvement in the fits, compared to using all the available information (i.e., no bound and choice based on sign of decision variable after t, p<0.003 for all subjects, likelihood ratio test). The deadline produced by the fit suggests that subjects avail themselves of most of the information in the processing pipeline. The model captures the complex dependence of post-initiation changes on both the motion strength and the initial decision (R2 = 0.63-0.85 and 0.76-0.99 for changes to correct and incorrect, respectively). Notice that changes of mind are most frequent at intermediate motion strengths when the initial choice was erroneous. The model offers an intuitive explanation. Viewed as a decision process beginning at the initial decision bound, there is a higher probability of reaffirming the initial choice, because the accumulated evidence is far from the change of mind bound. A change of mind therefore requires strong evidence in the short time available for post-initiation processing to move the accumulated evidence to the change-of-mind bound. Such strong evidence ought to arrive when the initial choice is an error and when the motion is strong. However, if the motion is very strong, initial errors are rare. Our central finding is that the same data stream may be sampled at different moments to support different decisions, hence a change of mind. As a further test of this idea, we placed the timing of the initial decision under experimental control. This allowed us to isolate changes of mind from the strategies governing the tradeoff of speed versus accuracy of initial decisions in the RT experiment. Instead of responding when ready, subjects were trained to time the initiation of their movement so that it coincided with an expected auditory beep. The stimulus motion began at a random time 200-2000 ms (mean 440 ms) before the beep and ended at the beep or at movement initiation, whichever occurred first (see Methods). This experiment therefore tested whether our suggested framework generalizes to a situation in which the time of the initial choice is determined by an exogenous cue. The results of this experiment, which are summarized in Supplementary Figures 1-3, confirm the finding that subjects base their initial choice on early evidence but can avail themselves of additional evidence in the processing pipeline to revise this choice. These data also conform to a variant of the bounded accumulation mechanism with post initiation processing (see Methods and Supplementary Figures 2 & 3). We expect the change-of-mind mechanism to apply under a wide variety of conditions if there is time pressure to respond. When two of our subjects were instructed to perform the reaction time experiment more slowly, their initial decisions were more accurate and there were fewer changes of mind (data not shown). The pattern was explained by the same model with higher initiation bounds9. Also, because in our study the subject must complete an arm movement, the optimal solution is likely to trade-off accuracy against motor costs (energy) associated with larger corrections nearer the end of the movement. Determining the optimal bounds for such a trade-off will require coupling concepts derived from theories of optimal feedback control19 and decision-making models. We suspect that more complex situations, for example in which movements must be timed more precisely or when a correction is more costly, might necessitate both a reaffirmation bound and bounds whose height varies over time. Our proposed mechanism cannot explain all changes of mind. For example, it would not explain corrections of initial errors that arise from confusion about stimulus-response associations12. Further, a change that depends on retrieval of information from memory or incorporation of a new decision policy (e.g., values) would require elaboration of the model. Presumably these types of vacillations could be based on more complex processes that involve memory retrieval or application of a new criterion on a stored decision variable. Advances in understanding the neurobiology of decision-making has benefited from simple perceptual tasks18,20,21, but the same principles appear to underlie decisions related to foraging2, gambling22, social selection23, and probabilistic reasoning24. The common principle is that the representation of information bearing on choice is imperfect, thus inviting the application of some criterion against which to judge the evidence. Such criteria balance the expected loss associated with two types of errors, owing to either a lax or conservative criterion. The class of bounded diffusion models5-7,25,26 extends this theory of signal classification4 to data streams and thus incorporates time costs as well27,28. An unexpected virtue of such models demonstrated by our experiment is that a part of the data stream that is not used to make the decision can nonetheless support revision after a response is initiated. This formalism provides a new view of decision making in which subjects can exploit the expectation that late arriving information may or may not be useful to refine a decision or action. We suspect that when a change of decision is costly, energetically or otherwise, subjects would naturally tend to shun this strategy and opt for longer initial decision times. It is precluded when an action is ballistic, for instance when a subject makes an eye movement to a choice target9,15. In these instances, change-of-mind can only lead to a post-decision regret29 or possibly a learning signal even in the absence of overt feedback. On the other hand, a variety of complex motor sequences might benefit from early initiation premised on the expectation of additional information that is in the pipeline. It is well known that the initiation and final specification of a movement can be dissociated in time30. What we have shown here is that when these processes act on the same data stream, they can lead to a change in a decision. We speculate that a common neural mechanism explains refinement of a movement after initiation and what we experience cognitively as a change of mind about a proposition.

Methods Summary

Three naïve subjects performed the main experiment. The local ethics committee approved the protocol. Subjects moved a handle in the horizontal plane. A mirror/CRT system overlaid virtual images into the plane of the movement. The hand position was displayed as a small blue circle. After a random delay, a dynamic random dot stimulus appeared (Fig. 1). On each trial the direction of motion was randomly chosen left or right. Task difficulty was varied randomly by controlling the fraction of coherently moving dots. The subjects were instructed to judge the net direction of motion as fast and as accurately as they could, and to reach a left or right choice target. The motion stimulus was extinguished when the movement was initiated. The trial ended when the subject reached one of the targets. Subjects performed an initial training session of at least 500 trials followed by 1500 test trials. We recorded the hand trajectories at 1000 Hz. For each trial, we measured the reaction time and the final target selection. Normally hand movements for easy trials (high coherence) were straight to the target. A change of mind was reflected in a trajectory that initially headed for one target but ended at the other. We calculated the area between the hand path and the line from the starting position to the midpoint between the two targets. A change of mind was detected if the area carved out by the hand on the side opposite to the final chosen target exceeded 0.1 cm2. In a control experiment with 100% coherent motion, this criterion detected no change-of-mind trials, suggesting that our detection strategy is conservative. We were, therefore, able to determine for each trial the choice at both initiation and termination of the movement. Supplementary Information is linked to the online version of the paper at www.nature.com/nature.

24 in total

1. Performance monitoring by the supplementary eye field.

Authors: V Stuphorn; T L Taylor; J D Schall
Journal: Nature Date: 2000-12-14 Impact factor: 49.962

2. The time course of perceptual choice: the leaky, competing accumulator model.

Authors: M Usher; J L McClelland
Journal: Psychol Rev Date: 2001-07 Impact factor: 8.934

Review 3. Banburismus and the brain: decoding the relationship between sensory stimuli, decisions, and reward.

Authors: Joshua I Gold; Michael N Shadlen
Journal: Neuron Date: 2002-10-10 Impact factor: 17.173

4. Response of neurons in the lateral intraparietal area during a combined visual discrimination reaction time task.

Authors: Jamie D Roitman; Michael N Shadlen
Journal: J Neurosci Date: 2002-11-01 Impact factor: 6.167

5. Neuronal correlates of decision-making in secondary somatosensory cortex.

Authors: Ranulfo Romo; Adrián Hernández; Antonio Zainos; Luis Lemus; Carlos D Brody
Journal: Nat Neurosci Date: 2002-11 Impact factor: 24.884

6. A recurrent network mechanism of time integration in perceptual decisions.

Authors: Kong-Fatt Wong; Xiao-Jing Wang
Journal: J Neurosci Date: 2006-01-25 Impact factor: 6.167

7. The physics of optimal decision making: a formal analysis of models of performance in two-alternative forced-choice tasks.

Authors: Rafal Bogacz; Eric Brown; Jeff Moehlis; Philip Holmes; Jonathan D Cohen
Journal: Psychol Rev Date: 2006-10 Impact factor: 8.934

8. Probabilistic reasoning by neurons.

Authors: Tianming Yang; Michael N Shadlen
Journal: Nature Date: 2007-06-03 Impact factor: 49.962

9. Cortical substrates for exploratory decisions in humans.

Authors: Nathaniel D Daw; John P O'Doherty; Peter Dayan; Ben Seymour; Raymond J Dolan
Journal: Nature Date: 2006-06-15 Impact factor: 49.962

10. A modular planar robotic manipulandum with end-point torque control.

Authors: Ian S Howard; James N Ingram; Daniel M Wolpert
Journal: J Neurosci Methods Date: 2009-05-18 Impact factor: 2.390

207 in total

Review 1. Principles of sensorimotor learning.

Authors: Daniel M Wolpert; Jörn Diedrichsen; J Randall Flanagan
Journal: Nat Rev Neurosci Date: 2011-10-27 Impact factor: 34.870

Review 2. Motor control is decision-making.

Authors: Daniel M Wolpert; Michael S Landy
Journal: Curr Opin Neurobiol Date: 2012-05-29 Impact factor: 6.627

3. Interference effects of choice on confidence: Quantum characteristics of evidence accumulation.

Authors: Peter D Kvam; Timothy J Pleskac; Shuli Yu; Jerome R Busemeyer
Journal: Proc Natl Acad Sci U S A Date: 2015-08-10 Impact factor: 11.205

4. Evidence integration in model-based tree search.

Authors: Alec Solway; Matthew M Botvinick
Journal: Proc Natl Acad Sci U S A Date: 2015-08-31 Impact factor: 11.205

5. Human control of an inverted pendulum: is continuous control necessary? Is intermittent control effective? Is intermittent control physiological?

Authors: Ian D Loram; Henrik Gollee; Martin Lakie; Peter J Gawthrop
Journal: J Physiol Date: 2010-11-22 Impact factor: 5.182

6. Integration of direction cues is invariant to the temporal gap between them.

Authors: Roozbeh Kiani; Anne K Churchland; Michael N Shadlen
Journal: J Neurosci Date: 2013-10-16 Impact factor: 6.167

7. Confidence predicts speed-accuracy tradeoff for subsequent decisions.

Authors: Kobe Desender; Annika Boldt; Tom Verguts; Tobias H Donner
Journal: Elife Date: 2019-08-20 Impact factor: 8.140

8. Prefrontal cortical recordings with biomorphic MEAs reveal complex columnar-laminar microcircuits for BCI/BMI implementation.

Authors: Ioan Opris; Joshua L Fuqua; Greg A Gerhardt; Robert E Hampson; Samuel A Deadwyler
Journal: J Neurosci Methods Date: 2014-06-02 Impact factor: 2.390

9. Dynamics of neural population responses in prefrontal cortex indicate changes of mind on single trials.

Authors: Roozbeh Kiani; Christopher J Cueva; John B Reppas; William T Newsome
Journal: Curr Biol Date: 2014-06-19 Impact factor: 10.834

10. Response inhibition and response monitoring in a saccadic double-step task in schizophrenia.

Authors: Katharine N Thakkar; Jeffrey D Schall; Gordon D Logan; Sohee Park
Journal: Brain Cogn Date: 2015-03-10 Impact factor: 2.310