Abstract:Throughout our day, we continually make decisions to behave effectively in our environment. Imagine you want to cross a busy road. To ultimately decide when to cross, you accumulate sensory evidence about the vehicles on the road (e.g., "What is that object, where is it, and how is it moving?"). Only when sufficient evidence is accumulated to ensure safe passage is the decision to cross the road made (Fig. 1). Figure 1. Sampling from external and internal sensations while deciding when to cross a busy road. To ultimately decide whether it is safe to cross, one can sample from external (i.e., from perception) or internal (i.e., memory) sensations. For example, the features, position, motion, and speed of the blue car are gathered from visual perception (on the left). In contrast, these properties of the red car are sampled from working memory (on the right). External and internal sensations are sampled to gradually accumulate evidence until a decision threshold is reached, and the decision is made that it is safe to cross the road. Sequential-sampling models state that decisions are made in an accumulation-to-bound fashion: evidence gradually accumulates until a decision threshold is reached (Ratcliff and McKoon, 2008). When this threshold is reached, a decision is formed (i.e., "I can safely cross now") and behavior follows (i.e., crossing the road). Given that decision-making is omnipresent in everyday behavior, there is a strong motivation to identify neural markers that sensitively index evidence accumulation. In order to truly reflect evidence accumulation, such a neural decision variable must not correlate with other ongoing processes such as the preparation of motor responses or sensory processing (O'Connell et al., 2012). Neural decision variables were first identified in nonhuman primates, where single-unit recordings in the lateral intraparietal cortex sensitively ... Correspondence should be addressed to Damian Koevoet at d.koevoet{at}uu.nl.

Sequential sampling without comparison to boundary through model-free reinforcement learning

Coherent noise enables probabilistic sequence replay in spiking neuronal networks

Embodied sequential sampling models and dynamic neural fields for decision-making: Why hesitate between two when a continuum is the answer

Probabilistic vs. non-probabilistic approaches to the neurobiology of perceptual decision-making

High-accuracy model-based reinforcement learning, a survey

Active Sensing as Bayes-Optimal Sequential Decision Making

Modeling sensory-motor decisions in natural behavior

Breaking the sample complexity barrier to regret-optimal model-free reinforcement learning

Better Safe than Sorry: Evidence Accumulation Allows for Safe Reinforcement Learning

Bridging Neural and Computational Viewpoints on Perceptual Decision-Making

One-shot learning and behavioral eligibility traces in sequential decision making

Sampling from Internal and External Sensations Guides Decision-Making

Adaptive Integration of Perceptual and Reward Information in an Uncertain World

Sequential sampling models with variable boundaries and non-normal noise: A comparison of six models

Posterior Sampling for Deep Reinforcement Learning

A decision-theoretic model of multistability: perceptual switches as internal actions

Learning Non-Markovian Decision-Making from State-only Sequences

Efficient Exploration in Continuous-time Model-based Reinforcement Learning

Reinforcement Learning Model With Dynamic State Space Tested on Target Search Tasks for Monkeys: Self-Determination of Previous States Based on Experience Saturation and Decision Uniqueness

Simulating how animals learn: a new modelling framework applied to the process of optimal foraging

Integrated Perceptual Decisions Rely on Parallel Evidence Accumulation