The Psychophysics of Human Three-Dimensional Active Visuospatial Problem-Solving

Markus D. Solbach,John K. Tsotsos
2023-06-20
Abstract:Our understanding of how visual systems detect, analyze and interpret visual stimuli has advanced greatly. However, the visual systems of all animals do much more; they enable visual behaviours. How well the visual system performs while interacting with the visual environment and how vision is used in the real world have not been well studied, especially in humans. It has been suggested that comparison is the most primitive of psychophysical tasks. Thus, as a probe into these active visual behaviours, we use a same-different task: are two physical 3D objects visually the same? This task seems to be a fundamental cognitive ability. We pose this question to human subjects who are free to move about and examine two real objects in an actual 3D space. Past work has dealt solely with a 2D static version of this problem. We have collected detailed, first-of-its-kind data of humans performing a visuospatial task in hundreds of trials. Strikingly, humans are remarkably good at this task without any training, with a mean accuracy of 93.82%. No learning effect was observed on accuracy after many trials, but some effect was seen for response time, number of fixations and extent of head movement. Subjects demonstrated a variety of complex strategies involving a range of movement and eye fixation changes, suggesting that solutions were developed dynamically and tailored to the specific task.
Neurons and Cognition,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper aims to explore how humans solve visuospatial problems in three-dimensional space, particularly how they compare 3D objects when freely moving and exploring them. The study achieves this goal by developing a 3D version of the "same-different task" experiment and designing an experimental setup that allows participants to solve problems naturally while accurately recording their visual behavior. Specifically, the study focuses on the following aspects: 1. **Accuracy**: The task completion accuracy of participants is very high, averaging 93.83%, indicating that humans can perform this task well even without training. 2. **Number of Fixations**: The number of fixations required to complete the task is related to the complexity of the objects. Simple objects require fewer fixations, while complex objects require more. 3. **Reaction Time**: Reaction time increases with object complexity, and the reaction time for identical object pairs is longer than for different object pairs. 4. **Head Movement**: The amount of head movement is also affected by object complexity; the higher the complexity, the greater the distance participants move their heads. 5. **Gaze Patterns**: The study also analyzes participants' gaze patterns, including fixation ratios and fixation combinations, finding that these patterns are related to object complexity and directional differences. In summary, the paper attempts to understand how humans use their visual system to effectively recognize and compare 3D objects in real environments and explores the cognitive strategies and behavioral patterns involved.