Bridging Cognitive Programs and Machine Learning

Amir Rosenfeld,John K. Tsotsos
DOI: https://doi.org/10.48550/arXiv.1802.06091
2018-02-17
Abstract:While great advances are made in pattern recognition and machine learning, the successes of such fields remain restricted to narrow applications and seem to break down when training data is scarce, a shift in domain occurs, or when intelligent reasoning is required for rapid adaptation to new environments. In this work, we list several of the shortcomings of modern machine-learning solutions, specifically in the contexts of computer vision and in reinforcement learning and suggest directions to explore in order to try to ameliorate these weaknesses.
Machine Learning,Artificial Intelligence,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the limitations of modern machine - learning methods in handling visual tasks. Specifically, the author points out several deficiencies in current machine - learning solutions (especially in the fields of computer vision and reinforcement learning): 1. **Data Scarcity Problem**: When the training data is limited, the performance of the model will decline significantly. 2. **Domain Transfer Problem**: When the test environment is different from the training environment, the model has poor generalization ability. 3. **Lack of Ability to Adapt to New Environments Quickly**: Existing models have difficulty in quickly adjusting and showing intelligent reasoning when facing new environments. To make up for these deficiencies, the author proposes a method of combining Cognitive Programs (CP) with modern machine - learning tools. Specific goals include: - **Improve the STAR Model**: By introducing modern machine - learning techniques, improve the performance of each component in the STAR model. - **Utilize Biological Visual Constraints**: Constrain machine - learning methods through the understanding of biological visual mechanisms, thereby improving the performance of the model and possibly obtaining new insights. - **Develop a General Visual Task Execution Framework**: Build a system that can generate cognitive programs according to task requirements to achieve effective control and execution of various visual tasks. ### Main Predictions The author makes two main predictions: 1. **Many components in the STAR model can benefit from modern machine - learning tools and techniques**. For example, through deep learning, the visual hierarchy can be better adjusted to perform better in different tasks. 2. **Using the knowledge of biological vision to constrain machine - learning methods** can improve the performance of the model and help to further understand the operating mechanism of the visual system. ### Specific Problems Some specific problems mentioned in the article include: 1. How to design the structure and parameters of the Visual Hierarchy (VH) so that it can handle a wide range of visual inputs under appropriate task biases / pre - activations? 2. How to learn the method of adjusting VH for each given task? 3. How to create a Visual Task Executive that can select a set of appropriate methods to complete visual tasks? These problems all involve how to carry out effective control and planning given a set of tools, and how to fit the model to complex data (such as images). To this end, the author suggests drawing on the progress of machine - learning in recent years, especially the development of deep learning and reinforcement learning, to solve these problems.