Accelerating the Development of Multimodal, Integrative-AI Systems with Platform for Situated Intelligence

Sean Andrist,Dan Bohus
DOI: https://doi.org/10.48550/arXiv.2010.06084
2020-10-13
Abstract:We describe Platform for Situated Intelligence, an open-source framework for multimodal, integrative-AI systems. The framework provides infrastructure, tools, and components that enable and accelerate the development of applications that process multimodal streams of data and in which timing is critical. The framework is particularly well-suited for developing physically situated interactive systems that perceive and reason about their surroundings in order to better interact with people, such as social robots, virtual assistants, smart meeting rooms, etc. In this paper, we provide a brief, high-level overview of the framework and its main affordances, and discuss its implications for HRI.
Artificial Intelligence,Robotics
What problem does this paper attempt to address?
This paper aims to solve the high - engineering - threshold problem faced when constructing multi - modal and integrated artificial intelligence systems (such as socially interactive autonomous robots). Despite the rapid progress and significant breakthroughs in individual AI technology fields such as natural language processing, computer vision, and robot control algorithms, integrating these technologies into end - to - end intelligent systems is still very challenging and time - consuming. Specifically, the paper points out the following main problems: 1. **Complex engineering tasks**: Before conducting this type of research, researchers often need to first overcome the arduous engineering tasks of constructing prototype systems, which include data - flow processing, real - time coordination of asynchronously executing components, etc. 2. **Lack of time - awareness support**: Existing programming languages and development tools lack important primitives and features for handling time and latency, resulting in a large amount of development time being spent on debugging low - level problems rather than solving high - level tasks. 3. **Insufficient debugging and visualization tools**: Multi - modal and integrated AI applications have specific debugging, visualization, and analysis requirements, and existing development tools do not fully meet these requirements. For example, standard breakpoint debugging or "print debugging" methods are not sufficient to meet the debugging requirements of these complex systems. 4. **Poor component reusability**: Researchers often need to "reinvent the wheel" and build their own infrastructures to represent and reason about important time constructs. These custom infrastructures are usually only applicable to specific applications and are difficult to generalize and reuse. To solve these problems, the paper introduces an open - source framework named "Platform for Situated Intelligence" (PSI for short). By providing runtime infrastructure, debugging and visualization tools, and an open component ecosystem, PSI aims to simplify the development, debugging, analysis, maintenance, and continuous evolution processes of multi - modal and integrated AI systems, thereby accelerating research and development in this field.