A modular architecture for creating multimodal agents

Thomas Baier,Selene Baez Santamaria,Piek Vossen
DOI: https://doi.org/10.48550/arXiv.2206.00636
2022-06-02
Abstract:The paper describes a flexible and modular platform to create multimodal interactive agents. The platform operates through an event-bus on which signals and interpretations are posted in a sequence in time. Different sensors and interpretation components can be integrated by defining their input and output as topics, which results in a logical workflow for further interpretations. We explain a broad range of components that have been developed so far and integrated into a range of interactive agents. We also explain how the actual interaction is recorded as multimodal data as well as in a so-called episodic Knowledge Graph. By analysing the recorded interaction, we can analyse and compare different agents and agent components.
Artificial Intelligence,Human-Computer Interaction,Robotics
What problem does this paper attempt to address?