Special issue on multimodal processing and robotics for dialogue systems (Part II)

David Traum,Gabriel Skantze,Hiromitsu Nishizaki,Ryuichiro Higashinaka,Takashi Minato,Takayuki Nagai
DOI: https://doi.org/10.1080/01691864.2024.2319945
IF: 2.057
2024-03-20
Advanced Robotics
Abstract:The proliferation of devices like Google Home and Amazon Alexa is ongoing, yet the realization of human-like interactions remains unachieved. In particular, dialogue robots and multimodal dialogue systems, which must consider multimodal information and the surrounding environment, require further technological advancements. Towards such advancements, dialogue robot competitions and dialogue system live competitions have been conducted in recent years. Unlike previous dialogue systems that rely solely on verbal language, systems that express through facial expressions and gestures, grounded in the real world, hold the potential to significantly transform society in the future. This special issue is devoted to the progress of such dialogue robots and multimodal dialogue systems.
robotics
What problem does this paper attempt to address?
This special issue aims to address the application of multimodal processing and dialogue systems in robotics. Specifically, it focuses on the following aspects: 1. **Achieving Human-like Interaction**: Although devices like Google Home and Amazon Alexa have become widespread, true human-like interaction has not yet been achieved. In particular, dialogue robots and multimodal dialogue systems require further technological advancements to handle multimodal information and the surrounding environment. 2. **Competitions Driving Technological Development**: In recent years, dialogue robot competitions and dialogue system live competitions have driven the development of related technologies. These competitions not only rely on language but also involve multimodal expressions such as facial expressions and gestures, with the potential to significantly change society in the future. 3. **Specific Research Directions**: - **User Personality Adaptation and Dialogue Strategy**: The first paper proposes a method to estimate the user's personality traits through multimodal information and adjust the dialogue strategy accordingly to provide satisfactory customer service. - **Enhancing User Engagement**: The second paper studies how to increase user engagement in dialogue by adding content of interest to the user and multimodal response requests. - **Service-oriented Robot Apology Strategy**: The third paper explores how to implement appropriate apology strategies based on user practicality and relationship orientation, and to implement multimodal expressions consistent with the user relationship. - **User Reactions to Dialogue Interruptions and Personality Traits**: The fourth paper analyzes the impact of user personality traits on their reactions to dialogue interruptions in multimodal dialogue systems. - **Dialogue Robot Role Expression Adapted to User Personality**: The fifth paper identifies the system role traits preferred by users through dialogue data analysis and verifies their effectiveness in laboratory-guided systems and chat systems. - **Emotion-driven and Topic-aware Dialogue Framework**: The sixth paper proposes a trainable framework for modeling context-aware human-computer dialogue and applies it to live applications, verifying the effectiveness of emotion recognition and topic awareness in dialogue. In summary, this special issue brings together several cutting-edge studies on multimodal processing and dialogue systems, aiming to advance dialogue robot technology and achieve more natural and efficient human-like interactions.