Abstract:Robot navigation is an important research field with applications in various domains. However, traditional approaches often prioritize efficiency and obstacle avoidance, neglecting a nuanced understanding of human behavior or intent in shared spaces. With the rise of service robots, there's an increasing emphasis on endowing robots with the capability to navigate and interact in complex real-world environments. Socially aware navigation has recently become a key research area. However, existing work either predicts pedestrian movements or simply emits alert signals to pedestrians, falling short of facilitating genuine interactions between humans and robots. In this paper, we introduce the Hybrid Soft Actor-Critic with Large Language Model (HSAC-LLM), an innovative model designed for socially-aware navigation in robots. This model seamlessly integrates deep reinforcement learning with large language models, enabling it to predict both continuous and discrete actions for navigation. Notably, HSAC-LLM facilitates bidirectional interaction based on natural language with pedestrian models. When a potential collision with pedestrians is detected, the robot can initiate or respond to communications with pedestrians, obtaining and executing subsequent avoidance strategies. Experimental results in 2D simulation, the Gazebo environment, and the real-world environment demonstrate that HSAC-LLM not only efficiently enables interaction with humans but also exhibits superior performance in navigation and obstacle avoidance compared to state-of-the-art DRL algorithms. We believe this innovative paradigm opens up new avenues for effective and socially aware human-robot interactions in dynamic environments. Videos are available at <a class="link-external link-https" href="https://hsacllm.github.io/" rel="external noopener nofollow">this https URL</a>.

Extracting Dynamic Navigation Goal from Natural Language Dialogue

ChatNav: Leveraging LLM to Zero-shot Semantic Reasoning in Object Navigation

GSON: A Group-based Social Navigation Framework with Large Multimodal Model

Automatic Object Searching and Behavior Learning for Mobile Robots in Unstructured Environment by Deep Belief Networks.

Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation

Real-Time Navigation In Dynamic Human Environments Using Optimal Reciprocal Collision Avoidance

Social Navigation Planning Based on People's Awareness of Robots

Language-guided Robust Navigation for Mobile Robots in Dynamically-changing Environments

Enhancing Socially-Aware Robot Navigation through Bidirectional Natural Language Conversation

Language and Sketching: An LLM-driven Interactive Multimodal Multitask Robot Navigation Framework

Resolving Positional Ambiguity in Dialogues by Vision-Language Models for Robot Navigation

Semantic Grounding for Long-Term Autonomy of Mobile Robots Towards Dynamic Object Search in Home Environments

A social planning and navigation for tour-guide robot in human environment

Refining Object Localization from Dialogues

Efficient Collaborative Navigation Through Perception Fusion for Multi-Robots in Unknown Environments

Enabling Socially Competent navigation through incorporating HRI

Social navigation framework for assistive robots in human inhabited unknown environments

Grounding Implicit Goal Description for Robot Indoor Navigation Via Recursive Belief Update

Toward Human-Like Social Robot Navigation: A Large-Scale, Multi-Modal, Social Human Navigation Dataset

Learning World Transition Model for Socially Aware Robot Navigation

OpenObject-NAV: Open-Vocabulary Object-Oriented Navigation Based on Dynamic Carrier-Relationship Scene Graph