Abstract:With the development of machine perception and multimodal information decision-making techniques, autonomous driving technology has become a crucial area of advancement in the transportation industry. The optimization of vehicle navigation, path planning, and obstacle avoidance tasks is of paramount importance. In this study, we explore the use of attention mechanisms in a end-to-end architecture for optimizing obstacle avoidance and path planning in autonomous driving vehicles. We position our research within the broader context of robotics, emphasizing the fusion of information and decision-making capabilities. The introduction of attention mechanisms enables vehicles to perceive the environment more accurately by focusing on important information and making informed decisions in complex scenarios. By inputting multimodal information, such as images and LiDAR data, into the attention mechanism module, the system can automatically learn and weigh crucial environmental features, thereby placing greater emphasis on key information during obstacle avoidance decisions. Additionally, we leverage the end-to-end architecture and draw from classical theories and algorithms in the field of robotics to enhance the perception and decision-making abilities of autonomous driving vehicles. Furthermore, we address the optimization of path planning using attention mechanisms. We transform the vehicle's navigation task into a sequential decision-making problem and employ LSTM (Long Short-Term Memory) models to handle dynamic navigation in varying environments. By applying attention mechanisms to weigh key points along the navigation path, the vehicle can flexibly select the optimal route and dynamically adjust it based on real-time conditions. Finally, we conducted extensive experimental evaluations and software experiments on the proposed end-to-end architecture on real road datasets. The method effectively avoids obstacles, adheres to traffic rules, and achieves stable, safe, and efficient autonomous driving in diverse road scenarios. This research provides an effective solution for optimizing obstacle avoidance and path planning in the field of autonomous driving. Moreover, it contributes to the advancement and practical applications of multimodal information fusion in navigation, localization, and human-robot interaction.

Multimodal Perception and Decision-Making Systems for Complex Roads Based on Foundation Models

A Fusion Method Aiming at Environmental Perception of Autonomous Vehicle Based on Visual Scheme

Delving into Multi-modal Multi-task Foundation Models for Road Scene Understanding: From Learning Paradigm Perspectives

Parameterized Decision-Making with Multi-Modality Perception for Autonomous Driving

Parallel Driving with Big Models and Foundation Intelligence in Cyber-Physical-Social Spaces

Parameterized Decision-making with Multi-modal Perception for Autonomous Driving

Prospective Role of Foundation Models in Advancing Autonomous Vehicles

Integrated Road Information Perception Framework for Road Type Recognition and Adaptive Evenness Assessment

A hierarchical perception decision-making framework for autonomous driving

A Real-Time Complex Road AI Perception Based on 5G-V2X for Smart City Security

Brain-Inspired Modelling and Decision-making for Human-Like Autonomous Driving in Mixed Traffic Environment

Research on obstacle avoidance optimization and path planning of autonomous vehicles based on attention mechanism combined with multimodal information decision-making thoughts of robots

Adaptive Optimization Strategy and Evaluation of Vehicle-Road Collaborative Perception Algorithm in Real-Time Settings

Humanlike Driving: Empirical Decision-Making System for Autonomous Vehicles

Brain-Inspired Modeling and Decision-Making for Human-Like Autonomous Driving in Mixed Traffic Environment

Human-vehicle Cooperative Visual Perception for Autonomous Driving under Complex Road and Traffic Scenarios

Human–machine cooperative decision-making and planning for automated vehicles using spatial projection of hand gestures

Brain Inspired Cognitive Model with Attention for Self-Driving Cars

A Survey for Foundation Models in Autonomous Driving

Applications of Large Scale Foundation Models for Autonomous Driving

Enhanced Scene Understanding and Situation Awareness for Autonomous Vehicles Based on Semantic Segmentation