Abstract:With the development of machine perception and multimodal information decision-making techniques, autonomous driving technology has become a crucial area of advancement in the transportation industry. The optimization of vehicle navigation, path planning, and obstacle avoidance tasks is of paramount importance. In this study, we explore the use of attention mechanisms in a end-to-end architecture for optimizing obstacle avoidance and path planning in autonomous driving vehicles. We position our research within the broader context of robotics, emphasizing the fusion of information and decision-making capabilities. The introduction of attention mechanisms enables vehicles to perceive the environment more accurately by focusing on important information and making informed decisions in complex scenarios. By inputting multimodal information, such as images and LiDAR data, into the attention mechanism module, the system can automatically learn and weigh crucial environmental features, thereby placing greater emphasis on key information during obstacle avoidance decisions. Additionally, we leverage the end-to-end architecture and draw from classical theories and algorithms in the field of robotics to enhance the perception and decision-making abilities of autonomous driving vehicles. Furthermore, we address the optimization of path planning using attention mechanisms. We transform the vehicle's navigation task into a sequential decision-making problem and employ LSTM (Long Short-Term Memory) models to handle dynamic navigation in varying environments. By applying attention mechanisms to weigh key points along the navigation path, the vehicle can flexibly select the optimal route and dynamically adjust it based on real-time conditions. Finally, we conducted extensive experimental evaluations and software experiments on the proposed end-to-end architecture on real road datasets. The method effectively avoids obstacles, adheres to traffic rules, and achieves stable, safe, and efficient autonomous driving in diverse road scenarios. This research provides an effective solution for optimizing obstacle avoidance and path planning in the field of autonomous driving. Moreover, it contributes to the advancement and practical applications of multimodal information fusion in navigation, localization, and human-robot interaction.

Multi-Modal Neural Feature Fusion for Automatic Driving Through Perception-Aware Path Planning

A Fusion Method Aiming at Environmental Perception of Autonomous Vehicle Based on Visual Scheme

Unifying Terrain Awareness Through Real-Time Semantic Segmentation

Multi-Modal Sensor Fusion-Based Deep Neural Network for End-to-End Autonomous Driving With Scene Understanding

Radar and Camera Fusion for Multi-Task Sensing in Autonomous Driving

FusionAD: Multi-modality Fusion for Prediction and Planning Tasks of Autonomous Driving

Probabilistic End-to-End Vehicle Navigation in Complex Dynamic Environments with Multimodal Sensor Fusion

MMFN: Multi-Modal-Fusion-Net for End-to-End Driving

Research on obstacle avoidance optimization and path planning of autonomous vehicles based on attention mechanism combined with multimodal information decision-making thoughts of robots

Perception Helps Planning: Facilitating Multi-Stage Lane-Level Integration via Double-Edge Structures

Parallel Planning:A New Motion Planning Framework for Autonomous Driving

Enhance Planning with Physics-informed Safety Controller for End-to-end Autonomous Driving

A Method to Plan the Path of a Robot Utilizing Deep Reinforcement Learning and Multi-Sensory Information Fusion

Multi-Model-Based Local Path Planning Methodology for Autonomous Driving: An Integrated Framework

Real-time path planning for autonomous vehicle off-road driving

Map Construction and Path Planning Method for a Mobile Robot Based on Multi-Sensor Information Fusion

Graph-Based Multi-Modal Sensor Fusion for Autonomous Driving

Enhancing 3D object detection through multi-modal fusion for cooperative perception

Trajectory Planning of Autonomous Driving Vehicles Based on Road-Vehicle Fusion

TLCFuse: Temporal Multi-Modality Fusion Towards Occlusion-Aware Semantic Segmentation-Aided Motion Planning