DroneVis: Versatile Computer Vision Library for Drones

Ahmed Heakl,Fatma Youssef,Victor Parque,Walid Gomaa
2024-06-01
Abstract:This paper introduces DroneVis, a novel library designed to automate computer vision algorithms on Parrot drones. DroneVis offers a versatile set of features and provides a diverse range of computer vision tasks along with a variety of models to choose from. Implemented in Python, the library adheres to high-quality code standards, facilitating effortless customization and feature expansion according to user requirements. In addition, comprehensive documentation is provided, encompassing usage guidelines and illustrative use cases. Our documentation, code, and examples are available in <a class="link-external link-https" href="https://github.com/ahmedheakl/drone-vis" rel="external noopener nofollow">this https URL</a>.
Computer Vision and Pattern Recognition,Artificial Intelligence,Computers and Society,Machine Learning,Robotics
What problem does this paper attempt to address?
The paper introduces DroneVis, a novel computer vision library designed for Parrot drones. This library provides a range of functionalities to perform various computer vision tasks, and users can customize and extend the library by choosing different models based on their needs. DroneVis is implemented in Python, following high-quality coding standards, and comes with detailed documentation including a user guide and examples. The supported tasks include object detection, tracking, segmentation, crowd counting, depth estimation, etc., making it applicable to multiple domains such as agriculture, environmental monitoring, disaster response, and surveillance. Key features of DroneVis include: 1. Latest algorithm models for various computer vision tasks, such as Faster R-CNN, YOLOv8, etc. 2. Comprehensive documentation for better code understanding and usage. 3. Dedicated testing demos to ensure real-time performance and accuracy. 4. High code coverage for library reliability. 5. Multiple interactive modes including a graphical user interface (GUI), command line interface (CLI), gesture control, etc. to cater to different user needs. 6. User-friendly API to simplify the development process of custom computer vision applications. The paper demonstrates the practical application of DroneVis in real-world scenarios, such as object detection, tracking, and pose estimation using high-definition video data captured by drones. The experimental results evaluate the performance of different models in terms of detection speed and accuracy. Additionally, the paper discusses the design of drone control and user interface, as well as establishing a connection between the drone and a laptop.