Efficient video summarization through MobileNetSSD: a robust deep learning-based framework for efficient video summarization focused on objects of interest
Manasa Yarrarapu,Narkedamilly Leelavathy,Dasari Haritha
DOI: https://doi.org/10.1007/s11042-024-20372-y
IF: 2.577
2024-10-25
Multimedia Tools and Applications
Abstract:Now-a-days, the generation of videos has increased dramatically due to the quick growth of multimedia and the internet. The need for effective ways to store, manage, and index the massive numbers of videos has become imperative due to this expansion. As a result, a method needs to be proposed that collects only the necessary data from the original recording. In computer vision, Video summarization is a significant task, and its primary goal is to give a quick summary of the video by removing irrelevant information and capturing key frames from the video. Many approaches have developed over the last several decades, using the most recent deep neural network architectures that represent the current state-of-the-art. Our method involves extracting vital key frames from the input video using the MobileNetSSD model, which is well-known for its efficient recognition and localization of objects of interest. These highlighted frames are essential in creating a detailed video summary. Furthermore, a method of temporal analysis is applied to guarantee that the summary accurately reflects the relevant events in the order in which they occurred, contributing to a coherent and meaningful representation of the information. We evaluated the proposed approach on TV Sum and SUM me video datasets, comparing the results against cutting-edge video summarization techniques. Our approach works effectively to produce clear and meaningful video summaries.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering