Florida International University - University of Miami TRECVID 2020 DSDI Track.

Maria E. Presa Reyes,Yudong Tao,Shu-Ching Chen,Mei-Ling Shyu
2020-01-01
Abstract:This paper presents the framework and results from the team “Florida International University-University of Miami (FIU-UM)” in the TRECVID 2020 Disaster Scene Description and Indexing (DSDI) task. We submitted four runs, each applying the same framework but using different score aggregation methods to rank each video shot. The score aggregation methods used in these runs are summarized as follows. • run1: the sum of the feature scores obtained from the video frames to rank the shots; • run2: the average of the feature scores obtained from the video frames to rank the shots; • run3: the maximum of the feature scores obtained from the video frames to rank the shots; • run4: the average of the top three features’ scores obtained from the video frames to rank the shots. Our framework includes the following processing steps: (1) pre-processing imagery from the provided LADI (Low Altitude Disaster Imagery) dataset; (2) generating soft labels for imagery in the LADI dataset through the fusion of annotations from both human and machine annotators as well as image time/location-based concept lookups of open datasets; (3) categorizing the frames in the LADI imagery by five Convolutional Neural Network (CNN) models (i.e., damage, environment, infrastructure, vehicles, and water), each focused on a subset of the 32 features; and (4) aggregating the predictive scores of the frame-level to the shot-level through sum, avg, max, and top. To improve the performance of the CNN models, we adopt various training strategies, including (1) the model pre-trained on ImageNet; (2) propagating the labels during training, following the sequence nature of the LADI dataset; and (3) retrieving more relevant data using an image crawler to enhance the training data. This year, the FIU-UM team achieved the first place among all the submitted runs, regardless of the training type. Among a total of four prioritized submitted runs with different relevancy sorting techniques, three of our runs ranked the top 3. The submission details are listed as follows. • Training type: LADI + Others (O) • Team ID: FIU-UM (Florida International University University of Miami) • Year: 2020
What problem does this paper attempt to address?