Large-Scale Video Classification with Elastic Streaming Sequential Data Processing System

Yao Peng,Hao Ye,Yining Lin,Yixin Bao,Zhijian Zhao,Haonan Qiu,Yao Lu,Li Wang,Yingbin Zheng
DOI: https://doi.org/10.1145/3134263.3134264
2017-01-01
Abstract:Videos are dominant on the Internet. Current systems to process large-scale videos are suboptimal due to the following reasons: (1) machine learning modules such as feature extractors and classifiers generate huge intermediate data and place heavy burden to the storage and network, and (2) task scheduling is explicit; manually configuring the machine learning modules on the cluster is tedious and inefficient. In this work, we propose Elastic Streaming Sequential data Processing system (ESSP) that supports automatic task scheduling; multiple machine learning components are automatically parallelized. Further, our system prevents extensive disc I/O by applying the in-memory dataflow scheme. Evaluation on real-world video classification datasets shows many-fold improvements.
What problem does this paper attempt to address?