TJUT-TJU@TRECVID 2011: Surveillance Event Detection.

Zan Gao,An-An Liu,Yu-Ting Su,Zhong Ji,Zhao-Xuan Yang
2011-01-01
Abstract:This year, we especially put our focus on analyzing motions in videos and the construction of hierarchical model. Firstly, we adopted a spatio-temporal interest point detector, which explicitly encodes appearance features together with motion information, to extract robust point features in a sliding window. And then the bag-of-word (BoW) approach is employed. After that, the hierarchical models are trained for each event and each camera. At the same time, we also discuss how to fuse results from different hierarchical models. Experiments show that the spatio-temporal feature is effective, and the hierarchical models are robust and stable, which are very helpful for improving our system's performance.
What problem does this paper attempt to address?