Introduction To A Large-Scale General Purpose Ground Truth Database: Methodology, Annotation Tool And Benchmarks

Benjamin Yao,Xiong Yang,Song-Chun Zhu
DOI: https://doi.org/10.1007/978-3-540-74198-5_14
2007-01-01
Abstract:This paper presents a large scale general purpose image database with human annotated ground truth. Firstly, an all-in-all labeling framework is proposed to group visual knowledge of three levels: scene level (global geometric description), object level (segmentation, sketch representation, hierarchical decomposition), and low-mid level (2.1D layered representation, object boundary attributes, curve completion, etc.). Much of this data has not appeared in previous databases. In addition, And-Or Graph is used to organize visual elements to facilitate top-down labeling. An annotation tool is developed to realize and integrate all tasks. With this tool, we've been able to create a database consisting of more than 636,748 annotated images and video frames. Lastly, the data is organized into 13 common subsets to serve as benchmarks for diverse evaluation endeavors.
What problem does this paper attempt to address?