Automatic Naming of Speakers in Video via Name-Face Mapping.

Zhixin Liu,Cheng Jin,Yuejie Zhang,Tao Zhang
DOI: https://doi.org/10.1007/978-3-319-47674-2_35
2016-01-01
Abstract:The problem of automatically labelling the appearances of characters in video with their names is challenging due to the huge variation in the appearance of each character and the weakness and ambiguity of available annotations. We can achieve high precision by combining multiple sources of information, both visual and textual. The principal novelties that we introduce in this paper are: (i) extracting face features in video by neural network; (ii) strengthening the mapping between names and faces by analyzing the co-occurrence of names and faces; (iii) automatically and efficiently labelling appearances of main characters with their names.
What problem does this paper attempt to address?