Know2Look: Commonsense Knowledge for Visual Search

Sreyasi Nag Chowdhury,Niket Tandon,Gerhard Weikum
DOI: https://doi.org/10.48550/arXiv.1909.00749
2019-09-02
Information Retrieval
Abstract:With the rise in popularity of social media, images accompanied by contextual text form a huge section of the web. However, search and retrieval of documents are still largely dependent on solely textual cues. Although visual cues have started to gain focus, the imperfection in object/scene detection do not lead to significantly improved results. We hypothesize that the use of background commonsense knowledge on query terms can significantly aid in retrieval of documents with associated images. To this end we deploy three different modalities - text, visual cues, and commonsense knowledge pertaining to the query - as a recipe for efficient search and retrieval.
What problem does this paper attempt to address?