Learning Common Sense Through Visual Abstraction Supplementary Material

Ramakrishna Vedantam, Xiao Lin, Tanmay Batra, C Lawrence Zitnick, Devi Parikh
Abstract:We first detail the webpages with qualitative results in Section 1. We then discuss our procedure for extracting commonsense tuples from sentences (Section 2). We give examples of the data we collect about relations via Abstract Scenes which includes the illustrations drawn by users and the tuples provided by them (Section 3). We then show our interface for collecting ground truth on plausibility of TEST/VAL relations (Section 4). These illustrations serve as training data for our vision based similarity model. We show some qualitative examples of cases where vision based similarity helps (Section 5).
What problem does this paper attempt to address?