More cat than cute? Interpretable Prediction of Adjective-Noun Pairs

Delia Fernandez,Alejandro Woodward,Victor Campos,Xavier Giro-i-Nieto,Brendan Jou,Shih-Fu Chang
DOI: https://doi.org/10.1145/3132515.3132520
2017-08-21
Abstract:The increasing availability of affect-rich multimedia resources has bolstered interest in understanding sentiment and emotions in and from visual content. Adjective-noun pairs (ANP) are a popular mid-level semantic construct for capturing affect via visually detectable concepts such as "cute dog" or "beautiful landscape". Current state-of-the-art methods approach ANP prediction by considering each of these compound concepts as individual tokens, ignoring the underlying relationships in ANPs. This work aims at disentangling the contributions of the `adjectives' and `nouns' in the visual prediction of ANPs. Two specialised classifiers, one trained for detecting adjectives and another for nouns, are fused to predict 553 different ANPs. The resulting ANP prediction model is more interpretable as it allows us to study contributions of the adjective and noun components. Source code and models are available at <a class="link-external link-https" href="https://imatge-upc.github.io/affective-2017-musa2/" rel="external noopener nofollow">this https URL</a> .
Computer Vision and Pattern Recognition,Artificial Intelligence,Multimedia
What problem does this paper attempt to address?