Abstract:With the increasing prevalence of smartphones and websites, Image Aesthetic Assessment (IAA) has become increasingly crucial. While the significance of attributes in IAA is widely recognized, many attribute-based methods lack consideration for the selection and utilization of aesthetic attributes. Our initial step involves the acquisition of aesthetic attributes from both intra- and inter-perspectives. Within the intra-perspective, we extract the direct visual attributes of images, constituting the absolute attribute. In the inter-perspective, our focus lies in modeling the relative score relationships between images within the same sequence, forming the relative attribute. Then, to better utilize image attributes in aesthetic assessment, we propose the Unified Multi-attribute Aesthetic Assessment Framework (UMAAF) to model both absolute and relative attributes of images. For absolute attributes, we leverage multiple absolute-attribute perception modules and an absolute-attribute interacting network. The absolute-attribute perception modules are first pre-trained on several absolute-attribute learning tasks and then used to extract corresponding absolute attribute features. The absolute-attribute interacting network adaptively learns the weight of diverse absolute-attribute features, effectively integrating them with generic aesthetic features from various absolute-attribute perspectives and generating the aesthetic prediction. To model the relative attribute of images, we consider the relative ranking and relative distance relationships between images in a Relative-Relation Loss function, which boosts the robustness of the UMAAF. Furthermore, UMAAF achieves state-of-the-art performance on TAD66K and AVA datasets, and multiple experiments demonstrate the effectiveness of each module and the model's alignment with human preference.

Text-guided Multi-Task Image Aesthetic Quality Assessment

Advancing Comprehensive Aesthetic Insight with Multi-Scale Text-Guided Self-Supervised Learning

Technical Quality-Assisted Image Aesthetics Quality Assessment.

Semantics-Aware Image Aesthetics Assessment Using Tag Matching and Contrastive Ranking

Theme-Aware Semi-Supervised Image Aesthetic Quality Assessment

AesExpert: Towards Multi-modality Foundation Model for Image Aesthetics Perception

Attribute-Driven Multimodal Hierarchical Prompts for Image Aesthetic Quality Assessment

AesCLIP: Multi-Attribute Contrastive Learning for Image Aesthetics Assessment

Multimodal Image Aesthetic Prediction with Missing Modality

Aesthetic Visual Question Answering of Photographs

Towards Explainable Image Aesthetics Assessment with Attribute-oriented Critiques Generation

Multi-modal Learnable Queries for Image Aesthetics Assessment

Multitask Attentive Network for Text Effects Quality Assessment.

UniQA: Unified Vision-Language Pre-training for Image Quality and Aesthetic Assessment

UMAAF: Unveiling Aesthetics via Multifarious Attributes of Images

Social-sensed Image Aesthetics Assessment.

Attribute-assisted Multimodal Network for Image Aesthetics Assessment.

A Multi-dimensional Aesthetic Quality Assessment Model for Mobile Game Images

Image Aesthetics Assessment With Attribute-Assisted Multimodal Memory Network

Vision Language Modeling of Content, Distortion and Appearance for Image Quality Assessment

Textual Aesthetics in Large Language Models