Identifying Professional Photographers Through Image Quality and Aesthetics in Flickr

Sofia Strukova,Rubén Gaspar Marco,José A. Ruipérez-Valiente,Félix Gómez Mármol
2023-07-04
Abstract:In our generation, there is an undoubted rise in the use of social media and specifically photo and video sharing platforms. These sites have proved their ability to yield rich data sets through the users' interaction which can be used to perform a data-driven evaluation of capabilities. Nevertheless, this study reveals the lack of suitable data sets in photo and video sharing platforms and evaluation processes across them. In this way, our first contribution is the creation of one of the largest labelled data sets in Flickr with the multimodal data which has been open sourced as part of this contribution. Predicated on these data, we explored machine learning models and concluded that it is feasible to properly predict whether a user is a professional photographer or not based on self-reported occupation labels and several feature representations out of the user, photo and crowdsourced sets. We also examined the relationship between the aesthetics and technical quality of a picture and the social activity of that picture. Finally, we depicted which characteristics differentiate professional photographers from non-professionals. As far as we know, the results presented in this work represent an important novelty for the users' expertise identification which researchers from various domains can use for different applications.
Computers and Society
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? This paper aims to address the following main issues: 1. **Creating a large dataset**: Researchers found that existing photo and video sharing platforms lack suitable datasets to evaluate users' professional skills. Therefore, they first created a large-scale annotated dataset from the Flickr platform, including multimodal data (user information, image quality ratings, etc.), and made this data open source. 2. **Predicting professional photographers**: Based on the created dataset, researchers explored machine learning models to determine whether a user is a professional photographer. Specifically, they used self-reported occupational labels and various features extracted from users, images, and community sources to train the models. 3. **Analyzing the relationship between image quality and social activity**: Researchers also investigated the relationship between the aesthetic and technical quality of images and their social activity. 4. **Distinguishing features of professional and non-professional photographers**: Finally, researchers revealed the key features that distinguish professional photographers from non-professional ones. Through the above work, this paper provides new methods for identifying user skills and offers an important data foundation for future research.