Abstract:Computational photo quality evaluation is a useful technique in many tasks of computer vision and graphics, for example, photo retaregeting, 3-D rendering, and fashion recommendation. The conventional photo quality models are designed by characterizing the pictures from all communities (e.g., “architecture” and “colorful”) indiscriminately, wherein community-specific features are not exploited explicitly. In this article, we develop a new community-aware photo quality evaluation framework. It uncovers the latent community-specific topics by a regularized latent topic model (LTM) and captures human visual quality perception by exploring multiple attributes. More specifically, given massive-scale online photographs from multiple communities, a novel ranking algorithm is proposed to measure the visual/semantic attractiveness of regions inside each photograph. Meanwhile, three attributes, namely: 1) photo quality scores; weak semantic tags; and inter-region correlations, are seamlessly and collaboratively incorporated during ranking. Subsequently, we construct the gaze shifting path (GSP) for each photograph by sequentially linking the top-ranking regions from each photograph, and an aggregation-based CNN calculates the deep representation for each GSP. Based on this, an LTM is proposed to model the GSP distribution from multiple communities in the latent space. To mitigate the overfitting problem caused by communities with very few photographs, a regularizer is incorporated into our LTM. Finally, given a test photograph, we obtain its deep GSP representation and its quality score is determined by the posterior probability of the regularized LTM. Comparative studies on four image sets have shown the competitiveness of our method. Besides, the eye-tracking experiments have demonstrated that our ranking-based GSPs are highly consistent with real human gaze movements.

One net to rule them all: efficient recognition and retrieval of POI from geo-tagged photos

DeepCamera

The Knowing Camera: Recognizing Places-Of-Interest In Smartphone Photos

The Knowing Camera 2: Recognizing And Annotating Places-Of-Interest In Smartphone Photos

KISS: Knowing Camera Prototype System for Recognizing and Annotating Places-of-Interest.

Community-Aware Photo Quality Evaluation by Deeply Encoding Human Perception

DualNet-PoiD: A Hybrid Neural Network for Highly Accurate Recognition of POIs on Road Networks in Complex Areas with Urban Terrain

PIC-Net: Point Cloud and Image Collaboration Network for Large-Scale Place Recognition

Deep Spatial Attention Hashing Network for Image Retrieval.

Attend and Guide (AG-Net): A Keypoints-driven Attention-based Deep Network for Image Recognition

PoCo: Point Context Cluster for RGBD Indoor Place Recognition

Towards Effective Next POI Prediction: Spatial and Semantic Augmentation with Remote Sensing Data

Deep Neural Network for Point Sets Based on Local Feature Integration

ModaLink: Unifying Modalities for Efficient Image-to-PointCloud Place Recognition

Depth Image Hashing Algorithm Based on Local Global Feature Fusion

Multi-scale pyramidal hash learning for traditional building facade image retrieval

Point-Cloud-Based Place Recognition Using CNN Feature Extraction

I2P-Rec: Recognizing Images on Large-scale Point Cloud Maps through Bird's Eye View Projections

Deep Captioning Hashing Network for Complex Scene Image Retrieval

Image Retrieval by Geological Proximity Using Deep Neural Network.