An Automatic and Scalable Application Crawler for Large-Scale Mobile Internet Content Retrieval.

Mingyi Huang,Yongqiang Lyu,Hao Yin
DOI: https://doi.org/10.3837/tiis.2018.10.013
2018-01-01
KSII Transactions on Internet and Information Systems
Abstract:The mobile internet has grown ubiquitous across the globe with the widespread use of smart devices. However, the designs of modern mobile operating systems and their applications limit content retrieval with mobile applications. The mobile internet is not as accessible as the traditional web, having more man-made restrictions and lacking a unified approach for crawling and content retrieval. In this study, we propose an automatic and scalable mobile application content crawler, which can recognize the interaction paths of mobile applications, representing them as interaction graphs and automatically collecting content according to the graphs in a parallel manner. The crawler was verified by retrieving content from 50 non-game applications from the Google Play Store using the Android platform. The experiment showed the efficiency and scalability potential of our crawler for large-scale mobile internet content retrieval.
What problem does this paper attempt to address?