Web Scraping - State of Art, Techniques and Approaches

Tsvetana Halacheva,Gabriel Kanev,Tsvetelina Mladenova,Irena Valova
DOI: https://doi.org/10.1109/TELECOM59629.2023.10409723
2023-11-16
Telecom
Abstract:Web scraping has emerged as a crucial technique for extracting valuable information from the vast and evergrowing expanse of the Internet. This paper provides a comprehensive overview of the current state of web scraping. The study surveys prominent applications across diverse domains, highlighting the pivotal role played by web scraping in modern data-driven decision-making processes. The paper meticulously explores various techniques employed in web scraping, including desktop applications, plugins and browser extensions, web-based applications, cloud applications and custom applications. To showcase the versatility of web scraping, the paper elucidates its applications in various domains, including e-commerce, finance, research, and social media analysis.
Computer Science
What problem does this paper attempt to address?