AI - Based Solution for Web Crawling

Prashanth Kumar HM Subramanya
DOI: https://doi.org/10.21275/sr23331154330
2023-04-05
International Journal of Science and Research (IJSR)
Abstract:: Web crawling, also known as web scraping or spidering, is the process of automatically gathering data from the internet. It involves using automated software tools using AI to visit websites, download data like web pages, pdf, videos, metadata, or images. Then store it in a structured format for later use. Web crawlers, also called spiders or bots, follow links from one webpage to another with AI validation. The information gathered by web crawlers can be used for a variety of purposes, including data mining, content aggregation, search engine indexing, market research or Plagiarism detection. Here our crawling is only for plagiarism detection, and our new AI based algorithms help us to do the fastest and most accurate data downloading.
Computer Science
What problem does this paper attempt to address?