Web Data Mining with Organized Contents Using Naive Bayes Algorithm
V. S. Prabhu,Dr. P. Asha,Bathini Ravinder,Senthil Kumar,S. P. Maniraj,C. Srinivasan
DOI: https://doi.org/10.1109/IC457434.2024.10486403
2024-02-08
Abstract:Data mining on the web has developed into a simple and crucial tool for finding relevant information. When it comes to file transfers, the World Wide Web is the user’s first choice. Finding useful information and trends in the ever-expanding amount of data available online is becoming ever more challenging and time-consuming. When dealing with massive amounts of textual data often seen online, there are a number of benefits to using the Naive Bayes method for web data mining. Web data mining using the Naive Bayes algorithm seeks to mine massive amounts of textual material on the web for useful patterns, insights, and knowledge. Text classification and categorization tasks are the most common uses of the Naive Bayes method in online data mining. Expert and user-requested data may be difficult to mine from the sea of unorganized and contradictory material that is the World Wide Web. Relevant data (hyperlinks, contents, web use records) is extracted from the web using a variety of mining methods. Internet-centric data mining is a subfield of data science. Structure mining, content mining, and use mining are the three main categories of online data mining. Each of these categories employs a unique set of methods, instruments, strategies, and Naïve Bayes algorithm to mine the web’s vast data stores for useful insights. The results show the density and velocity of web mining using Naïve Bayes algorithm.
Computer Science