A Simple Semantic Web Crawler for Intelligent Information Retrieval from Academic Websites

Dr V.Kiran Kumar*,Mrs Ramya,,
DOI: https://doi.org/10.35940/ijitee.d2085.029420
2020-02-10
International Journal of Innovative Technology and Exploring Engineering
Abstract:In various applications data is shared and reused through a common framework like Semantic Web. In essence, in the ever-expanding sphere of the web, huge quantities of web content is created and made available for men and machines for their interpretation and application. In the present paper, an attempt has been made to develop a “scutter”, other wisely known as semantic crawler that essentially collects and stores information in a centrally-located database by crawling through the semantic content. The projected „scutter‟ is based on Jena 3.0 Framework, which is a freely downloadable language software available on https://jena.apache.org/download/. In this process, a RDF file is taken as the seed input file after which the „scutter‟ accesses other RDF documents by crawling through „rdfs:seeAlso‟ property, thereby designated as an automatic extraction of semantic information emanating from various websites. Also, certain privacy related issues, especially in FOAF metadata is discussed in this paper.
What problem does this paper attempt to address?