Research and Design of Agricultural Information Vertical Search Engine Based on Heritrix+Solr

GUO Cheng-kun,CHEN Guo-song,RUAN Huai-jun,CHEN Ying-yi,TU Xing-yue
DOI: https://doi.org/10.3969/j.issn.1004-874x.2015.05.027
2015-01-01
Abstract:The agricultural information blooms rapidly with the development of agriculture in information and intelligence, therefore, a convenient and effective agricultural information search method and search engine for agricultural researchers, producers and managers is in need. A search engine framework based on Heritrix and Solr was put forward, in which Hidden Markvo Model based web information extraction and mmseg4 j agricultural dictionary based Chinese word segmentation were involved, moreover, the page ranking algorithm was improved according to the characteristics of agricultural information search. Finally, this paper provided suggestions for improving the user experience and efficiency of agricultural vertical search engine.
What problem does this paper attempt to address?