Research and Development of Medical Search Engine Based on Nutch

En-Ge YUAN,Xiang-Qian WU,Wen-zhong YANG
DOI: https://doi.org/10.3969/j.issn.1000-2839.2014.02.018
2014-01-01
Abstract:As the demands of public access to medical information with the help of network is growing, and when people use general search engines get professional information accuracy is poor and ineffcient. This paper designs a medical vertical search engine based on nutch components. The system realized the function of Chinese word segmentation.It also obtained Term Library by training texts. Using of SVM, the engine calculated the correlation between web page and medical domain.It realized the function of web page filtering. Finally,this system joined the theme relevant factors in the sorting algorithm.Test results show that,comparing with the general search engine,this system has a higher accuracy in terms of access to health information. It can reduce the interference of irrelevant information,to make finding and positioning medical information more accurate.So this system can provide the public with more targeted services.
What problem does this paper attempt to address?