A Content-Based Distributed FTP Search Engine:Design and Implementation
Xu Jun,Wang Chaokun,Li Rui,Wang Jianmin,Liu Zhang
2011-01-01
Journal of Computer Research and Development
Abstract:With the development of the Internet,FTP is regarded as the main pattern of sharing files.FTP and HTTP have many different features,like compartmentalization and obturation.It means that it is more difficult for FTP sites to do the indexing work.At present,the indexing work on FTP sites is spread out around filename and it makes users hard to obtain the required information.We put forward a system called iSearch,which is a distributed FTP search engine that based on content.It can index document contents and has many characteristics including incremental indexing,plug-in configuration,loading-balance,inquiring distribution and so on.At the same time,it provides a more stable and accurate inquiring service by reducing the cost of network transmission in the process,indexing more file information,making full use of the user feedback and support a more stable query service.