In the process of searching useful information from the massive information network,the vertical search system is often used by the information service organizations for medical information research and information service,to meet the specific needs. This paper uses open-source software Nutch and Lucene to design and implement a vertical search engine for biomedical information. Some key techniques such as crawling and processing of web page,content indexing and searching,are explained and discussed. The system improves the recognition rate of Chinese keywords and reduces the information update cycle by adding Chinese word segmentation and re-crawl modules. Currently the system has been tested online and obtained more accurate and timely search results.
|