Description: The resources in the internet are abundant, but it is a difficult job to search some useful information. So a search engine is the best method to solve this problem. This article fist introduces the system structure of search engine based on the internet in detail, then gives a minute explanation form Spider search, engine and web server. In order to understand the technology more deeply, I have programmed a news search engine by myself.
The news search engine is explained and searched according to hyperlink from a appointed web page, then indexs every searched information and adds it to the index database. Then after receiving the customers requests from the web server, it soon searchs the right news form the index engine,
In the chapter of introducing search engine, it is not only elaborate the core technology, but also combine with the modern code,pictures included, easy to understand.
To Search:
File list (Check if you may need any files):
搜索引擎的研究与实现(Java)(含源码)
..................................\bot.jar
..................................\News
..................................\....\bak
..................................\....\...\news
..................................\....\...\....\HTMLParse.java~11~
..................................\....\...\....\HTMLParse.java~12~
..................................\....\...\....\HTMLParse.java~13~
..................................\....\...\....\HTMLParse.java~14~
..................................\....\...\....\HTMLParse.java~15~
..................................\....\...\....\HTMLParse.java~16~
..................................\....\...\....\HTMLParse.java~17~
..................................\....\...\....\HTMLParse.java~18~
..................................\....\...\....\HTMLParse.java~19~
..................................\....\...\....\HTMLParse.java~20~
..................................\....\...\....\Index.java~32~
..................................\....\...\....\Index.java~33~
..................................\....\...\....\Index.java~34~
..................................\....\...\....\Index.java~35~
..................................\....\...\....\Index.java~36~
..................................\....\...\....\Index.java~37~
..................................\....\...\....\Index.java~38~
..................................\....\...\....\Index.java~39~
..................................\....\...\....\Index.java~40~
..................................\....\...\....\Index.java~41~
..................................\....\...\....\QueryNews.java~1~
..................................\....\...\....\QueryNews.java~2~
..................................\....\...\....\QueryNews.java~3~
..................................\....\...\....\QueryNews.java~4~
..................................\....\...\....\QueryNews.java~5~
..................................\....\...\....\QueryNews.java~6~
..................................\....\...\....\Searcher.java~1~
..................................\....\...\....\Searcher.java~2~
..................................\....\...\....\Searcher.java~3~
..................................\....\...\....\Searcher.java~4~
..................................\....\...\....\Searcher.java~5~
..................................\....\...\....\Searcher.java~6~
..................................\....\...\....\Searcher.java~7~
..................................\....\...\....\Searcher.java~8~
..................................\....\...\....\Searcher.java~9~
..................................\....\classes
..................................\....\.......\news
..................................\....\.......\....\HTMLParse.class
..................................\....\.......\....\Index.class
..................................\....\.......\....\Searcher.class
..................................\....\.......\package cache
..................................\....\.......\.............\news.dep2
..................................\....\News.jpx
..................................\....\News.jpx.local
..................................\....\News.jpx.local~
..................................\....\News.jpx~
..................................\....\src
..................................\....\...\news
..................................\....\...\....\HTMLParse.java
..................................\....\...\....\Index.java
..................................\....\...\....\Searcher.java
..................................\NewsServer
..................................\..........\bak
..................................\..........\...\defaultroot
..................................\..........\...\...........\WEB-INF
..................................\..........\...\...........\.......\web.xml~69~
..................................\..........\...\...........\.......\web.xml~70~
..................................\..........\...\...........\.......\web.xml~71~
..................................\..........\...\...........\.......\web.xml~72~
..................................\..........\...\...........\.......\web.xml~73~
....