Description: A simple web crawler, support multithreading, crawl depth under control
To Search:
File list (Check if you may need any files):
topicCrawler\.project
............\src\com\parser\HTTP.java
............\...\...\......\HtmlParser.java
............\...\...\......\Parser.java
............\...\...\segment\Segment.java
............\...\...\manner\MannerGather.java
............\...\...\relative\Relative.java
............\...\...\crawler\CrawlerUI.java
............\bin\com\segment\Segment.class
............\...\...\relative\Relative.class
............\...\...\parser\Parser.class
............\...\...\......\HtmlParser.class
............\...\...\......\HTTP.class
............\...\...\manner\MannerGather.class
............\...\...\crawler\CrawlerUI$Crawler.class
............\...\...\.......\CrawlerUI.class
............\...\...\.......\CrawlerUI$1.class
............\...\...\.......\CrawlerUI$3.class
............\...\...\.......\CrawlerUI$4.class
............\...\...\.......\CrawlerUI$2.class
............\.classpath
............\topic\topic.txt
............\doc\实验报告.docx
............\startUrl\startUrl.txt
............\lib\lucene-core-2.3.2.jar
............\...\IKAnalyzer1.4.jar
............\MANIFEST.MF
............\jar可执行程序\topicCrawler.jar
............\.............\说明.txt
............\exe可执行程序\topicCrawler.exe
............\.............\说明.txt
............\src\com\parser
............\...\...\segment
............\...\...\manner
............\...\...\relative
............\...\...\crawler
............\bin\com\segment
............\...\...\relative
............\...\...\parser
............\...\...\manner
............\...\...\crawler
............\src\com
............\bin\com
............\log\RelativeUrl
............\...\relativePage
............\src
............\bin
............\topic
............\log
............\doc
............\startUrl
............\lib
............\jar可执行程序
............\exe可执行程序
topicCrawler