Description: Java+ mysql web crawler.On a single web crawlers WordPress site
Use of open source libraries are as follows:
Apache HttpComponents 4.3
2.0 HTML Parser
The MySQL Connector/J 5.1.27
Use utf-8 to record label in Chinese
Using XAMPP MySQL default port localhost: 3306
Need local XAMPP environment
To Search:
File list (Check if you may need any files):
WPCrawler-master
................\.classpath
................\.project
................\.settings
................\.........\org.eclipse.jdt.core.prefs
................\README.md
................\bin
................\...\net
................\...\...\johnhany
................\...\...\........\wpcrawler
................\...\...\........\.........\crawler.class
................\...\...\........\.........\httpGet$1.class
................\...\...\........\.........\httpGet.class
................\...\...\........\.........\parsePage.class
................\lib
................\...\commons-logging-1.1.3.jar
................\...\htmllexer.jar
................\...\htmlparser.jar
................\...\httpclient-4.3.1.jar
................\...\httpcore-4.3.jar
................\...\mysql-connector-java-5.1.27-bin.jar
................\result-2013-11-29.txt
................\src
................\...\net
................\...\...\johnhany
................\...\...\........\wpcrawler
................\...\...\........\.........\crawler.java
................\...\...\........\.........\httpGet.java
................\...\...\........\.........\parsePage.java