Description: Light in the box crawling process. The use of HttpClient, regular expression analysis. xml data stored xpp3. Multi-threading support, the use of session, support for proxy servers list. Because crawling is foreign websites, so the speed is relatively slow, a little change that can be a relatively easy tool.
File list (Check if you may need any files):