Description: Web crawler project, network subsystem is based on the Linux platform reptile reptiles, divided into the main control module, download the module, URL extraction module and persistence module, which uses the Linux multiplexing technology (Epoll model), socket, multi-threaded, regular expressions, daemon, Linux and other Linux systems dynamic library development technology.
To Search:
File list (Check if you may need any files):
spider项目代码\Makefile
..............\modules\domainlimit.cpp
..............\.......\headerfilter.cpp
..............\.......\Makefile
..............\.......\maxdepth.cpp
..............\.......\savehtml.cpp
..............\.......\saveimage.cpp
..............\spider.conf
..............\.rc\bloomfilter.cpp
..............\...\bloomfilter.h
..............\...\confparser.cpp
..............\...\confparser.h
..............\...\crc32.cpp
..............\...\crc32.h
..............\...\dso.cpp
..............\...\dso.h
..............\...\hashs.cpp
..............\...\hashs.h
..............\...\Makefile
..............\...\md5.cpp
..............\...\md5.h
..............\...\qstring.cpp
..............\...\qstring.h
..............\...\sha1.cpp
..............\...\sha1.h
..............\...\socket.cpp
..............\...\socket.h
..............\...\spider.cpp
..............\...\spider.h
..............\...\threads.cpp
..............\...\threads.h
..............\...\url.cpp
..............\...\url.h
..............\modules
..............\src
spider项目代码