Description: Crawler. This is a simple crawler of web search engine. It crawls 500 links from very beginning. -Crawler of web search engine Platform: |
Size: 1024 |
Author:sun |
Hits:
Description: 1.Hyper Estraier是一个用C语言开发的全文检索引擎,他是由一位日本人开发的.工程注册在sourceforge.net(http://hyperestraier.sourceforge.net).
2.Hyper的特性:
高速度,高稳定性,高可扩展性…(这可都是有原因的,不是瞎吹)
P2P架构(可译为端到端的,不是咱们下大片用的p2p)
自带Web Crawler
文档权重排序
良好的多字节支持(想一想,它是由日本人开发的….)
简单实用的API(我看了一遍,真是个个都实用,我能看懂的,也就算简单了)
短语,正则表达式搜索(这个有点过了,不带这个,不是好的Full text Search Engine?)
结构化文档搜索能力(大概就是指可以自行给文档加上一堆属性并搜索这些属性吧?这个我没有实验)-1 a Hyper Estraier with C language development fulltext retrieval engine, he is by a Japanese development. Engineering registered in sourceforge.net (http://hyperestraier.sourceforge.net).
The characteristics: Hyper 2.
High speed, high stability, high expansibility. (this is a reason, not come)
The P2P software architecture (for end-to-end, not let down by the P2P) vast
Bringing Web Crawler
Document weighted order
Good multibyte support (think, it is the development of Japanese...).
Simple and practical API (I see again, is all practical, I can read, and even simple)
Phrases, regular expressions Search (this was a bit much, do not take the Full text, not good search.com)?
Structured document search ability (probably means to give document with a pile of attributes and search for these attributes? I didn t experiment), Platform: |
Size: 1154048 |
Author:maozhucai |
Hits:
Description: 自己写一个简单的网络爬虫,能够从网上自动爬会一些东西,实现了深度爬-To write a simple Web crawler that can crawl from the Internet will automatically something to climb to achieve the depth of Platform: |
Size: 18432 |
Author:oldwolf |
Hits:
Description: 简易多线程网络爬虫基于C#语言socket编程-Simple multi-threaded web crawler socket programming language based on C# Platform: |
Size: 455680 |
Author:亿龙 |
Hits:
Description: JAVA开发的简单网络爬虫 对指定站点新闻内容的获取
-JAVA development of a simple Web crawler on a specified site to access news content Platform: |
Size: 2670592 |
Author:殷威 |
Hits:
Description: 简易的网络爬虫,可以从特定的网站分析抓取及下载-Simple web crawler that can crawl from the analysis of specific sites and download the Platform: |
Size: 3072 |
Author:奋斗 |
Hits:
Description: 一个JAVA开发的简单网络爬虫 可以实现对指定站点新闻内容的获取
软件大小:2.6MB
运行环境:JSP+MSSQL -JAVA development of a simple Web crawler can be achieved on a specified site to access news content
software size: 2.6MB
operating environment: JSP+ MSSQL Platform: |
Size: 2669568 |
Author:huojy |
Hits:
Description: 一个简易的仿真网络爬虫,如果你是一个新手,请不要错过-The simulation of a simple web crawler, and if you are a novice, do not miss Platform: |
Size: 64512 |
Author:张星亮 |
Hits:
Description: 抓取豆瓣电影链接、电影简介的简单网络爬虫,自己写的-Crawl Douban movie link, the film profiles a simple web crawler, to write their own Platform: |
Size: 3072 |
Author:霉星星 |
Hits:
Description: 该源码是用python写的一个简单的网络爬虫,用来爬取百度百科上面的人物的网页,并能够提取出网页中的人物的照片-The source code is written in a simple python web crawler, Baidu Encyclopedia is used to crawl the page above figures, and be able to extract the characters in the picture page Platform: |
Size: 204800 |
Author:孙朔 |
Hits:
Description: 用java编写的简单的网络爬虫程序,对于想进行搜索引擎的初学者很有帮助。也可扩展成更强大的爬虫。-Using java prepared by the simple web crawler program, for those who want to search engines for beginners. Can also be extended into a more powerful reptiles. Platform: |
Size: 10240 |
Author:王国栋 |
Hits:
Description: 简单的网络爬虫,只能实现博客内容的截取,希望对大家有帮助-Simple web crawler, can realize the blog content of interception Platform: |
Size: 1886208 |
Author:denny |
Hits:
Description: 基于Python的Beautifulsoup4框架的爬虫,主要爬取出种子文件下载地址,由简单的GUI界面显示。(Based on Beautifulsoup4 frame in Python, the web crawler can grab RARBG torrent download address and displayed by simple GUI.) Platform: |
Size: 1024 |
Author:JamesChan |
Hits: