Description: A web crawler (also known as a web spider or web robot) is a program or automated script
which browses the in a methodical, automated manner. Other less frequently used names for
web crawlers are ants, automatic indexers, bots, and worms (Kobayashi and Takeda, 2000).来源。-A web crawler (also known as a web spider or web robot) is a program or automated scriptwhich browses the in a methodical, automated manner. Other less frequently used names forweb crawlers are ants, automatic indexers, bots, and worms (Kobayashi and Takeda , 2000). source. Platform: |
Size: 218112 |
Author:sun |
Hits:
Description: Lucene.Net is a high performance Information Retrieval (IR) library, also known as a search engine library. Lucene.Net contains powerful APIs for creating full text indexes and implementing advanced and precise search technologies into your programs. Some people may confuse Lucene.net with a ready to use application like a web search/crawler, or a file search application, but Lucene.Net is not such an application, it s a framework library. Lucene.Net provides a framework for implementing these difficult technologies yourself. Lucene.Net makes no discriminations on what you can index and search, which gives you a lot more power compared to other full text indexing/searching implications you can index anything that can be represented as text. There are also ways to get Lucene.Net to index HTML, Office documents, PDF files, and much more.-Lucene.Net is a high performance Information Retrieval (IR) library, also known as a search engine library. Lucene.Net contains powerful APIs for creating full text indexes and implementing advanced and precise search technologies into your programs. Some people may confuse Lucene.net with a ready to use application like a web search/crawler, or a file search application, but Lucene.Net is not such an application, it s a framework library. Lucene.Net provides a framework for implementing these difficult technologies yourself. Lucene.Net makes no discriminations on what you can index and search, which gives you a lot more power compared to other full text indexing/searching implications you can index anything that can be represented as text. There are also ways to get Lucene.Net to index HTML, Office documents, PDF files, and much more.
Platform: |
Size: 320512 |
Author:Yu-Chieh Wu |
Hits:
Description: 网页爬虫(也被称做蚂蚁或者蜘蛛)是一个自动抓取万维网中网页数据的程序.网页爬虫一般都是用于抓取大量的网页,为日后搜索引擎处理服务的.抓取的网页由一些专门的程序来建立索引(如:Lucene,DotLucene),加快搜索的速度.爬虫也可以作为链接检查器或者HTML代码校验器来提供一些服务.比较新的一种用法是用来检查E-mail地址,用来防止Trackback spam.-A web crawler (also known as a web spider or ant) is a program, which browses the World Wide Web in a methodical, automated manner. Web crawlers are mainly used to create a copy of all the visited pages for later processing by a search engine, that will index the downloaded pages to provide fast searches. Crawlers can also be used for automating maintenance tasks on a web site, such as checking links, or validating HTML code. Also, crawlers can be used to gather specific types of information from Web pages, such as harvesting e-mail addresses (usually for spam). Platform: |
Size: 55296 |
Author:lisi |
Hits:
Description: 网页抓取,可以实现网页的下载,并过滤出想要的内容。很实用-Web crawling, Web page downloads can be achieved, and to filter out unwanted content. Very practical Platform: |
Size: 366592 |
Author:ny |
Hits:
Description: 简易多线程网络爬虫基于C#语言socket编程-Simple multi-threaded web crawler socket programming language based on C# Platform: |
Size: 455680 |
Author:亿龙 |
Hits:
Description: 一个web爬虫 CSharp开发的,很小很不错,是个开放源代码的项目-CSharp developed a web crawler, very small and very good open source projects is Platform: |
Size: 197632 |
Author:zf |
Hits:
Description: C# 编写的网络爬虫,比较基础 适合初学者入门学习,含代码,可运行-Web crawler written in C#, more suitable for beginner to learn basic, containing the code, run Platform: |
Size: 112640 |
Author:李乐 |
Hits:
Description: csharp 编写:网络爬虫、网络蜘蛛、抓取网页,可分析网页里面的内容-csharp write: web crawler, spider, crawl the web, which can analyze the content of the page Platform: |
Size: 124928 |
Author:TYH |
Hits: