Location:
Search - stopwords
Search list
Description: ASPSeek是一个C++编写的互联网搜索引擎,并使用了STL库。它主要包括一个检索机器人,一个搜索守护程序,和一个搜索前端(CGI或者是Apache模块)。它大概可以检索几百万个URLs,来查找给定的短语和单词,并使用通配符,进行布尔搜索。搜索结果可以限定在给定的时间或站点,站点空间,并按照相关性或者时间进行排序(这里面使用了一些非常酷的技术)。ASPSeek可以应用于很多语言和编码中(甚至包括多字节语言如中文)。它为多个站点做了优化。(多线程检索,同步DNS查询, 按站点将结果分组, Web集合等),同时它也可以用于单个站点的搜索。其他特性包括支持stopwords和ispell, 字符集和语言的预测, 搜索结果的HTML模板,引用和查询词高亮度显示。并且它有详细的文档可以利用。-ASPSeek C is prepared in an Internet search engine and the use of the STL library. It mainly includes a retrieval robot, a guardian search procedures and a search front end (CGI or Apache module). It probably can search millions of URLs to the search for phrases and words, and the use of wildcards for Boolean search. Search results can be limited to the time or site, site space in accordance with the relevant time or rank (which is used by some very cool technology). ASPSeek can be used in many languages and coding (including multi-byte languages such as Chinese). It has done a number of site optimization. (Multi-threaded searching, synchronous DNS inquiries, according to results of a site, Web pools, etc.) It also can be used to search a single site. Other features include support stopwor
Platform: |
Size: 1157208 |
Author: qiu |
Hits:
Description: 中英文中的常用的停用词,对文本分析有帮助的!
Platform: |
Size: 2040 |
Author: zj |
Hits:
Description: ASPSeek是一个C++编写的互联网搜索引擎,并使用了STL库。它主要包括一个检索机器人,一个搜索守护程序,和一个搜索前端(CGI或者是Apache模块)。它大概可以检索几百万个URLs,来查找给定的短语和单词,并使用通配符,进行布尔搜索。搜索结果可以限定在给定的时间或站点,站点空间,并按照相关性或者时间进行排序(这里面使用了一些非常酷的技术)。ASPSeek可以应用于很多语言和编码中(甚至包括多字节语言如中文)。它为多个站点做了优化。(多线程检索,同步DNS查询, 按站点将结果分组, Web集合等),同时它也可以用于单个站点的搜索。其他特性包括支持stopwords和ispell, 字符集和语言的预测, 搜索结果的HTML模板,引用和查询词高亮度显示。并且它有详细的文档可以利用。-ASPSeek C is prepared in an Internet search engine and the use of the STL library. It mainly includes a retrieval robot, a guardian search procedures and a search front end (CGI or Apache module). It probably can search millions of URLs to the search for phrases and words, and the use of wildcards for Boolean search. Search results can be limited to the time or site, site space in accordance with the relevant time or rank (which is used by some very cool technology). ASPSeek can be used in many languages and coding (including multi-byte languages such as Chinese). It has done a number of site optimization. (Multi-threaded searching, synchronous DNS inquiries, according to results of a site, Web pools, etc.) It also can be used to search a single site. Other features include support stopwor
Platform: |
Size: 1157120 |
Author: qiu |
Hits:
Description: 中英文中的常用的停用词,对文本分析有帮助的!-Chinese and English in common stop words, the text analysis help!
Platform: |
Size: 2048 |
Author: zj |
Hits:
Description:
Platform: |
Size: 2048 |
Author: 浩 |
Hits:
Description: A English Stop word class. It helps to check whether your word is stopword or not.
Platform: |
Size: 2048 |
Author: hopebt |
Hits:
Description: ASPSeek是一个C++编写的互联网搜索引擎,并使用了STL库。它主要包括一个检索机器人,一个搜索守护程序,和一个搜索前端(CGI或者是 Apache模块)。它大概可以检索几百万个URLs,来查找给定的短语和单词,并使用通配符,进行布尔搜索。搜索结果可以限定在给定的时间或站点,站点空间,并按照相关性或者时间进行排序(这里面使用了一些非常酷的技术)。ASPSeek可以应用于很多语言和编码中(甚至包括多字节语言如中文)。它为多个站点做了优化。(多线程检索,同步DNS查询, 按站点将结果分组, Web集合等),同时它也可以用于单个站点的搜索。其他特性包括支持stopwords和ispell, 字符集和语言的预测, 搜索结果的HTML模板,引用和查询词高亮度显示。并且它有详细的文档可以利用-ASPSeek is a C++ written in the Internet search engine, and uses STL library. It includes a search robot, a search daemon, and a search front-end (CGI or Apache module). It probably can be retrieved millions of URLs, to find a given phrase and the words, and use wildcards, to Boolean search. Search results can be limited to a given time or site, site space, and in accordance with the relevant sort of sexual or time (which is inside the use of some very cool technology). ASPSeek can be used in many languages and encoding (even including multi-byte languages such as Chinese). It is optimized for multiple sites. (Multi-threaded retrieval, synchronous DNS query, the results grouped by site, Web sets, etc.), but it also can be used for a single site search. Other supported features include stopwords and ispell, character set and language of the predicted results of the HTML templates, reference and query words highlighted. Detailed documentation and it can be used
Platform: |
Size: 28391424 |
Author: 必扬 |
Hits:
Description: Information retriever based on cosine similarity with TFIDF weights and stopwords
Platform: |
Size: 137216 |
Author: frankdrevin |
Hits:
Description: This application removes all stop words from the given text document and performs stemming operation.
Platform: |
Size: 25600 |
Author: madhu |
Hits:
Description: a program to remove stopwords from the text file for faster data processing
Platform: |
Size: 31744 |
Author: preethi |
Hits:
Description: The system will find the cluster where the target student belongs to based on student number.
Inside the cluster, the system will compute the most similar students to the target student using
Platform: |
Size: 48128 |
Author: fffff |
Hits:
Description: 语义网中,文本分析、信息检索常用的停用词!-The Semantic Web, text analysis, information retrieval used stop words!
Platform: |
Size: 11264 |
Author: 陈芳 |
Hits:
Description: DynaCloud是一个jQuery插件,生成标记或关键字云从web页面上的文字,突出关键字匹配部件一旦点击。
几个方面的DynaCloud可以定制。
Stopwords
限制数量的标签
排序标签
自动生成标签云
-DynaCloud is a jQuery plugin that generates tag or keyword clouds from text on web pages and highlights matching parts once a keyword is clicked.
Several aspects of DynaCloud can be customized.
Stopwords
Limiting the number of tags
Sorting tags
Automatic generation of tag clouds
Platform: |
Size: 2048 |
Author: sadfas |
Hits:
Description: 英文文本词根还原+去停用词小工具 本小程序用以对指定目录下的英文文本文档执行批量还原处理,能够识别单词与单词之间的标点或连字符等,保持原文格式。比较强大的是能把整个文件夹包括小文件夹的都给处理了-This small program used to perform volume reduction treatment, able to identify between the word and the word punctuation or hyphens, and keep the original format of the English text of the document in the specified directory.
Platform: |
Size: 7443456 |
Author: hongwei |
Hits:
Description: 1、文件转换为字符串
2、文本文件分词后转换为ArrayList
3、从文件读取停用词用转换为ArrayList
4、从ArrayList中剔除停用词
5、利用正则表达式将文本文件中的数字、字母剔除-delete stopwords from texts
Platform: |
Size: 2347008 |
Author: Ariera |
Hits:
Description: 中文停用词表,比较全面,有1208个,通用词就是的,是,呢,了这样的词-Chinese stop word table, more comprehensive, 1208, is a generic term, is that it, such a word
Platform: |
Size: 6144 |
Author: 距离 |
Hits:
Description: 中文和英文的停用词词库,在信息检索方面能用到-this is the English and Chines Stop-words,you can use this in Information Searching program
Platform: |
Size: 37888 |
Author: 阿吉 |
Hits:
Description: In this file you can use English stop words.
The usage of this words may can helpful in analyzing content and deleting irrelevant content.
Platform: |
Size: 2048 |
Author: muhammad |
Hits:
Description: c++ stopwords removal
Platform: |
Size: 2048 |
Author: sugar cheng |
Hits:
Description: 文本处理,自然语言处理,包含中文和英文停用词(text processing,including chinese and english stopwords)
Platform: |
Size: 3072 |
Author: hugo123 |
Hits: