Location:
Search - stop word
Search list
Description: 通过JavaCC构建Lucene标准分析器、过滤器。其中有“停止词”分析器,“空格”分析器,及其他分析器-Standards through JavaCC to build Lucene analyzer filter. Among them, stop word parser, spaces, parser, and other analyzers
Platform: |
Size: 4096 |
Author: yuguojia |
Hits:
Description: ...将该字符串变量与停用词表中的所有单词进行比较,若果该词在停用词表中出现过则不对其进行统计,否则在对该词进行词干抽取。
经过以上停用词、词干处理后得到的将是实际进行统计的“单词”(此时的“单词”实际上已经是所有具有相同词干的原是单词的统一代表)...
注:jar包中含有完整的java源程序代码,仅供学习参考之用,传播时请保持本软件包的完整性 ---ZHG工作室 2008.4 E-mail:wudazhg@163.com All Rights Reserved-... The string variable with the stop words in the table to compare all the words, if the term as used in the stop word table, there were no statistics of their, otherwise, in the words extracted stem. After more than stop words, stem after treatment will be the actual statistics of the word (At this point the word actually stems all have the same word used for the unification of representative) ... Note: jar package containing the complete java source code, only to learn reference, dissemination, please maintain the integrity of the package--- ZHG Studio 2008.4 E-mail: wudazhg@163.com All Rights Reserved
Platform: |
Size: 71680 |
Author: zhg |
Hits:
Description: 停用词表,可以和词表结合用于分词,适用于任何开发环境。-Stop word table, and vocabulary can be combined for sub-word applies to any development environment.
Platform: |
Size: 2048 |
Author: 秋水长天 |
Hits:
Description:
根据一个停用词表,输入一个词语。然后来
判断一个词语是否为停用词
-Stop words based on a table, enter a word. And then to determine whether a stop word terms
Platform: |
Size: 616448 |
Author: 彩云 |
Hits:
Description: 程序中包含regEx正则表达式,并通过replaceALL替换为" ",可以删除文本中的常用停词-RegEx procedures included in regular expressions, and through replaceALL replace " " can be used to delete text the word stop
Platform: |
Size: 12288 |
Author: zx |
Hits:
Description: A English Stop word class. It helps to check whether your word is stopword or not.
Platform: |
Size: 2048 |
Author: hopebt |
Hits:
Description: 停用词表扩展,里面有所有的常用的停用词,在信息检索时需要进行去高频词的操作,就需要停用词表,需要的下载-Disable vocabulary expansion, which have all the common stop words, in the information retrieval to the high-frequency words when the need for the operation, you need to stop word table, need to download
Platform: |
Size: 2048 |
Author: 张杰 |
Hits:
Description: 在搜索中的无效词等,包括中文,英文两个文档。基本包含了见的所有无效词-Invalid words in the search, including the English and Chinese documents. See all basically contains invalid word
Platform: |
Size: 4096 |
Author: iantle |
Hits:
Description: 是有关文本处理停用词的小程序,使用常用的停词列表,去掉停用词-Stop words in the text processing is a small program, using the common stop word list, remove stop words
Platform: |
Size: 4096 |
Author: sklz |
Hits:
Description: Stop Word Remover for text files
Platform: |
Size: 1024 |
Author: s |
Hits:
Description: 可以在gcc3.4-gcc4.5下编译,同时支持32和64位平台,apache模块模式支持apache1.3x,增加了中文停词表,扩充了中文分词表。-Gcc3.4-gcc4.5 can compile the same time, 32 and 64-bit platform support, apache module mode support apache1.3x, an increase of the Chinese stop word list, the expansion of the Chinese word table.
Platform: |
Size: 2925568 |
Author: syn |
Hits:
Description: 支持向量机和EM最大熵文本分类算法,压缩包中包括了测试文本词典,停用词表等-Support vector machines and EM maximum entropy text classification algorithm, compressed package includes a test text dictionary, stop word table
Platform: |
Size: 2379776 |
Author: 毛龙 |
Hits:
Description: 在文本进行分类聚类之前,必须对文本进行预处理。预处理的第一步是分词,这中间需要去除停用词。这个文件就是停用词列表-Must preprocess the text before the text classification clustering. The first step in preprocessing is the word, the middle need to remove the stop words. This file is the stop word list
Platform: |
Size: 2048 |
Author: 吴志媛 |
Hits:
Description: 自己实现的中文分词器、贝叶斯文本分类器
附分词词典、中文停用词表
用于数据挖掘学习、交流
Visual Studio 2010 开发-Realize his Chinese word segmentation, Bayesian text classifier the attached word dictionary, the Chinese stop word table is used for data mining learning, exchange of the Visual Studio 2010 development
Platform: |
Size: 10167296 |
Author: rock |
Hits:
Description: arabic stop word list
Platform: |
Size: 256000 |
Author: Abdelkader |
Hits:
Description: 中文停用词表,比较全面,有1208个,通用词就是的,是,呢,了这样的词-Chinese stop word table, more comprehensive, 1208, is a generic term, is that it, such a word
Platform: |
Size: 6144 |
Author: 距离 |
Hits:
Description: 停用词表扩展,里面有所有的常用的停用词,在信息检索时需要进行去高频词的操作,就需要停用词表,需要的下载-Disable vocabulary expansion, which have all the common stop words, in the information retrieval to the high-frequency words when the need for the operation, you need to stop word table, need to download
Platform: |
Size: 2048 |
Author: uneThe |
Hits:
Description: Text Clustering, Kmeans Cluster Stop word Handler TermVector TFIDFMeasure Tokeniser
Platform: |
Size: 7168 |
Author: Rajesh |
Hits:
Description: 用于对文件的停用词删除,可以对文件中出现频率过高,没有用的字、词进行剔除-Stop words for file deletion,Can occur too frequently, no use of the word, the word of the document be removed
Platform: |
Size: 1024 |
Author: 朱凯健 |
Hits:
Description: R语言做文本挖掘的例子,附文本库和停用词,可直接运行;
另外代码中有词云展示功能!-R language to text mining example, with a stop word text libraries and can be run directly there is another code word cloud showing features!
Platform: |
Size: 1218560 |
Author: 扛 |
Hits: