Location:
Search - stop words
Search list
Description: 用于文本分类的一个程序,包括文本的预处理,过滤,分词,去停用词等步骤。编译后可运行。-For a text classification procedures, including the text of the pre-processing, filtering, segmentation, to stop words such as steps. Compiler can be run.
Platform: |
Size: 28390400 |
Author: 鲁西西 |
Hits:
Description: ...将该字符串变量与停用词表中的所有单词进行比较,若果该词在停用词表中出现过则不对其进行统计,否则在对该词进行词干抽取。
经过以上停用词、词干处理后得到的将是实际进行统计的“单词”(此时的“单词”实际上已经是所有具有相同词干的原是单词的统一代表)...
注:jar包中含有完整的java源程序代码,仅供学习参考之用,传播时请保持本软件包的完整性 ---ZHG工作室 2008.4 E-mail:wudazhg@163.com All Rights Reserved-... The string variable with the stop words in the table to compare all the words, if the term as used in the stop word table, there were no statistics of their, otherwise, in the words extracted stem. After more than stop words, stem after treatment will be the actual statistics of the word (At this point the word actually stems all have the same word used for the unification of representative) ... Note: jar package containing the complete java source code, only to learn reference, dissemination, please maintain the integrity of the package--- ZHG Studio 2008.4 E-mail: wudazhg@163.com All Rights Reserved
Platform: |
Size: 71680 |
Author: zhg |
Hits:
Description: 中英文中的常用的停用词,对文本分析有帮助的!-Chinese and English in common stop words, the text analysis help!
Platform: |
Size: 2048 |
Author: zj |
Hits:
Description: Create stop list hashmap using stoplist file for removing stop words
Platform: |
Size: 1024 |
Author: Manoj |
Hits:
Description: 用来去除英文文档中的停用词,将一些高频词从文档中删除-English documents used to remove the stop words, some high-frequency words will be deleted from the document
Platform: |
Size: 39936 |
Author: 范晓莉 |
Hits:
Description:
根据一个停用词表,输入一个词语。然后来
判断一个词语是否为停用词
-Stop words based on a table, enter a word. And then to determine whether a stop word terms
Platform: |
Size: 616448 |
Author: 彩云 |
Hits:
Description: 给一篇文章,然后根据停用词表,去除该文章的内的次用词,然后存入一个文件中。-To an article, and then form the basis of stop words to remove the article, the second term, and then into a file.
Platform: |
Size: 598016 |
Author: 张国 |
Hits:
Description: WordCloud is a visual depiction of how many times a word is used, or its frequency if you will, within a given set of words. It does this by: reading in plain text, filtering out "stop words", counting how many times a word is used, and displaying results in a Squarified Treemap.
Platform: |
Size: 63488 |
Author: wang |
Hits:
Description: 连接数据库 分词 去除停用词 计算权重值-Connect to the database to remove stop words word weighted value
Platform: |
Size: 32768 |
Author: 眭亚键 |
Hits:
Description: 停用词表扩展,里面有所有的常用的停用词,在信息检索时需要进行去高频词的操作,就需要停用词表,需要的下载-Disable vocabulary expansion, which have all the common stop words, in the information retrieval to the high-frequency words when the need for the operation, you need to stop word table, need to download
Platform: |
Size: 2048 |
Author: 张杰 |
Hits:
Description: 英文文本处理,去掉停用词,提取词干,提取文本特征向量-English text processing, removing stop words, stem extract, extract text feature vectors
Platform: |
Size: 1024 |
Author: 李沛 |
Hits:
Description: 在搜索中的无效词等,包括中文,英文两个文档。基本包含了见的所有无效词-Invalid words in the search, including the English and Chinese documents. See all basically contains invalid word
Platform: |
Size: 4096 |
Author: iantle |
Hits:
Description: 是有关文本处理停用词的小程序,使用常用的停词列表,去掉停用词-Stop words in the text processing is a small program, using the common stop word list, remove stop words
Platform: |
Size: 4096 |
Author: sklz |
Hits:
Description: 将网页中的文本提出,然后对文本分词,去停用词等处理,计算其词频-Make the page text, then the text word, to stop words such as processing, computing the word frequency
Platform: |
Size: 2048 |
Author: 吴华勤 |
Hits:
Description: This application removes all stop words from the given text document and performs stemming operation.
Platform: |
Size: 25600 |
Author: madhu |
Hits:
Description: 我实现的功能很简单,只是单个文件的检索,给出一个英文文本文件,预先准本好停用词文本,再建立一个索引表,就能实现实现文件的简单检索,检索的结果是某个单词在文本中的位置,如多次出现。就输出多个位置。
我把停用词文件记为fiel1.txt,另要检索的文件记为fiel2.txt.-I realize the function is very simple, just search a single file, given an English text file, stop words in advance to prepare for a good text, then create an index table, we can achieve a simple implementation file search, search results are a the position of the word in the text, such as the number appears. To output multiple locations. I remember as a word file to disable fiel1.txt, the other to retrieve the file denoted fiel2.txt.
Platform: |
Size: 3072 |
Author: 断剑 |
Hits:
Description: this program is a parser and removes stop words
Platform: |
Size: 1024 |
Author: Stitchh |
Hits:
Description: 英文文档去除停用词remove stop words-remove stop words for english documents
Platform: |
Size: 176128 |
Author: huangONE |
Hits:
Description: 用于对文件的停用词删除,可以对文件中出现频率过高,没有用的字、词进行剔除-Stop words for file deletion,Can occur too frequently, no use of the word, the word of the document be removed
Platform: |
Size: 1024 |
Author: 朱凯健 |
Hits:
Description: 在自然语言处理任务中常用的停用词表,可以去除中文停词(Frequently used stop lists in natural language processing tasks, Chinese stop words can be removed)
Platform: |
Size: 8192 |
Author: Levi- |
Hits: