Location:
Search - Word clustering
Search list
Description: 模糊聚类分析算法fuzzy_k_means,主要用于数据挖掘领域.-fuzzy clustering algorithm fuzzy_k_means, mainly for data mining areas.
Platform: |
Size: 3221 |
Author: 许朝 |
Hits:
Description: 模糊聚类分析算法fuzzy_k_means,主要用于数据挖掘领域.-fuzzy clustering algorithm fuzzy_k_means, mainly for data mining areas.
Platform: |
Size: 3072 |
Author: 许朝 |
Hits:
Description: [VC界面一字棋.rar] - 用人工智能的αβ剪枝算法实现,界面整洁漂亮,人机各为一方,三子连成一线即赢.. [FHC.rar] - 一个简单的聚类界面 是FHC和FCM的聚类算法比较 (FHC...
-[VC interface word game. Rar]- αβ pruning used artificial intelligence algorithm, the interface clean and beautiful, man-machine each side, three sons and even a line that is to win .. [FHC.rar]- a simple clustering interface is the FHC and the FCM clustering algorithm (FHC. ..
Platform: |
Size: 27648 |
Author: 11 |
Hits:
Description: 提出了一种快速准确车辆牌照的分割方法。首先利用形态学算子获取车牌的候选区域,剔除较小的和较大的区域;对保留的候选区域利用Trajkovic算法获取角点;最后对检测后的结果聚类,从而分割出包含车牌区域的子图像。-A fast and accurate method of vehicle license division. First of all, the use of morphological operators to obtain license plate candidate regions, excluding the smaller and larger areas to retain the use of the candidate region to obtain Corner Trajkovic algorithm Finally, after testing the results of clustering, which contains the license plate segmentation region sub-image.
Platform: |
Size: 198656 |
Author: jiangkai |
Hits:
Description: 此程序实现了如何在TXT或WORD文档中进行数据挖掘,在文本中提取有用信息-The realization of this procedure how to TXT or WORD document to carry out data mining, in the text to extract useful information
Platform: |
Size: 384000 |
Author: sam |
Hits:
Description: 一个自然语言处理的Java开源工具包。LingPipe目前已有很丰富的功能,包括主题分类(Top Classification)、命名实体识别(Named Entity Recognition)、词性标注(Part-of Speech Tagging)、句题检测(Sentence Detection)、查询拼写检查(Query Spell Checking)、兴趣短语检测(Interseting Phrase Detection)、聚类(Clustering)、字符语言建模(Character Language Modeling)、医学文献下载/解析/索引(MEDLINE Download, Parsing and Indexing)、数据库文本挖掘(Database Text Mining)、中文分词(Chinese Word Segmentation)、情感分析(Sentiment Analysis)、语言辨别(Language Identification)等API。-A natural language processing of the Java open-source toolkit. LingPipe currently have a lot of useful features, including Subject Classification (Top Classification), Named Entity Recognition (Named Entity Recognition), part of speech tagging (Part-of Speech Tagging), sentence detection problem (Sentence Detection), spell-checking query (Query Spell Checking), interest in the phrase detection (Interseting Phrase Detection), Cluster (Clustering), Character Modeling Language (Character Language Modeling), medical literature to download/analysis/index (MEDLINE Download, Parsing and Indexing), text mining database (Database Text Mining), Chinese word segmentation (Chinese Word Segmentation), emotional analysis (Sentiment Analysis), language identification (Language Identification), such as API.
Platform: |
Size: 4669440 |
Author: 张国栋 |
Hits:
Description: state of art language modeling methods:
An Empirical Study of Smoothing Techniques for Language Modeling.pdf
BLEU, a Method for Automatic Evaluation of Machine Translation.pdf
Class-based n-gram models of natural language.pdf
Distributed Language Modeling for N-best List Re-ranking.pdf
Distributed Word Clustering for Large Scale Class-Based Language Modeling in.pdf
-state of art language modeling methods: An Empirical Study of Smoothing Techniques for Language Modeling.pdfBLEU, a Method for Automatic Evaluation of Machine Translation.pdfClass-based n-gram models of natural language.pdfDistributed Language Modeling for N-best List Re-ranking . pdfDistributed Word Clustering for Large Scale Class-Based Language Modeling in.pdf
Platform: |
Size: 2016256 |
Author: wen6860 |
Hits:
Description: 聚类的资料 word版 包含很多算法源代码-Clustering data word version of the source code contains many algorithms
Platform: |
Size: 16384 |
Author: 刘瑶 |
Hits:
Description: 关于聚类的几个算法,在做方字识别时用到过-With regard to clustering of several algorithms, in word recognition, when used to do that too
Platform: |
Size: 50176 |
Author: 胡刚 |
Hits:
Description: java文本聚类程序代码文件,实现文本聚类功能,分词。-text clustering java code files to achieve text clustering features, sub-word.
Platform: |
Size: 9216 |
Author: wang |
Hits:
Description: 基于关键词的Web文档自动分类算法研究,文档关键词,语义相似度,聚类算法,知网,拓扑网络图,中文分词-Keyword-based Web Document Classification Algorithm, document keywords, semantic similarity, clustering algorithm, HowNet, topological network diagrams, Chinese word segmentation
Platform: |
Size: 2123776 |
Author: 王三 |
Hits:
Description: MS WORD document describing & comparing k means clustering and affinity propagation
Platform: |
Size: 524288 |
Author: Thams |
Hits:
Description: 用bayes实现的聚类算法,分词采用的是SharpICTCLAS分词系统 1.0-Achieved using bayes clustering algorithm, word segmentation is used SharpICTCLAS System 1.0
Platform: |
Size: 13979648 |
Author: Fu |
Hits:
Description: 这是关于谱聚类在汉字聚类领域应用的文章,谱聚类表现出比k-means更好的聚类效果。-This is the article on the spectral clustering in the field of Chinese characters clustering, spectral clustering performance better than the k-means clustering effect.
Platform: |
Size: 281600 |
Author: flint |
Hits:
Description: 用c#方法描述了话题识别(话题跟踪与检测)的过程,主要是提取特征词、特征词词频计算、权重计算(tfidf方法),进行相似度计算,最后聚类-C# method describes the process of topic identification (topic tracking and detection), the word feature extraction, feature words word frequency calculation, weight to calculate methods (tfidf), similarity calculation, the final clustering
Platform: |
Size: 3072 |
Author: dai |
Hits:
Description: 文本聚类的java算法,包括文本的预处理、特征词提取、词频统、权重计算、文本聚类等-Java text clustering algorithms, including preprocessing of text characteristic word extraction, word frequency system, the weight calculation, text clustering
Platform: |
Size: 10240 |
Author: dai |
Hits:
Description: 在文本进行分类聚类之前,必须对文本进行预处理。预处理的第一步是分词,这中间需要去除停用词。这个文件就是停用词列表-Must preprocess the text before the text classification clustering. The first step in preprocessing is the word, the middle need to remove the stop words. This file is the stop word list
Platform: |
Size: 2048 |
Author: 吴志媛 |
Hits:
Description: LBG分类算法
用初始室心随机法和扰动因子分裂法两种方法,比较不同方法不同参数设置时的分类性能。
-LBG classification algorithm vector quantization: vector normalization within a certain range for a particular type, consists of two steps: first generate a codebook, which is the speech feature vector space by the first process- also known as clustering speech parameter sequence as a vector, the reference code for classified- also known as quantization. Clustering algorithm: it is relatively simple and commonly used K-means clustering algorithm. LBG is a clustering algorithm, which is generally assumed that the codebook size is fixed, and for a power of 2. Codebook is small, then expanding until it reaches the requirements. It is often an existing classification split into two subclasses, and initial value with the new code word to each subclass. LBG algorithm on random data and a certain regularity (and meet certain Gaussian distribution) data classification, and look at the performance of the LBG algorithm, the initial chamber heart random disturbance factor-secession law are two
Platform: |
Size: 86016 |
Author: zzc |
Hits:
Description: 实现中文分词并聚类输出,分词算法是自己写的以空格分词,如果有需要高级的分词算法可自己下载相关算法-Realization of the Chinese word segmentation and clustering output
Platform: |
Size: 28672 |
Author: wangke |
Hits:
Description: word2vec:谷歌的开源项目,实现从词语到向量的转换(word to vector),Linux系统下运行,需要较大规模的语料资源用作训练才能体现出很好的效果(中英文均可),并且可以实现测量两个词语之间的距离(cos值表示),词语聚类等。-word2vec: Google' s open-source projects, a word-to-vector conversion (word to vector) running under Linux system, requires a large-scale corpus resource for training in order to reflect the good results (available in English), and can measure the distance between two words (cos values indicate), word clustering.
Platform: |
Size: 113664 |
Author: sherlydunn |
Hits: