Description: gmeans-- Clustering with first variation and splitting
文本聚类算法Gmeans ,使用了3种相似度函数,cosine,euclidean ,KL.文本数据使用的是稀疏矩阵形式.
-gmeans clustering with first variation and splitting
Gmeans,a text clustering algorithm, uses 3 functions,cosine,euclidean and KL in similarity measuring.Text data are described by sparse matrix. Platform: |
Size: 71680 |
Author:修宇 |
Hits:
Description: 简单处理两个句子中的相似度对比问题,具体用法很简单,在main函数中写入句子即可-Simple treatment of the similarity of two sentences merely a matter of comparison, the specific usage is very simple, in the main function can write sentences Platform: |
Size: 1506304 |
Author:chen |
Hits:
Description: JAVA实现文本聚类,用到TF/IDF权重,用余弦夹角计算文本相似度,用k-means进行数据聚类等数学和统计 知识。-JAVA realization of text clustering, using TF/IDF weight, calculated using cosine angle between the text of similarity, using k-means clustering for data such as mathematical and statistical knowledge. Platform: |
Size: 1024 |
Author:优优 |
Hits:
Description: 计算文本之间相似度的程序,用于文本的聚类。是在已知各个文本的文本特征向量基础上进行计算的,利用余弦值计算-Calculation of similarity between the text of the procedures for text clustering. Are known at all the text of the text feature vector calculated based on the use of cosine values Platform: |
Size: 1024 |
Author:effi |
Hits:
Description: 文本相似度计算余弦相似度代码,计算文本相似度,用于搜索引擎-Cosine similarity of text similarity computation code, the text of the similarity calculation for the search engine Platform: |
Size: 5120 |
Author:li xiaowen |
Hits:
Description: 用c#写的计算文本向量的TFIDF算法源码,同时包括用cosine距离计算文本相似度的算法源码-Calculation using c# to write the text of the TFIDF vector algorithm source code, while including the use of cosine similarity distance calculation algorithm for source text Platform: |
Size: 28672 |
Author:alan |
Hits:
Description: Cosine Similarity function recives a video frame oject and calculates the consine sililarity function. please enjoy Platform: |
Size: 1024 |
Author:Adam |
Hits:
Description: 用于统计任一英文文档中26个字母的统计频率,得到频率矩阵-Second, it fuses the features of the first singular value component and the second one, and then gets the complex feature vectors which reflect not only the statistic frequency but also the sequential structure of letters. In the end, the cosine similarity of texts is used to measure the similarity between the query and documents. Platform: |
Size: 1024 |
Author:gaoshilong |
Hits:
Description: 测试文本,选用著名的gettysburg这篇英文文本做为测试文本-Second, it fuses the features of the first singular value component and the second one, and then gets the complex feature vectors which reflect not only the statistic frequency but also the sequential structure of letters. In the end, the cosine similarity of texts is used to measure the similarity between the query and documents. Platform: |
Size: 2048 |
Author:gaoshilong |
Hits:
Description: 最后的测试程序,得出文本检索的正确率和召回率-In the end, the cosine similarity of texts is used to measure the similarity between the query and documents. The data comparison indicates that this algorithm has well experimental results. Moreover, it gets the advantage over the classic LSA retrieval algorithm in precision and operational efficiency. Platform: |
Size: 1024 |
Author:gaoshilong |
Hits:
Description: 我做的用Kmeans方法,分别采用欧式距离。夹角余弦,和度量函数的方法来表示两点的相似度-I do use Kmeans methods were used Euclidean distance. Angle cosine, and methods of measurement functions to represent the similarity of two Platform: |
Size: 55296 |
Author:bing |
Hits:
Description: 余弦相似度计算C#源代码,采用经典改进tf_idf特征值-Cosine similarity calculation C# source code, using the classical features of value to improve tf_idf Platform: |
Size: 17408 |
Author:hanpu |
Hits:
Description: Data Mining, for calculating Correlation, Euclidean distance, and cosine similarity, between objects and attributes. Platform: |
Size: 183296 |
Author:Aliyu |
Hits:
Description: 文本相似度计算余弦相似度代码,计算文本相似度,用于搜索引擎-Cosine similarity of text similarity computation code, the text of the similarity calculation for the search engine Platform: |
Size: 5120 |
Author:rterwill |
Hits: