Description: 简单处理两个句子中的相似度对比问题,具体用法很简单,在main函数中写入句子即可-Simple treatment of the similarity of two sentences merely a matter of comparison, the specific usage is very simple, in the main function can write sentences Platform: |
Size: 1506304 |
Author:chen |
Hits:
Description: JAVA实现文本聚类,用到TF/IDF权重,用余弦夹角计算文本相似度,用k-means进行数据聚类等数学和统计 知识。-JAVA realization of text clustering, using TF/IDF weight, calculated using cosine angle between the text of similarity, using k-means clustering for data such as mathematical and statistical knowledge. Platform: |
Size: 1024 |
Author:优优 |
Hits:
Description: 文本相似度计算余弦相似度代码,计算文本相似度,用于搜索引擎-Cosine similarity of text similarity computation code, the text of the similarity calculation for the search engine Platform: |
Size: 5120 |
Author:li xiaowen |
Hits:
Description: 我做的用Kmeans方法,分别采用欧式距离。夹角余弦,和度量函数的方法来表示两点的相似度-I do use Kmeans methods were used Euclidean distance. Angle cosine, and methods of measurement functions to represent the similarity of two Platform: |
Size: 55296 |
Author:bing |
Hits:
Description: java实现的文本相似度系统,使用向量空间模型以及余弦相似度距离公式,实测可以实现2篇文本的相似度计算且有一定的效果。-Java text similarity system, using the vector space model and the cosine similarity distance formula, the measured results can be achieved two similarity of text and have some effect. Platform: |
Size: 48626688 |
Author:王一大 |
Hits:
Description: 算法思想:提取文档的TF/IDF权重,然后用余弦定理计算两个多维向量的距离来计算两篇文档的相似度,用标准的k-means算法就可以实现文本聚类。源码为java实现(Algorithm idea: extract the TF/IDF weight of the document, then calculate the distance between two multidimensional vectors by cosine theorem, calculate the similarity of the two documents, and achieve the text clustering with the standard k-means algorithm. Source code for Java implementation) Platform: |
Size: 15360 |
Author:startrek
|
Hits: