Description: a very important corpus, as your segmentation procedure is a very good use of the database file
To Search:
- [Qiyi] - greatest probability- term data structur
- [quanwenjiansuo] - text retrieval procedures, the longest m
- [SVM.Rar] - SVM text classifier source, English inte
- [HanZiFreq] - words character frequency statistics, so
- [SplitCNWord] - a Chinese word achieve and demonstrate t
- [DBSCAN_JAVA] - DBSCAN algorithm JAVA, the D : \ text.tx
- [SegAndPosTools] - achieve Corpus segmentation, and eigenva
- [WordSeg] - This is a Chinese word segmentation proc
- [VSM] - Vector space model algorithm, given a se
- [chinese-text] - Text classification corpus, edited manua
File list (Check if you may need any files):