- Category:
- AI-NN-PR
- Tags:
-
[Java]
[源码]
- File Size:
- 13.91mb
- Update:
- 2013-01-06
- Downloads:
- 0 Times
- Uploaded by:
- 张忠辉
Description: A Chinese word segmentation procedures, source, dictionary, the training set. The algorithm is simple and efficient, high accuracy. The label contains a new segmentation method of integration of corpus and dictionaries. Corpus split 2:1 for the training set and a test set, plus an external dictionary, the accuracy rate can reach 95 . Suitable for beginners to learn. Also suitable for a simple wordsegmentation application.
To Search:
File list (Check if you may need any files):
ChineseSegment\.classpath
..............\.fatjar
..............\.project
..............\.settings\org.eclipse.core.resources.prefs
..............\bin\org\tseg\seg\199801q.txt
..............\...\...\....\...\BigramSeg.class
..............\...\...\....\...\BiWordGraph.class
..............\...\...\....\...\biWordRate.out
..............\...\...\....\...\Count.class
..............\...\...\....\...\CountBiGram.class
..............\...\...\....\...\MergeNamedEntity.class
..............\...\...\....\...\SegModel.class
..............\...\...\....\...\SplitSentenceTest.class
..............\...\...\....\...\SplitSentence_seg.class
..............\...\...\....\...\UnigramSeg.class
..............\...\...\....\...\UnigramSegTest.class
..............\...\...\....\...\wordFrequence.out
..............\...\...\....\...\WordGraph.class
..............\...\...\....\...\wordRate.out
..............\...\...\....\...\词典.txt
..............\src\org\tseg\seg\199801q.txt
..............\...\...\....\...\BigramSeg.java
..............\...\...\....\...\BiWordGraph.java
..............\...\...\....\...\biWordRate.out
..............\...\...\....\...\Count.java
..............\...\...\....\...\CountBiGram.java
..............\...\...\....\...\MergeNamedEntity.java
..............\...\...\....\...\SegModel.java
..............\...\...\....\...\SplitSentenceTest.java
..............\...\...\....\...\SplitSentence_seg.java
..............\...\...\....\...\UnigramSeg.java
..............\...\...\....\...\UnigramSegTest.java
..............\...\...\....\...\wordFrequence.out
..............\...\...\....\...\WordGraph.java
..............\...\...\....\...\wordRate.out
..............\...\...\....\...\词典.txt
..............\bin\org\tseg\seg
..............\src\org\tseg\seg
..............\bin\org\tseg
..............\src\org\tseg
..............\bin\org
..............\src\org
..............\.settings
..............\bin
..............\lib
..............\src
ChineseSegment