CodeBus
www.codebus.net
Search
Sign in
Sign up
Hot Search :
Source
embeded
web
remote control
p2p
game
More...
Location :
Home
Search - Chinese segmentation
Main Category
SourceCode
Documents
Books
WEB Code
Develop Tools
Other resource
Search - Chinese segmentation - List
[
Other
]
汉语切分
DL : 0
VC平台下的汉语切分小程序,是计算语言学最基础的,初学者可看一下.-VC platform under the Chinese segmentation small program, computational linguistics is the most basic, beginners can look at.
Update
: 2008-10-13
Size
: 28.02kb
Publisher
:
刘志
[
Other resource
]
segmentor_Perl
DL : 0
中文分词算法。Perl语言编写。wordlist.txt为词库。-Chinese Segmentation. Perl language. Wordlist.txt for the thesaurus.
Update
: 2008-10-13
Size
: 358.34kb
Publisher
:
kevinmou
[
Mathimatics-Numerical algorithms
]
findkey.c
DL : 0
此程序解决的问题:较好的, 并适应短字符串的中文分词算法.根据词库 发现以换行符分隔的众多标题中的 top N 关键字并以此更新词库.是一个分类分词算法 -this procedure to solve the problem : better, and adapt to the short string of Chinese Segmentation. According thesaurus found in the many separate newline heading the top key N this update and the word thesaurus. it is a classification algorithm Word
Update
: 2008-10-13
Size
: 8.55kb
Publisher
:
刘红周
[
MultiLanguage
]
dedesplit
DL : 0
中文切词,非常优秀特此推荐。是目前分词效率较高的算法-Chinese segmentation, hereby commend outstanding. Segmentation is more efficient algorithm
Update
: 2008-10-13
Size
: 662.04kb
Publisher
:
wu guangyin
[
MultiLanguage
]
stemming(porter edition)
DL : 0
中文切词程序和相关代码-Chinese segmentation procedures and related code
Update
: 2008-10-13
Size
: 4.06kb
Publisher
:
张化强
[
Other resource
]
ChSeg
DL : 0
Chinese segmentation in C
Update
: 2008-10-13
Size
: 826.66kb
Publisher
:
dongfeng
[
ELanguage
]
ICTCLAS
DL : 0
计算所汉语词法分析系统ICTCLAS.分词正确率高达97.58%(973专家组评测),未登录词识别召回率均高于90%,其中中国人名的识别召回率接近98%处理速度为31.5Kbytes/s。ICTCLAS的特色还在于:可以根据需要输出多个高概率结果,有多种输出格式,支持北大词性标注集,973专家组给出的词性标注集合。-Calculate the Chinese Lexical Analysis System ICTCLAS. Segmentation correct rate of 97.58 percent (973 Expert Group on Evaluation), the recall rate of identification of unknown words were higher than 90 percent, of which China s name to identify the recall rate of nearly 98 percent processing speed for 31.5Kbytes/s. Also features ICTCLAS is: can output a number of high probability that there are a variety of output formats, to support the North-of-speech tagging sets, 973 expert group is given a collection of-speech tagging.
Update
: 2025-02-17
Size
: 3mb
Publisher
:
站长
[
MultiLanguage
]
stemming(porter edition)
DL : 0
中文切词程序和相关代码-Chinese segmentation procedures and related code
Update
: 2025-02-17
Size
: 4kb
Publisher
:
张化强
[
AI-NN-PR
]
WordSeg
DL : 0
利用最大匹配法进行汉语句子的分词 最大匹配算法是最常用的分词算法,简单实用正确率可达到80%以上-the maximum matching method for the Chinese Sentence Word maximum matching algorithm is the most commonly used word segmentation algorithm, simple and practical accuracy rate can reach more than 80%
Update
: 2025-02-17
Size
: 72kb
Publisher
:
廖剑
[
CSharp
]
汉语分词统计
DL : 0
分词,针对汉语的分词,根据统计来实现的,可以直接使用目录即可,里面针对联合早报进行的测试,分次统计中可以包括任意目录(系统能承受得了就行),这是帮一个同学做的作业:)用asp。net + xml-Segmentation for Chinese word segmentation, according to statistics to be achieved, direct access to the directory can be, which for Lianhe test, sub-sub-statistics can include arbitrary directory (the system can accept the deregulation on the line), which is to help a fellow student to do the operation:) with asp. net+ xml
Update
: 2025-02-17
Size
: 42kb
Publisher
:
[
Other
]
汉语切分
DL : 0
VC平台下的汉语切分小程序,是计算语言学最基础的,初学者可看一下.-VC platform under the Chinese segmentation small program, computational linguistics is the most basic, beginners can look at.
Update
: 2025-02-17
Size
: 28kb
Publisher
:
刘志
[
AI-NN-PR
]
segmentor_Perl
DL : 0
中文分词算法。Perl语言编写。wordlist.txt为词库。-Chinese Segmentation. Perl language. Wordlist.txt for the thesaurus.
Update
: 2025-02-17
Size
: 358kb
Publisher
:
kevinmou
[
Mathimatics-Numerical algorithms
]
findkey.c
DL : 0
此程序解决的问题:较好的, 并适应短字符串的中文分词算法.根据词库 发现以换行符分隔的众多标题中的 top N 关键字并以此更新词库.是一个分类分词算法 -this procedure to solve the problem : better, and adapt to the short string of Chinese Segmentation. According thesaurus found in the many separate newline heading the top key N this update and the word thesaurus. it is a classification algorithm Word
Update
: 2025-02-17
Size
: 8kb
Publisher
:
刘红周
[
VC/MFC
]
ChSeg
DL : 0
Chinese segmentation in C
Update
: 2025-02-17
Size
: 826kb
Publisher
:
dongfeng
[
MultiLanguage
]
TextFeatureExtractor
DL : 0
集成了中科院切词技术的中文切词工具,可以进行文档处理-Integration of the Chinese Academy of Sciences of the Chinese segmentation technology segmentation tool can document processing
Update
: 2025-02-17
Size
: 2.1mb
Publisher
:
hanwangzhang
[
CSharp
]
Segmentation
DL : 0
用HMM实现的中文分词程序,用C#实现的。-HMM to achieve with the Chinese word segmentation
Update
: 2025-02-17
Size
: 4.13mb
Publisher
:
dauberfly123
[
MultiLanguage
]
imdict-chinese-analyzer
DL : 1
imdict-chinese-analyzer 是 imdict智能词典 的智能中文分词模块,算法基于隐马尔科夫模型(Hidden Markov Model, HMM),是中国科学院计算技术研究所的ictclas中文分词程序的重新实现(基于Java),可以直接为lucene搜索引擎提供简体中文分词支持。-imdict-chinese-analyzer is a smart imdict Chinese Dictionary smart module segmentation algorithm based on Hidden Markov Model (Hidden Markov Model, HMM), the Chinese Academy of Sciences Institute of Computing Technology of Chinese word segmentation ictclas process re-implement (based on Java ), can be directly provided for the lucene search engine support for Simplified Chinese word segmentation.
Update
: 2025-02-17
Size
: 3.11mb
Publisher
:
王同
[
MultiLanguage
]
Chinese-Segmentation
DL : 0
自己编写的中文分词源程序,用vc++编写,附有完整的文档,以及标准的分词数据库-I have written the source code of the Chinese word segmentation, using vc++ to prepare, with complete documentation, as well as sub-standard speech database
Update
: 2025-02-17
Size
: 8.58mb
Publisher
:
tanyi
[
Other
]
Chinese-segmentation
DL : 0
有关中文分词很不错的论文哦。。基于中文信息处理的古代汉语分词研究-A very good on paper about Chinese word segmentation
Update
: 2025-02-17
Size
: 101kb
Publisher
:
myx
[
JSP/Java
]
chinese-_segmentation
DL : 0
中文分词算法介绍,正向最大匹配。word-word for chinese segmentation algrithm
Update
: 2025-02-17
Size
: 59kb
Publisher
:
pud
«
1
2
3
4
5
6
7
8
9
10
...
34
»
CodeBus
is one of the largest source code repositories on the Internet!
Contact us :
1999-2046
CodeBus
All Rights Reserved.