Location:
Search - ICTclas
Search list
Description: 计算所汉语词法分析系统ICTCLAS.分词正确率高达97.58%(973专家组评测),未登录词识别召回率均高于90%,其中中国人名的识别召回率接近98%处理速度为31.5Kbytes/s。ICTCLAS的特色还在于:可以根据需要输出多个高概率结果,有多种输出格式,支持北大词性标注集,973专家组给出的词性标注集合。-Calculate the Chinese Lexical Analysis System ICTCLAS. Segmentation correct rate of 97.58 percent (973 Expert Group on Evaluation), the recall rate of identification of unknown words were higher than 90 percent, of which China s name to identify the recall rate of nearly 98 percent processing speed for 31.5Kbytes/s. Also features ICTCLAS is: can output a number of high probability that there are a variety of output formats, to support the North-of-speech tagging sets, 973 expert group is given a collection of-speech tagging.
Platform: |
Size: 3140608 |
Author: 站长 |
Hits:
Description: 含有多个分词算法。毕业设计的时候获得的,希望对大家在汉字处理中能用的到。
ICTCLAS算法,中科院,对名字识别能力很强。VC开发。
CSharp分词,向前匹配加向后最大匹配,C#开发,容易扩展。
小叮咚分词,由后向前最大匹配,C#开发。
xerdoc分词,基于ICTCLAS的Java版本分词。
文本分词词典,分词的词典,可以提供分词数据源。-containing more than Segmentation. Graduation was the time of hope for all the Chinese Processing usable to. ICTCLAS algorithm, Chinese Academy of Sciences, the strong name recognition. VC development. CSharp-term, forward backward matching increase maximum matching, C# development, easy expansion. Small Columbus Word, the largest forward after the match, C# development. Xerdoc-term ICTCLAS Java-based version of Word. Text Word dictionary, Word dictionary, the word can provide a breakdown of data sources.
Platform: |
Size: 20833280 |
Author: 阿瑟多幅 |
Hits:
Description: 计算所汉语词法分析系统ICTCLAS介绍
词是最小的能够独立活动的有意义的语言成分。 但汉语是以字为基本的书写单位,词语之间没有明显的区分标记,因此,中文词语分析是中文信息处理的基础与关键。为此,我们中国科学院计算技术研究所在多年研究基础上,耗时一年研制出了汉语词法分析系统ICTCLAS(Institute of Computing Technology, Chinese Lexical Analysis System),该系统的功能有:中文分词;词性标注;未登录词识别。分词正确率高达97%以上,未登录词识别召回率均高于90%,其中中国人名的识别召回率接近98%处理速度为31.5Kbytes/s。ICTCLAS的特色还在于:可以根据需要输出多个高概率结果,有多种输出格式,支持北大词性标注集,973专家组给出的词性标注集合。该系统得到了专家的好评,并有多篇论文在国内外发表。
计算所汉语词法分析系统ICTCLAS同时还提供一套完整的动态连接库ICTCLAS.dll和相应的概率词典,开发者可以完全忽略汉语词法分析,直接在自己的系统中调用ICTCLAS,ICTCLAS可以根据需要输出多个高概率的结果,输出格式也可以定制,开发者在分词和词性标注的基础上继续上层开发。-calculation Chinese lexical analysis system ICTCLAS introduced the term is the smallest independent of meaningful activities language components. It is Chinese characters written for the basic unit, the word no clear distinction between markers, therefore, the Chinese term analysis of the Chinese information processing infrastructure and key. To this end, we CAS Institute of Computing Technology based on years of research, 976,000 developed the Chinese lexical analysis system ICTCLAS (Institute of Compu Hosiery Technology, Chinese Lexical Analysis System), the system functions : the Chinese word; tagging; Unknown word recognition. Word accuracy rate of as high as 97%, unknown word recognition recall rate is higher than 90%. these names identify the recall rate of nearly 98% for the proce
Platform: |
Size: 112640 |
Author: 郑昀 |
Hits:
Description: ICTCLAS的JNI调用接口文件:
Title:ICTCLAS Caller
* <p>Description:do chinese word segmentation.don t change the pakage and CLASS name, orelse you can t use it.
* 请不要改变包名、类名以及native的方法名,否则调用将失效。
* 由于ICTCLAS本身存在很多鲁棒性问题,调用segSentence时,string参数请保证不要过长或带有乱码。调用次数过多(如处理几十G的数据)会有可能造成内存溢出。
* 故基本只能用于较小规模数据(相对几十G来说)。
* 请运行时设置jvm足够的堆栈空间。-ICTCLAS calling the JNI interface documentation : Title : Caller* ICTCLAS
Platform: |
Size: 2048 |
Author: 郑昀 |
Hits:
Description: 采用MyEclipse3.1+JDK1.5编译通过
中文词法分析-MyEclipse3.1 JDK1.5 compiler used by the Chinese lexical analysis
Platform: |
Size: 2467840 |
Author: 张磊 |
Hits:
Description: 基于中科院的ICTCLAS实现中文分词系统
开发工具是JAVA.经测试,效果很好.-ICTCLAS based on the realization of the Chinese Academy of Sciences Chinese word segmentation system is the Java development tools. Tested, good results.
Platform: |
Size: 1049600 |
Author: 木鱼 |
Hits:
Description: 中科院分词,可供直接使用,源于中科院中文分词网-CAS segmentation for direct use, and from the Chinese Academy of the Chinese word segmentation Network
Platform: |
Size: 65536 |
Author: 李华 |
Hits:
Description: Sharp ICTCLAS分词系统,对ICTCLAS分词系统进行了详尽的分析,并将代码移植到C#。-Sharp ICTCLAS segmentation system, right ICTCLAS segmentation system for a detailed analysis, and code to C# transplantation.
Platform: |
Size: 2910208 |
Author: 郝云龙 |
Hits:
Description: 对ICTCLAS源码进行了详尽的分析,对系统的理解相当有帮助。-right ICTCLAS source for a detailed analysis to understand the system very helpful.
Platform: |
Size: 696320 |
Author: 郝云龙 |
Hits:
Description: 汉语词法分析系统ICTCLAS windows版本,我将FreeICTCLASLinux移植到windows下,开发环境VS2005-Chinese lexical analysis system ICTCLAS windows version I will FreeICTCLASLinux transplant windows, VS2005 Development Environment
Platform: |
Size: 2746368 |
Author: zhoushusen |
Hits:
Description: 中文分词, N-最短路径算法
ICTCLAS研究学习组
http://groups.google.com/group/ictclas?msg=subscribe-Chinese word segmentation, N-shortest path algorithm ICTCLAS Studies Group http :// groups.google.com/group/sub ictclas msg = scribe
Platform: |
Size: 72704 |
Author: Jian Ye |
Hits:
Description: k Shortest Paths
David Eppstein s method
ICTCLAS研究学习组
http://groups.google.com/group/ictclas?msg=subscribe-k Shortest Paths David Eppstein's method IC TCLAS Studies Group http :// groups.google.com/group/sub ictclas msg = scribe
Platform: |
Size: 19456 |
Author: Jian Ye |
Hits:
Description: ictclas java实现 很不错的 包含全部所需的文件包 能对句子进行详细的词法分析.-ictclas java achieve very good contain all the necessary documents to the sentence package detailed Lexical analysis.
Platform: |
Size: 3646464 |
Author: wanglch |
Hits:
Description: 计算所最新的2007.03新出的包含多种语言的切词算法ICTCLAS-latest calculation of 2007.03 contains new multilingual segmentation algorithm ICTCLAS
Platform: |
Size: 3289088 |
Author: godtroop |
Hits:
Description: 附件是c++版本的ICTCLAS向.net的移植,如果你要使用C#编写索引程序,这是个不错的选择-c annex to the version of the ICTCLAS. Net transplant. If you want to use C# indexing, this is a good choice
Platform: |
Size: 80896 |
Author: Edgard |
Hits:
Description: ICTCLAS系统的vc++和java 源代码和调用示例-ICTCLAS system vc++ And java source code and call the sample
Platform: |
Size: 65536 |
Author: 奚士佳 |
Hits:
Description: Java中lucene分词需要的ICTCLAS.dll文件和data词库-Java Lucene participle in need ICTCLAS.dll documents and data thesaurus
Platform: |
Size: 2225152 |
Author: suncity |
Hits:
Description: 中科院的分词系统ictclas源码,自由开放的源码,很好-ictclas code
Platform: |
Size: 80896 |
Author: shiyc |
Hits:
Description: ICTCLAS分词,来自中科院的分词器,可以提取指纹的等功能-ICTCLAS
Platform: |
Size: 7545856 |
Author: robert |
Hits:
Description: 中科院ictclas分词系统的LINUX版本,非常不错的免费中文分词工具。-CAS ictclas Segmentation System LINUX version.
Platform: |
Size: 2806784 |
Author: |
Hits: