Welcome![Sign In][Sign Up]
Location:
Search - SogouW.20061127

Search list

[Search EngineSogouW.20061127

Description: 互联网词库来自于对SOGOU搜索引擎所索引到的中文互联网语料的统计分析,统计所进行的时间是2006年10月,涉及到的互联网语料规模在1亿页面以上。统计出的词条数约为15万条高频词,除标出这部分词条的词频信息之外,还标出了常用的词性信息。 语料库统计的意义:反映了互联网中文语言环境中的词频、词性情况。 应用案例:中文词性标注、词频分析等。 词性分类: N 名词 V 动词 ADJ 形容词 ADV 副词 CLAS 量词 ECHO 拟声词 STRU 结构助词 AUX 助词 COOR 并列连词 CONJ 连词 SUFFIX 前缀 PREFIX 后缀 PREP 介词 PRON 代词 QUES 疑问词 NUM 数词 IDIOM 成语-Internet thesaurus from the right SOGOU search engines to index the Chinese Internet Corpus statistical analysis, Statistics for the time in October 2006, involving the corpus size of the Internet in more than 100 million pages. Statistics from the entries of about 150,000 high-frequency words, in addition to this part of Article marked the word frequency information, also marked the commonly used POS information. Corpus statistical significance : the Internet reflects the Chinese language environment of the word frequency, POS situation. Applications : Chinese part-of-speech tagging, word frequency analysis. POS Categories : N nouns verbs ADJ V adjective ADV adverb CLAS Classifiers ECHO Onomatopoeia STRU structural particle AU X-particle COOR parallel conjunction CONJ conjunction SUFFIX s
Platform: | Size: 1259141 | Author: 17521 | Hits:

[Search EngineSogouW.20061127

Description: 互联网词库来自于对SOGOU搜索引擎所索引到的中文互联网语料的统计分析,统计所进行的时间是2006年10月,涉及到的互联网语料规模在1亿页面以上。统计出的词条数约为15万条高频词,除标出这部分词条的词频信息之外,还标出了常用的词性信息。 语料库统计的意义:反映了互联网中文语言环境中的词频、词性情况。 应用案例:中文词性标注、词频分析等。 词性分类: N 名词 V 动词 ADJ 形容词 ADV 副词 CLAS 量词 ECHO 拟声词 STRU 结构助词 AUX 助词 COOR 并列连词 CONJ 连词 SUFFIX 前缀 PREFIX 后缀 PREP 介词 PRON 代词 QUES 疑问词 NUM 数词 IDIOM 成语-Internet thesaurus from the right SOGOU search engines to index the Chinese Internet Corpus statistical analysis, Statistics for the time in October 2006, involving the corpus size of the Internet in more than 100 million pages. Statistics from the entries of about 150,000 high-frequency words, in addition to this part of Article marked the word frequency information, also marked the commonly used POS information. Corpus statistical significance : the Internet reflects the Chinese language environment of the word frequency, POS situation. Applications : Chinese part-of-speech tagging, word frequency analysis. POS Categories : N nouns verbs ADJ V adjective ADV adverb CLAS Classifiers ECHO Onomatopoeia STRU structural particle AU X-particle COOR parallel conjunction CONJ conjunction SUFFIX s
Platform: | Size: 1258496 | Author: 17521 | Hits:

CodeBus www.codebus.net