Location:
Search - 信息抽取
Search list
Description: 介绍信息抽取领域的发展。第2.1.节比较了信息抽取和信息检索的区别;第2.2.节介绍IE的历史。接下来两节解释评价IE系统的指标和常用的两派技术方法。信息抽取技术所处理的文本类型将在第2.5.节中说明。第2.6.节描述信息抽取技术可利用的网页特征。
Platform: |
Size: 155648 |
Author: aaaccceee@126.com |
Hits:
Description: 概率句法分析器对于统计自然语言处理的很多高层应用,如统计机器翻译、问答系统、信息抽取、文本挖掘等都是至关重要的,直接决定这些应用系统的最终性能。本系统是一个概率型的Chart分析器。系统的分析算法是采用了多种优化策略。分析结果是概率最大的一棵分析树。在概率模型方面,本系统在一定程度上突破了pcfg的上下文无关假设,引入了结构上下文条件,使得分析结果正确率有了明显提高。在使用宾州中文树库进行的实验中,我们的分析器的标记召回率和标记精确率平均在75%-80%左右。在使用一个短句树库进行的实验中,两个指标都在90%以上。概率句法分析既需要建立合理的概率模型,又需要积累树库等语言资源。我们把所做的一点工作进行开放,就是希望抛弃闭门造车的做法,集思广益,推动这个基础领域的发展,使汉语的句法分析尽早实现实用化-probability syntax analyzer for statistical natural language processing of many senior applications, such as statistical machine translation, quiz systems, information extraction, text mining are essential, these applications directly determine the final performance. The system is a probability- based Chart analyzer. Systematic analysis algorithm is optimized using a variety of strategies. Results of the analysis is the greatest probability of a tree. The probability model, the system to some extent breakthrough in the context of pcfg unrelated to the assumption that the introduction of the context of the structural conditions, making results of the analysis accuracy rate has markedly improved. The use of Chinese tree of Pennsylvania library experiments, the analyzer markers recall rate a
Platform: |
Size: 565248 |
Author: 江鹏 |
Hits:
Description: 从预料中抽取汉字数字变成英文数字(作信息抽取用)-taken from the expected number of Chinese characters into English figures (used for information extraction)
Platform: |
Size: 27648 |
Author: 古月 |
Hits:
Description: 通过将Visio图另存为XML文件,并采用DOM的方式对其进行解析,实现将VISIO中的有用信息抽取出来。欢迎下载!-Visio plans by Save as XML documents, and use the DOM its analytical approach, the realization of VISIO the useful information extracted. Welcome to download!
Platform: |
Size: 2048 |
Author: 贝蒂 |
Hits:
Description: java实现的,基于gnu.regexp正则表达式包实现的html信息抽取程序,可以解析CiteSeer网站中的论文、作者、会议以及期刊信息。-java achieved, gnu.regexp is based on the regular expression package to achieve the html information extraction procedures, Analysis can CiteSeer site papers, authors, information meetings and journals.
Platform: |
Size: 98304 |
Author: 张志 |
Hits:
Description: 网上信息抽取技术纵览,详细介绍当前的信息抽取技术-online information extraction technology overview, detailing the current information extraction technology
Platform: |
Size: 32768 |
Author: ewewewe |
Hits:
Description: 文本分类概述 王斌老师的经典PPT。信息抽取教程-text classification outlined Bin teachers classic PPT. Information Extraction Directory
Platform: |
Size: 124928 |
Author: ewewewe |
Hits:
Description: 贝叶斯公式,在信息检索以及信息抽取中有着重要的应用,需要的下载,有问题联系我-Bayesian formula, in the information retrieval and information extraction has important applications, the need for download, there are problems contact me
Platform: |
Size: 3715072 |
Author: 刘磊 |
Hits:
Description: 利用Lixto进行可视化的信息抽取
Visual Web Information Extraction with Lixto-Lixto for the use of visual information extraction Visual Web Information Extraction with Lixto
Platform: |
Size: 200704 |
Author: math |
Hits:
Description: 基于最大熵的隐马尔可夫模型文本信息抽取,林亚平!刘云中!周顺先!陈治平!蔡立军"湖南大学计算机与通信学院!湖南长沙#$%%&-Based on Maximum Entropy of Hidden Markov Model Text Information Extraction, Ya-Ping Lin! Liu in!廃?first! Chen Zhiping! Cai-jun,
Platform: |
Size: 171008 |
Author: 刘鹏飞 |
Hits:
Description: web信息抽取技术 web信息抽取技术 web信息抽取技术 web信息抽取技术-web information extraction technology web information extraction technology web information extraction technology web information extraction technology web information extraction technology web information extraction technology
Platform: |
Size: 135168 |
Author: howard |
Hits:
Description: web信息抽取技术参考1 -web information extraction Technical Reference 1 web information extraction technology reference 1
Platform: |
Size: 273408 |
Author: howard |
Hits:
Description: web信息抽取技术参考2web信息抽取技术参考1
Platform: |
Size: 30720 |
Author: howard |
Hits:
Description: web信息抽取技术参考3web信息抽取技术参考1 web信息抽取技术参考1-web information extraction information extraction 3web Technical Reference Technical Reference 1 web information extraction technology reference 1
Platform: |
Size: 30720 |
Author: howard |
Hits:
Description: W4F 工具包,用于web信息抽取,可以自动生成wrapper-W4F toolkit for web information extraction, you can automatically generate wrapper
Platform: |
Size: 583680 |
Author: xielang |
Hits:
Description: 一个经典的页面数据采集工具RoadRunner.其关键思想是通过处理页面比较得到的mismatch来不断地修改当前的模板,最终推导出能够覆盖例子页面的模板,然后根据模板来实现对类似
页面的信息抽取。
-A classic page data collection tool for RoadRunner. The key idea is to be compared through the pages deal with the mismatch to continue to modify the current template, and ultimately derived from examples to cover page template, and then come to realize in accordance with the template page for similar information extraction.
Platform: |
Size: 2313216 |
Author: 陈伟 |
Hits:
Description: 一款十分好用的网页信息抽取工具。利用了已经存在的诸如XSLT,Xquery等技术,很好地实现了基于xml/html的网页的数据抽取。-A very useful tool for information extraction page. Use of already existing, such as XSLT, Xquery, such as technology, realize very well based on the xml/html pages of data extraction.
Platform: |
Size: 5747712 |
Author: 陈伟 |
Hits:
Description: 近几年各种信息抽取资料汇集,全是国外资料,比较先进的资料,值得一看!-In recent years, brings together a variety of information collected information, all data are foreign, more advanced information, worth a visit!
Platform: |
Size: 2666496 |
Author: 王花 |
Hits:
Description: 近几年各种信息抽取资料汇集,全是国外资料,比较先进的资料,这是第二部分.-In recent years, brings together a variety of information collected information, all data are foreign, more advanced information, This is the second part.
Platform: |
Size: 5462016 |
Author: 王花 |
Hits: