- 1、原创力文档(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。。
- 2、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载。
- 3、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
- 4、该文档为VIP文档,如果想要下载,成为VIP会员后,下载免费。
- 5、成为VIP后,下载本文档将扣除1次下载权益。下载后,不支持退款、换文档。如有疑问请联系我们。
- 6、成为VIP后,您将拥有八大权益,权益包括:VIP文档下载权益、阅读免打扰、文档格式转换、高级专利检索、专属身份标志、高级客服、多端互通、版权登记。
- 7、VIP文档为合作方或网友上传,每下载1次, 网站将根据用户上传文档的质量评分、类型等,对文档贡献者给予高额补贴、流量扶持。如果你也想贡献VIP文档。上传文档
查看更多
Hsin-Hsi Chen (NTU) Chapter 13Chinese Information Extraction Technologies Hsin-Hsi Chen Department of Computer Science and Information Engineering National Taiwan University Taipei, Taiwan E-mail: hh_chen@.tw Outline Introduction to Information Extraction (IE) Chinese IE Technologies Tagging Environment for Chinese IE Applications Summary Introduction Introduction Information Extraction the extraction or pulling out of pertinent information from large volumes of texts Information Extraction System an automated system to extract pertinent information from large volumes of text Information Extraction Technologies techniques used to automatically extract specified information from text An Example in Air Vehicle Launch Original Document Named-Entity-Tagged Document Equivalence Classes Co-Reference Tagged Document IE Evaluation in MUC-7 (1998) Named Entity Task [NE]: Insert SGML tags into the text to mark each string that represents a person, organization, or location name, or a date or time stamp, or a currency or percentage figure Multi-lingual Entity Task [MET]: NE task for Chinese and Japanese Co-reference Task [CO]: Capture information on co-referring expressions: all mentions of a given entity, including those tagged in NE, TE tasks IE Evaluation in MUC-7 (cont.) Template Element Task [TE]: Extract basic information related to organization, person, and artifact entities, drawing evidence from anywhere in the text Template Relation Task [TR]: Extract relational information on employee_of, manufacture_of, and location_of relations Scenario Template Task [ST]: Extract pre-specified event information and relate the event information to particular organization, person, or artifact entities involved in the event. Chinese IE Technologies Segmentation Named Entity Extraction Part of Speech/Sense Tagging Full/Partial Parsing Co-Reference Resolution Segmentation Segmentation Problem A Chinese sentence is composed of characters without word boundary 這名記者會說國語。 這 名 記者
您可能关注的文档
最近下载
- 千级无尘室工程施工方案(3篇).docx VIP
- 深度解析《GBT 44037-2024焦炭溶损率及溶损后强度试验方法》.pptx
- 2025 中级注册安全工程师《金属非金属矿山安全》速记口诀.pdf
- 2025年中国吸顶式车载显示器数据监测研究报告.docx
- 九年级化学酸、碱、盐、氧化物知识小结 “三表一图”(二)天津版.doc VIP
- 部编版六年级上册语文第一周(草原-丁香结)达标测评卷 含答案.docx VIP
- 建筑电气安装工程管线预留预埋阶段质量管理.doc VIP
- 激光原理 全套课件.ppt
- 第1.2课《宁夏闽宁镇:昔日干沙滩,今日金沙滩》(课件)-【中职专用】高二语文同步精品课件(高教版2023·职业模块).pptx VIP
- 部编版语文六年级上册 周测卷(一)1草原+2丁香结(含答案).pdf VIP
文档评论(0)