- 1、原创力文档(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。。
- 2、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载。
- 3、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
- 4、该文档为VIP文档,如果想要下载,成为VIP会员后,下载免费。
- 5、成为VIP后,下载本文档将扣除1次下载权益。下载后,不支持退款、换文档。如有疑问请联系我们。
- 6、成为VIP后,您将拥有八大权益,权益包括:VIP文档下载权益、阅读免打扰、文档格式转换、高级专利检索、专属身份标志、高级客服、多端互通、版权登记。
- 7、VIP文档为合作方或网友上传,每下载1次, 网站将根据用户上传文档的质量评分、类型等,对文档贡献者给予高额补贴、流量扶持。如果你也想贡献VIP文档。上传文档
查看更多
【key】Perspectives on Crowdsourcing Annotations for NLP
Perspectives on Crowdsourcing Annotations for
Natural Language Processing1
Aobo Wang Cong Duy Vu Hoang
Min-Yen Kan
Computing 1, 13 Computing Drive
National University of Singapore
Singapore 117417
wangaobo,hcdvu,.sg
July 24, 2010
1The authors gratefully acknowledge the support of the China-Singapore Institute of Digital
Media’s support of this work by the “Co-training NLP systems and Language Learners” grant R
252-002-372-490.
Abstract
Crowdsourcing has emerged as a new method for obtaining annotations for training
models for machine learning. While many variants of this process exist, they largely
differ in their method of motivating subjects to contribute and the scale of their appli-
cations. To date, however, there has yet to be a study that helps a practitioner to decide
what form an annotation application should take to best reach its objectives within the
constraints of a project. We first provide a faceted analysis of existing crowdsourc-
ing annotation applications. We then use our analysis to discuss our recommendations
on how practitioners can take advantage of crowdsourcing and discuss our view on
potential opportunities in this area.
0.1 Introduction
It is an accepted tradition in natural language processing (NLP) to use annotated cor-
pora to obtain machine learned models for performing many tasks: machine trans-
lation, parsing, and summarization. Given that machine learners can only perform
tasks as good as their input annotation, much work in annotation centered on defining
high quality standards that were reliable and reproducible, and finding appropriately
trained personnel to carry out such tasks. The Penn Treebank and WordNet are prob-
ably the most visible exampl
您可能关注的文档
- mc1496中文资料 - 副本.pdf
- MPEO_PCL两嵌段聚合物对PVC的增塑_亲水双重改性.pdf
- P2P小额网络贷款模式研究_张玉梅.pdf
- P2P网络借贷中投资者出借意愿影响因素分析_宋文.pdf
- Pairs Trading of Two Assets with Uncertainty in Co-integration’s.pdf
- PAM-CRASH年会论文-1-吉林大学.ppt
- PBDEs的环境污染现状及其分析方法的研究进展_黄金鑫.pdf
- PE投资风险的分析与控制_王开良.pdf
- PI型衰减网络.pdf
- PPP模式的作用与运用要点_基于_基础设施和公用事业特许经营法_征求意见稿_的分.pdf
- 【正相关】创业板公司IPO前后业绩变化及风险投资的影响_梁建敏.pdf
- 【正式】开题报告.doc
- 【正相关】私募股权投资与公司盈利能力关系的实证分析_刘媛媛.pdf
- 【短期长期表现不同】创业投资对企业长期绩效的影响_基于我国中小企业板的实证研究_谈毅.pdf
- [游戏设计的艺术].pdf
- 【负相关】私募股权投资特征_IPO抑价与经营绩效_基于我国创业板的实证研究_马翔.pdf
- 【负相关】风险投资_IPO时机与经营绩效_来自香港创业板的经验证据_唐运舒.pdf
- 一个_量力而行_相时而动_的图霸之君_郑庄公新论.pdf
- 【负相关】风险投资对我国创业板公司业绩增长的影响_陈见丽.pdf
- 【负相关】风险投资参与对中小企业板上市公司的影响_谈毅.pdf
原创力文档


文档评论(0)